JaxMARL: Multi-Agent RL Environments and Algorithms in JAX

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

JaxMARL: Multi-Agent RL Environments and Algorithms in JAX

Rutherford, A; Ellis, B; Gallici, M; Cook, J; Lupu, A; Ingvarsson, G; Willi, T; ... Foerster, J; + view all (2024) JaxMARL: Multi-Agent RL Environments and Algorithms in JAX. In: Globersons, Amir and Mackey, Lester and Belgrave, Danielle and Fan, Angela and Paquet, Ulrich and Tomczak, Jakub M and Zhang, Cheng, (eds.) Advances in Neural Information Processing Systems. NeurIPS Green open access

Preview

Text
NeurIPS-2024-jaxmarl-multi-agent-rl-environments-and-algorithms-in-jax-Paper-Datasets_and_Benchmarks_Track.pdf - Published Version
Download (4MB) | Preview

Abstract

Benchmarks are crucial in the development of machine learning algorithms, with available environments significantly influencing reinforcement learning (RL) research. Traditionally, RL environments run on the CPU, which limits their scalability with typical academic compute. However, recent advancements in JAX have enabled the wider use of hardware acceleration, enabling massively parallel RL training pipelines and environments. While this has been successfully applied to single-agent RL, it has not yet been widely adopted for multi-agent scenarios. In this paper, we present JaxMARL, the first open-source, Python-based library that combines GPU-enabled efficiency with support for a large number of commonly used MARL environments and popular baseline algorithms. Our experiments show that, in terms of wall clock time, our JAX-based training pipeline is around 14 times faster than existing approaches, and up to 12500x when multiple training runs are vectorized. This enables efficient and thorough evaluations, potentially alleviating the evaluation crisis in the field. We also introduce and benchmark SMAX, a JAX-based approximate reimplementation of the popular StarCraft Multi-Agent Challenge, which removes the need to run the StarCraft II game engine. This not only enables GPU acceleration, but also provides a more flexible MARL environment, unlocking the potential for self-play, meta-learning, and other future applications in MARL. The code is available at https://github.com/flairox/jaxmarl.

Type:	Proceedings paper
Title:	JaxMARL: Multi-Agent RL Environments and Algorithms in JAX
Event:	NeurIPS 2024
Open access status:	An open access version is available from UCL Discovery
DOI:	10.52202/079017-1612
Publisher version:	https://papers.nips.cc/paper_files/paper/2024
Language:	English
Additional information:	This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords:	Reinforcement learning, multi-agent, multi-agent reinforcement learning, jax
UCL classification:	UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI:	https://discovery.ucl.ac.uk/id/eprint/10216730

Downloads since deposit

1Download

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item