UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning

Samvelyan, M; Khan, A; Dennis, M; Jiang, M; Parker-Holder, J; Foerster, J; Raileanu, R; (2023) MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning. In: Proceedings of the Eleventh International Conference on Learning Representations. : Kigali, Rwanda. Green open access

[thumbnail of 2303.03376v1.pdf]
Preview
Text
2303.03376v1.pdf - Accepted Version

Download (3MB) | Preview

Abstract

Open-ended learning methods that automatically generate a curriculum of increasingly challenging tasks serve as a promising avenue toward generally capable reinforcement learning agents. Existing methods adapt curricula independently over either environment parameters (in single-agent settings) or co-player policies (in multi-agent settings). However, the strengths and weaknesses of co-players can manifest themselves differently depending on environmental features. It is thus crucial to consider the dependency between the environment and co-player when shaping a curriculum in multi-agent domains. In this work, we use this insight and extend Unsupervised Environment Design (UED) to multi-agent environments. We then introduce Multi-Agent Environment Design Strategist for Open-Ended Learning (MAESTRO), the first multi-agent UED approach for two-player zero-sum settings. MAESTRO efficiently produces adversarial, joint curricula over both environments and co-players and attains minimax-regret guarantees at Nash equilibrium. Our experiments show that MAESTRO outperforms a number of strong baselines on competitive two-player games, spanning discrete and continuous control settings.

Type: Proceedings paper
Title: MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning
Event: The Eleventh International Conference on Learning Representations, ICLR 2023
Open access status: An open access version is available from UCL Discovery
Publisher version: https://iclr.cc/Conferences/2023
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10216733
Downloads since deposit
0Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item