UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Evolving Curricula with Regret-Based Environment Design

Parker-Holder, J; Jiang, M; Dennis, M; Samvelyan, M; Foerster, J; Grefenstette, E; Rocktäschel, T; (2022) Evolving Curricula with Regret-Based Environment Design. In: Proceedings of Machine Learning Research. (pp. pp. 17473-17498). Proceedings of Machine Learning Research (PMLR) Green open access

[thumbnail of parker-holder22a.pdf]
Preview
PDF
parker-holder22a.pdf - Published Version

Download (6MB) | Preview

Abstract

Training generally-capable agents with reinforcement learning (RL) remains a significant challenge. A promising avenue for improving the robustness of RL agents is through the use of curricula. One such class of methods frames environment design as a game between a student and a teacher, using regret-based objectives to produce environment instantiations (or levels) at the frontier of the student agent's capabilities. These methods benefit from theoretical robustness guarantees at equilibrium, yet they often struggle to find effective levels in challenging design spaces in practice. By contrast, evolutionary approaches incrementally alter environment complexity, resulting in potentially open-ended learning, but often rely on domain-specific heuristics and vast amounts of computational resources. This work proposes harnessing the power of evolution in a principled, regret-based curriculum. Our approach, which we call Adversarially Compounding Complexity by Editing Levels (ACCEL), seeks to constantly produce levels at the frontier of an agent's capabilities, resulting in curricula that start simple but become increasingly complex. ACCEL maintains the theoretical benefits of prior regret-based methods, while providing significant empirical gains in a diverse set of environments. An interactive version of this paper is available at https://accelagent.github.io.

Type: Proceedings paper
Title: Evolving Curricula with Regret-Based Environment Design
Event: 38th International Conference on Machine Learning (ICML)
Open access status: An open access version is available from UCL Discovery
Publisher version: https://proceedings.mlr.press/v162/
Language: English
Additional information: This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third-party material in this article are included in the Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10173887
Downloads since deposit
0Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item