Marris, Luke;
Muller, Paul;
Lanctot, Marc;
Tuyls, Karl;
Graepel, Thore;
(2021)
Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers.
In: Meila, M and Zhang, T, (eds.)
Proceedings of the 38th International Conference on Machine Learning.
(pp. pp. 7480-7491).
Proceedings of Machine Learning Research
Preview |
Text
marris21a.pdf - Published Version Download (2MB) | Preview |
Abstract
Two-player, constant-sum games are well studied in the literature, but there has been limited progress outside of this setting. We propose Joint Policy-Space Response Oracles (JPSRO), an algorithm for training agents in n-player, general-sum extensive form games, which provably converges to an equilibrium. We further suggest correlated equilibria (CE) as promising meta-solvers, and propose a novel solution concept Maximum Gini Correlated Equilibrium (MGCE), a principled and computationally efficient family of solutions for solving the correlated equilibrium selection problem. We conduct several experiments using CE meta-solvers for JPSRO and demonstrate convergence on n-player, general-sum games.
Type: | Proceedings paper |
---|---|
Title: | Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers |
Event: | The 38th International Conference on Machine Learning |
Open access status: | An open access version is available from UCL Discovery |
Publisher version: | https://proceedings.mlr.press/v139/marris21a.html |
Language: | English |
Additional information: | © The Authors 2023. Original content in this paper is licensed under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) Licence (https://creativecommons.org/licenses/by/4.0/). |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery.ucl.ac.uk/id/eprint/10187244 |




Archive Staff Only
![]() |
View Item |