Mean Field Multi-Agent Reinforcement Learning

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

Mean Field Multi-Agent Reinforcement Learning

Yang, Y; Luo, R; Li, M; Zhou, M; Zhang, W; Wang, J; (2018) Mean Field Multi-Agent Reinforcement Learning. In: Dy, J and Krause, A, (eds.) Proceedings of the 35th International Conference on Machine Learning. (pp. pp. 5571-5580). PMLR Green open access

Preview

Text
Wang_Mean field multi-agent reinforcement learning_AAM.pdf - Accepted Version
Download (3MB) | Preview

Abstract

Existing multi-agent reinforcement learning methods are limited typically to a small number of agents. When the agent number increases largely, the learning becomes intractable due to the curse of the dimensionality and the exponential growth of agent interactions. In this paper, we present Mean Field Reinforcement Learning where the interactions within the population of agents are approximated by those between a single agent and the average effect from the overall population or neighboring agents; the interplay between the two entities is mutually reinforced: the learning of the individual agent’s optimal policy depends on the dynamics of the population, while the dynamics of the population change according to the collective patterns of the individual policies. We develop practical mean field Q-learning and mean field Actor-Critic algorithms and analyze the convergence of the solution to Nash equilibrium. Experiments on Gaussian squeeze, Ising model, and battle games justify the learning effectiveness of our mean field approaches. In addition, we report the first result to solve the Ising model via model-free reinforcement learning methods.

Type:	Proceedings paper
Title:	Mean Field Multi-Agent Reinforcement Learning
Event:	The 35th International Conference on Machine Learning
Location:	Stockholm, Sweden
Dates:	10th-15th July 2018
Open access status:	An open access version is available from UCL Discovery
Publisher version:	http://proceedings.mlr.press/v80/yang18d.html
Language:	English
Additional information:	This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification:	UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI:	https://discovery.ucl.ac.uk/id/eprint/10066100

Downloads since deposit

71Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item