UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning

Mguni, D; Jafferjee, T; Wang, J; Slumbers, O; Perez-Nieves, N; Tong, F; Yang, L; ... Wang, J; + view all (2022) LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning. In: ICLR 2022 - 10th International Conference on Learning Representations. ICLR Green open access

[thumbnail of 1038_ligs_learnable_intrinsic_rewar.pdf]
Preview
Text
1038_ligs_learnable_intrinsic_rewar.pdf - Published Version

Download (1MB) | Preview

Abstract

Efficient exploration is important for reinforcement learners to achieve high rewards. In multi-agent systems, coordinated exploration and behaviour is critical for agents to jointly achieve optimal outcomes. In this paper, we introduce a new general framework for improving coordination and performance of multi-agent reinforcement learners (MARL). Our framework, named Learnable Intrinsic-Reward Generation Selection algorithm (LIGS) introduces an adaptive learner, Generator that observes the agents and learns to construct intrinsic rewards online that coordinate the agents' joint exploration and joint behaviour. Using a novel combination of MARL and switching controls, LIGS determines the best states to learn to add intrinsic rewards which leads to a highly efficient learning process. LIGS can subdivide complex tasks making them easier to solve and enables systems of MARL agents to quickly solve environments with sparse rewards. LIGS can seamlessly adopt existing MARL algorithms and, our theory shows that it ensures convergence to policies that deliver higher system performance. We demonstrate its superior performance in challenging tasks in Foraging and StarCraft II.

Type: Proceedings paper
Title: LIGS: Learnable Intrinsic-Reward Generation Selection for Multi-Agent Learning
Event: ICLR 2022 - 10th International Conference on Learning Representations
Open access status: An open access version is available from UCL Discovery
Publisher version: https://openreview.net/forum?id=CpTuR2ECuW
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: reinforcement learning, multi-agent, exploration
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10167463
Downloads since deposit
15Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item