UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Meta reinforcement learning with latent variable Gaussian processes

Sæmundsson, S; Hofmann, K; Deisenroth, MP; (2018) Meta reinforcement learning with latent variable Gaussian processes. In: Elidan, G and Kersting, K, (eds.) Proceedings of 34th Conference on Uncertainty in Artificial Intelligence (uai 2018). (pp. pp. 642-652). Association for Uncertainty in Artificial Intelligence (AUAI): Monterey, CA, USA. Green open access

[thumbnail of Deisenroth_permitted VoR_235.pdf]
Preview
Text
Deisenroth_permitted VoR_235.pdf - Published Version

Download (801kB) | Preview

Abstract

Learning from small data sets is critical in many practical applications where data collection is time consuming or expensive, e.g., robotics, animal experiments or drug design. Meta learning is one way to increase the data efficiency of learning algorithms by generalizing learned concepts from a set of training tasks to unseen, but related, tasks. Often, this relationship between tasks is hard coded or relies in some other way on human expertise. In this paper, we frame meta learning as a hierarchical latent variable model and infer the relationship between tasks automatically from data. We apply our framework in a modelbased reinforcement learning setting and show that our meta-learning model effectively generalizes to novel tasks by identifying how new tasks relate to prior ones from minimal data. This results in up to a 60% reduction in the average interaction time needed to solve tasks compared to strong baselines.

Type: Proceedings paper
Title: Meta reinforcement learning with latent variable Gaussian processes
Event: 34th Conference on Uncertainty in Artificial Intelligence (uai 2018), 6-10 August 2018, Monterey, CA, USA
Open access status: An open access version is available from UCL Discovery
Publisher version: http://auai.org/uai2018/proceedings/papers/235.pdf
Language: English
Additional information: This version is the version of record. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10083560
Downloads since deposit
Loading...
20Downloads
Download activity - last month
Loading...
Download activity - last 12 months
Loading...
Downloads by country - last 12 months
Loading...

Archive Staff Only

View Item View Item