Human-Aware Reinforcement Learning for Fault Recovery Using Contextual Gaussian Processes

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

Human-Aware Reinforcement Learning for Fault Recovery Using Contextual Gaussian Processes

McGuire, Steve; Furlong, P Michael; Heckman, Christoffer; Julier, Simon; Ahmed, Nisar; (2021) Human-Aware Reinforcement Learning for Fault Recovery Using Contextual Gaussian Processes. Journal of Aerospace Information Systems , 18 (7) 10.2514/1.i010921. Green open access

[thumbnail of AIAA2020_GaussianProcessesHumanLearning_11Apr.pdf]

Preview

Text
AIAA2020_GaussianProcessesHumanLearning_11Apr.pdf - Accepted Version
Download (756kB) | Preview

Abstract

This work addresses the iterated nonstationary assistant selection problem, in which over the course of repeated interactions on a mission, an autonomous robot experiencing a fault must select a single human from among a group of assistants to restore it to operation. The assistants in our problem have a level of performance that changes as a function of their experience solving the problem. Our approach uses reinforcement learning via a multi-arm bandit formulation to learn about the capabilities of each potential human assistant and decide which human to task. This study, which is built on our past work, evaluates the potential for a Gaussian-process-based machine learning method to effectively model the complex dynamics associated with human learning and forgetting. Application of our method in simulation shows that our method is capable of tracking performance of human-like dynamics for learning and forgetting. Using a novel selection policy called the proficiency window, it is shown that our technique can outperform baseline selection strategies while providing guarantees on human use. Our work offers an effective potential alternative to dedicated human supervisors, with application to any human–robot system where a set of humans is responsible for overseeing autonomous robot operations.

Type:	Article
Title:	Human-Aware Reinforcement Learning for Fault Recovery Using Contextual Gaussian Processes
Open access status:	An open access version is available from UCL Discovery
DOI:	10.2514/1.i010921
Publisher version:	https://doi.org/10.2514/1.I010921
Language:	English
Additional information:	This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords:	Gaussian Mixture Models, Gaussian Process, Robots, Autonomous Robot, Space Exploration, Analysis of Variance, Defense Advanced Research Projects Agencies, Autonomous Systems, Planetary Surfaces, Federal Aviation Administration
UCL classification:	UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science UCL > Provost and Vice Provost Offices > UCL BEAMS UCL
URI:	https://discovery.ucl.ac.uk/id/eprint/10155800

Downloads since deposit

0Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item