UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning

Anderlini, E; Parker, GG; Thomas, G; (2019) Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning. Applied Sciences , 9 (17) , Article 3456. 10.3390/app9173456. Green open access

[thumbnail of Anderlini_OA_applsci-09-03456-v2.pdf]
Preview
Text
Anderlini_OA_applsci-09-03456-v2.pdf - Published Version

Download (1MB) | Preview

Abstract

To achieve persistent systems in the future, autonomous underwater vehicles (AUVs) will need to autonomously dock onto a charging station. Here, reinforcement learning strategies were applied for the first time to control the docking of an AUV onto a fixed platform in a simulation environment. Two reinforcement learning schemes were investigated: one with continuous state and action spaces, deep deterministic policy gradient (DDPG), and one with continuous state but discrete action spaces, deep Q network (DQN). For DQN, the discrete actions were selected as step changes in the control input signals. The performance of the reinforcement learning strategies was compared with classical and optimal control techniques. The control actions selected by DDPG suffer from chattering effects due to a hyperbolic tangent layer in the actor. Conversely, DQN presents the best compromise between short docking time and low control effort, whilst meeting the docking requirements. Whereas the reinforcement learning algorithms present a very high computational cost at training time, they are five orders of magnitude faster than optimal control at deployment time, thus enabling an on-line implementation. Therefore, reinforcement learning achieves a performance similar to optimal control at a much lower computational cost at deployment, whilst also presenting a more general framework.

Type: Article
Title: Docking Control of an Autonomous Underwater Vehicle Using Reinforcement Learning
Open access status: An open access version is available from UCL Discovery
DOI: 10.3390/app9173456
Publisher version: https://doi.org/10.3390/app9173456
Language: English
Additional information: © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/ licenses/by/4.0/).
Keywords: autonomous underwater vehicle; reinforcement learning; optimal control
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Mechanical Engineering
URI: https://discovery.ucl.ac.uk/id/eprint/10080330
Downloads since deposit
110Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item