UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Adaptive and extendable control of unmanned surface vehicle formations using distributed deep reinforcement learning

Wang, S; Ma, F; Yan, X; Wu, P; Liu, Y; (2021) Adaptive and extendable control of unmanned surface vehicle formations using distributed deep reinforcement learning. Applied Ocean Research , 110 , Article 102590. 10.1016/j.apor.2021.102590. Green open access

[thumbnail of APOR_RL_USV.pdf]
Preview
Text
APOR_RL_USV.pdf - Accepted Version

Download (2MB) | Preview

Abstract

Future ocean exploration will be dominated by a large-scale deployment of marine robots such as unmanned surface vehicles (USVs). Without the involvement of human operators, USVs exploit oceans, especially the complex marine environments, in an unprecedented way with an increased mission efficiency. However, current autonomy level of USVs is still limited, and the majority of vessels are being remotely controlled. To address such an issue, artificial intelligence (AI) such as reinforcement learning can effectively equip USVs with high-level intelligence and consequently achieve full autonomous operation. Also, by adopting the concept of multi-agent intelligence, future trend of USV operations is to use them as a formation fleet. Current researches in USV formation control are largely based upon classical control theories such as PID, backstepping and model predictive control methods with the impact by using advanced AI technologies unclear. This paper, therefore, paves the way in this area by proposing a distributed deep reinforcement learning algorithm for USV formations. More importantly, using the proposed algorithm USV formations can learn two critical abilities, i.e. adaptability and extendibility that enable formations to arbitrarily increase the number of USVs or change formation shapes. The effectiveness of algorithms has been verified and validated through a number of computer-based simulations.

Type: Article
Title: Adaptive and extendable control of unmanned surface vehicle formations using distributed deep reinforcement learning
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.apor.2021.102590
Publisher version: https://doi.org/10.1016/j.apor.2021.102590
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Unmanned surface vehicles (USVs), USV formation control, Deep reinforcement learning, Deep deterministic policy gradient (DDPG), Extendable reinforcement learning
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Mechanical Engineering
URI: https://discovery.ucl.ac.uk/id/eprint/10123464
Downloads since deposit
0Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item