UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee

Han, M; Tian, Y; Zhang, L; Wang, J; Pan, W; (2021) Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee. Automatica , 129 , Article 109689. 10.1016/j.automatica.2021.109689. Green open access

[thumbnail of 1-s2.0-S0005109821002090-main.pdf]
Preview
Text
1-s2.0-S0005109821002090-main.pdf - Published Version

Download (2MB) | Preview

Abstract

Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. Without using a mathematical model, an optimal controller can be learned from data evaluated by certain performance criteria through trial-and-error. However, the data-based learning approach is notorious for not guaranteeing stability, which is the most fundamental property for any control system. In this paper, the classic Lyapunov’s method is explored to analyze the uniformly ultimate boundedness stability (UUB) solely based on data without using a mathematical model. It is further shown how RL with UUB guarantee can be applied to control dynamic systems with safety constraints. Based on the theoretical results, both off-policy and on-policy learning algorithms are proposed respectively. As a result, optimal controllers can be learned to guarantee UUB of the closed-loop system both at convergence and during learning. The proposed algorithms are evaluated on a series of robotic continuous control tasks with safety constraints. In comparison with the existing RL algorithms, the proposed method can achieve superior performance in terms of maintaining safety. As a qualitative evaluation of stability, our method shows impressive resilience even in the presence of external disturbances.

Type: Article
Title: Reinforcement learning control of constrained dynamic systems with uniformly ultimate boundedness stability guarantee
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.automatica.2021.109689
Publisher version: https://doi.org/10.1016/j.automatica.2021.109689
Language: English
Additional information: © 2021 The Author(s). Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Keywords: Data-based control, Reinforcement learning, Constrained dynamic system, Uniformly ultimate boundedness stability, Lyapunov's method
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10131098
Downloads since deposit
Loading...
100Downloads
Download activity - last month
Loading...
Download activity - last 12 months
Loading...
Downloads by country - last 12 months
Loading...

Archive Staff Only

View Item View Item