UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Accessibility-Based Clustering for Efficient Learning of Locomotion Skills

Zhang, C; Yu, W; Li, Z; (2022) Accessibility-Based Clustering for Efficient Learning of Locomotion Skills. In: 2022 International Conference on Robotics and Automation (ICRA). (pp. pp. 1600-1606). IEEE: Philadelphia, PA, USA. Green open access

[thumbnail of Accessibility_Based_Clustering_for_Efficient_Learning_of_Locomotion_Skills__2nd_sub_.pdf]
Preview
PDF
Accessibility_Based_Clustering_for_Efficient_Learning_of_Locomotion_Skills__2nd_sub_.pdf - Accepted Version

Download (9MB) | Preview

Abstract

For model-free deep reinforcement learning of quadruped locomotion, the initialization of robot configurations is crucial for data efficiency and robustness. This work focuses on algorithmic improvements of data efficiency and robustness simultaneously through automatic discovery of initial states, which is achieved by our proposed K-Access algorithm based on accessibility metrics. Specifically, we formulated accessibility metrics to measure the difficulty of transitions between two arbitrary states, and proposed a novel K-Access algorithm for state-space clustering that automatically discovers the centroids of the static-pose clusters based on the accessibility metrics. By using the discovered centroidal static poses as the initial states, we can improve data efficiency by reducing redundant explorations, and enhance the robustness by more effective explorations from the centroids to sampled poses. Focusing on fall recovery as a very hard set of locomotion skills, we validated our method extensively using an 8-DoF quadrupedal robot Bittle. Compared to the baselines, the learning curve of our method converges much faster, requiring only 60% of training episodes. With our method, the robot can successfully recover to standing poses within 3 seconds in 99.4% of the test cases. Moreover, the method can generalize to other difficult skills successfully, such as backflipping.

Type: Proceedings paper
Title: Accessibility-Based Clustering for Efficient Learning of Locomotion Skills
Event: 2022 International Conference on Robotics and Automation (ICRA)
Dates: 23 May 2022 - 27 May 2022
ISBN-13: 9781728196817
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/ICRA46639.2022.9812113
Publisher version: https://doi.org/10.1109/ICRA46639.2022.9812113
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Measurement, Training, Automation, Clustering algorithms, Focusing, Reinforcement learning, Robustness
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10159086
Downloads since deposit
30Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item