UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Gait-Conditioned Reinforcement Learning with Multi-Phase Curriculum for Humanoid Locomotion

Peng, T; Bao, L; Zhou, C; (2025) Gait-Conditioned Reinforcement Learning with Multi-Phase Curriculum for Humanoid Locomotion. In: 2025 IEEE-RAS 24th International Conference on Humanoid Robots (Humanoids). (pp. pp. 1087-1093). IEEE: Seoul, Korea, Republic of. Green open access

[thumbnail of 2025_Humanoid_Conference_Paper-13.pdf]
Preview
PDF
2025_Humanoid_Conference_Paper-13.pdf - Accepted Version

Download (3MB) | Preview

Abstract

We present a unified gait-conditioned reinforcement learning framework that enables humanoid robots to perform standing, walking, running, and smooth transitions within a single recurrent policy. A compact reward routing mechanism dynamically activates gait-specific objectives based on a one-hot gait ID, mitigating reward interference and supporting stable multi-gait learning. Human-inspired reward terms promote biomechanically natural motions, such as straight-knee stance and coordinated arm-leg swing, without requiring motion capture data. A structured curriculum progressively introduces gait complexity and expands command space over multiple phases. In simulation, the policy successfully achieves robust standing, walking, running, and gait transitions. On the real Unitree G1 humanoid, we validate standing, walking, and walk-to-stand transitions, demonstrating stable and coordinated locomotion. This work provides a scalable, reference-free solution toward versatile and naturalistic humanoid control across diverse modes and environments.

Type: Proceedings paper
Title: Gait-Conditioned Reinforcement Learning with Multi-Phase Curriculum for Humanoid Locomotion
Event: 2025 IEEE-RAS 24th International Conference on Humanoid Robots (Humanoids)
Dates: 30 Sep 2025 - 2 Oct 2025
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/Humanoids65713.2025.11203058
Publisher version: https://doi.org/10.1109/humanoids65713.2025.112030...
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Legged locomotion, Biomechanics, Humanoid robots, Reinforcement learning, Interference, Aerospace electronics, Routing, Motion capture, Complexity theory
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10219246
Downloads since deposit
0Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item