UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

H-GAP: Humanoid Control with a Generalist Planner

Jiang, Z; Xu, Y; Wagener, N; Luo, Y; Janner, M; Grefenstette, E; Rocktäschel, T; (2024) H-GAP: Humanoid Control with a Generalist Planner. In: 12th International Conference on Learning Representations ICLR 2024. ICLR Green open access

[thumbnail of 7263_H_GAP_Humanoid_Control_wi.pdf]
Preview
Text
7263_H_GAP_Humanoid_Control_wi.pdf - Accepted Version

Download (1MB) | Preview

Abstract

Humanoid control is an important research challenge offering avenues for integration into human-centric infrastructures and enabling physics-driven humanoid animations. The daunting challenges in this field stem from the difficulty of optimizing in high-dimensional action spaces and the instability introduced by the bipedal morphology of humanoids. However, the extensive collection of human motion-captured data and the derived datasets of humanoid trajectories, such as MoCapAct, paves the way to tackle these challenges. In this context, we present Humanoid Generalist Autoencoding Planner (H-GAP), a state-action trajectory generative model trained on humanoid trajectories derived from human motion-captured data, capable of adeptly handling downstream control tasks with Model Predictive Control (MPC). For 56 degrees of freedom humanoid, we empirically demonstrate that H-GAP learns to represent and generate a wide range of motor behaviours. Further, without any learning from online interactions, it can also flexibly transfer these behaviors to solve novel downstream control tasks via planning. Notably, H-GAP excels established MPC baselines that have access to the ground truth dynamics model, and is superior or comparable to offline RL methods trained for individual tasks. Finally, we do a series of empirical studies on the scaling properties of H-GAP, showing the potential for performance gains via additional data but not computing. Code and videos are available at https://ycxuyingchen.github.io/hgap/.

Type: Proceedings paper
Title: H-GAP: Humanoid Control with a Generalist Planner
Event: ICLR 2024
Open access status: An open access version is available from UCL Discovery
Publisher version: https://openreview.net/forum?id=LKT9Jq5Xrz
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10216726
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item