UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Efficient Planning in a Compact Latent Action Space

Jiang, Z; Zhang, T; Janner, M; Li, Y; Rocktäschel, T; Grefenstette, E; Tian, Y; (2023) Efficient Planning in a Compact Latent Action Space. In: 11th International Conference on Learning Representations ICLR 2023. ICLR Green open access

[thumbnail of 2208.10291v3.pdf]
Preview
Text
2208.10291v3.pdf - Accepted Version

Download (5MB) | Preview

Abstract

Planning-based reinforcement learning has shown strong performance in tasks in discrete and low-dimensional continuous action spaces. However, planning usually brings significant computational overhead for decision-making, and scaling such methods to high-dimensional action spaces remains challenging. To advance efficient planning for high-dimensional continuous control, we propose Trajectory Autoencoding Planner (TAP), which learns low-dimensional latent action codes with a state-conditional VQ-VAE. The decoder of the VQ-VAE thus serves as a novel dynamics model that takes latent actions and current state as input and reconstructs long-horizon trajectories. During inference time, given a starting state, TAP searches over discrete latent actions to find trajectories that have both high probability under the training distribution and high predicted cumulative reward. Empirical evaluation in the offline RL setting demonstrates low decision latency which is indifferent to the growing raw action dimensionality. For Adroit robotic hand manipulation tasks with high-dimensional continuous action space, TAP surpasses existing model-based methods by a large margin and also beats strong model-free actor-critic baselines.

Type: Proceedings paper
Title: Efficient Planning in a Compact Latent Action Space
Event: ICLR 2023
Open access status: An open access version is available from UCL Discovery
Publisher version: https://openreview.net/forum?id=NF-t9agoSr
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10216725
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item