UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation

Vecerik, Mel; Doersch, Carl; Yang, Yi; Davchev, Todor; Aytar, Yusuf; Zhou, Guangyao; Hadsell, Raia; ... Scholz, Jon; + view all (2024) RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation. In: Proceedings - IEEE International Conference on Robotics and Automation. (pp. pp. 5397-5403). IEEE: Yokohama, Japan. Green open access

[thumbnail of robotapicra24.pdf]
Preview
PDF
robotapicra24.pdf - Published Version

Download (19MB) | Preview

Abstract

For robots to be useful outside labs and specialized factories we need a way to teach them new useful behaviors quickly. Current approaches lack either the generality to onboard new tasks without task-specific engineering, or else lack the data-efficiency to do so in an amount of time that enables practical use. In this work we explore dense tracking as a representational vehicle to allow faster and more general learning from demonstration. Our approach utilizes Track-Any-Point (TAP) models to isolate the relevant motion in a demonstration, and parameterize a low-level controller to reproduce this motion across changes in the scene configuration. We show this results in robust robot policies that can solve complex object-arrangement tasks such as shape-matching, stacking, and even full path-following tasks such as applying glue and sticking objects together, all from demonstrations that can be collected in minutes.

Type: Proceedings paper
Title: RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation
Event: 2024 IEEE International Conference on Robotics and Automation (ICRA)
Dates: 13 May 2024 - 17 May 2024
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/ICRA57147.2024.10611409
Publisher version: https://doi.org/10.1109/ICRA57147.2024.10611409
Language: English
Additional information: This version is the author-accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Training, Visualization, Tracking, Stacking, Production facilities, Planning, Task analysis
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10196975
Downloads since deposit
10Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item