UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Reinforcement Learning-based Grasping via One-Shot Affordance Localization and Zero-Shot Contrastive Language–Image Learning

Long, Xiang; Beddow, Luke; Hadjivelichkov, Denis; Delfaki, Andromachi Maria; Wurdemann, Helge; Kanoulas, Dimitrios; (2024) Reinforcement Learning-based Grasping via One-Shot Affordance Localization and Zero-Shot Contrastive Language–Image Learning. In: Proceedings of the 2024 IEEE/SICE International Symposium on System Integration (SII). IEEE: Ha Long, Vietnam. Green open access

[thumbnail of sii_2024_xiang.pdf]
Preview
Text
sii_2024_xiang.pdf - Other

Download (16MB) | Preview

Abstract

We present a novel robotic grasping system using a caging-style gripper, that combines one-shot affordance localization and zero-shot object identification. We demonstrate an integrated system requiring minimal prior knowledge, focusing on flexible few-shot object agnostic approaches. For grasping a novel target object, we use as input the color and depth of the scene, an image of an object affordance similar to the target object, and an up to three-word text prompt describing the target object. We demonstrate the system using real-world grasping of objects from the YCB benchmark set, with four distractor objects cluttering the scene. Overall, our pipeline has a success rate of the affordance localization of 96%, object identification of 62.5%, and grasping of 72%. Videos are on the project website: https://sites.google.com/view/ rl-affcorrs-grasp.

Type: Proceedings paper
Title: Reinforcement Learning-based Grasping via One-Shot Affordance Localization and Zero-Shot Contrastive Language–Image Learning
Event: The 2024 16th IEEE/SICE International Symposium on System Integration
Location: Ha Long, Vietnam
Dates: 8 Jan 2024 - 11 Jan 2024
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/SII58957.2024.10417178
Publisher version: https://doi.org/10.1109/SII58957.2024.10417178
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Location awareness, Affordances, Pipelines, Grasping, System integration, Robots, Videos
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10178561
Downloads since deposit
41Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item