UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Slim DensePose: Thrifty Learning from Sparse Annotations and Motion Cues

Neverova, N; Thewlis, J; Güler, RA; Kokkinos, I; Vedaldi, A; (2019) Slim DensePose: Thrifty Learning from Sparse Annotations and Motion Cues. In: (Proceedings) CVPR 2019 - IEEE Conference on Computer Vision and Pattern Recognition. Computer Vision Foundation / IEEE: Long Beach, CA, USA. (In press). Green open access

[thumbnail of Kokkinos SlimDP.pdf]
Preview
Text
Kokkinos SlimDP.pdf - Accepted Version

Download (6MB) | Preview

Abstract

DensePose supersedes traditional landmark detectors by densely mapping image pixels to body surface coordinates. This power, however, comes at a greatly increased annotation time, as supervising the model requires to manually label hundreds of points per pose instance. In this work, we thus seek methods to significantly slim down the DensePose annotations, proposing more efficient data collection strategies. In particular, we demonstrate that if annotations are collected in video frames, their efficacy can be multiplied for free by using motion cues. To explore this idea, we introduce DensePose-Track, a dataset of videos where selected frames are annotated in the traditional DensePose manner. Then, building on geometric properties of the DensePose mapping, we use the video dynamic to propagate ground-truth annotations in time as well as to learn from Siamese equivariance constraints. Having performed exhaustive empirical evaluation of various data annotation and learning strategies, we demonstrate that doing so can deliver significantly improved pose estimation results over strong baselines. However, despite what is suggested by some recent works, we show that merely synthesizing motion patterns by applying geometric transformations to isolated frames is significantly less effective, and that motion cues help much more when they are extracted from videos.

Type: Proceedings paper
Title: Slim DensePose: Thrifty Learning from Sparse Annotations and Motion Cues
Event: CVPR 2019 - IEEE Conference on Computer Vision and Pattern Recognition
Open access status: An open access version is available from UCL Discovery
Publisher version: http://openaccess.thecvf.com/CVPR2019.py
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10088672
Downloads since deposit
0Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item