UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Learning monocular 3D reconstruction of articulated categories from motion

Kokkinos, F; Kokkinos, I; (2021) Learning monocular 3D reconstruction of articulated categories from motion. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). (pp. pp. 1737-1746). IEEE: Nashville, TN, USA. Green open access

[thumbnail of paper.pdf]
Preview
PDF
paper.pdf - Accepted Version

Download (9MB) | Preview

Abstract

Monocular 3D reconstruction of articulated object categories is challenging due to the lack of training data and the inherent ill-posedness of the problem. In this work we use video self-supervision, forcing the consistency of consecutive 3D reconstructions by a motion-based cycle loss. This largely improves both optimization-based and learning-based 3D mesh reconstruction. We further introduce an interpretable model of 3D template deformations that controls a 3D surface through the displacement of a small number of local, learnable handles. We formulate this operation as a structured layer relying on mesh-laplacian regularization and show that it can be trained in an end-to-end manner. We finally introduce a per-sample numerical optimisation approach that jointly optimises over mesh displacements and cameras within a video, boosting accuracy both for training and also as test time post-processing. While relying exclusively on a small set of videos collected per category for supervision, we obtain state-of-the-art reconstructions with diverse shapes, viewpoints and textures for multiple articulated object categories. Supplementary materials, code, and videos are provided on the project page: https://fkokkinos.github.io/video_3d_reconstruction/.

Type: Proceedings paper
Title: Learning monocular 3D reconstruction of articulated categories from motion
Event: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Location: ELECTR NETWORK
Dates: 19 Jun 2021 - 25 Jun 2021
ISBN-13: 9781665445092
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/CVPR46437.2021.00178
Publisher version: https://doi.org/10.1109/CVPR46437.2021.00178
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Science & Technology, Technology, Computer Science, Artificial Intelligence, Imaging Science & Photographic Technology, Computer Science
UCL classification: UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL
URI: https://discovery.ucl.ac.uk/id/eprint/10149390
Downloads since deposit
33Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item