The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth

Watson, Jamie; Mac Aodha, Oisin; Prisacariu, Victor; Brostow, Gabriel; Firman, Michael; (2021) The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth. In: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). (pp. pp. 1164-1174). IEEE Green open access

[thumbnail of Brostow_The_Temporal_Opportunist_WATSON_DOA28022021_AFV (2).pdf]

Preview

Text
Brostow_The_Temporal_Opportunist_WATSON_DOA28022021_AFV (2).pdf
Download (4MB) | Preview

Abstract

Self-supervised monocular depth estimation networks are trained to predict scene depth using nearby frames as a supervision signal during training. However, for many applications, sequence information in the form of video frames is also available at test time. The vast majority of monocular networks do not make use of this extra signal, thus ignoring valuable information that could be used to improve the predicted depth. Those that do, either use computationally expensive test-time refinement techniques or off-the-shelf recurrent networks, which only indirectly make use of the geometric information that is inherently available. We propose ManyDepth, an adaptive approach to dense depth estimation that can make use of sequence information at test time, when it is available. Taking inspiration from multi-view stereo, we propose a deep end-to-end cost volume based approach that is trained using self-supervision only. We present a novel consistency loss that encourages the network to ignore the cost volume when it is deemed unreliable, e.g. in the case of moving objects, and an augmentation scheme to cope with static cameras. Our detailed experiments on both KITTI and Cityscapes show that we outperform all published self-supervised baselines, including those that use single or multiple frames at test time.

Type:	Proceedings paper
Title:	The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth
Event:	IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021)
Location:	ELECTR NETWORK
Dates:	19 Jun 2021 - 25 Jun 2021
Open access status:	An open access version is available from UCL Discovery
DOI:	10.1109/CVPR46437.2021.00122
Publisher version:	https://doi.org/10.1109/CVPR46437.2021.00122
Language:	English
Additional information:	This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords:	Science & Technology, Technology, Computer Science, Artificial Intelligence, Imaging Science & Photographic Technology, Computer Science
UCL classification:	UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science UCL > Provost and Vice Provost Offices > UCL BEAMS UCL
URI:	https://discovery.ucl.ac.uk/id/eprint/10150735

Downloads since deposit

0Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item