UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Perceptual Real-Time 2D-to-3D Conversion Using Cue Fusion

Leimkuehler, T; Kellnhofer, P; Ritschel, T; Myszkowski, K; Seidel, H-P; (2018) Perceptual Real-Time 2D-to-3D Conversion Using Cue Fusion. IEEE Transactions on Visualization and Computer Graphics , 24 (6) pp. 2037-2050. 10.1109/TVCG.2017.2703612. Green open access

[thumbnail of Ritschel_StereoCueFusionPaper_TVCG.pdf]
Preview
Text
Ritschel_StereoCueFusionPaper_TVCG.pdf - Accepted Version

Download (16MB) | Preview

Abstract

We propose a system to infer binocular disparity from a monocular video stream in real-time. Different from classic reconstruction of physical depth in computer vision, we compute perceptually plausible disparity, that is numerically inaccurate, but results in a very similar overall depth impression with plausible overall layout, sharp edges, fine details and agreement between luminance and disparity. We use several simple monocular cues to estimate disparity maps and confidence maps of low spatial and temporal resolution in real-time. These are complemented by spatially-varying, appearance-dependent and class-specific disparity prior maps, learned from example stereo images. Scene classification selects this prior at runtime. Fusion of prior and cues is done by means of robust MAP inference on a dense spatio-temporal conditional random field with high spatial and temporal resolution. Using normal distributions allows this in constant-time, parallel per-pixel work. We compare our approach to previous 2D-to-3D conversion systems in terms of different metrics, as well as a user study and validate our notion of perceptually plausible disparity.

Type: Article
Title: Perceptual Real-Time 2D-to-3D Conversion Using Cue Fusion
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/TVCG.2017.2703612
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Depth cues, stereo, image-based rendering, perceptual reasoning, video analysis, viewing algorithms, pixel classification, real-time systems
UCL classification: UCL
UCL > Provost and Vice Provost Offices
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10061781
Downloads since deposit
149Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item