RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation

Anciukevičius, Titas; Xu, Zexiang; Fisher, Matthew; Henderson, Paul; Bilen, Hakan; Mitra, Niloy J; Guerrero, Paul; (2023) RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation. In: O'Conner, Lisa, (ed.) 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). (pp. pp. 12608-12618). IEEE: Vancouver, BC, Canada. Green open access

Preview

Text
Anciukevicius_RenderDiffusion_Image_Diffusion_for_3D_Reconstruction_Inpainting_and_Generation_CVPR_2023_paper.pdf - Accepted Version
Download (10MB) | Preview

Abstract

Diffusion models currently achieve state-of-the-art performance for both conditional and unconditional image generation. However, so far, image diffusion models do not support tasks required for 3D understanding, such as view-consistent 3D generation or single-view object reconstruction. In this paper, we present RenderDiffusion, the first diffusion model for 3D generation and inference, trained using only monocular 2D supervision. Central to our method is a novel image denoising architecture that generates and renders an intermediate three-dimensional representation of a scene in each denoising step. This enforces a strong inductive structure within the diffusion process, providing a 3D consistent representation while only requiring 2D supervision. The resulting 3D representation can be rendered from any view. We evaluate RenderDiffusion on FFHQ, AFHQ, ShapeNet and CLEVR datasets, showing competitive performance for generation of 3D scenes and inference of 3D scenes from 2D images. Additionally, our diffusion-based approach allows us to use 2D inpainting to edit 3D scenes.

Type:	Proceedings paper
Title:	RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation
Event:	2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Dates:	17 Jun 2023 - 24 Jun 2023
ISBN-13:	979-8-3503-0129-8
Open access status:	An open access version is available from UCL Discovery
DOI:	10.1109/CVPR52729.2023.01213
Publisher version:	https://doi.org/10.1109/CVPR52729.2023.01213
Language:	English
Additional information:	This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
Keywords:	Training; Solid modeling; Three-dimensional displays; Image synthesis; Noise reduction; Pose estimation; Rendering (computer graphics)
UCL classification:	UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI:	https://discovery.ucl.ac.uk/id/eprint/10179949

Downloads since deposit

25Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item