LIM: Large Interpolator Model for Dynamic Reconstruction

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

LIM: Large Interpolator Model for Dynamic Reconstruction

Sabathier, R; Mitra, NJ; Novotny, D; (2025) LIM: Large Interpolator Model for Dynamic Reconstruction. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. (pp. pp. 6154-6164). IEEE Green open access

Preview

Text
Sabathier_LIM_Large_Interpolator_Model_for_Dynamic_Reconstruction_CVPR_2025_paper.pdf - Accepted Version
Download (6MB) | Preview

Abstract

Reconstructing dynamic assets from video data is central to many in computer vision and graphics tasks. Existing 4D reconstruction approaches are limited by category-specific models or slow optimization-based methods. Inspired by the recent Large Reconstruction Model (LRM) [15], we present the Large Interpolation Model (LIM), a transformer-based feed-forward solution, guided by a novel causal consistency loss, for interpolating implicit 3D representations across time. Given implicit 3D representations at times t<inf>0</inf> and t<inf>1</inf>, LIM produces a deformed shape at any continuous time t ∈ [t<inf>0</inf>,t<inf>1</inf>] delivering high-quality interpolations in seconds (per frame). Furthermore, LIM allows explicit mesh tracking across time, producing a consistently uv-textured mesh sequence ready for integration into existing production pipelines. We also use LIM, in conjunction with a diffusion-based multi-view generator, to produce dynamic 4D reconstructions from monocular videos. We evaluate LIM on various dynamic datasets, benchmarking against image-space interpolation methods (e.g., FiLM [41]) and direct triplane linear interpolation, and demonstrate clear advantages. In summary, LIM is the first feed-forward model capable of high-speed tracked 4D asset reconstruction across diverse categories.

Type:	Proceedings paper
Title:	LIM: Large Interpolator Model for Dynamic Reconstruction
Event:	2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Location:	Nashville, TN, USA
Dates:	10th-17th June 2025
ISBN-13:	979-8-3315-4364-8
Open access status:	An open access version is available from UCL Discovery
DOI:	10.1109/CVPR52734.2025.00577
Publisher version:	https://doi.org/10.1109/cvpr52734.2025.00577
Language:	English
Additional information:	This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification:	UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI:	https://discovery.ucl.ac.uk/id/eprint/10215198

Downloads since deposit

5Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item