UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

LIM: Large Interpolator Model for Dynamic Reconstruction

Sabathier, R; Mitra, NJ; Novotny, D; (2025) LIM: Large Interpolator Model for Dynamic Reconstruction. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. (pp. pp. 6154-6164). IEEE Green open access

[thumbnail of Sabathier_LIM_Large_Interpolator_Model_for_Dynamic_Reconstruction_CVPR_2025_paper.pdf]
Preview
Text
Sabathier_LIM_Large_Interpolator_Model_for_Dynamic_Reconstruction_CVPR_2025_paper.pdf - Accepted Version

Download (6MB) | Preview

Abstract

Reconstructing dynamic assets from video data is central to many in computer vision and graphics tasks. Existing 4D reconstruction approaches are limited by category-specific models or slow optimization-based methods. Inspired by the recent Large Reconstruction Model (LRM) [15], we present the Large Interpolation Model (LIM), a transformer-based feed-forward solution, guided by a novel causal consistency loss, for interpolating implicit 3D representations across time. Given implicit 3D representations at times t<inf>0</inf> and t<inf>1</inf>, LIM produces a deformed shape at any continuous time t ∈ [t<inf>0</inf>,t<inf>1</inf>] delivering high-quality interpolations in seconds (per frame). Furthermore, LIM allows explicit mesh tracking across time, producing a consistently uv-textured mesh sequence ready for integration into existing production pipelines. We also use LIM, in conjunction with a diffusion-based multi-view generator, to produce dynamic 4D reconstructions from monocular videos. We evaluate LIM on various dynamic datasets, benchmarking against image-space interpolation methods (e.g., FiLM [41]) and direct triplane linear interpolation, and demonstrate clear advantages. In summary, LIM is the first feed-forward model capable of high-speed tracked 4D asset reconstruction across diverse categories.

Type: Proceedings paper
Title: LIM: Large Interpolator Model for Dynamic Reconstruction
Event: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Location: Nashville, TN, USA
Dates: 10th-17th June 2025
ISBN-13: 979-8-3315-4364-8
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/CVPR52734.2025.00577
Publisher version: https://doi.org/10.1109/cvpr52734.2025.00577
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10215198
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item