Endo-FASt3r: Endoscopic Foundation Model Adaptation for Structure from Motion

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

Endo-FASt3r: Endoscopic Foundation Model Adaptation for Structure from Motion

Sheikh Zeinoddin, M; Hoque, MI; Tandogdu, Z; Shaw, GL; Clarkson, MJ; Mazomenos, EB; Stoyanov, D; (2026) Endo-FASt3r: Endoscopic Foundation Model Adaptation for Structure from Motion. In: Gee, James C and Alexander, Daniel C and Hong, Jaesung and Iglesias, Juan Eugenio and Sudre, Carole H and Venkataraman, Archana and Golland, Polina and Kim, Jong Hyo and Park, Jinah, (eds.) Medical Image Computing and Computer Assisted Intervention – MICCAI 2025. (pp. pp. 117-126). Springer: Cham, Switzerland.

Text
Endo_FASt3r_Arxiv.pdf - Accepted Version
Access restricted to UCL open access staff until 21 September 2026.
Download (1MB)

Abstract

Accurate depth and camera pose estimation is essential for achieving high-quality 3D visualisations in robotic-assisted surgery. Despite recent advancements in foundation model adaptation to monocular depth estimation of endoscopic scenes via self-supervised learning (SSL), no prior work has explored their use for pose estimation. These methods rely on low rank-based adaptation approaches, which constrain model updates to a low-rank space. We propose Endo-FASt3r, the first monocular SSL depth and pose estimation framework that uses foundation models for both tasks. We extend the Reloc3r relative pose estimation foundation model by designing Reloc3rX, introducing modifications necessary for convergence in SSL. We also present DoMoRA, a novel adaptation technique that enables higher-rank updates and faster convergence. Experiments on the SCARED dataset show that Endo-FASt3r achieves a substantial 10% improvement in pose estimation and a 2% improvement in depth estimation over prior work. Similar performance gains on the Hamlyn and StereoMIS datasets reinforce the generalisability of Endo-FASt3r across different datasets. Our code is available at: https://github.com/Mona-ShZeinoddin/Endo_FASt3r.git.

Type:	Proceedings paper
Title:	Endo-FASt3r: Endoscopic Foundation Model Adaptation for Structure from Motion
Event:	MICCAI 2025 28th International Conference
ISBN-13:	9783032051400
DOI:	10.1007/978-3-032-05141-7_12
Publisher version:	https://doi.org/10.1007/978-3-032-05141-7_12
Language:	English
Additional information:	This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords:	Foundation model Adaptation, Depth and Pose estimation
UCL classification:	UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences > Div of Surgery and Interventional Sci UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Med Phys and Biomedical Eng UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences > Div of Surgery and Interventional Sci > Department of Targeted Intervention
URI:	https://discovery.ucl.ac.uk/id/eprint/10216021

Downloads since deposit

1Download

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item