UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its Effect on Image Correspondence

Daher, R; Vasconcelos, F; Stoyanov, D; (2023) A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its Effect on Image Correspondence. Medical Image Analysis , 90 , Article 102994. 10.1016/j.media.2023.102994. Green open access

[thumbnail of 1-s2.0-S1361841523002542-main.pdf]
Preview
Text
1-s2.0-S1361841523002542-main.pdf - Published Version

Download (3MB) | Preview

Abstract

Video streams are utilised to guide minimally-invasive surgery and diagnosis in a wide range of procedures, and many computer-assisted techniques have been developed to automatically analyse them. These approaches can provide additional information to the surgeon such as lesion detection, instrument navigation, or anatomy 3D shape modelling. However, the necessary image features to recognise these patterns are not always reliably detected due to the presence of irregular light patterns such as specular highlight reflections. In this paper, we aim at removing specular highlights from endoscopic videos using machine learning. We propose using a temporal generative adversarial network (GAN) to inpaint the hidden anatomy under specularities, inferring its appearance spatially and from neighbouring frames, where they are not present in the same location. This is achieved using in-vivo data from gastric endoscopy (Hyper Kvasir) in a fully unsupervised manner that relies on the automatic detection of specular highlights. System evaluations show significant improvements to other methods through direct comparison and ablation studies that depict the importance of the network's temporal and transfer learning components. The generalisability of our system to different surgical setups and procedures was also evaluated qualitatively on in-vivo data of gastric endoscopy and ex-vivo porcine data (SERV-CT, SCARED). We also assess the effect of our method in comparison to other methods on computer vision tasks that underpin 3D reconstruction and camera motion estimation, namely stereo disparity, optical flow, and sparse point feature matching. These are evaluated quantitatively and qualitatively and results show a positive effect of our specular inpainting method on these tasks in a novel comprehensive analysis. Our code and dataset are made available at https://github.com/endomapper/Endo-STTN.

Type: Article
Title: A Temporal Learning Approach to Inpainting Endoscopic Specularities and Its Effect on Image Correspondence
Location: Netherlands
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.media.2023.102994
Publisher version: https://doi.org/10.1016/j.media.2023.102994
Language: English
Additional information: Copyright © 2023 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Keywords: Endoscopy, Inpainting, Specular highlights, Surgical AI, Surgical Data Science, Temporal GANs, Transformers
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10179890
Downloads since deposit
57Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item