UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

DivGI: delve into digestive endoscopy image classification

He, Q; Bano, S; Stoyanov, D; Zuo, S; (2025) DivGI: delve into digestive endoscopy image classification. International Journal of Computer Assisted Radiology and Surgery , 20 (7) pp. 1513-1520. 10.1007/s11548-025-03441-x.

[thumbnail of IPCAI2025___DivGI__Delve_into_Digestive_Endoscopy_Image_Classification 1.pdf] Text
IPCAI2025___DivGI__Delve_into_Digestive_Endoscopy_Image_Classification 1.pdf - Accepted Version
Access restricted to UCL open access staff until 7 June 2026.

Download (1MB)

Abstract

Purpose: Gastrointestinal (GI) endoscopic imaging involves capturing routine anatomical landmarks and suspected lesions during endoscopic procedures for the clinical diagnosis of GI diseases. These images present three key challenges compared to typical scene images: significant class imbalance, a lack of distinctive features, and high similarity between some categories. While existing research has addressed the issue of image quantity imbalance, the challenges posed by indistinct features and inter-category similarity remain unresolved. This study proposes a unified image classification framework designed to tackle all three of these challenges comprehensively. Methods: We present a novel network architecture, DivGI, which integrates three essential strategies—balanced sampling, fine-grained classification, and multi-label classification—within a single framework. The balanced sampling strategy is implemented via resampling and mix-up techniques, fine-grained classification is enabled through multi-granularity feature learning, and multi-label classification is achieved using hierarchical label joint learning. The performance of our method is validated using three publicly available datasets. Results: Extensive experimental results demonstrate that DivGI significantly improves classification accuracy compared to existing approaches, with Matthews correlation coefficients (MCC) of 91.31% on the HyperKvasir dataset, 86.72% on the Upper GI dataset, and 82.88% on the GastroVision dataset. These results highlight that DivGI is more effective and efficient compared to existing methods. Conclusion: The proposed GI classification network, which incorporates multiple strategies, effectively classifies both routine landmark and suspected lesion images, aiming to facilitate better clinical diagnostics in gastrointestinal endoscopy. The code and data are publicly available at https://github.com/howardchina/DivGI

Type: Article
Title: DivGI: delve into digestive endoscopy image classification
Location: Germany
DOI: 10.1007/s11548-025-03441-x
Publisher version: https://doi.org/10.1007/s11548-025-03441-x
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Science & Technology, Technology, Life Sciences & Biomedicine, Engineering, Biomedical, Radiology, Nuclear Medicine & Medical Imaging, Surgery, Engineering, Gastrointestinal endoscopy, Image classification, Imbalanced learning, Fine-grained visual recognition, Hierarchical label
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10211493
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item