He, Q;
              
      
            
                Bano, S;
              
      
            
                Stoyanov, D;
              
      
            
                Zuo, S;
              
      
        
        
  
(2025)
  DivGI: delve into digestive endoscopy image classification.
International Journal of Computer Assisted Radiology and Surgery
, 20
       (7)
    
     pp. 1513-1520.
    
         10.1007/s11548-025-03441-x.
  
  
| ![[thumbnail of IPCAI2025___DivGI__Delve_into_Digestive_Endoscopy_Image_Classification 1.pdf]](https://discovery.ucl.ac.uk/style/images/fileicons/text.png) | Text IPCAI2025___DivGI__Delve_into_Digestive_Endoscopy_Image_Classification 1.pdf - Accepted Version Access restricted to UCL open access staff until 7 June 2026. Download (1MB) | 
Abstract
Purpose: Gastrointestinal (GI) endoscopic imaging involves capturing routine anatomical landmarks and suspected lesions during endoscopic procedures for the clinical diagnosis of GI diseases. These images present three key challenges compared to typical scene images: significant class imbalance, a lack of distinctive features, and high similarity between some categories. While existing research has addressed the issue of image quantity imbalance, the challenges posed by indistinct features and inter-category similarity remain unresolved. This study proposes a unified image classification framework designed to tackle all three of these challenges comprehensively. Methods: We present a novel network architecture, DivGI, which integrates three essential strategies—balanced sampling, fine-grained classification, and multi-label classification—within a single framework. The balanced sampling strategy is implemented via resampling and mix-up techniques, fine-grained classification is enabled through multi-granularity feature learning, and multi-label classification is achieved using hierarchical label joint learning. The performance of our method is validated using three publicly available datasets. Results: Extensive experimental results demonstrate that DivGI significantly improves classification accuracy compared to existing approaches, with Matthews correlation coefficients (MCC) of 91.31% on the HyperKvasir dataset, 86.72% on the Upper GI dataset, and 82.88% on the GastroVision dataset. These results highlight that DivGI is more effective and efficient compared to existing methods. Conclusion: The proposed GI classification network, which incorporates multiple strategies, effectively classifies both routine landmark and suspected lesion images, aiming to facilitate better clinical diagnostics in gastrointestinal endoscopy. The code and data are publicly available at https://github.com/howardchina/DivGI
| Type: | Article | 
|---|---|
| Title: | DivGI: delve into digestive endoscopy image classification | 
| Location: | Germany | 
| DOI: | 10.1007/s11548-025-03441-x | 
| Publisher version: | https://doi.org/10.1007/s11548-025-03441-x | 
| Language: | English | 
| Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. | 
| Keywords: | Science & Technology, Technology, Life Sciences & Biomedicine, Engineering, Biomedical, Radiology, Nuclear Medicine & Medical Imaging, Surgery, Engineering, Gastrointestinal endoscopy, Image classification, Imbalanced learning, Fine-grained visual recognition, Hierarchical label | 
| UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science | 
| URI: | https://discovery.ucl.ac.uk/id/eprint/10211493 | 
Archive Staff Only
|  | View Item | 
 
                      
