UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Delving Into Out-of-Distribution Detection with Medical Vision-Language Models

Ju, Lie; Zhou, Sijin; Zhou, Yukun; Lu, Huimin; Zhu, Zhuoting; Keane, Pearse A; Ge, Zongyuan; (2025) Delving Into Out-of-Distribution Detection with Medical Vision-Language Models. In: Gee, James C and Alexander, Daniel C and Hong, Jaesung and Iglesias, Juan Eugenio and Sudre, Carole H and Venkataraman, Archana and Golland, Polina and Kim, Jong Hyo and Park, Jinah, (eds.) Lecture Notes in Computer Science. (pp. pp. 133-143). Springer Nature Switzerland (In press).

[thumbnail of VLM_OOD.pdf] Text
VLM_OOD.pdf - Accepted Version
Access restricted to UCL open access staff until 21 September 2026.

Download (1MB)

Abstract

Recent advances in medical vision-language models (VLMs) demonstrate impressive performance in image classification tasks, driven by their strong zero-shot generalization capabilities. However, given the high variability and complexity inherent in medical imaging data, the ability of these models to detect out-of-distribution (OOD) data in this domain remains underexplored. In this work, we conduct the first systematic investigation into the OOD detection potential of medical VLMs. We evaluate state-of-the-art VLM-based OOD detection methods across a diverse set of medical VLMs, including both general and domain-specific purposes. To accurately reflect real-world challenges, we introduce a cross-modality evaluation pipeline for benchmarking full-spectrum OOD detection, rigorously assessing model robustness against both semantic shifts and covariate shifts. Furthermore, we propose a novel hierarchical prompt-based method that significantly enhances OOD detection performance. Extensive experiments are conducted to validate the effectiveness of our approach. The codes are available at https://github.com/PyJulie/Medical-VLMs-OOD-Detection

Type: Proceedings paper
Title: Delving Into Out-of-Distribution Detection with Medical Vision-Language Models
Event: Medical Image Computing and Computer Assisted Intervention – MICCAI 2025
ISBN-13: 9783032049704
DOI: 10.1007/978-3-032-04971-1_13
Publisher version: https://doi.org/10.1007/978-3-032-04971-1_13
Language: English
Additional information: This version is the author-accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Vision Language Models, Out-of-Distribution Detection
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Institute of Ophthalmology
URI: https://discovery.ucl.ac.uk/id/eprint/10216934
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item