UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology

Yildirim, Nur; Richardson, Hannah; Wetscherek, Maria Teodora; Bajwa, Junaid; Jacob, Joseph; Pinnock, Mark Ames; Harris, Stephen; ... Thieme, Anja; + view all (2024) Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology. In: Proceedings of the CHI Conference on Human Factors in Computing Systems. (pp. pp. 1-22). ACM (Association for Computing Machinery) Green open access

[thumbnail of Harris_Multimodal Healthcare AI- Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology_VoR.pdf]
Preview
Text
Harris_Multimodal Healthcare AI- Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology_VoR.pdf

Download (9MB) | Preview

Abstract

Recent advances in AI combine large language models (LLMs) with vision encoders that bring forward unprecedented technical capabilities to leverage for a wide range of healthcare applications. Focusing on the domain of radiology, vision-language models (VLMs) achieve good performance results for tasks such as generating radiology findings based on a patient’s medical image, or answering visual questions (e.g., “Where are the nodules in this chest X-ray?”). However, the clinical utility of potential applications of these capabilities is currently underexplored. We engaged in an iterative, multidisciplinary design process to envision clinically relevant VLM interactions, and co-designed four VLM use concepts: Draft Report Generation, Augmented Report Review, Visual Search and Querying, and Patient Imaging History Highlights. We studied these concepts with 13 radiologists and clinicians who assessed the VLM concepts as valuable, yet articulated many design considerations. Reflecting on our findings, we discuss implications for integrating VLM capabilities in radiology, and for healthcare AI more generally.

Type: Proceedings paper
Title: Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology
Event: CHI '24: CHI Conference on Human Factors in Computing Systems
Open access status: An open access version is available from UCL Discovery
DOI: 10.1145/3613904.3642013
Publisher version: http://dx.doi.org/10.1145/3613904.3642013
Language: English
Additional information: © 2024 Copyright held by the owner/author(s). This work is licensed under a Creative Commons Attribution-NoDerivs International 4.0 License (https://creativecommons.org/licenses/by-nd/4.0/).
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences > Div of Medicine
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences > Div of Medicine > Respiratory Medicine
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics > Clinical Epidemiology
URI: https://discovery.ucl.ac.uk/id/eprint/10192536
Downloads since deposit
15Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item