UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Trustworthy Data and AI Environments for Clinical Prediction: Application to Crisis-Risk in People With Depression

Msosa, YJ; Grauslys, A; Zhou, Y; Wang, T; Buchan, I; Langan, P; Foster, S; ... Kehoe, D; + view all (2023) Trustworthy Data and AI Environments for Clinical Prediction: Application to Crisis-Risk in People With Depression. IEEE Journal of Biomedical and Health Informatics , 27 (11) pp. 5588-5598. 10.1109/JBHI.2023.3312011. Green open access

[thumbnail of A_Trustworthy_AI_Environment_for_Predicting_Crisis_Risk_of_Depression.pdf]
Preview
Text
A_Trustworthy_AI_Environment_for_Predicting_Crisis_Risk_of_Depression.pdf - Accepted Version

Download (1MB) | Preview

Abstract

Depression is a common mental health condition that often occurs in association with other chronic illnesses, and varies considerably in severity. Electronic Health Records (EHRs) contain rich information about a patient's medical history and can be used to train, test and maintain predictive models to support and improve patient care. This work evaluated the feasibility of implementing an environment for predicting mental health crisis among people living with depression based on both structured and unstructured EHRs. A large EHR from a mental health provider, Mersey Care, was pseudonymised and ingested into the Natural Language Processing (NLP) platform CogStack, allowing text content in binary clinical notes to be extracted. All unstructured clinical notes and summaries were semantically annotated by MedCAT and BioYODIE NLP services. Cases of crisis in patients with depression were then identified. Random forest models, gradient boosting trees, and Long Short-Term Memory (LSTM) networks, with varying feature arrangement, were trained to predict the occurrence of crisis. The results showed that all the prediction models can use a combination of structured and unstructured EHR information to predict crisis in patients with depression with good and useful accuracy. The LSTM network that was trained on a modified dataset with only 1000 most-important features from the random forest model with temporality showed the best performance with a mean AUC of 0.901 and a standard deviation of 0.006 using a training dataset and a mean AUC of 0.810 and 0.01 using a hold-out test dataset. Comparing the results from the technical evaluation with the views of psychiatrists shows that there are now opportunities to refine and integrate such prediction models into pragmatic point-of-care clinical decision support tools for supporting mental healthcare delivery.

Type: Article
Title: Trustworthy Data and AI Environments for Clinical Prediction: Application to Crisis-Risk in People With Depression
Location: United States
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/JBHI.2023.3312011
Publisher version: https://doi.org/10.1109/JBHI.2023.3312011
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Humans, Depression, Electronic Health Records, Natural Language Processing, Mental Health, Mental Disorders
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics > Clinical Epidemiology
URI: https://discovery.ucl.ac.uk/id/eprint/10183194
Downloads since deposit
103Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item