Filipow, Nicole Jane;
(2022)
Development of a Composite Health Index in Children with Cystic Fibrosis: A Pipeline for Data Processing, Machine Learning, and Model Implementation using Electronic Health Records.
Doctoral thesis (Ph.D), UCL (University College London).
Preview |
Text
Filipow_10152878_thesis_redacted.pdf Download (14MB) | Preview |
Abstract
Cystic Fibrosis (CF) is a heterogeneous multi-faceted genetic condition that primarily affects the lungs and digestive system. For children and young people living with CF, timely management is necessary to prevent the establishment of severe disease. Modern data capture through electronic health records (EHR) have created an opportunity to use machine learning algorithms to classify subgroups of disease to understand health status and prognosis. The overall aim of this thesis was to develop a composite health index in children with CF. An iterative approach to unsupervised cluster analysis was developed to identify homogeneous clusters of children with CF in a pre-existing encounter-based CF database from Toronto Canada. An external validation of the model was carried out in a historical CF dataset from Great Ormond Street Hospital (GOSH) in London UK. The clusters were also re-created and validated using EHR data from GOSH when it first became accessible in 2021. The interpretability and sensitivity of the GOSH EHR model was explored. Lastly, a scoping review was carried out to investigate common barriers to implementation of prognostic machine learning algorithms in paediatric respiratory care. A cluster model was identified that detailed four clusters associated with time to future hospitalisation, pulmonary exacerbation, and lung function. The clusters were also associated with different disease related variables such as comorbidities, anthropometrics, microbiology infections, and treatment history. An app was developed to display individualised cluster assignment, which will be a useful way to interpret the cluster model clinically. The review of prognostic machine learning algorithms identified a lack of reproducibility and validations as the major limitation to model reporting that impair clinical translation. EHR systems facilitate point-of-care access of individualised data and integrated machine learning models. However, there is a gap in translation to clinical implementation of machine learning models. With appropriate regulatory frameworks the health index developed for children with CF could be implemented in CF care.
Type: | Thesis (Doctoral) |
---|---|
Qualification: | Ph.D |
Title: | Development of a Composite Health Index in Children with Cystic Fibrosis: A Pipeline for Data Processing, Machine Learning, and Model Implementation using Electronic Health Records |
Open access status: | An open access version is available from UCL Discovery |
Language: | English |
Additional information: | Copyright © The Author 2022. Original content in this thesis is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) Licence (https://creativecommons.org/licenses/by-nc/4.0/). Any third-party copyright material present remains the property of its respective owner(s) and is licensed under its existing terms. Access may initially be restricted at the author’s request. - Some third party copyright material has been removed from this e-thesis. |
UCL classification: | UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health UCL |
URI: | https://discovery.ucl.ac.uk/id/eprint/10152878 |
Archive Staff Only
View Item |