UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

From prediction to practice: mitigating bias and data shift in machine-learning models for chemotherapy-induced organ dysfunction across unseen cancers

Watson, Matthew; Chambers, Pinkie; Steventon, Luke; Harmsworth King, James; Ercia, Angelo; Shaw, Heather; Al Moubayed, Noura; (2024) From prediction to practice: mitigating bias and data shift in machine-learning models for chemotherapy-induced organ dysfunction across unseen cancers. BMJ Oncology , 3 (1) , Article e000430. 10.1136/ bmjonc-2024-000430. Green open access

[thumbnail of Chambers_e000430.full.pdf]
Preview
Text
Chambers_e000430.full.pdf

Download (1MB) | Preview

Abstract

Objectives: Routine monitoring of renal and hepatic function during chemotherapy ensures that treatment-related organ damage has not occurred and clearance of subsequent treatment is not hindered; however, frequency and timing are not optimal. Model bias and data heterogeneity concerns have hampered the ability of machine learning (ML) to be deployed into clinical practice. This study aims to develop models that could support individualised decisions on the timing of renal and hepatic monitoring while exploring the effect of data shift on model performance. // Methods and analysis: We used retrospective data from three UK hospitals to develop and validate ML models predicting unacceptable rises in creatinine/bilirubin post cycle 3 for patients undergoing treatment for the following cancers: breast, colorectal, lung, ovarian and diffuse large B-cell lymphoma. // Results: We extracted 3614 patients with no missing blood test data across cycles 1–6 of chemotherapy treatment. We improved on previous work by including predictions post cycle 3. Optimised for sensitivity, we achieve F2 scores of 0.7773 (bilirubin) and 0.6893 (creatinine) on unseen data. Performance is consistent on tumour types unseen during training (F2 bilirubin: 0.7423, F2 creatinine: 0.6820). // Conclusion: Our technique highlights the effectiveness of ML in clinical settings, demonstrating the potential to improve the delivery of care. Notably, our ML models can generalise to unseen tumour types. We propose gold-standard bias mitigation steps for ML models: evaluation on multisite data, thorough patient population analysis, and both formalised bias measures and model performance comparisons on patient subgroups. We demonstrate that data aggregation techniques have unintended consequences on model bias.

Type: Article
Title: From prediction to practice: mitigating bias and data shift in machine-learning models for chemotherapy-induced organ dysfunction across unseen cancers
Open access status: An open access version is available from UCL Discovery
DOI: 10.1136/ bmjonc-2024-000430
Publisher version: https://doi.org/10.1136/ bmjonc-2024-000430
Language: English
Additional information: This is an open access article distributed under the terms of the Creative Commons CC BY 4.0 license, https://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > UCL School of Pharmacy
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > UCL School of Pharmacy > Practice and Policy
URI: https://discovery.ucl.ac.uk/id/eprint/10198411
Downloads since deposit
7Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item