UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Understanding Inequalities in Cancer Survival Using Bayesian Machine Learning

Basak, Piyali; Maringe, Camille; Rubio, F Javier; Linero, Antonio R; (2025) Understanding Inequalities in Cancer Survival Using Bayesian Machine Learning. Journal of the American Statistical Association 10.1080/01621459.2025.2547968. (In press).

[thumbnail of Understanding Inequalities in Cancer Survival Using Bayesian Machine Learning (1).pdf] Text
Understanding Inequalities in Cancer Survival Using Bayesian Machine Learning (1).pdf - Accepted Version
Access restricted to UCL open access staff until 21 August 2026.

Download (1MB)

Abstract

Most cancer patients are diagnosed after the age of 60, often with existing chronic health conditions (comorbidities), that can delay diagnosis and complicate treatment, prognosis, and monitoring. These comorbidities may exacerbate existing sociodemographic inequalities in cancer survival. While much research has focused on how comorbidities affect overall survival, national and international institutions typically prefer the relative survival framework for population-based studies. This framework decomposes an individual’s overall hazard into a known population hazard and an excess hazard attributable to cancer. Estimands derived from the excess hazard, such as net survival, are widely used to assess interventions and inform policy. In this paper, we use a Bayesian machine learning approach to estimate the excess hazard and identify vulnerable subgroups with a higher excess hazard, using Bayesian additive regression trees (BART). We develop a proportional hazards version of BART for the relative survival context and extend it to accommodate non-proportional hazards. We also provide tools for model interpretation and posterior summarization. This is applied to colon cancer data from England to provide insights when paired with state-of-the-art data linkage methods. We then identify drivers of inequalities in cancer survival through variable importance quantification.

Type: Article
Title: Understanding Inequalities in Cancer Survival Using Bayesian Machine Learning
DOI: 10.1080/01621459.2025.2547968
Publisher version: https://doi.org/10.1080/01621459.2025.2547968
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Bayesian nonparametrics, cancer epidemiology, decision trees, excess hazard, survival analysis
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery.ucl.ac.uk/id/eprint/10212561
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item