UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Data-driven identification of ageing-related diseases from electronic health records

Kuan, V; Fraser, HC; Hingorani, M; Denaxas, S; Gonzalez-Izquierdo, A; Direk, K; Nitsch, D; ... Hingorani, AD; + view all (2021) Data-driven identification of ageing-related diseases from electronic health records. Scientific Reports , 11 , Article 2938. 10.1038/s41598-021-82459-y. Green open access

[thumbnail of Article]
Preview
Text (Article)
Data-driven identification of ageing-related diseases from electronic health records.pdf - Published Version

Download (5MB) | Preview
[thumbnail of Supplementary Information]
Preview
Text (Supplementary Information)
Partridge_41598_2021_82459_MOESM1_ESM.pdf - Published Version

Download (1MB) | Preview

Abstract

Reducing the burden of late-life morbidity requires an understanding of the mechanisms of ageing-related diseases (ARDs), defined as diseases that accumulate with increasing age. This has been hampered by the lack of formal criteria to identify ARDs. Here, we present a framework to identify ARDs using two complementary methods consisting of unsupervised machine learning and actuarial techniques, which we applied to electronic health records (EHRs) from 3,009,048 individuals in England using primary care data from the Clinical Practice Research Datalink (CPRD) linked to the Hospital Episode Statistics admitted patient care dataset between 1 April 2010 and 31 March 2015 (mean age 49.7 years (s.d. 18.6), 51% female, 70% white ethnicity). We grouped 278 high-burden diseases into nine main clusters according to their patterns of disease onset, using a hierarchical agglomerative clustering algorithm. Four of these clusters, encompassing 207 diseases spanning diverse organ systems and clinical specialties, had rates of disease onset that clearly increased with chronological age. However, the ages of onset for these four clusters were strikingly different, with median age of onset 82 years (IQR 82–83) for Cluster 1, 77 years (IQR 75–77) for Cluster 2, 69 years (IQR 66–71) for Cluster 3 and 57 years (IQR 54–59) for Cluster 4. Fitting to ageing-related actuarial models confirmed that the vast majority of these 207 diseases had a high probability of being ageing-related. Cardiovascular diseases and cancers were highly represented, while benign neoplastic, skin and psychiatric conditions were largely absent from the four ageing-related clusters. Our framework identifies and clusters ARDs and can form the basis for fundamental and translational research into ageing pathways.

Type: Article
Title: Data-driven identification of ageing-related diseases from electronic health records
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1038/s41598-021-82459-y
Publisher version: https://doi.org/10.1038/s41598-021-82459-y
Language: English
Additional information: This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Keywords: Cancer, Cardiovascular diseases, Endocrine system and metabolic diseases, Epidemiology, Eye diseases, Gastrointestinal diseases, Geriatrics, Haematological diseases, Immunological disorders, Infectious diseases, Kidney diseases, Metabolic disorders, Neurological disorders, Nutrition disorders, Psychiatric disorders, Public health, Reproductive disorders, Respiratory tract diseases, Rheumatic diseases, Skin diseases, Urogenital diseases
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > UCL School of Pharmacy
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > UCL School of Pharmacy > Practice and Policy
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Cardiovascular Science
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Cardiovascular Science > Population Science and Experimental Medicine
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Cardiovascular Science > Population Science and Experimental Medicine > MRC Unit for Lifelong Hlth and Ageing
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics > Clinical Epidemiology
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics > Infectious Disease Informatics
URI: https://discovery.ucl.ac.uk/id/eprint/10121506
Downloads since deposit
87Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item