eprintid: 10204223 rev_number: 7 eprint_status: archive userid: 699 dir: disk0/10/20/42/23 datestamp: 2025-02-03 13:37:11 lastmod: 2025-02-03 13:37:11 status_changed: 2025-02-03 13:37:11 type: article metadata_visibility: show sword_depositor: 699 creators_name: Thygesen, Johan H creators_name: Zhang, Huayu creators_name: Issa, Hanane creators_name: Wu, Jinge creators_name: Hama, Tuankasfee creators_name: Phiho-Gomes, Ana-Caterina creators_name: Groza, Tudor creators_name: Khalid, Sara creators_name: Lumbers, Thomas R creators_name: Hocaoglu, Mevhibe creators_name: Khunti, Kamlesh creators_name: Priedon, Rouven creators_name: Banerjee, Amitava creators_name: Pontikos, Nikolas creators_name: Tomlinson, Chris creators_name: Torralbo, Ana creators_name: Taylor, Paul creators_name: Sudlow, Cathie creators_name: Denaxas, Spiros creators_name: Hemingway, Harry creators_name: Wu, Honghan title: Prevalence and demographics of 331 rare diseases and associated COVID-19-related mortality among 58 million individuals: a nationwide retrospective observational study ispublished: inpress divisions: UCL divisions: B02 divisions: DD4 note: Copyright © 2025 The Author(s). Published by Elsevier Ltd. This is an Open Access article under the CC BY 4.0 license. abstract: Background The Global Burden of Disease Study has provided key evidence to inform clinicians, researchers, and policy makers across common diseases, but no similar effort with a single-study design exists for hundreds of rare diseases. Consequently, for many rare conditions there is little population-level evidence, including prevalence and clinical vulnerability, resulting in an absence of evidence-based care that was prominent during the COVID-19 pandemic. We aimed to inform rare disease care by providing key descriptors from national data and explore the impact of rare diseases during the COVID-19 pandemic. Methods In this nationwide retrospective observational cohort study, we used the electronic health records (EHRs) of more than 58 million people in England, linking nine National Health Service datasets spanning health-care settings for people who were alive on Jan 23, 2020. Starting with all rare diseases listed in Orphanet (an extensive online resource for rare diseases), we quality assured and filtered down to analyse 331 conditions mapped to ICD-10 or Systemized Nomenclature of Medicine–Clinical Terms that were clinically validated in our dataset. For all 331 rare diseases, we calculated population prevalences, analysed patients’ clinical and demographic details, and investigated mortality with SARS-CoV-2. We assessed COVID-19-related mortality by comparing cohorts of patients for each rare disease and rare disease category with controls matched for age group, sex, ethnicity, and vaccination status, at a ratio of two controls per individual with a rare disease. Findings Of 58 162316 individuals, we identified 894 396 with at least one rare disease and assessed COVID-19-related mortality between Sept 1, 2020, and Nov 30, 2021. We calculated reproducible estimates, adjusted for age and sex, for all 331 rare diseases, including for 186 (56·2%) conditions without existing prevalence estimates in Orphanet. 49 rare diseases were significantly more frequent in female individuals than in male individuals, and 62 were significantly more frequent in male individuals than in female individuals; 47 were significantly more frequent in Asian or British Asian individuals than in White individuals; and 22 were significantly more frequent in Black or Black British individuals than in White individuals. 37 rare diseases were significantly more frequent in the White population compared with either the Black or Asian population. 7965 (0·9%) of 894 396 patients with a rare disease died from COVID-19, compared with 141 287 (0·2%) of 58 162316 in the full study population. In fully vaccinated individuals, the risk of COVID-19-related mortality was significantly higher for eight rare diseases, with patients with bullous pemphigoid (hazard ratio 8·07, 95% CI 3·01–21·62) being at highest risk. Interpretation Our study highlights that national-scale EHRs provide a unique resource to estimate detailed prevalence, clinical, and demographic data for rare diseases. Using COVID-19-related mortality analysis, we showed the power of large-scale EHRs in providing insights to inform public health decision making for these often neglected patient populations. Funding British Heart Foundation Data Science Centre, led by Health Data Research UK. date: 2025-02 date_type: published publisher: Elsevier BV official_url: https://doi.org/10.1016/s2589-7500(24)00253-x oa_status: green full_text_type: pub language: eng primo: open primo_central: open_green verified: verified_manual elements_id: 2357295 doi: 10.1016/s2589-7500(24)00253-x lyricists_name: Tomlinson, Christopher lyricists_id: CTOML04 actors_name: Tomlinson, Christopher actors_id: CTOML04 actors_role: owner full_text_status: public publication: The Lancet Digital Health volume: 7 number: 2 pagerange: e145-e156 issn: 2589-7500 citation: Thygesen, Johan H; Zhang, Huayu; Issa, Hanane; Wu, Jinge; Hama, Tuankasfee; Phiho-Gomes, Ana-Caterina; Groza, Tudor; ... Wu, Honghan; + view all <#> Thygesen, Johan H; Zhang, Huayu; Issa, Hanane; Wu, Jinge; Hama, Tuankasfee; Phiho-Gomes, Ana-Caterina; Groza, Tudor; Khalid, Sara; Lumbers, Thomas R; Hocaoglu, Mevhibe; Khunti, Kamlesh; Priedon, Rouven; Banerjee, Amitava; Pontikos, Nikolas; Tomlinson, Chris; Torralbo, Ana; Taylor, Paul; Sudlow, Cathie; Denaxas, Spiros; Hemingway, Harry; Wu, Honghan; - view fewer <#> (2025) Prevalence and demographics of 331 rare diseases and associated COVID-19-related mortality among 58 million individuals: a nationwide retrospective observational study. The Lancet Digital Health , 7 (2) e145-e156. 10.1016/s2589-7500(24)00253-x <https://doi.org/10.1016/s2589-7500%2824%2900253-x>. (In press). Green open access document_url: https://discovery.ucl.ac.uk/id/eprint/10204223/1/PIIS258975002400253X.pdf