Blake, Helen A;
Sharples, Linda D;
Harron, Katie;
van der Meulen, Jan H;
Walker, Kate;
(2022)
Linkage of multiple electronic health record datasets using a 'spine linkage' approach compared with all 'pairwise linkages'.
International Journal of Epidemiology
10.1093/ije/dyac130.
Preview |
Text
Harron_Linkage of multiple electronic health record datasets using a spine linkage approach compared with all pairwise linkages_AOP.pdf - Published Version Download (858kB) | Preview |
Abstract
BACKGROUND: Methods for linking records between two datasets are well established. However, guidance is needed for linking more than two datasets. Using all 'pairwise linkages'-linking each dataset to every other dataset-is the most inclusive, but resource-intensive, approach. The 'spine' approach links each dataset to a designated 'spine dataset', reducing the number of linkages, but potentially reducing linkage quality. METHODS: We compared the pairwise and spine linkage approaches using real-world data on patients undergoing emergency bowel cancer surgery between 31 October 2013 and 30 April 2018. We linked an administrative hospital dataset (Hospital Episode Statistics; HES) capturing patients admitted to hospitals in England, and two clinical datasets comprising patients diagnosed with bowel cancer and patients undergoing emergency bowel surgery. RESULTS: The spine linkage approach, with HES as the spine dataset, created an analysis cohort of 15 826 patients, equating to 98.3% of the 16 100 patients identified using the pairwise linkage approach. There were no systematic differences in patient characteristics between these analysis cohorts. Associations of patient and tumour characteristics with mortality, complications and length of stay were not sensitive to the linkage approach. When eligibility criteria were applied before linkage, spine linkage included 14 509 patients (90.0% compared with pairwise linkage). CONCLUSION: Spine linkage can be used as an efficient alternative to pairwise linkage if case ascertainment in the spine dataset and data quality of linkage variables are high. These aspects should be systematically evaluated in the nominated spine dataset before spine linkage is used to create the analysis cohort.
Type: | Article |
---|---|
Title: | Linkage of multiple electronic health record datasets using a 'spine linkage' approach compared with all 'pairwise linkages' |
Location: | England |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1093/ije/dyac130 |
Publisher version: | https://doi.org/10.1093/ije/dyac130 |
Language: | English |
Additional information: | This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/ by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
Keywords: | Record linkage, electronic health records, pairwise linkage, spine linkage approach |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health > Population, Policy and Practice Dept UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences |
URI: | https://discovery.ucl.ac.uk/id/eprint/10150953 |
Archive Staff Only
View Item |