UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Participation bias in the UK Biobank distorts genetic associations and downstream analyses

Schoeler, Tabea; Speed, Doug; Porcu, Eleonora; Pirastu, Nicola; Pingault, Jean-Baptiste; Kutalik, Zoltán; (2023) Participation bias in the UK Biobank distorts genetic associations and downstream analyses. Nature Human Behaviour 10.1038/s41562-023-01579-9. (In press). Green open access

[thumbnail of s41562-023-01579-9.pdf]
Preview
Text
s41562-023-01579-9.pdf - Published Version

Download (1MB) | Preview

Abstract

While volunteer-based studies such as the UK Biobank have become the cornerstone of genetic epidemiology, the participating individuals are rarely representative of their target population. To evaluate the impact of selective participation, here we derived UK Biobank participation probabilities on the basis of 14 variables harmonized across the UK Biobank and a representative sample. We then conducted weighted genome-wide association analyses on 19 traits. Comparing the output from weighted genome-wide association analyses (neffective = 94,643 to 102,215) with that from standard genome-wide association analyses (n = 263,464 to 283,749), we found that increasing representativeness led to changes in SNP effect sizes and identified novel SNP associations for 12 traits. While heritability estimates were less impacted by weighting (maximum change in h2, 5%), we found substantial discrepancies for genetic correlations (maximum change in rg, 0.31) and Mendelian randomization estimates (maximum change in βSTD, 0.15) for socio-behavioural traits. We urge the field to increase representativeness in biobank samples, especially when studying genetic correlates of behaviour, lifestyles and social outcomes.

Type: Article
Title: Participation bias in the UK Biobank distorts genetic associations and downstream analyses
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1038/s41562-023-01579-9
Publisher version: https://doi.org/10.1038/s41562-023-01579-9
Language: English
Additional information: © 2023 Springer Nature Limited. This article is licensed under a Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).
Keywords: Behavioural genetics, Genome-wide association studies, Population genetics, Quantitative trait
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Div of Psychology and Lang Sciences > Clinical, Edu and Hlth Psychology
URI: https://discovery.ucl.ac.uk/id/eprint/10169318
Downloads since deposit
43Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item