UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Polygenic risk prediction: why and when out-of-sample prediction R2 can exceed SNP-based heritability

Wang, X; Walker, A; Revez, JA; Ni, G; Adams, MJ; McIntosh, AM; Wray, NR; ... Bøcker Pedersen, C; + view all (2023) Polygenic risk prediction: why and when out-of-sample prediction R2 can exceed SNP-based heritability. American Journal of Human Genetics , 110 (7) pp. 1207-1215. 10.1016/j.ajhg.2023.06.006. Green open access

[thumbnail of Lewis_Wang submitted 2023.pdf]
Preview
Text
Lewis_Wang submitted 2023.pdf

Download (370kB) | Preview

Abstract

In polygenic score (PGS) analysis, the coefficient of determination (R2) is a key statistic to evaluate efficacy. R2 is the proportion of phenotypic variance explained by the PGS, calculated in a cohort that is independent of the genome-wide association study (GWAS) that provided estimates of allelic effect sizes. The SNP-based heritability (hSNP2, the proportion of total phenotypic variances attributable to all common SNPs) is the theoretical upper limit of the out-of-sample prediction R2. However, in real data analyses R2 has been reported to exceed hSNP2, which occurs in parallel with the observation that hSNP2 estimates tend to decline as the number of cohorts being meta-analyzed increases. Here, we quantify why and when these observations are expected. Using theory and simulation, we show that if heterogeneities in cohort-specific hSNP2 exist, or if genetic correlations between cohorts are less than one, hSNP2 estimates can decrease as the number of cohorts being meta-analyzed increases. We derive conditions when the out-of-sample prediction R2 will be greater than hSNP2 and show the validity of our derivations with real data from a binary trait (major depression) and a continuous trait (educational attainment). Our research calls for a better approach to integrating information from multiple cohorts to address issues of between-cohort heterogeneity.

Type: Article
Title: Polygenic risk prediction: why and when out-of-sample prediction R2 can exceed SNP-based heritability
Location: United States
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.ajhg.2023.06.006
Publisher version: https://doi.org/10.1016/j.ajhg.2023.06.006
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Polygenic risk prediction, out-of-sample prediction R2, meta-analysis, SNP-based heritability
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > Division of Psychiatry
URI: https://discovery.ucl.ac.uk/id/eprint/10176972
Downloads since deposit
4Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item