UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND)

Williams, RC; Elston, RC; Kumar, P; Knowler, WC; Abboud, HE; Adler, S; Bowden, DW; ... FIND Research Group, .; + view all (2016) Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND). BMC Genomics , 17 , Article 325. 10.1186/s12864-016-2654-x. Green open access

[thumbnail of Guo_s12864-016-2654-x.pdf]
Preview
Text
Guo_s12864-016-2654-x.pdf - Published Version

Download (1MB) | Preview

Abstract

BACKGROUND: The presence of population structure in a sample may confound the search for important genetic loci associated with disease. Our four samples in the Family Investigation of Nephropathy and Diabetes (FIND), European Americans, Mexican Americans, African Americans, and American Indians are part of a genome- wide association study in which population structure might be particularly important. We therefore decided to study in detail one component of this, individual genetic ancestry (IGA). From SNPs present on the Affymetrix 6.0 Human SNP array, we identified 3 sets of ancestry informative markers (AIMs), each maximized for the information in one the three contrasts among ancestral populations: Europeans (HAPMAP, CEU), Africans (HAPMAP, YRI and LWK), and Native Americans (full heritage Pima Indians). We estimate IGA and present an algorithm for their standard errors, compare IGA to principal components, emphasize the importance of balancing information in the ancestry informative markers (AIMs), and test the association of IGA with diabetic nephropathy in the combined sample. RESULTS: A fixed parental allele maximum likelihood algorithm was applied to the FIND to estimate IGA in four samples: 869 American Indians; 1385 African Americans; 1451 Mexican Americans; and 826 European Americans. When the information in the AIMs is unbalanced, the estimates are incorrect with large error. Individual genetic admixture is highly correlated with principle components for capturing population structure. It takes ~700 SNPs to reduce the average standard error of individual admixture below 0.01. When the samples are combined, the resulting population structure creates associations between IGA and diabetic nephropathy. CONCLUSIONS: The identified set of AIMs, which include American Indian parental allele frequencies, may be particularly useful for estimating genetic admixture in populations from the Americas. Failure to balance information in maximum likelihood, poly-ancestry models creates biased estimates of individual admixture with large error. This also occurs when estimating IGA using the Bayesian clustering method as implemented in the program STRUCTURE. Odds ratios for the associations of IGA with disease are consistent with what is known about the incidence and prevalence of diabetic nephropathy in these populations.

Type: Article
Title: Selecting SNPs informative for African, American Indian and European Ancestry: application to the Family Investigation of Nephropathy and Diabetes (FIND)
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1186/s12864-016-2654-x
Publisher version: http://doi.org/10.1186/s12864-016-2654-x
Language: English
Additional information: © 2016 Williams et al. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Keywords: Diabetic nephropathy, Individual genetic ancestry, Population structure, SNP, African Americans, Algorithms, Chromosome Mapping, Diabetic Nephropathies, European Continental Ancestry Group, Genetic Markers, Genetic Predisposition to Disease, Genome-Wide Association Study, Humans, Indians, North American, Likelihood Functions, Mexican Americans, Models, Genetic, Oligonucleotide Array Sequence Analysis, Polymorphism, Single Nucleotide, Principal Component Analysis, United States
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Education
UCL > Provost and Vice Provost Offices > School of Education > UCL Institute of Education
UCL > Provost and Vice Provost Offices > School of Education > UCL Institute of Education > Centre for Languages and Intl Educatn
URI: https://discovery.ucl.ac.uk/id/eprint/10036856
Downloads since deposit
77Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item