Lawson, DJ;
Van Dorp, L;
Falush, D;
(2018)
A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots.
Nature Communications
, 9
, Article 3258. 10.1038/s41467-018-05257-7.
Preview |
Text
s41467-018-05257-7.pdf - Published Version Download (2MB) | Preview |
Abstract
Genetic clustering algorithms, implemented in programs such as STRUCTURE and ADMIXTURE, have been used extensively in the characterisation of individuals and populations based on genetic data. A successful example is the reconstruction of the genetic history of African Americans as a product of recent admixture between highly differentiated populations. Histories can also be reconstructed using the same procedure for groups that do not have admixture in their recent history, where recent genetic drift is strong or that deviate in other ways from the underlying inference model. Unfortunately, such histories can be misleading. We have implemented an approach, badMIXTURE, to assess the goodness of fit of the model using the ancestry “palettes” estimated by CHROMOPAINTER and apply it to both simulated data and real case studies. Combining these complementary analyses with additional methods that are designed to test specific hypotheses allows a richer and more robust analysis of recent demographic history.
Type: | Article |
---|---|
Title: | A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1038/s41467-018-05257-7 |
Publisher version: | http://doi.org/10.1038/s41467-018-05257-7 |
Language: | English |
Additional information: | This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
Keywords: | Science & Technology, Multidisciplinary Sciences, Science & Technology - Other Topics, Multilocus Genotype Data, Population-Structure, Genetic-Structure, Program Structure, History, Inference, Clusters, Individuals, Simulation, Sequence |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment |
URI: | https://discovery.ucl.ac.uk/id/eprint/10054913 |
Archive Staff Only
View Item |