UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots

Lawson, DJ; Van Dorp, L; Falush, D; (2018) A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots. Nature Communications , 9 , Article 3258. 10.1038/s41467-018-05257-7. Green open access

[thumbnail of s41467-018-05257-7.pdf]
Preview
Text
s41467-018-05257-7.pdf - Published Version

Download (2MB) | Preview

Abstract

Genetic clustering algorithms, implemented in programs such as STRUCTURE and ADMIXTURE, have been used extensively in the characterisation of individuals and populations based on genetic data. A successful example is the reconstruction of the genetic history of African Americans as a product of recent admixture between highly differentiated populations. Histories can also be reconstructed using the same procedure for groups that do not have admixture in their recent history, where recent genetic drift is strong or that deviate in other ways from the underlying inference model. Unfortunately, such histories can be misleading. We have implemented an approach, badMIXTURE, to assess the goodness of fit of the model using the ancestry “palettes” estimated by CHROMOPAINTER and apply it to both simulated data and real case studies. Combining these complementary analyses with additional methods that are designed to test specific hypotheses allows a richer and more robust analysis of recent demographic history.

Type: Article
Title: A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots
Open access status: An open access version is available from UCL Discovery
DOI: 10.1038/s41467-018-05257-7
Publisher version: http://doi.org/10.1038/s41467-018-05257-7
Language: English
Additional information: This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Keywords: Science & Technology, Multidisciplinary Sciences, Science & Technology - Other Topics, Multilocus Genotype Data, Population-Structure, Genetic-Structure, Program Structure, History, Inference, Clusters, Individuals, Simulation, Sequence
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment
URI: https://discovery.ucl.ac.uk/id/eprint/10054913
Downloads since deposit
87Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item