Dalquen, D;
Zhu, T;
Yang, Z;
(2017)
Maximum likelihood implementation of an isolation-with-migration model for three species.
Systematic Biology
, 66
(3)
pp. 379-398.
10.1093/sysbio/syw063.
Preview |
Text
USYB-2016-042.Dalquen.pdf - Accepted Version Download (2MB) | Preview |
Abstract
We develop a maximum likelihood (ML) method for estimating migration rates between species using genomic sequence data. A species tree is used to accommodate the phylogenetic relationships among three species, allowing for migration between the two sister species, while the third species is used as an outgroup. A Markov chain characterization of the genealogical process of coalescence and migration is used to integrate out the migration histories at each locus analytically, while Gaussian quadrature is used to integrate over the coalescent times on each genealogical tree numerically. This is an extension of our early implementation of the symmetrical isolation-with-migration model for three species to accommodate arbitrary loci with two or three sequences per locus and to allow asymmetrical migration rates. Our implementation can accommodate tens of thousands of loci, making it feasible to analyze genome-scale datasets to test for gene flow. We calculate the posterior probabilities of gene trees at individual loci to identify genomic regions that are likely to have been transferred between species due to gene flow. We conduct a simulation study to examine the statistical properties of the likelihood ratio test for gene flow between the two ingroup species and of the maximum likelihood estimates of model parameters such as the migration rate. Inclusion of data from a third outgroup species is found to increase dramatically the power of the test and the precision of parameter estimation. We compiled and analyzed several genomic datasets from the Drosophila fruit flies. Our analyses suggest no migration from D. melanogaster to D. simulans, and a significant amount of gene flow from D. simulans to D. melanogaster, at the rate of ~0.02 migrant individuals per generation. We discuss the utility of the multispecies coalescent model for species tree estimation, accounting for incomplete lineage sorting and migration.
Type: | Article |
---|---|
Title: | Maximum likelihood implementation of an isolation-with-migration model for three species |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1093/sysbio/syw063 |
Publisher version: | http://dx.doi.org/10.1093/sysbio/syw063 |
Language: | English |
Additional information: | Copyright © The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. This is a pre-copyedited, author-produced PDF of an article accepted for publication in Systematic Biology following peer review. The version of record [insert complete citation information here] is available online at: http://dx.doi.org/10.1093/sysbio/syw063 |
Keywords: | Multispecies coalescent; maximum likelihood; speciation; IM model; migration |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment |
URI: | https://discovery.ucl.ac.uk/id/eprint/1503515 |
Archive Staff Only
View Item |