UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Maximum likelihood implementation of an isolation-with-migration model for three species

Dalquen, D; Zhu, T; Yang, Z; (2017) Maximum likelihood implementation of an isolation-with-migration model for three species. Systematic Biology , 66 (3) pp. 379-398. 10.1093/sysbio/syw063. Green open access

[thumbnail of USYB-2016-042.Dalquen.pdf]
Preview
Text
USYB-2016-042.Dalquen.pdf - Accepted Version

Download (2MB) | Preview

Abstract

We develop a maximum likelihood (ML) method for estimating migration rates between species using genomic sequence data. A species tree is used to accommodate the phylogenetic relationships among three species, allowing for migration between the two sister species, while the third species is used as an outgroup. A Markov chain characterization of the genealogical process of coalescence and migration is used to integrate out the migration histories at each locus analytically, while Gaussian quadrature is used to integrate over the coalescent times on each genealogical tree numerically. This is an extension of our early implementation of the symmetrical isolation-with-migration model for three species to accommodate arbitrary loci with two or three sequences per locus and to allow asymmetrical migration rates. Our implementation can accommodate tens of thousands of loci, making it feasible to analyze genome-scale datasets to test for gene flow. We calculate the posterior probabilities of gene trees at individual loci to identify genomic regions that are likely to have been transferred between species due to gene flow. We conduct a simulation study to examine the statistical properties of the likelihood ratio test for gene flow between the two ingroup species and of the maximum likelihood estimates of model parameters such as the migration rate. Inclusion of data from a third outgroup species is found to increase dramatically the power of the test and the precision of parameter estimation. We compiled and analyzed several genomic datasets from the Drosophila fruit flies. Our analyses suggest no migration from D. melanogaster to D. simulans, and a significant amount of gene flow from D. simulans to D. melanogaster, at the rate of ~0.02 migrant individuals per generation. We discuss the utility of the multispecies coalescent model for species tree estimation, accounting for incomplete lineage sorting and migration.

Type: Article
Title: Maximum likelihood implementation of an isolation-with-migration model for three species
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/sysbio/syw063
Publisher version: http://dx.doi.org/10.1093/sysbio/syw063
Language: English
Additional information: Copyright © The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. This is a pre-copyedited, author-produced PDF of an article accepted for publication in Systematic Biology following peer review. The version of record [insert complete citation information here] is available online at: http://dx.doi.org/10.1093/sysbio/syw063
Keywords: Multispecies coalescent; maximum likelihood; speciation; IM model; migration
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment
URI: https://discovery.ucl.ac.uk/id/eprint/1503515
Downloads since deposit
83Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item