UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Bayesian statistical modelling of genetic sequence evolution

Angelis, K; (2016) Bayesian statistical modelling of genetic sequence evolution. Doctoral thesis , UCL (University College London). Green open access

[thumbnail of Kostas_Thesis.pdf]
Preview
Text
Kostas_Thesis.pdf - Accepted Version

Download (5MB) | Preview

Abstract

Bayesian statistics has been at the heart of phylogenetic inference over the last decade, particularly after the development of powerful programs that implement efficient Markov chain Monte Carlo algorithms, allowing inference from multi-parametric problems in realistic time frames. In this thesis we develop and test Bayesian methods to analyse molecular sequence data to address important biological questions. First, we review some fundamental aspects of Bayesian inference and highlight current Bayesian applications in molecular evolution with particular focus in studying natural selection and estimating species divergence times. Then, we develop a new Bayesian method to estimate the nonsynonymous/synonymous rate ratio and evolutionary distance for pairwise sequence comparisons. The new method addresses weaknesses of previous counting and maximum-likelihood methods. It is also computationally efficient and thus suitable for genome-scale screening. Then, we explore the performance of existing Bayesian algorithms in estimating species divergence times. In particular, we study the impact of ancestral population size and incomplete lineage sorting on Bayesian estimates of species divergence times under the molecular clock, when those factors of molecular evolution are ignored by the inference model. The estimates can be highly biased, especially in the case of shallow phylogenies with large ancestral population sizes. Then, using computer simulations and real data analyses we study the effect of five commonly used partitioning strategies for divergence times estimation and show that the choice of the partitioning scheme is important in case of serious clock-violation with incorrect prior assumptions. Finally, a Bayesian molecular clock dating study is performed to estimate the timeline of animal evolution. The results indicate that the time estimates are highly variable, precluding the inference of a precise timescale of animal evolution based on the current data and methods.

Type: Thesis (Doctoral)
Title: Bayesian statistical modelling of genetic sequence evolution
Event: University College London
Open access status: An open access version is available from UCL Discovery
Language: English
UCL classification: UCL > Provost and Vice Provost Offices
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
URI: https://discovery.ucl.ac.uk/id/eprint/1471622
Downloads since deposit
417Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item