UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Incorporating alignment uncertainty into Felsenstein’s phylogenetic bootstrap to improve its reliability

Chang, J-M; Floden, EW; Herrero, J; Gascuel, O; Di Tommaso, P; Notredame, C; (2019) Incorporating alignment uncertainty into Felsenstein’s phylogenetic bootstrap to improve its reliability. Bioinformatics 10.1093/bioinformatics/btz082. (In press). Green open access

[img]
Preview
Text
Herrero_Incorporating alignment uncertainty into Felsenstein's phylogenetic bootstrap to improve its reliability_Proof.pdf - Published version

Download (727kB) | Preview

Abstract

MOTIVATION: Most evolutionary analyses are based on pre-estimated multiple sequence alignment. Wong et al. established the existence of an uncertainty induced by multiple sequence alignment when reconstructing phylogenies. They were able to show that in many cases different aligners produce different phylogenies, with no simple objective criterion sufficient to distinguish among these alternatives. RESULTS: We demonstrate that incorporating MSA induced uncertainty into bootstrap sampling can significantly increase correlation between clade correctness and its corresponding bootstrap value. Our procedure involves concatenating several alternative multiple sequence alignments of the same sequences, produced using different commonly used aligners. We then draw bootstrap replicates while favoring columns of the more unique aligner among the concatenated aligners. We named this concatenation and bootstrapping method, Weighted Partial Super Bootstrap (wpSBOOT). We show on three simulated datasets of 16, 32 and 64 tips that our method improves the predictive power of bootstrap values. We also used as a benchmark an empirical collection of 853 1-to-1 orthologous genes from seven yeast species and found wpSBOOT to significantly improve discrimination capacity between topologically correct and incorrect trees. Bootstrap values of wpSBOOT are comparable to similar readouts estimated using a single method. However, for reduced trees by 50% and 95% bootstrap thresholds, wpSBOOT comes out the lowest Type I error (less FP). AVAILABILITY: The automated generation of replicates has been implemented in the T-Coffee package, which is available as open source freeware available from www.tcoffee.org.

Type: Article
Title: Incorporating alignment uncertainty into Felsenstein’s phylogenetic bootstrap to improve its reliability
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/bioinformatics/btz082
Publisher version: https://doi.org/10.1093/bioinformatics/btz082
Language: English
Additional information: © The Author(s) 2019. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/).
UCL classification: UCL > Provost and Vice Provost Offices
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences > Cancer Institute
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences > Cancer Institute > Research Department of Cancer Bio
URI: https://discovery.ucl.ac.uk/id/eprint/10068234
Downloads since deposit
94Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item