Glover, N;
Sheppard, S;
Dessimoz, C;
(2021)
Homoeolog inference methods requiring bidirectional best hits or synteny miss many pairs.
Genome Biology and Evolution
10.1093/gbe/evab077.
(In press).
Preview |
Text
evab077-2.pdf - Accepted Version Download (808kB) | Preview |
Abstract
Homoeologs are pairs of genes or chromosomes in the same species that originated by speciation and were brought back together in the same genome by allopolyploidization. Bioinformatic methods for accurate homoeology inference are crucial for studying the evolutionary consequences of polyploidization, and homoeology is typically inferred on the basis of bidirectional best hit (BBH) and/or positional conservation (synteny). However, these methods neglect the fact that genes can duplicate and move, both prior to and after the allopolyploidization event. These duplications and movements can result in many-to-many and/or nonsyntenic homoeologs-which thus remain undetected and unstudied. Here, using the allotetraploid upland cotton (Gossypium hirsutum) as a case study, we show that conventional approaches indeed miss a substantial proportion of homoeologs. Additionally, we found that many of the missed pairs of homoeologs are broadly and highly expressed. A Gene Ontology (GO) analysis revealed a high proportion of the nonsyntenic and non-BBH homoeologs to be involved in protein translation and are likely to contribute to the functional repertoire of cotton. Thus, from an evolutionary and functional genomics standpoint, choosing a homoeolog inference method which does not solely rely on 1:1 relationship cardinality or synteny is crucial for not missing these potentially important homoeolog pairs.
Type: | Article |
---|---|
Title: | Homoeolog inference methods requiring bidirectional best hits or synteny miss many pairs |
Location: | England |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1093/gbe/evab077 |
Publisher version: | https://doi.org/10.1093/gbe/evab077 |
Language: | English |
Additional information: | © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/). |
Keywords: | Gossypium hirsutum, best bidirectional hit, comparative genomics, cotton, homoeolog, synteny |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Genetics, Evolution and Environment |
URI: | https://discovery.ucl.ac.uk/id/eprint/10128747 |
Archive Staff Only
View Item |