UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

KGCAK: a K-mer based database for genome-wide phylogeny and complexity evaluation

Wang, D; Xu, J; Yu, J; (2015) KGCAK: a K-mer based database for genome-wide phylogeny and complexity evaluation. Biol Direct , 10 , Article 53. 10.1186/s13062-015-0083-4. Green open access

[thumbnail of KGCAK: a K-mer based database for genome-wide phylogeny and complexity evaluation.pdf]
Preview
Text
KGCAK: a K-mer based database for genome-wide phylogeny and complexity evaluation.pdf

Download (1MB) | Preview

Abstract

BACKGROUND: The K-mer approach, treating genomic sequences as simple characters and counting the relative abundance of each string upon a fixed K, has been extensively applied to phylogeny inference for genome assembly, annotation, and comparison. RESULTS: To meet increasing demands for comparing large genome sequences and to promote the use of the K-mer approach, we develop a versatile database, KGCAK ( http://kgcak.big.ac.cn/KGCAK/ ), containing ~8,000 genomes that include genome sequences of diverse life forms (viruses, prokaryotes, protists, animals, and plants) and cellular organelles of eukaryotic lineages. It builds phylogeny based on genomic elements in an alignment-free fashion and provides in-depth data processing enabling users to compare the complexity of genome sequences based on K-mer distribution. CONCLUSION: We hope that KGCAK becomes a powerful tool for exploring relationship within and among groups of species in a tree of life based on genomic data.

Type: Article
Title: KGCAK: a K-mer based database for genome-wide phylogeny and complexity evaluation
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1186/s13062-015-0083-4
Publisher version: http://dx.doi.org/10.1186/s13062-015-0083-4
Additional information: © 2015 Wang et al. Open Access. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences
URI: https://discovery.ucl.ac.uk/id/eprint/1473312
Downloads since deposit
119Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item