UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

cath-resolve-hits: a new tool that resolves domain matches suspiciously quickly

Lewis, TE; Sillitoe, I; Lees, JG; (2018) cath-resolve-hits: a new tool that resolves domain matches suspiciously quickly. Bioinformatics , 35 (10) pp. 1766-1767. 10.1093/bioinformatics/bty863. Green open access

[thumbnail of Sillitoe_Cath-resolve-hits. a new tool that resolves domain matches suspiciously quickly_VoR.pdf]
Preview
Text
Sillitoe_Cath-resolve-hits. a new tool that resolves domain matches suspiciously quickly_VoR.pdf - Published Version

Download (154kB) | Preview

Abstract

MOTIVATION: Many bioinformatics areas require us to assign domain matches onto stretches of a query protein. Starting with a set of candidate matches, we want to identify the optimal subset that has limited/no overlap between matches. This may be further complicated by discontinuous domains in the input data. Existing tools are increasingly facing very large data-sets for which they require prohibitive amounts of CPU-time and memory. RESULTS: We present cath-resolve-hits (CRH), a new tool that uses a dynamic-programming algorithm implemented in open-source C ++ to handle large data-sets quickly (up to ∼1 million hits/second) and in reasonable amounts of memory. It accepts multiple input formats and provides its output in plain text, JSON or graphical HTML. We describe a benchmark against an existing algorithm, which shows CRH delivers very similar or slightly improved results and very much improved CPU/memory performance on large data-sets. AVAILABILITY AND IMPLEMENTATION: CRH is available at https://github.com/UCLOrengoGroup/cath-tools; documentation is available at http://cath-tools.readthedocs.io.

Type: Article
Title: cath-resolve-hits: a new tool that resolves domain matches suspiciously quickly
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/bioinformatics/bty863
Publisher version: https://doi.org/10.1093/bioinformatics/bty863
Language: English
Additional information: © The Author(s) 2018. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/).
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Structural and Molecular Biology
URI: https://discovery.ucl.ac.uk/id/eprint/10059632
Downloads since deposit
85Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item