UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint

Marsden, RL; Lewis, TA; Orengo, CA; (2007) Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint. BMC BIOINFORMATICS , 8 , Article 86. 10.1186/1471-2105-8-86. Green open access

[thumbnail of 1471-2105-8-86.pdf]
Preview
PDF
1471-2105-8-86.pdf

Download (726kB)

Abstract

Background: Structural genomics initiatives were established with the aim of solving protein structures on a large-scale. For many initiatives, such as the Protein Structure Initiative ( PSI), the primary aim of target selection is focussed towards structurally characterising protein families which, so far, lack a structural representative. It is therefore of considerable interest to gain insights into the number and distribution of these families, and what efforts may be required to achieve a comprehensive structural coverage across all protein families.Results: In this analysis we have derived a comprehensive domain annotation of the genomes using CATH, Pfam-A and Newfam domain families. We consider what proportions of structurally uncharacterised families are accessible to high-throughput structural genomics pipelines, specifically those targeting families containing multiple prokaryotic orthologues. In measuring the domain coverage of the genomes, we show the benefits of selecting targets from both structurally uncharacterised domain families, whilst in addition, pursuing additional targets from large structurally characterised protein superfamilies.Conclusion: This work suggests that such a combined approach to target selection is essential if structural genomics is to achieve a comprehensive structural coverage of the genomes, leading to greater insights into structure and the mechanisms that underlie protein evolution.

Type: Article
Title: Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint
Open access status: An open access version is available from UCL Discovery
DOI: 10.1186/1471-2105-8-86
Publisher version: http://dx.doi.org/10.1186/1471-2105-8-86
Language: English
Additional information: © 2007 Marsden et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Keywords: HIDDEN MARKOV-MODELS, TARGET SELECTION, MICROBIAL GENOMES, PROTEIN FAMILIES, SEQUENCE, DATABASE, CATH, SUPERFAMILIES, EFFICIENT, SPACE
UCL classification: UCL
UCL > Provost and Vice Provost Offices
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Structural and Molecular Biology
URI: https://discovery.ucl.ac.uk/id/eprint/129455
Downloads since deposit
119Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item