UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms

Bordin, Nicola; Sillitoe, Ian; Nallapareddy, Vamsi; Rauer, Clemens; Lam, Su Datt; Waman, Vaishali P; Sen, Neeladri; ... Orengo, Christine; + view all (2023) AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms. Communications Biology , 6 (1) , Article 160. 10.1038/s42003-023-04488-9. Green open access

[thumbnail of AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms.pdf]
Preview
PDF
AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms.pdf - Published Version

Download (3MB) | Preview

Abstract

Deep-learning (DL) methods like DeepMind's AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and classification. Of ~370,000 confident models, 92% can be assigned to 3253 superfamilies in our CATH domain superfamily classification. The remaining cluster into 2367 putative novel superfamilies. Detailed manual analysis on 618 of these, having at least one human relative, reveal extremely remote homologies and further unusual features. Only 25 novel superfamilies could be confirmed. Although most models map to existing superfamilies, AF2 domains expand CATH by 67% and increases the number of unique 'global' folds by 36% and will provide valuable insights on structure function relationships. CATH-Assign will harness the huge expansion in structural data provided by DeepMind to rationalise evolutionary changes driving functional divergence.

Type: Article
Title: AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1038/s42003-023-04488-9
Publisher version: https://doi.org/10.1038/s42003-023-04488-9
Language: English
Additional information: © 2023 Springer Nature Limited. This article is licensed under a Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).
Keywords: Humans, Furylfuramide, Databases, Protein, Proteins
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Structural and Molecular Biology
URI: https://discovery.ucl.ac.uk/id/eprint/10164833
Downloads since deposit
52Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item