UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Characterizing and explaining the impact of disease-associated mutations in proteins without known structures or structural homologs

Sen, Neeladri; Anishchenko, Ivan; Bordin, Nicola; Sillitoe, Ian; Velankar, Sameer; Baker, David; Orengo, Christine; (2022) Characterizing and explaining the impact of disease-associated mutations in proteins without known structures or structural homologs. Briefings in Bioinformatics 10.1093/bib/bbac187. (In press). Green open access

[thumbnail of bbac187.pdf]
Preview
Text
bbac187.pdf - Published Version

Download (1MB) | Preview

Abstract

Mutations in human proteins lead to diseases. The structure of these proteins can help understand the mechanism of such diseases and develop therapeutics against them. With improved deep learning techniques, such as RoseTTAFold and AlphaFold, we can predict the structure of proteins even in the absence of structural homologs. We modeled and extracted the domains from 553 disease-associated human proteins without known protein structures or close homologs in the Protein Databank. We noticed that the model quality was higher and the Root mean square deviation (RMSD) lower between AlphaFold and RoseTTAFold models for domains that could be assigned to CATH families as compared to those which could only be assigned to Pfam families of unknown structure or could not be assigned to either. We predicted ligand-binding sites, protein–protein interfaces and conserved residues in these predicted structures. We then explored whether the disease-associated missense mutations were in the proximity of these predicted functional sites, whether they destabilized the protein structure based on ddG calculations or whether they were predicted to be pathogenic. We could explain 80% of these disease-associated mutations based on proximity to functional sites, structural destabilization or pathogenicity. When compared to polymorphisms, a larger percentage of disease-associated missense mutations were buried, closer to predicted functional sites, predicted as destabilizing and pathogenic. Usage of models from the two state-of-the-art techniques provide better confidence in our predictions, and we explain 93 additional mutations based on RoseTTAFold models which could not be explained based solely on AlphaFold models.

Type: Article
Title: Characterizing and explaining the impact of disease-associated mutations in proteins without known structures or structural homologs
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/bib/bbac187
Publisher version: https://doi.org/10.1093/bib/bbac187
Language: English
Additional information: Copyright © The Author(s) 2022. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Keywords: protein structure modeling, mutation, AlphaFold, RoseTTAFold, disease-associated, functional site
UCL classification: UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Structural and Molecular Biology
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
URI: https://discovery.ucl.ac.uk/id/eprint/10149940
Downloads since deposit
40Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item