UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

InterPro: the protein sequence classification resource in 2025

Blum, Matthias; Andreeva, Antonina; Florentino, Laise Cavalcanti; Chuguransky, Sara Rocio; Grego, Tiago; Hobbs, Emma; Pinto, Beatriz Lazaro; ... Bateman, Alex; + view all (2024) InterPro: the protein sequence classification resource in 2025. Nucleic Acids Research , Article gkae1082. 10.1093/nar/gkae1082. (In press). Green open access

[thumbnail of InterPro.pdf]
Preview
Text
InterPro.pdf - Published Version

Download (2MB) | Preview

Abstract

InterPro (https://www.ebi.ac.uk/interpro) is a freely accessible resource for the classification of protein sequences into families. It integrates predictive models, known as signatures, from multiple member databases to classify sequences into families and predict the presence of domains and significant sites. The InterPro database provides annotations for over 200 million sequences, ensuring extensive coverage of UniProtKB, the standard repository of protein sequences, and includes mappings to several other major resources, such as Gene Ontology (GO), Protein Data Bank in Europe (PDBe) and the AlphaFold Protein Structure Database. In this publication, we report on the status of InterPro (version 101.0), detailing new developments in the database, associated web interface and software. Notable updates include the increased integration of structures predicted by AlphaFold and the enhanced description of protein families using artificial intelligence. Over the past two years, more than 5000 new InterPro entries have been created. The InterPro website now offers access to 85 000 protein families and domains from its member databases and serves as a long-term archive for retired databases. InterPro data, software and tools are freely available.

Type: Article
Title: InterPro: the protein sequence classification resource in 2025
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/nar/gkae1082
Publisher version: https://doi.org/10.1093/nar/gkae1082
Language: English
Additional information: © The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Structural and Molecular Biology
URI: https://discovery.ucl.ac.uk/id/eprint/10200737
Downloads since deposit
39Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item