UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Protein-ligand data at scale to support machine learning

Edwards, Aled M; Owen, Dafydd R; Zhang, Leili; Young, Damian W; Willson, Timothy M; Wellnitz, James; Wang, Yanli; ... Ackloo, Suzanne; + view all (2025) Protein-ligand data at scale to support machine learning. Nature Reviews Chemistry , 9 pp. 634-645. 10.1038/s41570-025-00737-z.

[thumbnail of Burgess-Brown_Protein-ligand data at scale .pdf] Text
Burgess-Brown_Protein-ligand data at scale .pdf
Access restricted to UCL open access staff until 24 January 2026.

Download (1MB)

Abstract

Target 2035 is a global initiative that aims to develop a potent and selective pharmacological modulator, such as a chemical probe, for every human protein by 2035. Here, we describe the Target 2035 roadmap to develop computational methods to improve small-molecule hit discovery, which is a key bottleneck in the discovery of chemical probes. Large, publicly available datasets of high-quality protein–small-molecule binding data will be created using affinity-selection mass spectrometry and DNA-encoded chemical library screening. Positive and negative data will be made openly available, and the machine learning community will be challenged to use these data to build models and predict new, diverse small-molecule binders. Iterative cycles of prediction and testing will lead to improved models and more successful predictions. By 2030, Target 2035 will have identified experimentally verified hits for thousands of human proteins and advanced the development of open-access algorithms capable of predicting hits for proteins for which there are not yet any experimental data. (Figure presented.)

Type: Article
Title: Protein-ligand data at scale to support machine learning
Location: England
DOI: 10.1038/s41570-025-00737-z
Publisher version: https://doi.org/10.1038/s41570-025-00737-z
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Chemistry, Chemistry, Multidisciplinary, DRUG DISCOVERY, IDENTIFICATION, Physical Sciences, Science & Technology
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > UCL School of Pharmacy
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Div of Biosciences > Structural and Molecular Biology
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > UCL School of Pharmacy > Pharma and Bio Chemistry
URI: https://discovery.ucl.ac.uk/id/eprint/10218676
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item