UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

A community effort to optimize sequence-based deep learning models of gene regulation

Rafi, AM; Nogina, D; Penzar, D; Lee, D; Lee, D; Kim, N; Kim, S; ... Random Promoter DREAM Challenge Consortium; + view all (2024) A community effort to optimize sequence-based deep learning models of gene regulation. Nature Biotechnology 10.1038/s41587-024-02414-w. (In press). Green open access

[thumbnail of s41587-024-02414-w.pdf]
Preview
Text
s41587-024-02414-w.pdf - Published Version

Download (4MB) | Preview

Abstract

A systematic evaluation of how model architectures and training strategies impact genomics model performance is needed. To address this gap, we held a DREAM Challenge where competitors trained models on a dataset of millions of random promoter DNA sequences and corresponding expression levels, experimentally determined in yeast. For a robust evaluation of the models, we designed a comprehensive suite of benchmarks encompassing various sequence types. All top-performing models used neural networks but diverged in architectures and training strategies. To dissect how architectural and training choices impact performance, we developed the Prix Fixe framework to divide models into modular building blocks. We tested all possible combinations for the top three models, further improving their performance. The DREAM Challenge models not only achieved state-of-the-art results on our comprehensive yeast dataset but also consistently surpassed existing benchmarks on Drosophila and human genomic datasets, demonstrating the progress that can be driven by gold-standard genomics datasets.

Type: Article
Title: A community effort to optimize sequence-based deep learning models of gene regulation
Location: United States
Open access status: An open access version is available from UCL Discovery
DOI: 10.1038/s41587-024-02414-w
Publisher version: http://dx.doi.org/10.1038/s41587-024-02414-w
Language: English
Additional information: This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10199730
Downloads since deposit
3Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item