UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

CLN3 transcript complexity revealed by long-read RNA sequencing analysis

Zhang, Hao-Yu; Minnis, Christopher; Gustavsson, Emil; Ryten, Mina; Mole, Sara E; (2024) CLN3 transcript complexity revealed by long-read RNA sequencing analysis. BMC Medical Genomics , 17 , Article 244. 10.1186/s12920-024-02017-z. Green open access

[thumbnail of CLN3 transcript complexity revealed by long-read RNA sequencing analysis.pdf]
Preview
PDF
CLN3 transcript complexity revealed by long-read RNA sequencing analysis.pdf - Published Version

Download (2MB) | Preview

Abstract

Background: Batten disease is a group of rare inherited neurodegenerative diseases. Juvenile CLN3 disease is the most prevalent type, and the most common pathogenic variant shared by most patients is the “1-kb” deletion which removes two internal coding exons (7 and 8) in CLN3. Previously, we identified two transcripts in patient fibroblasts homozygous for the 1-kb deletion: the ‘major’ and ‘minor’ transcripts. To understand the full variety of disease transcripts and their role in disease pathogenesis, it is necessary to first investigate CLN3 transcription in “healthy” samples without juvenile CLN3 disease. Methods: We leveraged PacBio long-read RNA sequencing datasets from ENCODE to investigate the full range of CLN3 transcripts across various tissues and cell types in human control samples. Then we sought to validate their existence using data from different sources. Results: We found that a readthrough gene affects the quantification and annotation of CLN3. After taking this into account, we detected over 100 novel CLN3 transcripts, with no dominantly expressed CLN3 transcript. The most abundant transcript has median usage of 42.9%. Surprisingly, the known disease-associated ‘major’ transcripts are detected. Together, they have median usage of 1.5% across 22 samples. Furthermore, we identified 48 CLN3 ORFs, of which 26 are novel. The predominant ORF that encodes the canonical CLN3 protein isoform has median usage of 66.7%, meaning around one-third of CLN3 transcripts encode protein isoforms with different stretches of amino acids. The same ORFs could be found with alternative UTRs. Moreover, we were able to validate the translational potential of certain transcripts using public mass spectrometry data. Conclusion: Overall, these findings provide valuable insights into the complexity of CLN3 transcription, highlighting the importance of studying both canonical and non-canonical CLN3 protein isoforms as well as the regulatory role of UTRs to fully comprehend the regulation and function(s) of CLN3. This knowledge is essential for investigating the impact of the 1-kb deletion and rare pathogenic variants on CLN3 transcription and disease pathogenesis.

Type: Article
Title: CLN3 transcript complexity revealed by long-read RNA sequencing analysis
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1186/s12920-024-02017-z
Publisher version: https://doi.org/10.1186/s12920-024-02017-z
Language: English
Additional information: This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Keywords: Juvenile CLN3 disease, Batten disease, Neuronal ceroid lipofuscinoses, CLN3, Transcription, Readthrough gene, Alternative splicing, Long-read RNA sequencing
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health > Genetics and Genomic Medicine Dept
URI: https://discovery.ucl.ac.uk/id/eprint/10210092
Downloads since deposit
12Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item