UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Assessing splicing accuracy and its determinants across human tissues using RNA-sequencing

García Ruiz, Sonia; (2023) Assessing splicing accuracy and its determinants across human tissues using RNA-sequencing. Doctoral thesis (Ph.D), UCL (University College London). Green open access

[thumbnail of Sonia_Garcia_Ruiz_PhD_Thesis.pdf]
Preview
Text
Sonia_Garcia_Ruiz_PhD_Thesis.pdf - Accepted Version

Download (8MB) | Preview

Abstract

Alternative splicing (AS) is a feature of most multi-exonic human genes and its dysregulation is known to contribute to ageing and disease. However, the genome-wide accuracy of AS has received less attention. This is despite the fact that high-depth RNA-sequencing data of non-diseased human samples commonly contains partially unannotated reads detected at low frequency across samples and within an individual, implying the existence of splicing inaccuracies. In this thesis, I investigated the characteristics and drivers of inaccurate splicing, namely mis-splicing activity, which can be detected using split reads partially mapping to known transcripts in the reference annotation. Using short-read RNA-sequencing data derived from ∼14K samples and 42 human tissues provided by the Genotype-Tissue Expression Consortium v8, I developed IntroVerse, a relational database on the splicing of >300K annotated introns and a linked set of >4.5m novel junctions covering ∼32K genes. By using a subset of the data stored on IntroVerse, I found that mis-splicing has a distribution pattern, and is generated at different rates across introns and tissues. Using linear regression models to predict mis-splicing, I found that invariable intronic properties such as inter-species sequence conservation neighbouring the 5’/3’ splice sites, had the highest variability in predictive value for mis-splicing across tissues and that these tissue differences were unlikely to be driven by germline or somatic mutations. Using RNA-sequencing data after in vitro knockdowns of multiple RNA-binding proteins, I demonstrated significant changes in the distribution of mis-splicing, showing that accurate splice site selection is affected by RBP expression levels. I also found that mis-splicing tends to increase with age in most tissues, and particularly affects genes implicated in neurodegenerative diseases. I anticipate that this in-depth characterization of mis-splicing will help improve our understanding of the role of mis-splicing in human disease and age-related disorders.

Type: Thesis (Doctoral)
Qualification: Ph.D
Title: Assessing splicing accuracy and its determinants across human tissues using RNA-sequencing
Open access status: An open access version is available from UCL Discovery
Language: English
Additional information: Copyright © The Author 2023. Original content in this thesis is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) Licence (https://creativecommons.org/licenses/by-nc/4.0/). Any third-party copyright material present remains the property of its respective owner(s) and is licensed under its existing terms. Access may initially be restricted at the author’s request.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health > Genetics and Genomic Medicine Dept
URI: https://discovery.ucl.ac.uk/id/eprint/10179390
Downloads since deposit
87Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item