UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Genomic loci susceptible to systematic sequencing bias in clinical whole genomes

Freeman, TM; Wang, D; Harris, J; Ambrose, JC; Arumugam, P; Baple, EL; Bleda, M; ... Zarowiecki, M; + view all (2020) Genomic loci susceptible to systematic sequencing bias in clinical whole genomes. Genome Research , 30 (3) pp. 415-426. 10.1101/gr.255349.119. Green open access

[thumbnail of Genome Res.-2020-Freeman-415-26.pdf]
Preview
Text
Genome Res.-2020-Freeman-415-26.pdf - Published Version

Download (3MB) | Preview

Abstract

Accurate massively parallel sequencing (MPS) of genetic variants is key to many areas of science and medicine, such as cataloging population genetic variation and diagnosing genetic diseases. Certain genomic positions can be prone to higher rates of systematic sequencing and alignment bias that limit accuracy, resulting in false positive variant calls. Current standard practices to differentiate between loci that can and cannot be sequenced with high confidence utilize consensus between different sequencing methods as a proxy for sequencing confidence. These practices have significant limitations, and alternative methods are required to overcome them. We have developed a novel statistical method based on summarizing sequenced reads from whole-genome clinical samples and cataloging them in “Incremental Databases” that maintain individual confidentiality. Allele statistics were cataloged for each genomic position that consistently showed systematic biases with the corresponding MPS sequencing pipeline. We found systematic biases present at ∼1%–3% of the human autosomal genome across five patient cohorts. We identified which genomic regions were more or less prone to systematic biases, including large homopolymer flanks (odds ratio = 23.29–33.69) and the NIST high confidence genomic regions (odds ratio = 0.154–0.191). We confirmed our predictions on a gold-standard reference genome and showed that these systematic biases can lead to suspect variant calls within clinical panels. Our results recommend increased caution to address systematic biases in whole-genome sequencing and alignment. This study provides the implementation of a simple statistical approach to enhance quality control of clinically sequenced samples by flagging variants at suspect loci for further analysis or exclusion.

Type: Article
Title: Genomic loci susceptible to systematic sequencing bias in clinical whole genomes
Open access status: An open access version is available from UCL Discovery
DOI: 10.1101/gr.255349.119
Publisher version: https://doi.org/10.1101/gr.255349.119
Language: English
Additional information: Copyright © 2020 Freeman et al.; Published by Cold Spring Harbor Laboratory Press. This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health > Genetics and Genomic Medicine Dept
URI: https://discovery.ucl.ac.uk/id/eprint/10094957
Downloads since deposit
58Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item