UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Identification of single nucleotide variants using position-specific error estimation in deep sequencing data

Kleftogiannis, D; Punta, M; Jayaram, A; Sandhu, S; Wong, SQ; Gasi Tandefelt, D; Conteduca, V; ... Lise, S; + view all (2019) Identification of single nucleotide variants using position-specific error estimation in deep sequencing data. BMC Med Genomics , 12 (1) , Article 115. 10.1186/s12920-019-0557-9. Green open access

[thumbnail of Attard_Identification of single nucleotide variants using position-specific error estimation in deep sequencing data_VoR.pdf]
Preview
Text
Attard_Identification of single nucleotide variants using position-specific error estimation in deep sequencing data_VoR.pdf - Published Version

Download (1MB) | Preview

Abstract

BACKGROUND: Targeted deep sequencing is a highly effective technology to identify known and novel single nucleotide variants (SNVs) with many applications in translational medicine, disease monitoring and cancer profiling. However, identification of SNVs using deep sequencing data is a challenging computational problem as different sequencing artifacts limit the analytical sensitivity of SNV detection, especially at low variant allele frequencies (VAFs). METHODS: To address the problem of relatively high noise levels in amplicon-based deep sequencing data (e.g. with the Ion AmpliSeq technology) in the context of SNV calling, we have developed a new bioinformatics tool called AmpliSolve. AmpliSolve uses a set of normal samples to model position-specific, strand-specific and nucleotide-specific background artifacts (noise), and deploys a Poisson model-based statistical framework for SNV detection. RESULTS: Our tests on both synthetic and real data indicate that AmpliSolve achieves a good trade-off between precision and sensitivity, even at VAF below 5% and as low as 1%. We further validate AmpliSolve by applying it to the detection of SNVs in 96 circulating tumor DNA samples at three clinically relevant genomic positions and compare the results to digital droplet PCR experiments. CONCLUSIONS: AmpliSolve is a new tool for in-silico estimation of background noise and for detection of low frequency SNVs in targeted deep sequencing data. Although AmpliSolve has been specifically designed for and tested on amplicon-based libraries sequenced with the Ion Torrent platform it can, in principle, be applied to other sequencing platforms as well. AmpliSolve is freely available at https://github.com/dkleftogi/AmpliSolve .

Type: Article
Title: Identification of single nucleotide variants using position-specific error estimation in deep sequencing data
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1186/s12920-019-0557-9
Publisher version: https://doi.org/10.1186/s12920-019-0557-9
Language: English
Additional information: © The Author(s). 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/)
Keywords: Cancer genomics, Deep sequencing, Error correction, Ion torrent, Liquid biopsies, Next generation sequencing (NGS), Targeted sequencing, Variant calling
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences > Cancer Institute
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Medical Sciences > Cancer Institute > Research Department of Oncology
URI: https://discovery.ucl.ac.uk/id/eprint/10079959
Downloads since deposit
68Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item