UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data

Marriott, H; Kabiljo, R; Al Khleifat, A; Dobson, RJ; Al-Chalabi, A; Iacoangeli, A; (2023) DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data. Bioinformatics , 39 (4) , Article btad152. 10.1093/bioinformatics/btad152. Green open access

[thumbnail of DNAscan2 a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data.pdf]
Preview
Text
DNAscan2 a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data.pdf - Published Version

Download (332kB) | Preview

Abstract

SUMMARY: The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In this landscape, versatility, scalability, and user-friendliness are key characteristics for an NGS analysis software. We developed DNAscan2, a highly flexible, end-to-end pipeline for the analysis of NGS data, which (i) can be used for the detection of multiple variant types, including SNVs, small indels, transposable elements, short tandem repeats, and other large structural variants; (ii) covers all standard steps of NGS analysis, from quality control of raw data and genome alignment to variant calling, annotation, and generation of reports for the interpretation and prioritization of results; (iii) is highly adaptable as it can be deployed and run via either a graphic user interface for non-bioinformaticians and a command line tool for personal computer usage; (iv) is scalable as it can be executed in parallel as a Snakemake workflow, and; (v) is computationally efficient by minimizing RAM and CPU time requirements. AVAILABILITY AND IMPLEMENTATION: DNAscan2 is implemented in Python3 and is available at https://github.com/KHP-Informatics/DNAscanv2.

Type: Article
Title: DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/bioinformatics/btad152
Publisher version: https://doi.org/10.1093/bioinformatics/btad152
Language: English
Additional information: © The Author(s) 2023. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Keywords: Humans, Software, High-Throughput Nucleotide Sequencing, INDEL Mutation, Quality Control, Workflow
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics > Clinical Epidemiology
URI: https://discovery.ucl.ac.uk/id/eprint/10169680
Downloads since deposit
17Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item