Marriott, H;
Kabiljo, R;
Al Khleifat, A;
Dobson, RJ;
Al-Chalabi, A;
Iacoangeli, A;
(2023)
DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data.
Bioinformatics
, 39
(4)
, Article btad152. 10.1093/bioinformatics/btad152.
Preview |
Text
DNAscan2 a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data.pdf - Published Version Download (332kB) | Preview |
Abstract
SUMMARY: The current widespread adoption of next-generation sequencing (NGS) in all branches of basic research and clinical genetics fields means that users with highly variable informatics skills, computing facilities and application purposes need to process, analyse, and interpret NGS data. In this landscape, versatility, scalability, and user-friendliness are key characteristics for an NGS analysis software. We developed DNAscan2, a highly flexible, end-to-end pipeline for the analysis of NGS data, which (i) can be used for the detection of multiple variant types, including SNVs, small indels, transposable elements, short tandem repeats, and other large structural variants; (ii) covers all standard steps of NGS analysis, from quality control of raw data and genome alignment to variant calling, annotation, and generation of reports for the interpretation and prioritization of results; (iii) is highly adaptable as it can be deployed and run via either a graphic user interface for non-bioinformaticians and a command line tool for personal computer usage; (iv) is scalable as it can be executed in parallel as a Snakemake workflow, and; (v) is computationally efficient by minimizing RAM and CPU time requirements. AVAILABILITY AND IMPLEMENTATION: DNAscan2 is implemented in Python3 and is available at https://github.com/KHP-Informatics/DNAscanv2.
Type: | Article |
---|---|
Title: | DNAscan2: a versatile, scalable, and user-friendly analysis pipeline for human next-generation sequencing data |
Location: | England |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1093/bioinformatics/btad152 |
Publisher version: | https://doi.org/10.1093/bioinformatics/btad152 |
Language: | English |
Additional information: | © The Author(s) 2023. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
Keywords: | Humans, Software, High-Throughput Nucleotide Sequencing, INDEL Mutation, Quality Control, Workflow |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > Institute of Health Informatics > Clinical Epidemiology |
URI: | https://discovery.ucl.ac.uk/id/eprint/10169680 |
Archive Staff Only
View Item |