UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Spectrum: fast density-aware spectral clustering for single and multi-omic data

John, CR; Watson, D; Barnes, MR; Pitzalis, C; Lewis, MJ; (2020) Spectrum: fast density-aware spectral clustering for single and multi-omic data. Bioinformatics , 36 (4) pp. 1159-1166. 10.1093/bioinformatics/btz704. Green open access

[thumbnail of John et al. - Spectrum.pdf]
Preview
Text
John et al. - Spectrum.pdf - Published Version

Download (1MB) | Preview

Abstract

MOTIVATION: Clustering patient omic data is integral to developing precision medicine because it allows the identification of disease subtypes. A current major challenge is the integration multi-omic data to identify a shared structure and reduce noise. Cluster analysis is also increasingly applied on single-omic data, for example, in single cell RNA-seq analysis for clustering the transcriptomes of individual cells. This technology has clinical implications. Our motivation was therefore to develop a flexible and effective spectral clustering tool for both single and multi-omic data. RESULTS: We present Spectrum, a new spectral clustering method for complex omic data. Spectrum uses a self-tuning density-aware kernel we developed that enhances the similarity between points that share common nearest neighbours. It uses a tensor product graph data integration and diffusion procedure to reduce noise and reveal underlying structures. Spectrum contains a new method for finding the optimal number of clusters (K) involving eigenvector distribution analysis. Spectrum can automatically find K for both Gaussian and non-Gaussian structures. We demonstrate across 21 real expression datasets that Spectrum gives improved runtimes and better clustering results relative to other methods. AVAILABILITY AND IMPLEMENTATION: Spectrum is available as an R software package from CRAN https://cran.r-project.org/web/packages/Spectrum/index.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Type: Article
Title: Spectrum: fast density-aware spectral clustering for single and multi-omic data
Location: England
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/bioinformatics/btz704
Publisher version: https://doi.org/10.1093/bioinformatics/btz704
Language: English
Additional information: Copyright © The Author(s) 2019. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Keywords: Cluster Analysis, Humans, Precision Medicine, Single-Cell Analysis, Software, Transcriptome
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery.ucl.ac.uk/id/eprint/10118989
Downloads since deposit
56Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item