Login / Signup

Determination of essential phenotypic elements of clusters in high-dimensional entities-DEPECHE.

Axel TheorellYenan Troi BrycesonJakob Theorell
Published in: PloS one (2019)
Technological advances have facilitated an exponential increase in the amount of information that can be derived from single cells, necessitating new computational tools that can make such highly complex data interpretable. Here, we introduce DEPECHE, a rapid, parameter free, sparse k-means-based algorithm for clustering of multi- and megavariate single-cell data. In a number of computational benchmarks aimed at evaluating the capacity to form biologically relevant clusters, including flow/mass-cytometry and single cell RNA sequencing data sets with manually curated gold standard solutions, DEPECHE clusters as well or better than the currently available best performing clustering algorithms. However, the main advantage of DEPECHE, compared to the state-of-the-art, is its unique ability to enhance interpretability of the formed clusters, in that it only retains variables relevant for cluster separation, thereby facilitating computational efficient analyses as well as understanding of complex datasets. DEPECHE is implemented in the open source R package DepecheR currently available at github.com/Theorell/DepecheR.
Keyphrases
  • single cell
  • rna seq
  • high throughput
  • electronic health record
  • machine learning
  • big data
  • induced apoptosis
  • cell proliferation
  • cell cycle arrest
  • healthcare
  • health information
  • liquid chromatography