Login / Signup

scDEED: a statistical method for detecting dubious 2D single-cell embeddings.

Lucy XiaChristy LeeJingyi Jessica Li
Published in: bioRxiv : the preprint server for biology (2023)
Two-dimensional (2D) embedding methods are crucial for single-cell data visualization. Popular methods such as t-SNE and UMAP are commonly used for visualizing cell clusters; however, it is well known that t-SNE and UMAP's 2D embedding might not reliably inform the similarities among cell clusters. Motivated by this challenge, we developed a statistical method, scDEED, for detecting dubious cell embeddings output by any 2D embedding method. By calculating a reliability score for every cell embedding, scDEED identifies the cell embeddings with low reliability scores as dubious and those with high reliability scores as trustworthy. Moreover, by minimizing the number of dubious cell embeddings, scDEED provides intuitive guidance for optimizing the hyperparameters of an embedding method. Applied to multiple scRNA-seq datasets, scDEED demonstrates its effectiveness for detecting dubious cell embeddings and optimizing the hyperparameters of t-SNE and UMAP.
Keyphrases
  • single cell
  • rna seq
  • cell therapy
  • stem cells
  • systematic review
  • randomized controlled trial
  • bone marrow
  • machine learning
  • artificial intelligence
  • deep learning
  • living cells