Login / Signup

f-scLVM: scalable and versatile factor analysis for single-cell RNA-seq.

Florian BuettnerNaruemon PratanwanichDavis J McCarthyJohn C MarioniOliver Stegle
Published in: Genome biology (2017)
Single-cell RNA-sequencing (scRNA-seq) allows studying heterogeneity in gene expression in large cell populations. Such heterogeneity can arise due to technical or biological factors, making decomposing sources of variation difficult. We here describe f-scLVM (factorial single-cell latent variable model), a method based on factor analysis that uses pathway annotations to guide the inference of interpretable factors underpinning the heterogeneity. Our model jointly estimates the relevance of individual factors, refines gene set annotations, and infers factors without annotation. In applications to multiple scRNA-seq datasets, we find that f-scLVM robustly decomposes scRNA-seq datasets into interpretable components, thereby facilitating the identification of novel subpopulations.
Keyphrases
  • single cell
  • rna seq
  • high throughput
  • gene expression
  • dna methylation
  • stem cells
  • drinking water
  • bone marrow