Login / Signup

Unbiased integration of single cell transcriptome replicates.

Martin LozaShunsuke TeraguchiDaron M StandleyDiego Diez
Published in: NAR genomics and bioinformatics (2022)
Single cell transcriptomic approaches are becoming mainstream, with replicate experiments commonly performed with the same single cell technology. Methods that enable integration of these datasets by removing batch effects while preserving biological information are required for unbiased data interpretation. Here, we introduce Canek for this purpose. Canek leverages information from mutual nearest neighbor to combine local linear corrections with cell-specific non-linear corrections within a fuzzy logic framework. Using a combination of real and synthetic datasets, we show that Canek corrects batch effects while introducing the least amount of bias compared with competing methods. Canek is computationally efficient and can easily integrate thousands of single-cell transcriptomes from replicated experiments.
Keyphrases
  • single cell
  • rna seq
  • high throughput
  • healthcare
  • health information
  • stem cells
  • electronic health record
  • anaerobic digestion
  • dna methylation
  • mesenchymal stem cells
  • deep learning