Login / Signup

scButterfly: a versatile single-cell cross-modality translation method via dual-aligned variational autoencoders.

Yichuan CaoXiamiao ZhaoSongming TangQun JiangSijie LiSiyu LiShengquan Chen
Published in: Nature communications (2024)
Recent advancements for simultaneously profiling multi-omics modalities within individual cells have enabled the interrogation of cellular heterogeneity and molecular hierarchy. However, technical limitations lead to highly noisy multi-modal data and substantial costs. Although computational methods have been proposed to translate single-cell data across modalities, broad applications of the methods still remain impeded by formidable challenges. Here, we propose scButterfly, a versatile single-cell cross-modality translation method based on dual-aligned variational autoencoders and data augmentation schemes. With comprehensive experiments on multiple datasets, we provide compelling evidence of scButterfly's superiority over baseline methods in preserving cellular heterogeneity while translating datasets of various contexts and in revealing cell type-specific biological insights. Besides, we demonstrate the extensive applications of scButterfly for integrative multi-omics analysis of single-modality data, data enhancement of poor-quality single-cell multi-omics, and automatic cell type annotation of scATAC-seq data. Moreover, scButterfly can be generalized to unpaired data training, perturbation-response analysis, and consecutive translation.
Keyphrases
  • single cell
  • rna seq
  • electronic health record
  • big data
  • high throughput
  • gene expression
  • machine learning
  • deep learning
  • dna methylation
  • network analysis