Data integration and inference of gene regulation using single-cell temporal multimodal data with scTIE.
Yingxin LinTung-Yu WuXi ChenSheng WanBrian ChaoJingxue XinJean Yee Hwa YangWing H WongY X Rachel WangPublished in: Genome research (2023)
Single-cell technologies offer unprecedented opportunities to dissect gene regulatory mechanisms in context-specific ways. Although there are computational methods for extracting gene regulatory relationships from scRNA-seq and scATAC-seq data, the data integration problem, essential for accurate cell type identification, has been mostly treated as a standalone challenge. Here we present scTIE, a unified method that integrates temporal multimodal data and infers regulatory relationships predictive of cellular state changes. scTIE uses an autoencoder to embed cells from all time points into a common space using iterative optimal transport, followed by extracting interpretable information to predict cell trajectories. Using a variety of synthetic and real temporal multimodal datasets, we demonstrate scTIE achieves effective data integration while preserving more biological signals than existing methods, particularly in the presence of batch effects and noise. Furthermore, on the exemplar multiome dataset we generated from differentiating mouse embryonic stem cells over time, we demonstrate scTIE captures regulatory elements highly predictive of cell transition probabilities, providing new avenues to understand the regulatory landscape driving developmental processes.
Keyphrases
- single cell
- rna seq
- electronic health record
- big data
- high throughput
- stem cells
- data analysis
- magnetic resonance imaging
- transcription factor
- healthcare
- depressive symptoms
- cell therapy
- computed tomography
- pain management
- air pollution
- artificial intelligence
- mesenchymal stem cells
- gene expression
- social media
- dna methylation
- health information
- bioinformatics analysis