GRouNdGAN: GRN-guided simulation of single-cell RNA-seq data using causal generative adversarial networks.
Yazdan ZinatiAbdulrahman TakiddeenAmin EmadPublished in: Nature communications (2024)
We introduce GRouNdGAN, a gene regulatory network (GRN)-guided reference-based causal implicit generative model for simulating single-cell RNA-seq data, in silico perturbation experiments, and benchmarking GRN inference methods. Through the imposition of a user-defined GRN in its architecture, GRouNdGAN simulates steady-state and transient-state single-cell datasets where genes are causally expressed under the control of their regulating transcription factors (TFs). Training on six experimental reference datasets, we show that our model captures non-linear TF-gene dependencies and preserves gene identities, cell trajectories, pseudo-time ordering, and technical and biological noise, with no user manipulation and only implicit parameterization. GRouNdGAN can synthesize cells under new conditions to perform in silico TF knockout experiments. Benchmarking various GRN inference algorithms reveals that GRouNdGAN effectively bridges the existing gap between simulated and biological data benchmarks of GRN inference algorithms, providing gold standard ground truth GRNs and realistic cells corresponding to the biological system of interest.
Keyphrases
- single cell
- rna seq
- induced apoptosis
- high throughput
- electronic health record
- machine learning
- genome wide identification
- genome wide
- cell cycle arrest
- big data
- transcription factor
- copy number
- molecular docking
- deep learning
- depressive symptoms
- signaling pathway
- endoplasmic reticulum stress
- stem cells
- mesenchymal stem cells
- data analysis
- blood brain barrier
- bone marrow
- brain injury
- dna methylation