Advanced methods for gene network identification and noise decomposition from single-cell data.
Zhou FangAnkit GuptaSant KumarMustafa KhammashPublished in: Nature communications (2024)
Central to analyzing noisy gene expression systems is solving the Chemical Master Equation (CME), which characterizes the probability evolution of the reacting species' copy numbers. Solving CMEs for high-dimensional systems suffers from the curse of dimensionality. Here, we propose a computational method for improved scalability through a divide-and-conquer strategy that optimally decomposes the whole system into a leader system and several conditionally independent follower subsystems. The CME is solved by combining Monte Carlo estimation for the leader system with stochastic filtering procedures for the follower subsystems. We demonstrate this method with high-dimensional numerical examples and apply it to identify a yeast transcription system at the single-cell resolution, leveraging mRNA time-course experimental data. The identification results enable an accurate examination of the heterogeneity in rate parameters among isogenic cells. To validate this result, we develop a noise decomposition technique exploiting time-course data but requiring no supplementary components, e.g., dual-reporters.
Keyphrases
- single cell
- gene expression
- electronic health record
- rna seq
- big data
- monte carlo
- induced apoptosis
- air pollution
- dna methylation
- genome wide
- transcription factor
- oxidative stress
- machine learning
- cell cycle arrest
- deep learning
- data analysis
- binding protein
- signaling pathway
- high resolution
- endoplasmic reticulum stress
- mass spectrometry
- artificial intelligence
- pi k akt
- saccharomyces cerevisiae
- cell wall