Gene expression model inference from snapshot RNA data using Bayesian non-parametrics.
Zeliha KilicMax SchweigerCamille MoyerDouglas P ShepherdSteve PresséPublished in: Nature computational science (2023)
Gene expression models, which are key towards understanding cellular regulatory response, underlie observations of single-cell transcriptional dynamics. Although RNA expression data encode information on gene expression models, existing computational frameworks do not perform simultaneous Bayesian inference of gene expression models and parameters from such data. Rather, gene expression models-composed of gene states, their connectivities and associated parameters-are currently deduced by pre-specifying gene state numbers and connectivity before learning associated rate parameters. Here we propose a method to learn full distributions over gene states, state connectivities and associated rate parameters, simultaneously and self-consistently from single-molecule RNA counts. We propagate noise from fluctuating RNA counts over models by treating models themselves as random variables. We achieve this within a Bayesian non-parametric paradigm. We demonstrate our method on the Escherichia coli lacZ pathway and the Saccharomyces cerevisiae STL1 pathway, and verify its robustness on synthetic data.
Keyphrases
- gene expression
- dna methylation
- single cell
- single molecule
- electronic health record
- escherichia coli
- genome wide
- big data
- saccharomyces cerevisiae
- copy number
- rna seq
- transcription factor
- peripheral blood
- machine learning
- health information
- pseudomonas aeruginosa
- artificial intelligence
- binding protein
- genome wide analysis
- multidrug resistant
- heat stress