Stochastic simulation and statistical inference platform for visualization and estimation of transcriptional kinetics.
Gennady GorinMengyu WangIdo GoldingHeng XuPublished in: PloS one (2020)
Recent advances in single-molecule fluorescent imaging have enabled quantitative measurements of transcription at a single gene copy, yet an accurate understanding of transcriptional kinetics is still lacking due to the difficulty of solving detailed biophysical models. Here we introduce a stochastic simulation and statistical inference platform for modeling detailed transcriptional kinetics in prokaryotic systems, which has not been solved analytically. The model includes stochastic two-state gene activation, mRNA synthesis initiation and stepwise elongation, release to the cytoplasm, and stepwise co-transcriptional degradation. Using the Gillespie algorithm, the platform simulates nascent and mature mRNA kinetics of a single gene copy and predicts fluorescent signals measurable by time-lapse single-cell mRNA imaging, for different experimental conditions. To approach the inverse problem of estimating the kinetic parameters of the model from experimental data, we develop a heuristic optimization method based on the genetic algorithm and the empirical distribution of mRNA generated by simulation. As a demonstration, we show that the optimization algorithm can successfully recover the transcriptional kinetics of simulated and experimental gene expression data. The platform is available as a MATLAB software package at https://data.caltech.edu/records/1287.
Keyphrases
- gene expression
- single cell
- transcription factor
- single molecule
- high throughput
- high resolution
- genome wide
- machine learning
- copy number
- living cells
- electronic health record
- genome wide identification
- dna methylation
- big data
- binding protein
- deep learning
- heat shock
- quantum dots
- rna seq
- aqueous solution
- data analysis
- neural network
- virtual reality
- mass spectrometry
- heat shock protein
- human serum albumin