Peak-agnostic high-resolution cis-regulatory circuitry mapping using single cell multiome data.
Zidong ZhangFrederique Ruf-ZamojskiMichel ZamojskiDaniel J BernardXi ChenOlga G TroyanskayaStuart C SealfonPublished in: Nucleic acids research (2023)
Single same cell RNAseq/ATACseq multiome data provide unparalleled potential to develop high resolution maps of the cell-type specific transcriptional regulatory circuitry underlying gene expression. We present CREMA, a framework that recovers the full cis-regulatory circuitry by modeling gene expression and chromatin activity in individual cells without peak-calling or cell type labeling constraints. We demonstrate that CREMA overcomes the limitations of existing methods that fail to identify about half of functional regulatory elements which are outside the called chromatin 'peaks'. These circuit sites outside called peaks are shown to be important cell type specific functional regulatory loci, sufficient to distinguish individual cell types. Analysis of mouse pituitary data identifies a Gata2-circuit for the gonadotrope-enriched disease-associated Pcsk1 gene, which is experimentally validated by reduced gonadotrope expression in a gonadotrope conditional Gata2-knockout model. We present a web accessible human immune cell regulatory circuit resource, and provide CREMA as an R package.
Keyphrases
- transcription factor
- gene expression
- high resolution
- single cell
- genome wide
- dna methylation
- electronic health record
- poor prognosis
- genome wide identification
- rna seq
- endothelial cells
- mass spectrometry
- induced apoptosis
- dna damage
- stem cells
- cell proliferation
- cell therapy
- machine learning
- signaling pathway
- climate change
- high throughput
- oxidative stress
- long non coding rna
- cell cycle arrest
- deep learning
- induced pluripotent stem cells
- heat stress
- high speed