Deciphering the intricate relationships between transcription factors (TFs), enhancers, and genes through the inference of enhancer-driven gene regulatory networks (eGRNs) is crucial in understanding gene regulatory programs in a complex biological system. This study introduces STREAM, a novel method that leverages a Steiner forest problem model, a hybrid biclustering pipeline, and submodular optimization to infer eGRNs from jointly profiled single-cell transcriptome and chromatin accessibility data. Compared to existing methods, STREAM demonstrates enhanced performance in terms of TF recovery, TF-enhancer linkage prediction, and enhancer-gene relation discovery. Application of STREAM to an Alzheimer's disease dataset and a diffuse small lymphocytic lymphoma dataset reveals its ability to identify TF-enhancer-gene relations associated with pseudotime, as well as key TF-enhancer-gene relations and TF cooperation underlying tumor cells.
Keyphrases
- single cell
- rna seq
- transcription factor
- genome wide identification
- genome wide
- binding protein
- high throughput
- copy number
- electronic health record
- dna binding
- climate change
- machine learning
- gene expression
- high grade
- big data
- low grade
- small molecule
- hiv infected
- cognitive decline
- oxidative stress
- artificial intelligence
- mild cognitive impairment
- hepatitis c virus
- deep learning