Single-cell multi-omics analysis reveals cooperative transcription factors for gene regulation in oligodendrocytes.
Jerome J ChoiJohn SvarenDaifeng WangPublished in: bioRxiv : the preprint server for biology (2024)
Oligodendrocytes are the myelinating cells within the central nervous system. Many oligodendrocyte genes have been associated with brain disorders. However, how transcription factors (TFs) cooperate for gene regulation in oligodendrocytes remains largely uncharacterized. To address this, we integrated scRNA-seq and scATAC-seq data to identify the cooperative TFs that co-regulate the target gene (TG) expression in oligodendrocytes. First, we identified co- binding TF pairs whose binding sites overlapped in oligodendrocyte-specific regulatory regions. Second, we trained a deep learning model to predict the expression level of each TG using the expression levels of co-binding TFs. Third, using the trained models, we computed the TF importance and TF-TF interaction scores for predicting TG expression by the Shapley interaction scores. We found that the co-binding TF pairs involving known important TF pairs for oligodendrocyte differentiation, such as SOX10-TCF12, SOX10-MYRF, and SOX10-OLIG2, exhibited significantly higher Shapley scores than others (t-test, p-value < 1e-4). Furthermore, we identified 153 oligodendrocyte-associated eQTLs that reside in oligodendrocyte-specific enhancers or promoters where their eGenes (TGs) are regulated by cooperative TFs, suggesting potential novel regulatory roles from genetic variants. We also experimentally validated some identified TF pairs such as SOX10-OLIG2 and SOX10-NKX2.2 by co-enrichment analysis, using ChIP-seq data from rat peripheral nerve.
Keyphrases
- transcription factor
- single cell
- poor prognosis
- dna binding
- genome wide
- rna seq
- genome wide identification
- binding protein
- stem cells
- deep learning
- peripheral nerve
- induced apoptosis
- electronic health record
- dna methylation
- long non coding rna
- oxidative stress
- big data
- gene expression
- risk assessment
- computed tomography
- climate change
- machine learning
- brain injury
- magnetic resonance
- cell death
- circulating tumor cells
- data analysis
- functional connectivity
- endoplasmic reticulum stress
- resting state
- cerebral ischemia
- body composition
- contrast enhanced
- bioinformatics analysis