A gene-centered C. elegans protein-DNA interaction network provides a framework for functional predictions.
Juan I Fuxman BassCarles PonsLucie KozlowskiJohn S Reece-HoyesShaleen ShresthaAmy D HoldorfAkihiro MoriChad L MyersAlbertha J M WalhoutPublished in: Molecular systems biology (2016)
Transcription factors (TFs) play a central role in controlling spatiotemporal gene expression and the response to environmental cues. A comprehensive understanding of gene regulation requires integrating physical protein-DNA interactions (PDIs) with TF regulatory activity, expression patterns, and phenotypic data. Although great progress has been made in mapping PDIs using chromatin immunoprecipitation, these studies have only characterized ~10% of TFs in any metazoan species. The nematode C. elegans has been widely used to study gene regulation due to its compact genome with short regulatory sequences. Here, we delineated the largest gene-centered metazoan PDI network to date by examining interactions between 90% of C. elegans TFs and 15% of gene promoters. We used this network as a backbone to predict TF binding sites for 77 TFs, two-thirds of which are novel, as well as integrate gene expression, protein-protein interaction, and phenotypic data to predict regulatory and biological functions for multiple genes and TFs.
Keyphrases
- gene expression
- transcription factor
- protein protein
- genome wide identification
- genome wide
- dna methylation
- small molecule
- copy number
- circulating tumor
- binding protein
- poor prognosis
- high resolution
- big data
- cell free
- dna binding
- single molecule
- genome wide analysis
- mental health
- amino acid
- oxidative stress
- machine learning
- nucleic acid
- climate change
- risk assessment
- genetic diversity