Login / Signup

Limited conservation in cross-species comparison of GLK transcription factor binding suggested wide-spread cistrome divergence.

Xiaoyu TuSibo RenWei ShenJianjian LiYuxiang LiChuanshun LiYangmeihui LiZhanxiang ZongWeibo XieDonald GriersonZhangjun FeiJames J GiovannoniPinghua LiSilin Zhong
Published in: Nature communications (2022)
Non-coding cis-regulatory variants in animal genomes are an important driving force in the evolution of transcription regulation and phenotype diversity. However, cistrome dynamics in plants remain largely underexplored. Here, we compare the binding of GOLDEN2-LIKE (GLK) transcription factors in tomato, tobacco, Arabidopsis, maize and rice. Although the function of GLKs is conserved, most of their binding sites are species-specific. Conserved binding sites are often found near photosynthetic genes dependent on GLK for expression, but sites near non-differentially expressed genes in the glk mutant are nevertheless under purifying selection. The binding sites' regulatory potential can be predicted by machine learning model using quantitative genome features and TF co-binding information. Our study show that genome cis-variation caused wide-spread TF binding divergence, and most of the TF binding sites are genetically redundant. This poses a major challenge for interpreting the effect of individual sites and highlights the importance of quantitatively measuring TF occupancy.
Keyphrases
  • transcription factor
  • dna binding
  • genome wide identification
  • genome wide
  • machine learning
  • binding protein
  • poor prognosis
  • healthcare
  • gene expression
  • single molecule
  • bioinformatics analysis
  • human health