Login / Signup

The molecular grammar of protein disorder guiding genome-binding locations.

Felix JonasMiri CarmiBeniamin KrupkinJoseph SteinbergerSagie BrodskyTamar JanaNaama Barkai
Published in: Nucleic acids research (2023)
Intrinsically disordered regions (IDRs) direct transcription factors (TFs) towards selected genomic occurrences of their binding motif, as exemplified by budding yeast's Msn2. However, the sequence basis of IDR-directed TF binding selectivity remains unknown. To reveal this sequence grammar, we analyze the genomic localizations of >100 designed IDR mutants, each carrying up to 122 mutations within this 567-AA region. Our data points at multivalent interactions, carried by hydrophobic-mostly aliphatic-residues dispersed within a disordered environment and independent of linear sequence motifs, as the key determinants of Msn2 genomic localization. The implications of our results for the mechanistic basis of IDR-based TF binding preferences are discussed.
Keyphrases
  • dna binding
  • binding protein
  • copy number
  • transcription factor
  • amino acid
  • genome wide
  • machine learning
  • deep learning