The 3D folding of a mammalian gene can be studied by a polymer model, where the chromatin fiber is represented by a semiflexible polymer which interacts with multivalent proteins, representing complexes of DNA-binding transcription factors and RNA polymerases. This physical model leads to the natural emergence of clusters of proteins and binding sites, accompanied by the folding of chromatin into a set of topologies, each associated with a different network of loops. Here, we combine numerics and analytics to first classify these networks and then find their relative importance or statistical weight, when the properties of the underlying polymer are those relevant to chromatin. Unlike polymer networks previously studied, our chromatin networks have finite average distances between successive binding sites, and this leads to giant differences between the weights of topologies with the same number of edges and nodes but different wiring. These weights strongly favor rosettelike structures with a local cloud of loops with respect to more complicated nonlocal topologies. Our results suggest that genes should overwhelmingly fold into a small fraction of all possible 3D topologies, which can be robustly characterized by the framework we propose here.
Keyphrases
- transcription factor
- dna binding
- genome wide
- genome wide identification
- dna damage
- gene expression
- dna methylation
- single molecule
- body mass index
- squamous cell carcinoma
- molecular dynamics simulations
- high resolution
- oxidative stress
- mental health
- big data
- radiation therapy
- early stage
- weight gain
- lymph node
- neoadjuvant chemotherapy
- locally advanced
- sentinel lymph node