3D reconstruction of genomic regions from sparse interaction data.
Julen Mendieta-EstebanMarco Di StefanoDavid CastilloIrene FarabellaMarc A Marti-RenomPublished in: NAR genomics and bioinformatics (2021)
Chromosome conformation capture (3C) technologies measure the interaction frequency between pairs of chromatin regions within the nucleus in a cell or a population of cells. Some of these 3C technologies retrieve interactions involving non-contiguous sets of loci, resulting in sparse interaction matrices. One of such 3C technologies is Promoter Capture Hi-C (pcHi-C) that is tailored to probe only interactions involving gene promoters. As such, pcHi-C provides sparse interaction matrices that are suitable to characterize short- and long-range enhancer-promoter interactions. Here, we introduce a new method to reconstruct the chromatin structural (3D) organization from sparse 3C-based datasets such as pcHi-C. Our method allows for data normalization, detection of significant interactions and reconstruction of the full 3D organization of the genomic region despite of the data sparseness. Specifically, it builds, with as low as the 2-3% of the data from the matrix, reliable 3D models of similar accuracy of those based on dense interaction matrices. Furthermore, the method is sensitive enough to detect cell-type-specific 3D organizational features such as the formation of different networks of active gene communities.
Keyphrases
- genome wide
- copy number
- transcription factor
- electronic health record
- gene expression
- dna methylation
- big data
- dna damage
- induced apoptosis
- stem cells
- quantum dots
- machine learning
- bone marrow
- cell cycle arrest
- cell therapy
- deep learning
- oxidative stress
- signaling pathway
- rna seq
- smoking cessation
- mesenchymal stem cells