Enhancing Hi-C data resolution with deep convolutional neural network HiCPlus.
Yan ZhangLin AnJie XuBo ZhangW Jim ZhengMing HuJijun TangFeng YuePublished in: Nature communications (2018)
Although Hi-C technology is one of the most popular tools for studying 3D genome organization, due to sequencing cost, the resolution of most Hi-C datasets are coarse and cannot be used to link distal regulatory elements to their target genes. Here we develop HiCPlus, a computational approach based on deep convolutional neural network, to infer high-resolution Hi-C interaction matrices from low-resolution Hi-C data. We demonstrate that HiCPlus can impute interaction matrices highly similar to the original ones, while only using 1/16 of the original sequencing reads. We show that the models learned from one cell type can be applied to make predictions in other cell or tissue types. Our work not only provides a computational framework to enhance Hi-C data resolution but also reveals features underlying the formation of 3D chromatin interactions.
Keyphrases
- convolutional neural network
- deep learning
- single cell
- electronic health record
- single molecule
- high resolution
- genome wide
- big data
- transcription factor
- gene expression
- dna damage
- rna seq
- stem cells
- machine learning
- minimally invasive
- cell therapy
- dna methylation
- artificial intelligence
- oxidative stress
- bone marrow
- molecular dynamics simulations
- mass spectrometry
- tandem mass spectrometry
- high speed