HiConfidence: a novel approach uncovering the biological signal in Hi-C data affected by technical biases.
Victoria A KobetsSergey V UlianovAleksandra A GalitsynaSemen A DoroninElena A MikhalevaMikhail S GelfandYuri Y ShevelyovSergey V RazinEkaterina E KhrameevaPublished in: Briefings in bioinformatics (2023)
The chromatin interaction assays, particularly Hi-C, enable detailed studies of genome architecture in multiple organisms and model systems, resulting in a deeper understanding of gene expression regulation mechanisms mediated by epigenetics. However, the analysis and interpretation of Hi-C data remain challenging due to technical biases, limiting direct comparisons of datasets obtained in different experiments and laboratories. As a result, removing biases from Hi-C-generated chromatin contact matrices is a critical data analysis step. Our novel approach, HiConfidence, eliminates biases from the Hi-C data by weighing chromatin contacts according to their consistency between replicates so that low-quality replicates do not substantially influence the result. The algorithm is effective for the analysis of global changes in chromatin structures such as compartments and topologically associating domains. We apply the HiConfidence approach to several Hi-C datasets with significant technical biases, that could not be analyzed effectively using existing methods, and obtain meaningful biological conclusions. In particular, HiConfidence aids in the study of how changes in histone acetylation pattern affect chromatin organization in Drosophila melanogaster S2 cells. The method is freely available at GitHub: https://github.com/victorykobets/HiConfidence.
Keyphrases
- gene expression
- data analysis
- genome wide
- dna damage
- transcription factor
- dna methylation
- electronic health record
- drosophila melanogaster
- big data
- induced apoptosis
- machine learning
- high throughput
- oxidative stress
- high resolution
- deep learning
- cell death
- antiretroviral therapy
- quality improvement
- multidrug resistant
- single cell
- mass spectrometry
- artificial intelligence
- gram negative