Sequence-based 5-mers highly correlated to epigenetic modifications in genes interactions.
Dariush SalimiAli MoeiniAli Masoudi-NejadPublished in: Genes & genomics (2018)
One of the main concerns in biology is extracting sophisticated features from DNA sequence for gene interaction determination, receiving a great deal of researchers' attention. The epigenetic modifications along with their patterns have been intensely recognized as dominant features affecting on gene expression. However, studying sequenced-based features highly correlated to this key element has remained limited. The main objective in this research was to propose a new feature highly correlated to epigenetic modifications capable of classification of genes. In this paper, classification of 34 genes in PPAR signaling pathway associated with muscle fat tissue in human was performed. Using different statistical outlier detection methods, we proposed that 5-mers highly correlated to epigenetic modifications can correctly categorize the genes involved in the same biological pathway or process. Thirty-four genes in PPAR signaling pathway were classified via applying a proposed feature, 5-mers strongly associated to 17 different epigenetic modifications. For this, diverse statistical outlier detection methods were applied to specify the group of thoroughly correlated genes. The results indicated that these 5-mers can appropriately identify correlated genes. In addition, our results corresponded to GeneMania interaction information, leading to support the suggested method. The appealing findings imply that not only epigenetic modifications but also their highly correlated 5-mers can be applied for reconstructing gene regulatory networks as supplementary data as well as other applications like physical interaction, genes prioritization, indicating some sort of data fusion in this analysis.
Keyphrases
- gene expression
- genome wide
- dna methylation
- genome wide identification
- sars cov
- signaling pathway
- bioinformatics analysis
- respiratory syndrome coronavirus
- machine learning
- deep learning
- endothelial cells
- skeletal muscle
- healthcare
- epithelial mesenchymal transition
- pi k akt
- physical activity
- copy number
- mental health
- oxidative stress
- social media
- high resolution
- cell proliferation
- endoplasmic reticulum stress
- artificial intelligence
- quantum dots
- induced apoptosis