Tissue-Specific Methylation Biosignatures for Monitoring Diseases: An In Silico Approach.
Makrina KaraglaniMaria PanagopoulouIsmini BaltsaviaParaskevi ApalakiTheodosis TheodosiouIoannis IliopoulosIoannis TsamardinosEkaterini ChatzakiPublished in: International journal of molecular sciences (2022)
Tissue-specific gene methylation events are key to the pathogenesis of several diseases and can be utilized for diagnosis and monitoring. Here, we established an in silico pipeline to analyze high-throughput methylome datasets to identify specific methylation fingerprints in three pathological entities of major burden, i.e., breast cancer (BrCa), osteoarthritis (OA) and diabetes mellitus (DM). Differential methylation analysis was conducted to compare tissues/cells related to the pathology and different types of healthy tissues, revealing Differentially Methylated Genes (DMGs). Highly performing and low feature number biosignatures were built with automated machine learning, including: (1) a five-gene biosignature discriminating BrCa tissue from healthy tissues (AUC 0.987 and precision 0.987), (2) three equivalent OA cartilage-specific biosignatures containing four genes each (AUC 0.978 and precision 0.986) and (3) a four-gene pancreatic β-cell-specific biosignature (AUC 0.984 and precision 0.995). Next, the BrCa biosignature was validated using an independent ccfDNA dataset showing an AUC and precision of 1.000, verifying the biosignature's applicability in liquid biopsy. Functional and protein interaction prediction analysis revealed that most DMGs identified are involved in pathways known to be related to the studied diseases or pointed to new ones. Overall, our data-driven approach contributes to the maximum exploitation of high-throughput methylome readings, helping to establish specific disease profiles to be applied in clinical practice and to understand human pathology.
Keyphrases
- genome wide
- high throughput
- machine learning
- dna methylation
- genome wide identification
- single cell
- copy number
- gene expression
- clinical practice
- induced apoptosis
- knee osteoarthritis
- endothelial cells
- rna seq
- artificial intelligence
- molecular docking
- rheumatoid arthritis
- cell therapy
- ionic liquid
- metabolic syndrome
- breast cancer risk
- stem cells
- mesenchymal stem cells
- skeletal muscle
- signaling pathway
- bone marrow
- transcription factor
- glycemic control
- cell proliferation
- drug induced
- protein protein