Towards In Silico Identification of Genes Contributing to Similarity of Patients' Multi-Omics Profiles: A Case Study of Acute Myeloid Leukemia.
Declan J BattenJonathan J CroftsNadia ChuzhanovaPublished in: Genes (2023)
We propose a computational framework for selecting biologically plausible genes identified by clustering of multi-omics data that reveal patients' similarity, thus giving researchers a more comprehensive view on any given disease. We employ spectral clustering of a similarity network created by fusion of three similarity networks, based on mRNA expression of immune genes, miRNA expression and DNA methylation data, using SNF_v2.1 software. For each cluster, we rank multi-omics features, ensuring the best separation between clusters, and select the top-ranked features that preserve clustering. To find genes targeted by DNA methylation and miRNAs found in the top-ranked features, we use chromosome-conformation capture data and miRNet2.0 software, respectively. To identify informative genes, these combined sets of target genes are analyzed in terms of their enrichment in somatic/germline mutations, GO biological processes/pathways terms and known sets of genes considered to be important in relation to a given disease, as recorded in the Molecular Signature Database from GSEA. The protein-protein interaction (PPI) networks were analyzed to identify genes that are hubs of PPI networks. We used data recorded in The Cancer Genome Atlas for patients with acute myeloid leukemia to demonstrate our approach, and discuss our findings in the context of results in the literature.
Keyphrases
- genome wide
- dna methylation
- bioinformatics analysis
- single cell
- acute myeloid leukemia
- genome wide identification
- end stage renal disease
- protein protein
- gene expression
- ejection fraction
- chronic kidney disease
- big data
- small molecule
- systematic review
- squamous cell carcinoma
- emergency department
- magnetic resonance
- prognostic factors
- peritoneal dialysis
- optical coherence tomography
- long non coding rna
- acute lymphoblastic leukemia
- mass spectrometry
- liquid chromatography