Machine Learning Identifies Robust Matrisome Markers and Regulatory Mechanisms in Cancer.
Anni KääriäinenVilma PesolaAnnalena DittmannJuho KontioJarkko KoivunenTaina PihlajaniemiValerio IzziPublished in: International journal of molecular sciences (2020)
The expression and regulation of matrisome genes-the ensemble of extracellular matrix, ECM, ECM-associated proteins and regulators as well as cytokines, chemokines and growth factors-is of paramount importance for many biological processes and signals within the tumor microenvironment. The availability of large and diverse multi-omics data enables mapping and understanding of the regulatory circuitry governing the tumor matrisome to an unprecedented level, though such a volume of information requires robust approaches to data analysis and integration. In this study, we show that combining Pan-Cancer expression data from The Cancer Genome Atlas (TCGA) with genomics, epigenomics and microenvironmental features from TCGA and other sources enables the identification of "landmark" matrisome genes and machine learning-based reconstruction of their regulatory networks in 74 clinical and molecular subtypes of human cancers and approx. 6700 patients. These results, enriched for prognostic genes and cross-validated markers at the protein level, unravel the role of genetic and epigenetic programs in governing the tumor matrisome and allow the prioritization of tumor-specific matrisome genes (and their regulators) for the development of novel therapeutic approaches.
Keyphrases
- genome wide
- extracellular matrix
- machine learning
- data analysis
- papillary thyroid
- transcription factor
- bioinformatics analysis
- dna methylation
- poor prognosis
- squamous cell
- single cell
- big data
- genome wide identification
- end stage renal disease
- endothelial cells
- newly diagnosed
- high resolution
- public health
- chronic kidney disease
- binding protein
- electronic health record
- gene expression
- ejection fraction
- childhood cancer
- genome wide analysis
- mass spectrometry
- small molecule
- deep learning
- prognostic factors
- induced pluripotent stem cells
- pluripotent stem cells