Machine Learning-Assisted Network Inference Approach to Identify a New Class of Genes that Coordinate the Functionality of Cancer Networks.
Mehrab Ghanat BariChoong Yong UngCheng ZhangShizhen ZhuHu LiPublished in: Scientific reports (2017)
Emerging evidence indicates the existence of a new class of cancer genes that act as "signal linkers" coordinating oncogenic signals between mutated and differentially expressed genes. While frequently mutated oncogenes and differentially expressed genes, which we term Class I cancer genes, are readily detected by most analytical tools, the new class of cancer-related genes, i.e., Class II, escape detection because they are neither mutated nor differentially expressed. Given this hypothesis, we developed a Machine Learning-Assisted Network Inference (MALANI) algorithm, which assesses all genes regardless of expression or mutational status in the context of cancer etiology. We used 8807 expression arrays, corresponding to 9 cancer types, to build more than 2 × 108 Support Vector Machine (SVM) models for reconstructing a cancer network. We found that ~3% of ~19,000 not differentially expressed genes are Class II cancer gene candidates. Some Class II genes that we found, such as SLC19A1 and ATAD3B, have been recently reported to associate with cancer outcomes. To our knowledge, this is the first study that utilizes both machine learning and network biology approaches to uncover Class II cancer genes in coordinating functionality in cancer networks and will illuminate our understanding of how genes are modulated in a tissue-specific network contribute to tumorigenesis and therapy development.
Keyphrases
- papillary thyroid
- machine learning
- squamous cell
- genome wide
- poor prognosis
- stem cells
- squamous cell carcinoma
- lymph node metastasis
- healthcare
- type diabetes
- dna methylation
- childhood cancer
- gene expression
- mesenchymal stem cells
- bioinformatics analysis
- genome wide analysis
- long non coding rna
- skeletal muscle
- big data
- mass spectrometry
- artificial intelligence
- copy number
- single cell
- cell therapy
- label free
- gestational age
- loop mediated isothermal amplification