Integration of lncRNAs, Protein-Coding Genes and Pathology Images for Detecting Metastatic Melanoma.
Shuai LiuYusi FanKewei LiHaotian ZhangXi WangRuofei JuLan HuangMeiyu DuanFengfeng ZhouPublished in: Genes (2022)
Melanoma is a lethal skin disease that develops from moles. This study aimed to integrate multimodal data to predict metastatic melanoma, which is highly aggressive and difficult to treat. The proposed EnsembleSKCM method evaluated the prediction performances of long noncoding RNAs (lncRNAs), protein-coding messenger genes (mRNAs) and pathology images (images) for metastatic melanoma. Feature selection was used to screen for metastatic biomarkers in the lncRNA and mRNA datasets. The integrated EnsembleSKCM model was built based on the weighted results of the lncRNA-, mRNA- and image-based models. EnsembleSKCM achieved 0.9444 in the prediction accuracy of metastatic melanoma and outperformed the single-modal prediction models based on the lncRNA, mRNA and image data. The experimental data suggest the importance of integrating the complementary information from the three data modalities. WGCNA was used to analyze the relationship of molecular-level features and image features, and the results show connections between them. Another cohort was used to validate our prediction.
Keyphrases
- deep learning
- electronic health record
- convolutional neural network
- big data
- binding protein
- long non coding rna
- artificial intelligence
- optical coherence tomography
- genome wide
- genome wide analysis
- squamous cell carcinoma
- genome wide identification
- small cell lung cancer
- long noncoding rna
- network analysis
- high throughput
- protein protein
- gene expression
- health information
- computed tomography
- single molecule
- chronic pain
- single cell
- rna seq
- amino acid
- pain management
- bioinformatics analysis
- neural network