Graph Neural Network Model for Prediction of Non-Small Cell Lung Cancer Lymph Node Metastasis Using Protein-Protein Interaction Network and 18 F-FDG PET/CT Radiomics.
Hyemin JuKangsan KimByung Il KimSang-Keun WooPublished in: International journal of molecular sciences (2024)
The image texture features obtained from 18 F-fluorodeoxyglucose positron emission tomography/computed tomography ( 18 F-FDG PET/CT) images of non-small cell lung cancer (NSCLC) have revealed tumor heterogeneity. A combination of genomic data and radiomics may improve the prediction of tumor prognosis. This study aimed to predict NSCLC metastasis using a graph neural network (GNN) obtained by combining a protein-protein interaction (PPI) network based on gene expression data and image texture features. 18 F-FDG PET/CT images and RNA sequencing data of 93 patients with NSCLC were acquired from The Cancer Imaging Archive. Image texture features were extracted from 18 F-FDG PET/CT images and area under the curve receiver operating characteristic curve (AUC) of each image feature was calculated. Weighted gene co-expression network analysis (WGCNA) was used to construct gene modules, followed by functional enrichment analysis and identification of differentially expressed genes. The PPI of each gene module and genes belonging to metastasis-related processes were converted via a graph attention network. Images and genomic features were concatenated. The GNN model using PPI modules from WGCNA and metastasis-related functions combined with image texture features was evaluated quantitatively. Fifty-five image texture features were extracted from 18 F-FDG PET/CT, and radiomic features were selected based on AUC (n = 10). Eighty-six gene modules were clustered by WGCNA. Genes (n = 19) enriched in the metastasis-related pathways were filtered using DEG analysis. The accuracy of the PPI network, derived from WGCNA modules and metastasis-related genes, improved from 0.4795 to 0.5830 ( p < 2.75 × 10 -12 ). Integrating PPI of four metastasis-related genes with 18 F-FDG PET/CT image features in a GNN model elevated its accuracy over a without image feature model to 0.8545 (95% CI = 0.8401-0.8689, p -value < 0.02). This model demonstrated significant enhancement compared to the model using PPI and 18 F-FDG PET/CT derived from WGCNA ( p -value < 0.02), underscoring the critical role of metastasis-related genes in prediction model. The enhanced predictive capability of the lymph node metastasis prediction GNN model for NSCLC, achieved through the integration of comprehensive image features with genomic data, demonstrates promise for clinical implementation.
Keyphrases
- deep learning
- protein protein
- lymph node metastasis
- network analysis
- positron emission tomography
- computed tomography
- neural network
- convolutional neural network
- small molecule
- genome wide
- contrast enhanced
- copy number
- small cell lung cancer
- papillary thyroid
- gene expression
- artificial intelligence
- machine learning
- big data
- genome wide identification
- squamous cell carcinoma
- healthcare
- magnetic resonance imaging
- advanced non small cell lung cancer
- high resolution
- magnetic resonance
- optical coherence tomography
- photodynamic therapy
- pet imaging
- tyrosine kinase
- long non coding rna
- genome wide analysis