Estimation of an Image Biomarker for Distant Recurrence Prediction in NSCLC Using Proliferation-Related Genes.
Hye Min JuByung-Chul KimIlhan LimByung Hyun ByunSang-Keun WooPublished in: International journal of molecular sciences (2023)
This study aimed to identify a distant-recurrence image biomarker in NSCLC by investigating correlations between heterogeneity functional gene expression and fluorine-18-2-fluoro-2-deoxy-D-glucose positron emission tomography ( 18 F-FDG PET) image features of NSCLC patients. RNA-sequencing data and 18 F-FDG PET images of 53 patients with NSCLC (19 with distant recurrence and 34 without recurrence) from The Cancer Imaging Archive and The Cancer Genome Atlas Program databases were used in a combined analysis. Weighted correlation network analysis was performed to identify gene groups related to distant recurrence. Genes were selected for functions related to distant recurrence. In total, 47 image features were extracted from PET images as radiomics. The relationship between gene expression and image features was estimated using a hypergeometric distribution test with the Pearson correlation method. The distant recurrence prediction model was validated by a random forest (RF) algorithm using image texture features and related gene expression. In total, 37 gene modules were identified by gene-expression pattern with weighted gene co-expression network analysis. The gene modules with the highest significance were selected ( p -value < 0.05). Nine genes with high protein-protein interaction and area under the curve (AUC) were identified as hub genes involved in the proliferation function, which plays an important role in distant recurrence of cancer. Four image features (GLRLM_SRHGE, GLRLM_HGRE, SUVmean, and GLZLM_GLNU) and six genes were identified to be correlated ( p -value < 0.1). AUCs (accuracy: 0.59, AUC: 0.729) from the 47 image texture features and AUCs (accuracy: 0.767, AUC: 0.808) from hub genes were calculated using the RF algorithm. AUCs (accuracy: 0.783, AUC: 0.912) from the four image texture features and six correlated genes and AUCs (accuracy: 0.738, AUC: 0.779) from only the four image texture features were calculated using the RF algorithm. The four image texture features validated by heterogeneity group gene expression were found to be related to cancer heterogeneity. The identification of these image texture features demonstrated that advanced prediction of NSCLC distant recurrence is possible using the image biomarker.
Keyphrases
- deep learning
- gene expression
- positron emission tomography
- network analysis
- genome wide
- computed tomography
- small cell lung cancer
- dna methylation
- lymph node
- free survival
- contrast enhanced
- pet ct
- genome wide identification
- pet imaging
- papillary thyroid
- single cell
- copy number
- convolutional neural network
- machine learning
- squamous cell carcinoma
- bioinformatics analysis
- high resolution
- magnetic resonance
- magnetic resonance imaging
- metabolic syndrome
- advanced non small cell lung cancer
- small molecule
- poor prognosis
- signaling pathway
- blood glucose
- optical coherence tomography
- type diabetes
- genome wide analysis
- lymph node metastasis
- quality improvement
- chronic kidney disease
- young adults
- photodynamic therapy
- transcription factor
- mass spectrometry
- end stage renal disease
- drug induced
- patient reported outcomes
- neural network