Estimation of an Image Biomarker for Distant Recurrence Prediction in NSCLC Using Proliferation-Related Genes.
Hye Min JuByung-Chul KimIlhan LimByung Hyun ByunSang-Keun WooPublished in: International journal of molecular sciences (2023)
This study aimed to identify a distant-recurrence image biomarker in NSCLC by investigating correlations between heterogeneity functional gene expression and fluorine-18-2-fluoro-2-deoxy-D-glucose positron emission tomography ( 18 F-FDG PET) image features of NSCLC patients. RNA-sequencing data and 18 F-FDG PET images of 53 patients with NSCLC (19 with distant recurrence and 34 without recurrence) from The Cancer Imaging Archive and The Cancer Genome Atlas Program databases were used in a combined analysis. Weighted correlation network analysis was performed to identify gene groups related to distant recurrence. Genes were selected for functions related to distant recurrence. In total, 47 image features were extracted from PET images as radiomics. The relationship between gene expression and image features was estimated using a hypergeometric distribution test with the Pearson correlation method. The distant recurrence prediction model was validated by a random forest (RF) algorithm using image texture features and related gene expression. In total, 37 gene modules were identified by gene-expression pattern with weighted gene co-expression network analysis. The gene modules with the highest significance were selected ( p -value < 0.05). Nine genes with high protein-protein interaction and area under the curve (AUC) were identified as hub genes involved in the proliferation function, which plays an important role in distant recurrence of cancer. Four image features (GLRLM_SRHGE, GLRLM_HGRE, SUVmean, and GLZLM_GLNU) and six genes were identified to be correlated ( p -value < 0.1). AUCs (accuracy: 0.59, AUC: 0.729) from the 47 image texture features and AUCs (accuracy: 0.767, AUC: 0.808) from hub genes were calculated using the RF algorithm. AUCs (accuracy: 0.783, AUC: 0.912) from the four image texture features and six correlated genes and AUCs (accuracy: 0.738, AUC: 0.779) from only the four image texture features were calculated using the RF algorithm. The four image texture features validated by heterogeneity group gene expression were found to be related to cancer heterogeneity. The identification of these image texture features demonstrated that advanced prediction of NSCLC distant recurrence is possible using the image biomarker.
Keyphrases
- deep learning
- gene expression
- positron emission tomography
- network analysis
- genome wide
- small cell lung cancer
- lymph node
- computed tomography
- dna methylation
- genome wide identification
- free survival
- pet ct
- pet imaging
- contrast enhanced
- convolutional neural network
- artificial intelligence
- papillary thyroid
- bioinformatics analysis
- machine learning
- type diabetes
- magnetic resonance imaging
- copy number
- advanced non small cell lung cancer
- small molecule
- poor prognosis
- squamous cell
- chronic kidney disease
- end stage renal disease
- squamous cell carcinoma
- adipose tissue
- high resolution
- protein protein
- magnetic resonance
- climate change
- patient reported outcomes
- skeletal muscle
- blood pressure
- mass spectrometry
- childhood cancer
- glycemic control