Identification of Genomic Signatures for Colorectal Cancer Survival Using Exploratory Data Mining.
Justin J HummelDanlu LiuErin M TallonJohn SnyderWesley WarrenChi-Ren ShyuJonathan MitchemRene CortesePublished in: International journal of molecular sciences (2024)
Clinicopathological presentations are critical for establishing a postoperative treatment regimen in Colorectal Cancer (CRC), although the prognostic value is low in Stage 2 CRC. We implemented a novel exploratory algorithm based on artificial intelligence (explainable artificial intelligence, XAI) that integrates mutational and clinical features to identify genomic signatures by repurposing the FoundationOne Companion Diagnostic (F1CDx) assay. The training data set ( n = 378) consisted of subjects with recurrent and non-recurrent Stage 2 or 3 CRC retrieved from TCGA. Genomic signatures were built for identifying subgroups in Stage 2 and 3 CRC patients according to recurrence using genomic parameters and further associations with the clinical presentation. The summarization of the top-performing genomic signatures resulted in a 32-gene genomic signature that could predict tumor recurrence in CRC Stage 2 patients with high precision. The genomic signature was further validated using an independent dataset ( n = 149), resulting in high-precision prognosis (AUC: 0.952; PPV = 0.974; NPV = 0.923). We anticipate that our genomic signatures and NCCN guidelines will improve recurrence predictions in CRC molecular stratification.
Keyphrases
- artificial intelligence
- copy number
- big data
- genome wide
- machine learning
- deep learning
- end stage renal disease
- chronic kidney disease
- free survival
- patients undergoing
- gene expression
- dna methylation
- ejection fraction
- electronic health record
- high throughput
- transcription factor
- clinical practice
- single cell
- virtual reality
- genome wide identification