KRASFormer: a fully vision transformer-based framework for predicting KRAS gene mutations in histopathological images of colorectal cancer.
Vivek Kumar SinghYasmine MakhloufMd Mostafa Kamal SarkerStephanie CraigJuvenal BaenaChristine GreeneLee MasonJacqueline JamesManuel Salto-TellezPaul O'ReillyManuel Salto-TellezPublished in: Biomedical physics & engineering express (2024)
Detecting the Kirsten Rat Sarcoma Virus (KRAS) gene mutation is significant for colorectal cancer (CRC) patients. The KRAS gene encodes a protein involved in the epidermal growth factor receptor (EGFR) signaling pathway, and mutations in this gene can negatively impact the use of monoclonal antibodies in antiEGFR therapy and affect treatment decisions. Currently, commonly used 
 methods like next-generation sequencing (NGS) identify KRAS mutations but are expensive, time-consuming, and may not be suitable for every cancer patient sample. To address these challenges, we have developed KRASFormer, a novel framework that predicts
KRAS gene mutations from Haematoxylin and Eosin (H&E) stained WSIs that are widely available for most CRC patients. KRASFormer consists of two stages: the first stage filters out non-tumor regions and selects only tumour cells using a quality screening mechanism, and the second stage predicts the KRAS gene either 'wildtype' or 'mutant' using a Vision Transformer-based XCiT method. The XCiT employs cross-covariance attention to capture clinically meaningful long-range representations of textural patterns in tumour tissue and KRAS mutant cells. We evaluated the performance of the first stage using an independent CRC-5000 dataset, and the second stage included both The Cancer Genome Atlas colon and rectal cancer (TCGA-CRCDX) and in-house cohorts. The results of our experiments showed that the XCiT outperformed existing state-of-the-art methods, achieving AUCs for ROC curves of 0.691 and 0.653 on TCGA-CRC-DX and in-house datasets, respectively. Our findings emphasize three key consequences: the potential of using H&E-stained 
 tissue slide images for predicting KRAS gene mutations as a cost-effective and time-efficient means for guiding treatment choice with CRC patients; the increase in performance metrics of a Transformer-based model; and the value of the collaboration between pathologists and data scientists in deriving a morphologically meaningful model.
Keyphrases
- wild type
- end stage renal disease
- epidermal growth factor receptor
- ejection fraction
- signaling pathway
- chronic kidney disease
- newly diagnosed
- induced apoptosis
- rectal cancer
- copy number
- small cell lung cancer
- genome wide
- mesenchymal stem cells
- gene expression
- deep learning
- oxidative stress
- papillary thyroid
- epithelial mesenchymal transition
- rna seq
- decision making
- machine learning
- replacement therapy
- single cell
- smoking cessation
- squamous cell