AUTOSurv: Interpretable Deep Learning Framework for Cancer Survival Analysis Incorporating Clinical and Multi-omics Data.
Lindong JiangChao XuYuntong BaiAnqi LiuYun GongYu-Ping WangHong-Wen DengPublished in: Research square (2023)
Accurate prognosis for cancer patients can provide critical information for optimizing treatment plans and improving life quality. Combining omics data and demographic/clinical information can offer a more comprehensive view of cancer prognosis than using omics or clinical data alone and can reveal the underlying disease mechanisms at the molecular level. In this study, we developed a novel deep learning framework to extract information from high-dimensional gene expression and miRNA expression data and conduct prognosis prediction for breast cancer and ovarian cancer patients. Our model achieved significantly better prognosis prediction than the conventional Cox Proportional Hazard model and other competitive deep learning approaches in various settings. Moreover, an interpretation approach was applied to tackle the "black-box" nature of deep neural networks and we identified features (i.e., genes, miRNA, demographic/clinical variables) that made important contributions to distinguishing predicted high- and low-risk patients. The identified associations were partially supported by previous studies.
Keyphrases
- deep learning
- gene expression
- electronic health record
- big data
- single cell
- papillary thyroid
- end stage renal disease
- healthcare
- genome wide
- health information
- artificial intelligence
- dna methylation
- squamous cell
- newly diagnosed
- machine learning
- convolutional neural network
- peritoneal dialysis
- squamous cell carcinoma
- health insurance
- binding protein
- ejection fraction
- mass spectrometry
- prognostic factors
- young adults
- childhood cancer
- free survival