Accurate Tumor Subtype Detection with Raman Spectroscopy via Variational Autoencoder and Machine Learning.
Chang HeShuo ZhuXiaorong WuJiale ZhouYonghui ChenXiaohua QianJian YePublished in: ACS omega (2022)
Accurate diagnosis of cancer subtypes is a great guide for the development of surgical plans and prognosis in the clinic. Raman spectroscopy, combined with the machine learning algorithm, has been demonstrated to be a powerful tool for tumor identification. However, the analysis and classification of Raman spectra for biological samples with complex compositions are still challenges. In addition, the signal-to-noise ratio of the spectra also influences the accuracy of the classification. Herein, we applied the variational autoencoder (VAE) to Raman spectra for downscaling and noise reduction simultaneously. We validated the performance of the VAE algorithm at the cellular and tissue levels. VAE successfully downscaled high-dimensional Raman spectral data to two-dimensional (2D) data for three subtypes of non-small cell lung cancer cells and two subtypes of kidney cancer tissues. Gaussian naïve bayes was applied to subtype discrimination with the 2D data after VAE encoding at both the cellular and tissue levels, significantly outperforming the discrimination results using original spectra. Therefore, the analysis of Raman spectroscopy based on VAE and machine learning has great potential for rapid diagnosis of tumor subtypes.
Keyphrases
- raman spectroscopy
- machine learning
- big data
- artificial intelligence
- deep learning
- papillary thyroid
- density functional theory
- electronic health record
- squamous cell
- air pollution
- primary care
- gene expression
- loop mediated isothermal amplification
- squamous cell carcinoma
- stem cells
- data analysis
- single cell
- magnetic resonance imaging
- label free