Tutorial: multivariate classification for vibrational spectroscopy in biological samples.
Camilo de Lelis Medeiros de MoraisKassio Michell Gomes de LimaManeesh SinghFrancis Luke MartinPublished in: Nature protocols (2020)
Vibrational spectroscopy techniques, such as Fourier-transform infrared (FTIR) and Raman spectroscopy, have been successful methods for studying the interaction of light with biological materials and facilitating novel cell biology analysis. Spectrochemical analysis is very attractive in disease screening and diagnosis, microbiological studies and forensic and environmental investigations because of its low cost, minimal sample preparation, non-destructive nature and substantially accurate results. However, there is now an urgent need for multivariate classification protocols allowing one to analyze biologically derived spectrochemical data to obtain accurate and reliable results. Multivariate classification comprises discriminant analysis and class-modeling techniques where multiple spectral variables are analyzed in conjunction to distinguish and assign unknown samples to pre-defined groups. The requirement for such protocols is demonstrated by the fact that applications of deep-learning algorithms of complex datasets are being increasingly recognized as critical for extracting important information and visualizing it in a readily interpretable form. Hereby, we have provided a tutorial for multivariate classification analysis of vibrational spectroscopy data (FTIR, Raman and near-IR) highlighting a series of critical steps, such as preprocessing, data selection, feature extraction, classification and model validation. This is an essential aspect toward the construction of a practical spectrochemical analysis model for biological analysis in real-world applications, where fast, accurate and reliable classification models are fundamental.
Keyphrases
- deep learning
- machine learning
- raman spectroscopy
- high resolution
- stem cells
- data analysis
- big data
- low cost
- magnetic resonance imaging
- molecular dynamics simulations
- single molecule
- risk assessment
- mass spectrometry
- cell therapy
- optical coherence tomography
- mesenchymal stem cells
- climate change
- living cells
- molecular dynamics
- liquid chromatography
- fluorescent probe