Detecting Respiratory Viruses Using a Portable NIR Spectrometer-A Preliminary Exploration with a Data Driven Approach.
Jian-Dong HuangHui WangUltan F PowerJames A D McLaughlinChristopher NugentEnayetur RahmanJudit BarabasPaul MaguirePublished in: Sensors (Basel, Switzerland) (2024)
Respiratory viruses' detection is vitally important in coping with pandemics such as COVID-19. Conventional methods typically require laboratory-based, high-cost equipment. An emerging alternative method is Near-Infrared (NIR) spectroscopy, especially a portable one of the type that has the benefits of low cost, portability, rapidity, ease of use, and mass deployability in both clinical and field settings. One obstacle to its effective application lies in its common limitations, which include relatively low specificity and general quality. Characteristically, the spectra curves show an interweaving feature for the virus-present and virus-absent samples. This then provokes the idea of using machine learning methods to overcome the difficulty. While a subsequent obstacle coincides with the fact that a direct deployment of the machine learning approaches leads to inadequate accuracy of the modelling results. This paper presents a data-driven study on the detection of two common respiratory viruses, the respiratory syncytial virus (RSV) and the Sendai virus (SEV), using a portable NIR spectrometer supported by a machine learning solution enhanced by an algorithm of variable selection via the Variable Importance in Projection (VIP) scores and its Quantile value, along with variable truncation processing, to overcome the obstacles to a certain extent. We conducted extensive experiments with the aid of the specifically developed algorithm of variable selection, using a total of four datasets, achieving classification accuracy of: (1) 0.88, 0.94, and 0.93 for RSV, SEV, and RSV + SEV, respectively, averaged over multiple runs, for the neural network modelling of taking in turn 3 sessions of data for training and the remaining one session of an 'unknown' dataset for testing. (2) the average accuracy of 0.94 (RSV), 0.97 (SEV), and 0.97 (RSV + SEV) for model validation and 0.90 (RSV), 0.93 (SEV), and 0.91 (RSV + SEV) for model testing, using two of the datasets for model training, one for model validation and the other for model testing. These results demonstrate the feasibility of using portable NIR spectroscopy coupled with machine learning to detect respiratory viruses with good accuracy, and the approach could be a viable solution for population screening.
Keyphrases
- respiratory syncytial virus
- machine learning
- respiratory tract
- low cost
- deep learning
- neural network
- artificial intelligence
- high resolution
- big data
- photodynamic therapy
- fluorescent probe
- coronavirus disease
- fluorescence imaging
- sars cov
- social support
- single molecule
- solid state
- rna seq
- mass spectrometry
- quality improvement
- living cells
- drug delivery
- single cell
- genetic diversity
- molecular dynamics
- quantum dots
- data analysis
- sensitive detection