Development and validation of a quick, automated, and reproducible ATR FT-IR spectroscopy machine-learning model for Klebsiella pneumoniae typing.
Ângela NovaisAna Beatriz GonçalvesTeresa Gonçalves RibeiroAna R FreitasGema MéndezLuis ManceraAntónia ReadValquíria AlvesLorena López-CereroJesús Rodríguez-BañoÁlvaro PascualLuísa PeixePublished in: Journal of clinical microbiology (2024)
The reliability of Fourier-transform infrared (FT-IR) spectroscopy for Klebsiella pneumoniae typing and outbreak control has been previously assessed, but issues remain in standardization and reproducibility. We developed and validated a reproducible FT-IR with attenuated total reflectance (ATR) workflow for the identification of K. pneumoniae lineages. We used 293 isolates representing multidrug-resistant K. pneumoniae lineages causing outbreaks worldwide (2002-2021) to train a random forest classification (RF) model based on capsular (KL)-type discrimination. This model was validated with 280 contemporaneous isolates (2021-2022), using wzi sequencing and whole-genome sequencing as references. Repeatability and reproducibility were tested in different culture media and instruments throughout time. Our RF model allowed the classification of 33 capsular (KL)-types and up to 36 clinically relevant K. pneumoniae lineages based on the discrimination of specific KL- and O-type combinations. We obtained high rates of accuracy (89%), sensitivity (88%), and specificity (92%), including from cultures obtained directly from the clinical sample, allowing to obtain typing information the same day bacteria are identified. The workflow was reproducible in different instruments throughout time (>98% correct predictions). Direct colony application, spectral acquisition, and automated KL prediction through Clover MS Data analysis software allow a short time-to-result (5 min/isolate). We demonstrated that FT-IR ATR spectroscopy provides meaningful, reproducible, and accurate information at a very early stage (as soon as bacterial identification) to support infection control and public health surveillance. The high robustness together with automated and flexible workflows for data analysis provide opportunities to consolidate real-time applications at a global level. IMPORTANCE We created and validated an automated and simple workflow for the identification of clinically relevant Klebsiella pneumoniae lineages by FT-IR spectroscopy and machine-learning, a method that can be extremely useful to provide quick and reliable typing information to support real-time decisions of outbreak management and infection control. This method and workflow is of interest to support clinical microbiology diagnostics and to aid public health surveillance.
Keyphrases
- klebsiella pneumoniae
- machine learning
- multidrug resistant
- public health
- data analysis
- deep learning
- escherichia coli
- high resolution
- genetic diversity
- early stage
- gram negative
- drug resistant
- acinetobacter baumannii
- artificial intelligence
- big data
- single molecule
- solid state
- multiple sclerosis
- health information
- high throughput
- dna damage response
- computed tomography
- optical coherence tomography
- mass spectrometry
- pseudomonas aeruginosa
- radiation therapy
- infectious diseases
- dna damage
- magnetic resonance
- rectal cancer