Identification of Mycobacterium abscessus Subspecies by MALDI-TOF Mass Spectrometry and Machine Learning.
David Rodríguez-TemporalLaura HerreraFernando AlcaideDiego DomingoGenevieve Héry-ArnaudJakko van IngenAn Van den BosscheAndré IngebretsenClémence BeauruelleEva TerschlüsenSamira BoarbiNeus VilaManuel J ArroyoGema MéndezPatricia MuñozLuis ManceraMaría Jesús Ruiz-SerranoBelén Rodríguez-SánchezPublished in: Journal of clinical microbiology (2023)
Mycobacterium abscessus is one of the most common and pathogenic nontuberculous mycobacteria (NTM) isolated in clinical laboratories. It consists of three subspecies: M. abscessus subsp. abscessus , M. abscessus subsp. bolletii , and M. abscessus subsp. massiliense . Due to their different antibiotic susceptibility pattern, a rapid and accurate identification method is necessary for their differentiation. Although matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF MS) has proven useful for NTM identification, the differentiation of M. abscessus subspecies is challenging. In this study, a collection of 325 clinical isolates of M. abscessus was used for MALDI-TOF MS analysis and for the development of machine learning predictive models based on MALDI-TOF MS protein spectra. Overall, using a random forest model with several confidence criteria (samples by triplicate and similarity values >60%), a total of 96.5% of isolates were correctly identified at the subspecies level. Moreover, an improved model with Spanish isolates was able to identify 88.9% of strains collected in other countries. In addition, differences in culture media, colony morphology, and geographic origin of the strains were evaluated, showing that the latter had an impact on the protein spectra. Finally, after studying all protein peaks previously reported for this species, two novel peaks with potential for subspecies differentiation were found. Therefore, machine learning methodology has proven to be a promising approach for rapid and accurate identification of subspecies of M. abscessus using MALDI-TOF MS.
Keyphrases
- mass spectrometry
- machine learning
- liquid chromatography
- high resolution
- escherichia coli
- gas chromatography
- high performance liquid chromatography
- mycobacterium tuberculosis
- capillary electrophoresis
- big data
- protein protein
- bioinformatics analysis
- climate change
- molecular dynamics
- ms ms
- genetic diversity
- small molecule
- density functional theory
- human health