Open access repository-scale propagated nearest neighbor suspect spectral library for untargeted metabolomics.
Wout BittremieuxNicole E AvalonSydney P ThomasSarvar A KakhkhorovAlexander A AksenovPaulo Wender Portal GomesChristine M AcevesAndrés Mauricio Caraballo-RodríguezJulia M GauglitzWilliam H GerwickTao HuanAlan K JarmuschRima F Kaddurah-DaoukKyo-Bin KangHyun Woo KimTodor KondićHelena M RussoMichael J MeehanAlexey V MelnikLouis-Felix NothiasClaire O'DonovanMorgan PanitchpakdiDaniel PetrasRobin SchmidEmma L SchymanskiJustin Johan Jozias van der HooftKelly C WeldonHeejung YangShipei XingJasmine ZemlinMingxun WangPieter C DorresteinPublished in: Nature communications (2023)
Despite the increasing availability of tandem mass spectrometry (MS/MS) community spectral libraries for untargeted metabolomics over the past decade, the majority of acquired MS/MS spectra remain uninterpreted. To further aid in interpreting unannotated spectra, we created a nearest neighbor suspect spectral library, consisting of 87,916 annotated MS/MS spectra derived from hundreds of millions of MS/MS spectra originating from published untargeted metabolomics experiments. Entries in this library, or "suspects," were derived from unannotated spectra that could be linked in a molecular network to an annotated spectrum. Annotations were propagated to unknowns based on structural relationships to reference molecules using MS/MS-based spectrum alignment. We demonstrate the broad relevance of the nearest neighbor suspect spectral library through representative examples of propagation-based annotation of acylcarnitines, bacterial and plant natural products, and drug metabolism. Our results also highlight how the library can help to better understand an Alzheimer's brain phenotype. The nearest neighbor suspect spectral library is openly available for download or for data analysis through the GNPS platform to help investigators hypothesize candidate structures for unknown MS/MS spectra in untargeted metabolomics data.
Keyphrases
- ms ms
- mass spectrometry
- liquid chromatography
- high performance liquid chromatography
- ultra high performance liquid chromatography
- tandem mass spectrometry
- optical coherence tomography
- liquid chromatography tandem mass spectrometry
- data analysis
- density functional theory
- high resolution mass spectrometry
- gas chromatography
- high resolution
- simultaneous determination
- dual energy
- gas chromatography mass spectrometry
- solid phase extraction
- mental health
- healthcare
- magnetic resonance imaging
- mild cognitive impairment
- randomized controlled trial
- minimally invasive
- big data
- molecular dynamics
- white matter
- computed tomography
- artificial intelligence
- rna seq
- machine learning
- magnetic resonance
- drug induced
- deep learning
- cell wall