PeakDecoder enables machine learning-based metabolite annotation and accurate profiling in multidimensional mass spectrometry measurements.
Aivett BilbaoNathalie Munoz MunozJoonhoon KimDaniel J OrtonYuqian GaoKunal PooreyKyle R PomraningKarl WeitzMeagan BurnetCarrie D NicoraRosemarie WiltonShuang DengZiyu DaiEthan OksenAaron GeeRick A FasaniAnya TsalenkoDeepti TanjoreJames GardnerRichard D SmithJoshua K MichenerJohn M GladdenErin S BakerChristopher J PetzoldYoung-Mo KimAlex ApffelJohn M GladdenKristin E Burnum-JohnsonPublished in: Nature communications (2023)
Multidimensional measurements using state-of-the-art separations and mass spectrometry provide advantages in untargeted metabolomics analyses for studying biological and environmental bio-chemical processes. However, the lack of rapid analytical methods and robust algorithms for these heterogeneous data has limited its application. Here, we develop and evaluate a sensitive and high-throughput analytical and computational workflow to enable accurate metabolite profiling. Our workflow combines liquid chromatography, ion mobility spectrometry and data-independent acquisition mass spectrometry with PeakDecoder, a machine learning-based algorithm that learns to distinguish true co-elution and co-mobility from raw data and calculates metabolite identification error rates. We apply PeakDecoder for metabolite profiling of various engineered strains of Aspergillus pseudoterreus, Aspergillus niger, Pseudomonas putida and Rhodosporidium toruloides. Results, validated manually and against selected reaction monitoring and gas-chromatography platforms, show that 2683 features could be confidently annotated and quantified across 116 microbial sample runs using a library built from 64 standards.
Keyphrases
- mass spectrometry
- liquid chromatography
- gas chromatography
- machine learning
- tandem mass spectrometry
- high resolution mass spectrometry
- big data
- electronic health record
- high resolution
- capillary electrophoresis
- single cell
- high performance liquid chromatography
- solid phase extraction
- artificial intelligence
- high throughput
- simultaneous determination
- gas chromatography mass spectrometry
- deep learning
- rna seq
- escherichia coli