A Portable and Reusable Database Infrastructure for Mass Spectrometry, and Its Associated Toolkit (The DIMSpec Project).
Jared M RaglandBenjamin J PlacePublished in: Journal of the American Society for Mass Spectrometry (2024)
Nontargeted analysis (NTA) is a rapidly growing field of techniques that includes the identification of unknown chemical analytes in complex mixtures such as environmental, biological, and food matrices. The use of reference mass spectral databases is a key component of most NTA workflows, providing a high level of confidence for chemical identification when analytical standards are not available, yet effective interlaboratory sharing of research grade spectra remains challenging. The Database Infrastructure for Mass Spectrometry (DIMSpec) project focused on the creation of an open-source toolkit supporting storage and sharing of high-resolution mass spectra with attached sample and methodological metadata. As a demonstration of its utility, the DIMSpec toolkit was used to create a database of curated mass spectra for per- and polyfluoroalkyl substances (PFAS) generated from various sources. While the underlying toolkit is agnostic to analytical targets, this initial release (along with the database schema, mass spectral data, and database tools) should enable PFAS researchers to use these data for their own studies, including the identification of novel PFAS in the environment.
Keyphrases
- mass spectrometry
- high resolution
- liquid chromatography
- adverse drug
- electronic health record
- big data
- optical coherence tomography
- quality improvement
- density functional theory
- drinking water
- gas chromatography
- high resolution mass spectrometry
- bioinformatics analysis
- emergency department
- high performance liquid chromatography
- healthcare
- capillary electrophoresis
- tandem mass spectrometry
- computed tomography
- machine learning
- climate change
- artificial intelligence
- deep learning