Mind the Gap: Mapping Mass Spectral Databases in Genome-Scale Metabolic Networks Reveals Poorly Covered Areas.
Clement FrainayEmma L SchymanskiSteffen NeumannBenjamin MerletReza M SalekFabien JourdanÓscar YanesPublished in: Metabolites (2018)
The use of mass spectrometry-based metabolomics to study human, plant and microbial biochemistry and their interactions with the environment largely depends on the ability to annotate metabolite structures by matching mass spectral features of the measured metabolites to curated spectra of reference standards. While reference databases for metabolomics now provide information for hundreds of thousands of compounds, barely 5% of these known small molecules have experimental data from pure standards. Remarkably, it is still unknown how well existing mass spectral libraries cover the biochemical landscape of prokaryotic and eukaryotic organisms. To address this issue, we have investigated the coverage of 38 genome-scale metabolic networks by public and commercial mass spectral databases, and found that on average only 40% of nodes in metabolic networks could be mapped by mass spectral information from standards. Next, we deciphered computationally which parts of the human metabolic network are poorly covered by mass spectral libraries, revealing gaps in the eicosanoids, vitamins and bile acid metabolism. Finally, our network topology analysis based on the betweenness centrality of metabolites revealed the top 20 most important metabolites that, if added to MS databases, may facilitate human metabolome characterization in the future.
Keyphrases
- mass spectrometry
- optical coherence tomography
- endothelial cells
- ms ms
- big data
- high resolution
- induced pluripotent stem cells
- healthcare
- liquid chromatography
- single cell
- squamous cell carcinoma
- magnetic resonance imaging
- gene expression
- radiation therapy
- dna methylation
- genome wide
- magnetic resonance
- machine learning
- mental health
- molecular dynamics
- lymph node
- electronic health record
- multidrug resistant
- tandem mass spectrometry
- gas chromatography
- cell wall