Login / Signup

Detection and Exclusion of False-Positive Molecular Formula Assignments via Mass Error Distributions in UHR Mass Spectra of Natural Organic Matter.

Shuxian GaoElaine K JenningsLimei HanBoris P KochPeter HerzsprungOliver Jens Lechtenfeld
Published in: Analytical chemistry (2024)
Ultrahigh resolution mass spectrometry (UHRMS) routinely detects and identifies thousands of mass peaks in complex mixtures, such as natural organic matter (NOM) and petroleum. The assignment of several chemically plausible molecular formulas (MFs) for a single accurate mass still poses a major problem for the reliable interpretation of NOM composition in a biogeochemical context. Applying sensible chemical rules for MF validation is often insufficient to eliminate multiple assignments (MultiAs)─especially for mass peaks with low abundance or if ample heteroatoms or isotopes are included - and requires manual inspection or expert judgment. Here, we present a new approach based on mass error distributions for the identification of true and false assignments among MultiAs. To this end, we used the mass error in millidalton (mDa), which was superior to the commonly used relative mass error in ppm. We developed an automatic workflow to group MultiAs based on their shared formula units and Kendrick mass defect values and to evaluate the mass error distribution. In this way, the number of valid assignments of chlorinated disinfection byproducts was increased by 8-fold as compared to only applying 37 Cl/ 35 Cl isotope ratio filters. Likewise, phosphorus-containing MFs can be differentiated against chlorine-containing MFs with high confidence. Further, false assignments of highly aromatic sulfur-containing MFs ("black sulfur") to sodium adducts in negative ionization mode can be excluded by applying our approach. Overall, MFs for mass peaks that are close to the detection limit or where naturally occurring isotopes are rare (e.g., 15 N) or absent (e.g., P and F) can now be validated, substantially increasing the reliability of MF assignments and broadening the applicability of UHRMS analysis to even more complex samples and processes.
Keyphrases