ConCISE: Consensus Annotation Propagation of Ion Features in Untargeted Tandem Mass Spectrometry Combining Molecular Networking and In Silico Metabolite Structure Prediction.
Zachary A QuinlanIrina KoesterAllegra T AronDaniel PetrasLihini I AluwiharePieter C DorresteinCraig E NelsonLinda Wegley KellyPublished in: Metabolites (2022)
Recent developments in molecular networking have expanded our ability to characterize the metabolome of diverse samples that contain a significant proportion of ion features with no mass spectral match to known compounds. Manual and tool-assisted natural annotation propagation is readily used to classify molecular networks; however, currently no annotation propagation tools leverage consensus confidence strategies enabled by hierarchical chemical ontologies or enable the use of new in silico tools without significant modification. Herein we present ConCISE (Consensus Classifications of In Silico Elucidations) which is the first tool to fuse molecular networking, spectral library matching and in silico class predictions to establish accurate putative classifications for entire subnetworks. By limiting annotation propagation to only structural classes which are identical for the majority of ion features within a subnetwork, ConCISE maintains a true positive rate greater than 95% across all levels of the ChemOnt hierarchical ontology used by the ClassyFire annotation software (superclass, class, subclass). The ConCISE framework expanded the proportion of reliable and consistent ion feature annotation up to 76%, allowing for improved assessment of the chemo-diversity of dissolved organic matter pools from three complex marine metabolomics datasets comprising dominant reef primary producers, five species of the diatom genus Pseudo-nitzchia, and stromatolite sediment samples.
Keyphrases
- rna seq
- tandem mass spectrometry
- molecular docking
- liquid chromatography
- mass spectrometry
- single cell
- high performance liquid chromatography
- high resolution
- ultra high performance liquid chromatography
- clinical practice
- optical coherence tomography
- machine learning
- simultaneous determination
- photodynamic therapy
- magnetic resonance
- deep learning
- magnetic resonance imaging
- cancer therapy
- risk assessment
- solid phase extraction
- drug delivery
- rectal cancer
- locally advanced
- neural network
- data analysis
- genetic diversity