Extending the breadth of saliva metabolome fingerprinting by smart template strategies and effective pattern realignment on comprehensive two-dimensional gas chromatographic data.
Simone SquaraFriederike ManigThomas HenleMichael HellwigAndrea CarattiCarlo BicchiStephen E ReichenbachQingping TaoMassimo CollinoChiara E CorderoPublished in: Analytical and bioanalytical chemistry (2023)
Comprehensive two-dimensional gas chromatography with time-of-flight mass spectrometry (GC × GC-TOFMS) is one the most powerful analytical platforms for chemical investigations of complex biological samples. It produces large datasets that are rich in information, but highly complex, and its consistency may be affected by random systemic fluctuations and/or changes in the experimental parameters. This study details the optimization of a data processing strategy that compensates for severe 2D pattern misalignments and detector response fluctuations for saliva samples analyzed across 2 years. The strategy was trained on two batches: one with samples from healthy subjects who had undergone dietary intervention with high/low-Maillard reaction products (dataset A), and the second from healthy/unhealthy obese individuals (dataset B). The combined untargeted and targeted pattern recognition algorithm (i.e., UT fingerprinting) was tuned for key process parameters, the signal-to-noise ratio (S/N), and MS spectrum similarity thresholds, and then tested for the best transform function (global or local, affine or low-degree polynomial) for pattern realignment in the temporal domain. Reliable peak detection achieved its best performance, computed as % of false negative/positive matches, with a S/N threshold of 50 and spectral similarity direct match factor (DMF) of 700. Cross-alignment of bi-dimensional (2D) peaks in the temporal domain was fully effective with a supervised operation including multiple centroids (reference peaks) and a match-and-transform strategy using affine functions. Regarding the performance-derived response fluctuations, the most promising strategy for cross-comparative analysis and data fusion included the mass spectral total useful signal (MSTUS) approach followed by Z-score normalization on the resulting matrix.
Keyphrases
- gas chromatography
- mass spectrometry
- electronic health record
- machine learning
- liquid chromatography
- high resolution mass spectrometry
- optical coherence tomography
- randomized controlled trial
- gas chromatography mass spectrometry
- healthcare
- metabolic syndrome
- multiple sclerosis
- type diabetes
- weight loss
- magnetic resonance imaging
- ms ms
- computed tomography
- cancer therapy
- social media
- rna seq
- drug delivery
- bariatric surgery
- solid phase extraction
- diffusion weighted imaging
- obese patients
- high resolution
- contrast enhanced
- single cell
- room temperature
- molecularly imprinted