Markedly Enhanced Analysis of Mass Spectrometry Images Using Weakly Supervised Machine Learning.
Wil GardnerDavid A WinklerSarah E BamfordBenjamin W MuirPaul J PigramPublished in: Small methods (2024)
Supervised and unsupervised machine learning algorithms are routinely applied to time-of-flight secondary ion mass spectrometry (ToF-SIMS) imaging data and, more broadly, to mass spectrometry imaging (MSI). These algorithms have accelerated large-scale, single-pixel analysis, classification, and regression. However, there is relatively little research on methods suited for so-called weakly supervised problems, where ground-truth class labels exist at the image level, but not at the individual pixel level. Unsupervised learning methods are usually applied to these problems. However, these methods cannot make use of available labels. Here a novel method specifically designed for weakly supervised MSI data is presented. A dual-stream multiple instance learning (MIL) approach is adapted from computational pathology that reveals the spatial-spectral characteristics distinguishing different classes of MSI images. The method uses an information entropy-regularized attention mechanism to identify characteristic class pixels that are then used to extract characteristic mass spectra. This work provides a proof-of-concept exemplification using printed ink samples imaged by ToF-SIMS. A second application-oriented study is also presented, focusing on the analysis of a mixed powder sample type. Results demonstrate the potential of the MIL method for broader application in MSI, with implications for understanding subtle spatial-spectral characteristics in various applications and contexts.
Keyphrases
- machine learning
- mass spectrometry
- deep learning
- big data
- high resolution
- liquid chromatography
- optical coherence tomography
- artificial intelligence
- gas chromatography
- high performance liquid chromatography
- capillary electrophoresis
- convolutional neural network
- mental health
- solid phase extraction
- ms ms
- metal organic framework
- oxidative stress
- simultaneous determination
- healthcare
- data analysis
- working memory
- computed tomography
- dual energy
- fluorescence imaging
- risk assessment
- magnetic resonance imaging