Login / Signup

Two-Phase and Graph-Based Clustering Methods for Accurate and Efficient Segmentation of Large Mass Spectrometry Images.

Alex DexterAlan M RaceRory T StevenJennifer R BarnesHeather HulmeRichard J A GoodwinIain B StylesJosephine Bunch
Published in: Analytical chemistry (2017)
Clustering is widely used in MSI to segment anatomical features and differentiate tissue types, but existing approaches are both CPU and memory-intensive, limiting their application to small, single data sets. We propose a new approach that uses a graph-based algorithm with a two-phase sampling method that overcomes this limitation. We demonstrate the algorithm on a range of sample types and show that it can segment anatomical features that are not identified using commonly employed algorithms in MSI, and we validate our results on synthetic MSI data. We show that the algorithm is robust to fluctuations in data quality by successfully clustering data with a designed-in variance using data acquired with varying laser fluence. Finally, we show that this method is capable of generating accurate segmentations of large MSI data sets acquired on the newest generation of MSI instruments and evaluate these results by comparison with histopathology.
Keyphrases
  • electronic health record
  • deep learning
  • machine learning
  • big data
  • mass spectrometry
  • convolutional neural network
  • high resolution
  • neural network
  • rna seq
  • data analysis
  • liquid chromatography