Speeding Up Percolator.
John T HalloranHantian ZhangKaan KaraCédric RenggliMatthew TheCe ZhangDavid M RockeLukas KällWilliam Stafford NoblePublished in: Journal of proteome research (2019)
The processing of peptide tandem mass spectrometry data involves matching observed spectra against a sequence database. The ranking and calibration of these peptide-spectrum matches can be improved substantially using a machine learning postprocessor. Here, we describe our efforts to speed up one widely used postprocessor, Percolator. The improved software is dramatically faster than the previous version of Percolator, even when using relatively few processors. We tested the new version of Percolator on a data set containing over 215 million spectra and recorded an overall reduction to 23% of the running time as compared to the unoptimized code. We also show that the memory footprint required by these speedups is modest relative to that of the original version of Percolator.
Keyphrases
- tandem mass spectrometry
- psychometric properties
- machine learning
- ultra high performance liquid chromatography
- big data
- high performance liquid chromatography
- electronic health record
- liquid chromatography
- simultaneous determination
- gas chromatography
- density functional theory
- high resolution
- data analysis
- solid phase extraction
- artificial intelligence
- working memory
- emergency department
- high intensity
- high resolution mass spectrometry
- amino acid