Param-Medic: A Tool for Improving MS/MS Database Search Yield by Optimizing Parameter Settings.
Damon H MayKaipo TamuraWilliam Stafford NoblePublished in: Journal of proteome research (2017)
In shotgun proteomics analysis, user-specified parameters are critical to database search performance and therefore to the yield of confident peptide-spectrum matches (PSMs). Two of the most important parameters are related to the accuracy of the mass spectrometer. Precursor mass tolerance defines the peptide candidates considered for each spectrum. Fragment mass tolerance or bin size determines how close observed and theoretical fragments must be to be considered a match. For either of these two parameters, too wide a setting yields randomly high-scoring false PSMs, whereas too narrow a setting erroneously excludes true PSMs, in both cases, lowering the yield of peptides detected at a given false discovery rate. We describe a strategy for inferring optimal search parameters by assembling and analyzing pairs of spectra that are likely to have been generated by the same peptide ion to infer precursor and fragment mass error. This strategy does not rely on a database search, making it usable in a wide variety of settings. In our experiments on data from a variety of instruments including Orbitrap and Q-TOF acquisitions, this strategy yields more high-confidence PSMs than using settings based on instrument defaults or determined by experts. Param-Medic is open-source and cross-platform. It is available as a standalone tool ( http://noble.gs.washington.edu/proj/param-medic/ ) and has been integrated into the Crux proteomics toolkit ( http://crux.ms ), providing automatic parameter selection for the Comet and Tide search engines.
Keyphrases
- mass spectrometry
- ms ms
- high resolution
- adverse drug
- high throughput
- small molecule
- multiple sclerosis
- machine learning
- high performance liquid chromatography
- emergency department
- big data
- liquid chromatography tandem mass spectrometry
- electronic health record
- ultra high performance liquid chromatography
- gas chromatography
- tandem mass spectrometry
- molecular dynamics
- deep learning
- single cell
- density functional theory