Can We Boost N -Glycopeptide Identification Confidence? Smart Collision Energy Choice Taking into Account Structure and Search Engine.
Helga HevérAndrea XueKinga NagyKinga KomkaKároly VékeyLaszlo DrahosÁgnes RévészPublished in: Journal of the American Society for Mass Spectrometry (2024)
High confidence and reproducibility are still challenges in bottom-up mass spectrometric N -glycopeptide identification. The collision energy used in the MS/MS measurements and the database search engine used to identify the species are perhaps the two most decisive factors. We investigated how the structural features of N -glycopeptides and the choice of the search engine influence the optimal collision energy, delivering the highest identification confidence. We carried out LC-MS/MS measurements using a series of collision energies on a large set of N -glycopeptides with both the glycan and peptide part varied and studied the behavior of Byonic, pGlyco, and GlycoQuest scores. We found that search engines show a range of behavior between peptide-centric and glycan-centric, which manifests itself already in the dependence of optimal collision energy on m / z . Using classical statistical and machine learning methods, we revealed that peptide hydrophobicity, glycan and peptide masses, and the number of mobile protons also have significant and search-engine-dependent influence, as opposed to a series of other parameters we probed. We envisioned an MS/MS workflow making a smart collision energy choice based on online available features such as the hydrophobicity (described by retention time) and glycan mass (potentially available from a scout MS/MS). Our assessment suggests that this workflow can lead to a significant gain (up to 100%) in the identification confidence, particularly for low-scoring hits close to the filtering limit, which has the potential to enhance reproducibility of N -glycopeptide analyses. Data are available via MassIVE (MSV000093110).
Keyphrases
- ms ms
- machine learning
- bioinformatics analysis
- electronic health record
- cell surface
- liquid chromatography tandem mass spectrometry
- healthcare
- emergency department
- magnetic resonance
- computed tomography
- health information
- social media
- high performance liquid chromatography
- fine needle aspiration
- data analysis
- molecular dynamics
- tandem mass spectrometry
- solid state
- ultra high performance liquid chromatography