A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides.
Joel M ChickDeepak KolippakkamDavid P NusinowBo ZhaiRamin RadEdward L HuttlinSteven P GygiPublished in: Nature biotechnology (2015)
Fewer than half of all tandem mass spectrometry (MS/MS) spectra acquired in shotgun proteomics experiments are typically matched to a peptide with high confidence. Here we determine the identity of unassigned peptides using an ultra-tolerant Sequest database search that allows peptide matching even with modifications of unknown masses up to ± 500 Da. In a proteome-wide data set on HEK293 cells (9,513 proteins and 396,736 peptides), this approach matched an additional 184,000 modified peptides, which were linked to biological and chemical modifications representing 523 distinct mass bins, including phosphorylation, glycosylation and methylation. We localized all unknown modification masses to specific regions within a peptide. Known modifications were assigned to the correct amino acids with frequencies >90%. We conclude that at least one-third of unassigned spectra arise from peptides with substoichiometric modifications.
Keyphrases
- amino acid
- tandem mass spectrometry
- ms ms
- high performance liquid chromatography
- mass spectrometry
- ultra high performance liquid chromatography
- liquid chromatography
- genome wide
- simultaneous determination
- high resolution
- density functional theory
- induced apoptosis
- dna methylation
- gas chromatography
- cell death
- big data
- computed tomography
- liquid chromatography tandem mass spectrometry
- cell cycle arrest
- molecular dynamics