Login / Signup

Sequence-Specific Model for Predicting Peptide Collision Cross Section Values in Proteomic Ion Mobility Spectrometry.

Chih-Hsiang ChangDarien YeungVictor SpicerKosuke OgataOleg V KrokhinYasushi Ishihama
Published in: Journal of proteome research (2021)
The contribution of peptide amino acid sequence to collision cross section values (CCS) has been investigated using a dataset of ∼134 000 peptides of four different charge states (1+ to 4+). The migration data were acquired using a two-dimensional liquid chromatography (LC)/trapped ion mobility spectrometry/quadrupole/time-of-flight mass spectrometry (MS) analysis of HeLa cell digests created using seven different proteases and was converted to CCS values. Following the previously reported modeling approaches using intrinsic size parameters (ISP), we extended this methodology to encode the position of individual residues within a peptide sequence. A generalized prediction model was built by dividing the dataset into eight groups (four charges for both tryptic/nontryptic peptides). Position-dependent ISPs were independently optimized for the eight subsets of peptides, resulting in prediction accuracy of ∼0.981 for the entire population of peptides. We find that ion mobility is strongly affected by the peptide's ability to solvate the positively charged sites. Internal positioning of polar residues and proline leads to decreased CCS values as they improve charge solvation; conversely, this ability decreases with increasing peptide charge due to electrostatic repulsion. Furthermore, higher helical propensity and peptide hydrophobicity result in a preferential formation of extended structures with higher than predicted CCS values. Finally, acidic/basic residues exhibit position-dependent ISP behavior consistent with electrostatic interaction with the peptide macrodipole, which affects the peptide helicity. The MS raw data files have been deposited with the ProteomeXchange Consortium via the jPOST partner repository (http://jpostdb.org) with the dataset identifiers PXD021440/JPST000959, PXD022800/JPST001017, and PXD026087/ JPST001176.
Keyphrases