Neoepitope peptides are newly formed antigens presented by major histocompatibility complex class I (MHC-I) on cell surfaces. The cells presenting neoepitope peptides are recognized and subsequently killed by cytotoxic T-cells. Immunopeptidomic approaches aim to characterize the peptide repertoire (including neoepitope) associated with the MHC-I molecules on the surface of tumor cells using proteomic technologies, providing critical information for designing effective immunotherapy strategies. We developed a novel constrained de novo sequencing algorithm to identify neo-epitope peptides from tandem mass spectra acquired in immunopeptidomic analyses. Our method incorporates prior probabilities to putative peptides according to position specific scoring matrices (PSSMs) representing the sequence preferences recognized by MHC-I molecules. We implemented a dynamic programming algorithm to determine the peptide sequences with an optimal posterior matching score for each given MS/MS spectrum. Similar to the de novo peptide sequencing, the dynamic programming algorithm allows an efficient searching in the entire peptide sequence space. On an LC-MS/MS dataset, we demonstrated the performance of our algorithm in detecting the neoepitope peptides bound by the HLA-C*0501 molecules that were superior to database search approaches and existing general purpose de novo peptide sequencing algorithms.
Keyphrases
- machine learning
- single cell
- deep learning
- amino acid
- tandem mass spectrometry
- ms ms
- induced apoptosis
- liquid chromatography
- stem cells
- ultra high performance liquid chromatography
- dendritic cells
- immune response
- bone marrow
- escherichia coli
- oxidative stress
- cell proliferation
- emergency department
- mesenchymal stem cells
- cell death
- cell cycle arrest
- signaling pathway
- solid phase extraction
- density functional theory
- electronic health record
- biofilm formation
- endoplasmic reticulum stress
- health information
- anti inflammatory
- decision making
- pi k akt