Login / Signup

A predictive model for vertebrate bone identification from collagen using proteomic mass spectrometry.

Heyi YangErin R ButlerSamantha A MonierJennifer TeublDavid FenyoBeatrix UeberheideDonald Siegel
Published in: Scientific reports (2021)
Proteogenomics is an increasingly common method for species identification as it allows for rapid and inexpensive interrogation of an unknown organism's proteome-even when the proteome is partially degraded. The proteomic method typically uses tandem mass spectrometry to survey all peptides detectable in a sample that frequently contains hundreds or thousands of proteins. Species identification is based on detection of a small numbers of species-specific peptides. Genetic analysis of proteins by mass spectrometry, however, is a developing field, and the bone proteome, typically consisting of only two proteins, pushes the limits of this technology. Nearly 20% of highly confident spectra from modern human bone samples identify non-human species when searched against a vertebrate database-as would be necessary with a fragment of unknown bone. These non-human peptides are often the result of current limitations in mass spectrometry or algorithm interpretation errors. Consequently, it is difficult to know if a "species-specific" peptide used to identify a sample is actually present in that sample. Here we evaluate the causes of peptide sequence errors and propose an unbiased, probabilistic approach to determine the likelihood that a species is correctly identified from bone without relying on species-specific peptides.
Keyphrases