Hidden in plain sight: challenges in proteomics detection of small ORF-encoded polypeptides.
Igor FijalkowskiPatrick WillemsVeronique JonckheereLaure SimoensPetra Van DammePublished in: microLife (2022)
Genomic studies of bacteria have long pointed toward widespread prevalence of small open reading frames (sORFs) encoding for short proteins, <100 amino acids in length. Despite the mounting genomic evidence of their robust expression, relatively little progress has been made in their mass spectrometry-based detection and various blanket statements have been used to explain this observed discrepancy. In this study, we provide a large-scale riboproteogenomics investigation of the challenging nature of proteomic detection of such small proteins as informed by conditional translation data. A panel of physiochemical properties alongside recently developed mass spectrometry detectability metrics was interrogated to provide a comprehensive evidence-based assessment of sORF-encoded polypeptide (SEP) detectability. Moreover, a large-scale proteomics and translatomics compendium of proteins produced by Salmonella Typhimurium ( S . Typhimurium), a model human pathogen, across a panel of growth conditions is presented and used in support of our in silico SEP detectability analysis. This integrative approach is used to provide a data-driven census of small proteins expressed by S . Typhimurium across growth phases and infection-relevant conditions. Taken together, our study pinpoints current limitations in proteomics-based detection of novel small proteins currently missing from bacterial genome annotations.
Keyphrases
- mass spectrometry
- label free
- loop mediated isothermal amplification
- real time pcr
- listeria monocytogenes
- liquid chromatography
- poor prognosis
- high resolution
- amino acid
- escherichia coli
- minimally invasive
- gas chromatography
- risk factors
- capillary electrophoresis
- working memory
- high performance liquid chromatography
- molecular docking
- long non coding rna
- candida albicans
- big data
- network analysis
- data analysis