Peptide clustering enhances large-scale analyses and reveals proteolytic signatures in mass spectrometry data.
Erik HartmanFredrik ForsbergSven KjellströmJitka PetrlovaCongyu LuoAaron M ScottManoj PuthiaJohan MalmstömArtur SchmidtchenPublished in: Nature communications (2024)
Recent advances in mass spectrometry-based peptidomics have catalyzed the identification and quantification of thousands of endogenous peptides across diverse biological systems. However, the vast peptidomic landscape generated by proteolytic processing poses several challenges for downstream analyses and limits the comparability of clinical samples. Here, we present an algorithm that aggregates peptides into peptide clusters, reducing the dimensionality of peptidomics data, improving the definition of protease cut sites, enhancing inter-sample comparability, and enabling the implementation of large-scale data analysis methods akin to those employed in other omics fields. We showcase the algorithm by performing large-scale quantitative analysis of wound fluid peptidomes of highly defined porcine wound infections and human clinical non-healing wounds. This revealed signature phenotype-specific peptide regions and proteolytic activity at the earliest stages of bacterial colonization. We validated the method on the urinary peptidome of type 1 diabetics which revealed potential subgroups and improved classification accuracy.
Keyphrases
- data analysis
- single cell
- mass spectrometry
- machine learning
- deep learning
- liquid chromatography
- high resolution
- endothelial cells
- rna seq
- electronic health record
- big data
- healthcare
- gas chromatography
- capillary electrophoresis
- high performance liquid chromatography
- amino acid
- type diabetes
- room temperature
- artificial intelligence
- insulin resistance
- metabolic syndrome
- glycemic control