DeepDetect: Deep Learning of Peptide Detectability Enhanced by Peptide Digestibility and Its Application to DIA Library Reduction.
Jinghan YangZhiyuan ChengFuzhou GongYan FuPublished in: Analytical chemistry (2023)
In tandem mass spectrometry-based proteomics, proteins are digested into peptides by specific protease(s), but generally only a fraction of peptides can be detected. To characterize detectable proteotypic peptides, we have developed a series of methods to predict peptide digestibility and detectability. Here, we propose a bidirectional long short-term memory (BiLSTM)-based algorithm, named DeepDetect, for the prediction of peptide detectability enhanced by peptide digestibility. Compared with existing algorithms, DeepDetect is featured by its improved prediction accuracy for a wide range of commonly used proteases, covering trypsin, ArgC, chymotrypsin, GluC, LysC, AspN, LysN, and LysargiNase. On 11 test data sets from E. coli , yeast, mouse, and human samples, DeepDetect achieved higher prediction accuracies than PepFormer, a state-of-the-art deep-learning-based peptide detectability prediction algorithm. The results further demonstrated that peptide digestibility can substantially enhance the performance of peptide detectability predictors. As an application, DeepDetect was used to reduce the in silico predicted spectral libraries in data-independent acquisition mass spectrometry data analysis. Experiments using DIA-NN software showed that DeepDetect can significantly accelerate the library search without loss of peptide and protein identification sensitivity.
Keyphrases
- deep learning
- mass spectrometry
- data analysis
- machine learning
- tandem mass spectrometry
- endothelial cells
- escherichia coli
- artificial intelligence
- magnetic resonance
- magnetic resonance imaging
- high performance liquid chromatography
- gas chromatography
- amino acid
- risk assessment
- binding protein
- ms ms
- ultra high performance liquid chromatography
- molecular docking
- heavy metals
- anaerobic digestion