Deep Learning Powers Protein Identification from Precursor MS Information.
Yameng DaiYi YangEnhui WuChengpin ShenLiang QiaoPublished in: Journal of proteome research (2024)
Proteome analysis currently heavily relies on tandem mass spectrometry (MS/MS), which does not fully utilize MS1 features, as many precursors remain unselected for MS/MS fragmentation, especially in the cases of low abundance samples and wide abundance dynamic range samples. Therefore, leveraging MS1 features as a complement to MS/MS has become an attractive option to improve the coverage of feature identification. Herein, we propose MonoMS1, an approach combining deep learning-based retention time, ion mobility, detectability prediction, and logistic regression-based scoring for MS1 feature identification. The approach achieved a significant increase in MS1 feature identification based on an E. coli data set. Application of MonoMS1 to data sets with wide dynamic range, such as human serum proteome samples, and with low sample abundance, such as single-cell proteome samples, enabled substantial complementation of MS/MS-based peptide and protein identification. This method opens a new avenue for proteomic analysis and can boost proteomic research on complex samples.
Keyphrases
- ms ms
- deep learning
- ultra high performance liquid chromatography
- mass spectrometry
- liquid chromatography tandem mass spectrometry
- tandem mass spectrometry
- multiple sclerosis
- machine learning
- bioinformatics analysis
- single cell
- liquid chromatography
- convolutional neural network
- electronic health record
- high resolution
- antibiotic resistance genes
- binding protein
- high throughput
- protein protein
- health information
- microbial community
- data analysis