IQMMA: Efficient MS1 Intensity Extraction Pipeline Using Multiple Feature Detection Algorithms for DDA Proteomics.
Valeriy I PostoenkoLeyla A GaribovaLev I LevitskyJulia A BubisMikhail V GorshkovMikhail V GorshkovPublished in: Journal of proteome research (2023)
One of the key steps in data dependent acquisition (DDA) proteomics is detection of peptide isotopic clusters, also called "features", in MS1 spectra and matching them to MS/MS-based peptide identifications. A number of peptide feature detection tools became available in recent years, each relying on its own matching algorithm. Here, we provide an integrated solution, the intensity-based Quantitative Mix and Match Approach (IQMMA), which integrates a number of untargeted peptide feature detection algorithms and returns the most probable intensity values for the MS/MS-based identifications. IQMMA was tested using available proteomic data acquired for both well-characterized (ground truth) and real-world biological samples, including a mix of Yeast and E. coli digests spiked at different concentrations into the Human K562 digest used as a background, and a set of glioblastoma cell lines. Three open-source feature detection algorithms were integrated: Dinosaur, biosaur2, and OpenMS FeatureFinder. None of them was found optimal when applied individually to all the data sets employed in this work; however, their combined use in IQMMA improved efficiency of subsequent protein quantitation. The software implementing IQMMA is freely available at https://github.com/PostoenkoVI/IQMMA under Apache 2.0 license.
Keyphrases
- machine learning
- ms ms
- label free
- deep learning
- mass spectrometry
- loop mediated isothermal amplification
- big data
- real time pcr
- artificial intelligence
- liquid chromatography tandem mass spectrometry
- high intensity
- endothelial cells
- escherichia coli
- high resolution
- liquid chromatography
- high performance liquid chromatography
- data analysis
- neural network
- small molecule
- quantum dots
- ultra high performance liquid chromatography
- high resolution mass spectrometry