Semiautomated glycoproteomics data analysis workflow for maximized glycopeptide identification and reliable quantification.
Steffen LippoldArnoud H de RuJan NoutaPeter A van VeelenMagnus PalmbladDana L E VergoossenNoortje de HaanPublished in: Beilstein journal of organic chemistry (2020)
Glycoproteomic data are often very complex, reflecting the high structural diversity of peptide and glycan portions. The use of glycopeptide-centered glycoproteomics by mass spectrometry is rapidly evolving in many research areas, leading to a demand in reliable data analysis tools. In recent years, several bioinformatic tools were developed to facilitate and improve both the identification and quantification of glycopeptides. Here, a selection of these tools was combined and evaluated with the aim of establishing a robust glycopeptide detection and quantification workflow targeting enriched glycoproteins. For this purpose, a tryptic digest from affinity-purified immunoglobulins G and A was analyzed on a nano-reversed-phase liquid chromatography-tandem mass spectrometry platform with a high-resolution mass analyzer and higher-energy collisional dissociation fragmentation. Initial glycopeptide identification based on MS/MS data was aided by the Byonic software. Additional MS1-based glycopeptide identification relying on accurate mass and retention time differences using GlycopeptideGraphMS considerably expanded the set of confidently annotated glycopeptides. For glycopeptide quantification, the performance of LaCyTools was compared to Skyline, and GlycopeptideGraphMS. All quantification packages resulted in comparable glycosylation profiles but featured differences in terms of robustness and data quality control. Partial cysteine oxidation was identified as an unexpectedly abundant peptide modification and impaired the automated processing of several IgA glycopeptides. Finally, this study presents a semiautomated workflow for reliable glycoproteomic data analysis by the combination of software packages for MS/MS- and MS1-based glycopeptide identification as well as the integration of analyte quality control and quantification.
Keyphrases
- data analysis
- ms ms
- quality control
- liquid chromatography tandem mass spectrometry
- mass spectrometry
- high resolution
- electronic health record
- bioinformatics analysis
- machine learning
- simultaneous determination
- multiple sclerosis
- liquid chromatography
- nitric oxide
- big data
- high performance liquid chromatography
- artificial intelligence
- sensitive detection
- deep learning
- gas chromatography