An XIC-Centric Strategy for Improved Identification and Quantification in Proteomic Data Analyses.
Guanghui WangZheng ZhangYi LiuMeghan C BurkeSergey L SheetlinStephen E SteinPublished in: Journal of proteome research (2024)
Reproducibility is a "proteomic dream" yet to be fully realized. A typical data analysis workflow utilizing extracted ion chromatograms (XICs) often treats the information path from identification to quantification as a one-way street. Here, we propose an XIC-centric approach in which the data flow is bidirectional: identifications are used to derive XICs whose information is in turn applied to validate the identifications. In this study, we employed liquid chromatography-mass spectrometry data from glycoprotein and human hair samples to illustrate the XIC-centric concept. At the core of this approach was XIC-based monoisotope repicking. Taking advantage of the intensity information for all detected isotopes across the whole range of an XIC peak significantly improved the accuracy and uncovered misidentifications originating from monoisotope assignment mistakes. It could also rescue non-top-ranked glycopeptide hits. Identification of glycopeptides is particularly susceptible to precursor mass errors for their low abundances, large masses, and glycans differing by 1 or 2 Da easily confused as isotopes. In addition, the XIC-centric strategy significantly reduced the problem of one XIC peak associated with multiple unique identifications, a source of quantitative irreproducibility. Taken together, the proposed approach can lead to improved identification and quantification accuracy and, ultimately, enhanced reproducibility in proteomic data analyses.
Keyphrases
- data analysis
- electronic health record
- mass spectrometry
- liquid chromatography
- big data
- bioinformatics analysis
- endothelial cells
- health information
- magnetic resonance
- healthcare
- emergency department
- patient safety
- high resolution mass spectrometry
- tandem mass spectrometry
- artificial intelligence
- quantum dots
- gas chromatography
- simultaneous determination
- induced pluripotent stem cells
- capillary electrophoresis