SPECTRUM - A MATLAB Toolbox for Proteoform Identification from Top-Down Proteomics Data.
Abdul Rehman BasharatKanzal ImanMuhammad Farhan KhalidZohra AnwarRashid HussainHumnah Gohar KabirMaria TahreemAnam ShahidMaheen HumayunHira Azmat HayatMuhammad MustafaMuhammad Ali ShoaibZakir UllahShamshad ZarinaSameer AhmedEmad UddinSadia HameraFayyaz AhmadSafee Ullah ChaudharyPublished in: Scientific reports (2019)
Top-Down Proteomics (TDP) is an emerging proteomics protocol that involves identification, characterization, and quantitation of intact proteins using high-resolution mass spectrometry. TDP has an edge over other proteomics protocols in that it allows for: (i) accurate measurement of intact protein mass, (ii) high sequence coverage, and (iii) enhanced identification of post-translational modifications (PTMs). However, the complexity of TDP spectra poses a significant impediment to protein search and PTM characterization. Furthermore, limited software support is currently available in the form of search algorithms and pipelines. To address this need, we propose 'SPECTRUM', an open-architecture and open-source toolbox for TDP data analysis. Its salient features include: (i) MS2-based intact protein mass tuning, (ii) de novo peptide sequence tag analysis, (iii) propensity-driven PTM characterization, (iv) blind PTM search, (v) spectral comparison, (vi) identification of truncated proteins, (vii) multifactorial coefficient-weighted scoring, and (viii) intuitive graphical user interfaces to access the aforementioned functionalities and visualization of results. We have validated SPECTRUM using published datasets and benchmarked it against salient TDP tools. SPECTRUM provides significantly enhanced protein identification rates (91% to 177%) over its contemporaries. SPECTRUM has been implemented in MATLAB, and is freely available along with its source code and documentation at https://github.com/BIRL/SPECTRUM/.
Keyphrases
- mass spectrometry
- data analysis
- amyotrophic lateral sclerosis
- liquid chromatography
- amino acid
- bioinformatics analysis
- high resolution mass spectrometry
- protein protein
- ms ms
- magnetic resonance
- label free
- binding protein
- randomized controlled trial
- optical coherence tomography
- healthcare
- magnetic resonance imaging
- small molecule
- high performance liquid chromatography
- big data
- density functional theory
- diffusion weighted imaging
- rna seq