VIPpred: a novel model for predicting variant impact on phosphorylation events driving carcinogenesis.
Xiaofeng XuYing LiTaoyu ChenChao HouLiang YangPeiyu ZhuYi ZhangTingting LiPublished in: Briefings in bioinformatics (2024)
Disrupted protein phosphorylation due to genetic variation is a widespread phenomenon that triggers oncogenic transformation of healthy cells. However, few relevant phosphorylation disruption events have been verified due to limited biological experimental methods. Because of the lack of reliable benchmark datasets, current bioinformatics methods primarily use sequence-based traits to study variant impact on phosphorylation (VIP). Here, we increased the number of experimentally supported VIP events from less than 30 to 740 by manually curating and reanalyzing multi-omics data from 916 patients provided by the Clinical Proteomic Tumor Analysis Consortium. To predict VIP events in cancer cells, we developed VIPpred, a machine learning method characterized by multidimensional features that exhibits robust performance across different cancer types. Our method provided a pan-cancer landscape of VIP events, which are enriched in cancer-related pathways and cancer driver genes. We found that variant-induced increases in phosphorylation events tend to inhibit the protein degradation of oncogenes and promote tumor suppressor protein degradation. Our work provides new insights into phosphorylation-related cancer biology as well as novel avenues for precision therapy.
Keyphrases
- papillary thyroid
- machine learning
- protein kinase
- squamous cell
- single cell
- end stage renal disease
- newly diagnosed
- ejection fraction
- chronic kidney disease
- genome wide
- prognostic factors
- squamous cell carcinoma
- childhood cancer
- big data
- lymph node metastasis
- drug induced
- artificial intelligence
- young adults
- oxidative stress
- functional connectivity
- resting state
- smoking cessation