VIPpred: a novel model for predicting variant impact on phosphorylation events driving carcinogenesis.
Xiaofeng XuYing LiTaoyu ChenChao HouLiang YangPeiyu ZhuYi ZhangTingting LiPublished in: Briefings in bioinformatics (2024)
Disrupted protein phosphorylation due to genetic variation is a widespread phenomenon that triggers oncogenic transformation of healthy cells. However, few relevant phosphorylation disruption events have been verified due to limited biological experimental methods. Because of the lack of reliable benchmark datasets, current bioinformatics methods primarily use sequence-based traits to study variant impact on phosphorylation (VIP). Here, we increased the number of experimentally supported VIP events from less than 30 to 740 by manually curating and reanalyzing multi-omics data from 916 patients provided by the Clinical Proteomic Tumor Analysis Consortium. To predict VIP events in cancer cells, we developed VIPpred, a machine learning method characterized by multidimensional features that exhibits robust performance across different cancer types. Our method provided a pan-cancer landscape of VIP events, which are enriched in cancer-related pathways and cancer driver genes. We found that variant-induced increases in phosphorylation events tend to inhibit the protein degradation of oncogenes and promote tumor suppressor protein degradation. Our work provides new insights into phosphorylation-related cancer biology as well as novel avenues for precision therapy.
Keyphrases
- papillary thyroid
- machine learning
- squamous cell
- protein kinase
- induced apoptosis
- amino acid
- stem cells
- oxidative stress
- genome wide
- protein protein
- big data
- bone marrow
- small molecule
- newly diagnosed
- young adults
- mesenchymal stem cells
- prognostic factors
- endoplasmic reticulum stress
- single cell
- functional connectivity
- resting state
- drug induced
- patient reported outcomes
- smoking cessation
- stress induced
- chemotherapy induced