A machine learning-based KNIME workflow to predict VEGFR-2 inhibitors.
Nancy TripathiNivedita BhardwajSanjay KumarShreyans K JainPublished in: Chemical biology & drug design (2023)
Vascular endothelial growth factors (VEGFs) are specific cytokines involved in angiogenesis and do so via binding to vascular endothelial growth factor receptors (VEGFRs), a type of receptor tyrosine kinase. VEGFs are reported to facilitate angiogenesis in physiological (embryogenesis) and pathological (tumor) conditions. The overexpression of VEGFs and consequently VEGFRs is reported in tumorigenic conditions. Several VEGFR inhibitors currently used as anticancer drugs to prevent angiogenesis are sunitinib, sorafenib, etc. To identify new potential candidates as VEGFR inhibitors, a classification study using a large and diverse dataset of VEGFR inhibitors from the BindingDB database has been conducted. The KNIME platform was used to calculate molecular and fingerprint-based descriptors and several classification algorithms viz. linear regression (LR), k-nearest neighbor (kNN), decision tree (DT), random forest (RF), and gradient boosted tree (GBT) were employed to build the classification model. The model performance was evaluated by accuracy, precision, recall, and F1 score of the test set. The best LR, kNN, DT, RF, and GBT classifiers had the F1 score of 0.81, 0.87, 0.82, 0.87, and 0.87, respectively. The assorted 5120 VEGFR inhibitors were clustered into 10 subsets, and the structural features of each subset were assessed along with the identification of significant fragments in active and inactive compounds. The automated classifier model developed using the KNIME platform could serve as an important platform for screening and designing molecules as VEGFR inhibitors.