HToPred: A Tool for Human Topoisomerase II Inhibitor Prediction.
Neha TripathiNaeem ShaikhPrasad V BharatamPrabha GargPublished in: Molecular informatics (2018)
The enzyme human topoisomerase IIα (hTopoIIα) is an important anticancer drug target. Due to the availability of multiple inhibitor-binding sites in this enzyme, the anti-hTopoII agents possess high chemical diversity. Chemoinformatics methods can be used to identify lead compounds from large databases for hTopoII inhibitory activity and classify them. In this work, we report the use of machine learning methods to develop classification models for the identification of possible anti-hTopoIIα agents and to classify them as catalytic inhibitors vs. poisons. Initially, an extensive dataset of small molecules which are reported to be evaluated towards hTopoIIα inhibition was collected from ChEMBL database and literature. Using this dataset, predictive models for classifying small molecules into hTopoIIα inhibitors and non-inhibitors were developed. Additionally, the model development was taken up for the prediction of the type of hTopoIIα inactivation. Several molecular fingerprints and physicochemical descriptors of the molecules in the dataset were calculated using the chemoinformatics tool RDKit. Various classifiers were evaluated to establish suitable protocol. Further, ensemble models were developed by bagging of homogenous classifier and selective fusion of heterogeneous classifiers. The models were thoroughly validated with 5-fold cross validation and external validation. The best performing models were incorporated into a tool christened as Human Topoisomerase IIα Inhibitor Prediction (HToPred, http://14.139.57.41/HToPred). A molecular docking based validation for the successful application of HToPred in predicting the mode of enzyme inhibition was performed, which further established the acceptability of this tool. This tool can serve as an important platform to prescreen compounds for anti-hTopoIIα potential.