Multiple machine learning methods aided virtual screening of Na V 1.5 inhibitors.
Weikaixin KongWeiran HuangChao PengBowen ZhangGuifang DuanWeining MaZhuo HuangPublished in: Journal of cellular and molecular medicine (2022)
Na v 1.5 sodium channels contribute to the generation of the rapid upstroke of the myocardial action potential and thereby play a central role in the excitability of myocardial cells. At present, the patch clamp method is the gold standard for ion channel inhibitor screening. However, this method has disadvantages such as high technical difficulty, high cost and low speed. In this study, novel machine learning models to screen chemical blockers were developed to overcome the above shortage. The data from the ChEMBL Database were employed to establish the machine learning models. Firstly, six molecular fingerprints together with five machine learning algorithms were used to develop 30 classification models to predict effective inhibitors. A validation and a test set were used to evaluate the performance of the models. Subsequently, the privileged substructures tightly associated with the inhibition of the Na v 1.5 ion channel were extracted using the bioalerts Python package. In the validation set, the RF-Graph model performed best. Similarly, RF-Graph produced the best result in the test set in which the Prediction Accuracy (Q) was 0.9309 and Matthew's correlation coefficient was 0.8627, further indicating the model had high classification ability. The results of the privileged substructures indicated Sulfa structures and fragments with large Steric hindrance tend to block Na v 1.5. In the unsupervised learning task of identifying sulfa drugs, MACCS and Graph fingerprints had good results. In summary, effective machine learning models have been constructed which help to screen potential inhibitors of the Na v 1.5 ion channel and key privileged substructures with high affinity were also extracted.
Keyphrases
- machine learning
- big data
- artificial intelligence
- deep learning
- convolutional neural network
- high throughput
- heart failure
- computed tomography
- oxidative stress
- high resolution
- wastewater treatment
- cell proliferation
- signaling pathway
- diffusion weighted imaging
- electronic health record
- magnetic resonance imaging
- mass spectrometry
- quantum dots
- single cell
- data analysis
- human health
- single molecule
- endoplasmic reticulum stress
- adverse drug
- climate change
- angiotensin converting enzyme