First fully-automated AI/ML virtual screening cascade implemented at a drug discovery centre in Africa.
Gemma TuronJason HlozekJohn G WoodlandAnkur KumarKelly ChibaleMiquel Duran-FrigolaPublished in: Nature communications (2023)
Streamlined data-driven drug discovery remains challenging, especially in resource-limited settings. We present ZairaChem, an artificial intelligence (AI)- and machine learning (ML)-based tool for quantitative structure-activity/property relationship (QSAR/QSPR) modelling. ZairaChem is fully automated, requires low computational resources and works across a broad spectrum of datasets. We describe an end-to-end implementation at the H3D Centre, the leading integrated drug discovery unit in Africa, at which no prior AI/ML capabilities were available. By leveraging in-house data collected over a decade, we have developed a virtual screening cascade for malaria and tuberculosis drug discovery comprising 15 models for key decision-making assays ranging from whole-cell phenotypic screening and cytotoxicity to aqueous solubility, permeability, microsomal metabolic stability, cytochrome inhibition, and cardiotoxicity. We show how computational profiling of compounds, prior to synthesis and testing, can inform progression of frontrunner compounds at H3D. This project is a first-of-its-kind deployment at scale of AI/ML tools in a research centre operating in a low-resource setting.
Keyphrases
- drug discovery
- artificial intelligence
- machine learning
- big data
- deep learning
- single cell
- decision making
- high throughput
- quality improvement
- healthcare
- primary care
- mycobacterium tuberculosis
- rna seq
- molecular docking
- emergency department
- hiv aids
- electronic health record
- stem cells
- molecular dynamics
- bone marrow
- plasmodium falciparum
- mesenchymal stem cells
- drug induced