Discovery of a structural class of antibiotics with explainable deep learning.
Felix WongErica J ZhengJacqueline A ValeriNina M DonghiaMelis N AnahtarSatotaka OmoriAlicia LiAndres Cubillos-RuizAarti KrishnanWengong JinAbigail L MansonJens FriedrichsRalf HelbigBehnoush HajianDawid K FiejtekFlorence F WagnerHolly H SoutterAshlee M EarlJonathan M StokesLars David RennerJames J CollinsPublished in: Nature (2023)
The discovery of novel structural classes of antibiotics is urgently needed to address the ongoing antibiotic resistance crisis 1-9 . Deep learning approaches have aided in exploring chemical spaces 1,10-15 ; these typically use black box models and do not provide chemical insights. Here we reasoned that the chemical substructures associated with antibiotic activity learned by neural network models can be identified and used to predict structural classes of antibiotics. We tested this hypothesis by developing an explainable, substructure-based approach for the efficient, deep learning-guided exploration of chemical spaces. We determined the antibiotic activities and human cell cytotoxicity profiles of 39,312 compounds and applied ensembles of graph neural networks to predict antibiotic activity and cytotoxicity for 12,076,365 compounds. Using explainable graph algorithms, we identified substructure-based rationales for compounds with high predicted antibiotic activity and low predicted cytotoxicity. We empirically tested 283 compounds and found that compounds exhibiting antibiotic activity against Staphylococcus aureus were enriched in putative structural classes arising from rationales. Of these structural classes of compounds, one is selective against methicillin-resistant S. aureus (MRSA) and vancomycin-resistant enterococci, evades substantial resistance, and reduces bacterial titres in mouse models of MRSA skin and systemic thigh infection. Our approach enables the deep learning-guided discovery of structural classes of antibiotics and demonstrates that machine learning models in drug discovery can be explainable, providing insights into the chemical substructures that underlie selective antibiotic activity.
Keyphrases
- deep learning
- neural network
- staphylococcus aureus
- machine learning
- methicillin resistant staphylococcus aureus
- convolutional neural network
- artificial intelligence
- small molecule
- drug discovery
- high throughput
- endothelial cells
- mouse model
- stem cells
- public health
- mesenchymal stem cells
- escherichia coli
- biofilm formation
- cystic fibrosis
- single cell
- bone marrow