Extensive exploration of structure activity relationships for the SARS-CoV-2 macrodomain from shape-based fragment merging and active learning.
Galen J CorreyMoira M RachmanTakaya TogoStefan GahbauerYagmur Umay DorukMaisie G V StevensPriyadarshini JaishankarBrian KelleyBrian GoldmanMolly SchmidtTrevor KramerAlan AshworthPatrick F RileyBrian K ShoichetAdam R RensloW Patrick WaltersBrian K ShoichetPublished in: bioRxiv : the preprint server for biology (2024)
The macrodomain contained in the SARS-CoV-2 non-structural protein 3 (NSP3) is required for viral pathogenesis and lethality. Inhibitors that block the macrodomain could be a new therapeutic strategy for viral suppression. We previously performed a large-scale X-ray crystallography-based fragment screen and discovered a sub-micromolar inhibitor by fragment linking. However, this carboxylic acid-containing lead had poor membrane permeability and other liabilities that made optimization difficult. Here, we developed a shape- based virtual screening pipeline - FrankenROCS - to identify new macrodomain inhibitors using fragment X-ray crystal structures. We used FrankenROCS to exhaustively screen the Enamine high-throughput screening (HTS) collection of 2.1 million compounds and selected 39 compounds for testing, with the most potent compound having an IC 50 value equal to 130 μM. We then paired FrankenROCS with an active learning algorithm (Thompson sampling) to efficiently search the Enamine REAL database of 22 billion molecules, testing 32 compounds with the most potent having an IC 50 equal to 220 μM. Further optimization led to analogs with IC 50 values better than 10 μM, with X-ray crystal structures revealing diverse binding modes despite conserved chemical features. These analogs represent a new lead series with improved membrane permeability that is poised for optimization. In addition, the collection of 137 X-ray crystal structures with associated binding data will serve as a resource for the development of structure-based drug discovery methods. FrankenROCS may be a scalable method for fragment linking to exploit ever-growing synthesis-on- demand libraries.
Keyphrases
- sars cov
- high resolution
- dual energy
- drug discovery
- respiratory syndrome coronavirus
- electron microscopy
- high throughput
- computed tomography
- endothelial cells
- machine learning
- molecular docking
- binding protein
- transcription factor
- magnetic resonance imaging
- big data
- dna binding
- anti inflammatory
- electronic health record
- protein protein
- molecular dynamics simulations