Enhancing the Small-Scale Screenable Biological Space beyond Known Chemogenomics Libraries with Gray Chemical Matter─Compounds with Novel Mechanisms from High-Throughput Screening Profiles.
Jason R ThomasClaude SheltonJason MurphyScott BrittainMark-Anthony BrayPeter AspesiJohn ConcannonFrederick J KingRobert J IhryDaniel J HoMartin HenaultAndrea HadjikyriacouMarilisa NeriFrederic D SigoillotHelen T PhamMatthew ShumLouise BarysMichael D JonesEric J MartinAnke BlechschmidtSébastien RieffelThomas J TroxlerFelipa A MapaJeremy L JenkinsRishi K JainPeter S KutchukianMarkus SchirleSteffen RennerPublished in: ACS chemical biology (2024)
Phenotypic assays have become an established approach to drug discovery. Greater disease relevance is often achieved through cellular models with increased complexity and more detailed readouts, such as gene expression or advanced imaging. However, the intricate nature and cost of these assays impose limitations on their screening capacity, often restricting screens to well-characterized small compound sets such as chemogenomics libraries. Here, we outline a cheminformatics approach to identify a small set of compounds with likely novel mechanisms of action (MoAs), expanding the MoA search space for throughput limited phenotypic assays. Our approach is based on mining existing large-scale, phenotypic high-throughput screening (HTS) data. It enables the identification of chemotypes that exhibit selectivity across multiple cell-based assays, which are characterized by persistent and broad structure activity relationships (SAR). We validate the effectiveness of our approach in broad cellular profiling assays (Cell Painting, DRUG-seq, and Promotor Signature Profiling) and chemical proteomics experiments. These experiments revealed that the compounds behave similarly to known chemogenetic libraries, but with a notable bias toward novel protein targets. To foster collaboration and advance research in this area, we have curated a public set of such compounds based on the PubChem BioAssay dataset and made it available for use by the scientific community.
Keyphrases
- single cell
- high throughput
- rna seq
- gene expression
- drug discovery
- healthcare
- randomized controlled trial
- mental health
- high resolution
- dna methylation
- genome wide
- cell therapy
- systematic review
- mass spectrometry
- stem cells
- emergency department
- big data
- machine learning
- photodynamic therapy
- artificial intelligence
- bone marrow
- mesenchymal stem cells
- drug induced