Machine Learned Classification of Ligand Intrinsic Activities at Human μ -Opioid Receptor.
Myong In OhMaximilian ShenRuibin LiuLidiya StavitskayaJana ShenPublished in: bioRxiv : the preprint server for biology (2024)
Opioids are small-molecule agonists of μ -opioid receptor ( μ OR), while reversal agents such as naloxone are antagonists of μ OR. Here we developed machine learning (ML) models to classify the intrinsic activities of ligands at the human μ OR based on the SMILE strings and two-dimensional molecular descriptors. We first manually curated a database of 983 small molecules with measured E max values at the human μ OR. Analysis of the chemical space allowed identification of dominant scaffolds and structurally similar agonists and antagonists. Decision tree models and directed message passing neural networks (MPNNs) were then trained to classify agonistic and antagonistic ligands. The hold-out test AUCs (areas under the receiver operator curves) of the extra-tree (ET) and MPNN models are 91.5±3.9% and 91.8± 4.4%, respectively. To overcome the challenge of small dataset, a student-teacher learning method called tri-training with disagreement was tested using an unlabeled dataset comprised of 15,816 ligands of human, mouse, or rat μ OR, κ OR, or δ OR. We found that the tri-training scheme was able to increase the hold-out AUC of MPNN to as high as 95.7%. Our work demonstrates the feasibility of developing ML models to accurately predict the intrinsic activities of μ OR ligands, even with limited data. We envisage potential applications of these models in evaluating uncharacterized substances for public safety risks and discovering new therapeutic agents to counteract opioid overdoses.
Keyphrases
- endothelial cells
- machine learning
- chronic pain
- small molecule
- pain management
- induced pluripotent stem cells
- pluripotent stem cells
- healthcare
- deep learning
- oxidative stress
- mental health
- electronic health record
- risk assessment
- drinking water
- resistance training
- human health
- climate change
- data analysis
- binding protein