Login / Signup

Synergy conformal prediction applied to large-scale bioactivity datasets and in federated learning.

Ulf NorinderOla SpjuthFredrik Svensson
Published in: Journal of cheminformatics (2021)
Confidence predictors can deliver predictions with the associated confidence required for decision making and can play an important role in drug discovery and toxicity predictions. In this work we investigate a recently introduced version of conformal prediction, synergy conformal prediction, focusing on the predictive performance when applied to bioactivity data. We compare the performance to other variants of conformal predictors for multiple partitioned datasets and demonstrate the utility of synergy conformal predictors for federated learning where data cannot be pooled in one location. Our results show that synergy conformal predictors based on training data randomly sampled with replacement can compete with other conformal setups, while using completely separate training sets often results in worse performance. However, in a federated setup where no method has access to all the data, synergy conformal prediction is shown to give promising results. Based on our study, we conclude that synergy conformal predictors are a valuable addition to the conformal prediction toolbox.
Keyphrases
  • drug discovery
  • decision making
  • big data
  • oxidative stress
  • machine learning
  • copy number
  • single cell
  • open label
  • oxide nanoparticles