Login / Signup

Evaluation of different virtual screening strategies on the basis of compound sets with characteristic core distributions and dissimilarity relationships.

Tomoyuki MiyaoSwarit JasialJürgen BajorathKimito Funatsu
Published in: Journal of computer-aided molecular design (2019)
In this work, computational compound screening strategies on the basis of two- and three-dimensional (2D and 3D) molecular representations were investigated including similarity searching and support vector machine (SVM) ranking. Calculations based on topological fingerprints and molecular shape queries and features were compared. A unique aspect of the analysis setting apart from previous comparisons of 2D and 3D virtual screening approaches has been the design of compound reference, training, and test data sets with controlled incremental increases in intra-set structural diversity and different categories of structural relationships between reference/training and test sets. The use of these data sets made it possible to assess the relative performance of 2D and 3D screening strategies under increasingly challenging conditions ultimately leading to the use of training and test sets with essentially unrelated structures. The results showed that 3D similarity searching had little advantage over 2D searching in identifying active compounds with remote structural relationships. However, 3D SVM models trained on the basis of shape features were superior to other approaches (including 2D SVM) when the detection of structure-activity relationships became increasingly challenging. Such 3D SVM methods has thus far only been little investigated in virtual screening, proving a wealth of opportunities for further analyses.
Keyphrases
  • virtual reality
  • electronic health record
  • big data
  • atomic force microscopy
  • molecular dynamics
  • high resolution
  • machine learning
  • molecular dynamics simulations
  • real time pcr
  • resistance training
  • body composition