Cross-Classified Multilevel Modelling of the Effectiveness of Similarity-Based Virtual Screening.
Lucyantie MazalanAndrew Jd BellLaura SbaffiPeter WillettPublished in: ChemMedChem (2017)
The screening effectiveness of a chemical similarity search depends on a range of factors, including the bioactivity of interest, the types of similarity coefficient and fingerprint that comprise the similarity measure, and the nature of the reference structure that is being searched against a database. This study introduces the use of cross-classified multilevel modelling as a way to investigate the relative importance of these four factors when carrying out similarity searches on the ChEMBL database. Two principal conclusions can be drawn from the analyses: that the fingerprint plays a more important role than the similarity coefficient in determining the effectiveness of a similarity search, and that comparative studies of similarity measures should involve many more reference structures than has been the case in much previous work.