Privileged Structural Motif Detection and Analysis Using Generative Topographic Maps.
Shilva KayasthaDragos HorvathErik GilbergMichael GütschowJürgen BajorathAlexander VarnekPublished in: Journal of chemical information and modeling (2017)
Identification of "privileged structural motifs" associated with specific target families is of particular importance for designing novel bioactive compounds. Here, we demonstrate that they can be extracted from a data distribution represented on a two-dimensional map obtained by Generative Topographic Mapping (GTM). In GTM, structurally related molecules are grouped together on the map. Zones of the map preferentially populated by target-specific compounds were delineated, which helped to capture common substructures on the basis of which these compounds were grouped together by GTM. Such privileged structural motifs were identified across three major target superfamilies including proteases, kinases, and G protein coupled receptors. Traditionally, the search for privileged structural motifs focused on scaffolds, whereas motifs were detected here without prior knowledge of compound classification in GTMs. This alternative way of navigating medicinal chemistry space further extends the classical, scaffold-centric approach. Importantly, detected motifs might also comprise fuzzy sets of similar scaffolds, pharmacophore-like patterns, or, by contrast, well-defined scaffolds with specific substituent patterns.