SMILES to Smell: Decoding the Structure-Odor Relationship of Chemical Compounds Using the Deep Neural Network Approach.
Anju SharmaRajnish KumarShabnam RanjtaPritish Kumar VaradwajPublished in: Journal of chemical information and modeling (2021)
Finding the relationship between the structure of an odorant molecule and its associated smell has always been an extremely challenging task. The major limitation in establishing the structure-odor relation is the vague and ambiguous nature of the descriptor-labeling, especially when the sources of odorant molecules are different. With the advent of deep networks, data-driven approaches have been substantiated to achieve more accurate linkages between the chemical structure and its smell. In this study, the deep neural network (DNN) with physiochemical properties and molecular fingerprints (PPMF) and the convolution neural network (CNN) with chemical-structure images (IMG) are developed to predict the smells of chemicals using their SMILES notations. A data set of 5185 chemical compounds with 104 smell percepts was used to develop the multilabel prediction models. The accuracies of smell prediction from DNN + PPMF and CNN + IMG (Xception based) were found to be 97.3 and 98.3%, respectively, when applied on an independent test set of chemicals. The deep learning architecture combining both DNN + PPMF and CNN + IMG prediction models is proposed, which classifies smells and may help understand the generic mechanism underlying the relationship between chemical structure and smell perception.