In silico prediction of boiling point, octanol-water partition coefficient, and retention time index of polycyclic aromatic hydrocarbons through machine learning.
Linkang SunMin ZhangLiangxu XieQian GaoXiaojun XuLei XuPublished in: Chemical biology & drug design (2022)
Polycyclic aromatic hydrocarbons (PAHs), a special class of persistent organic pollutants (POPs) with two or more aromatic rings, have received extensive attention owing to their carcinogenic, mutagenic, and teratogenic effects. Quantitative structure-property relationship (QSPR) is powerful chemometric method to correlate structural descriptors of PAHs with their physicochemical properties. In this manuscript, a QSPR study of PAHs was performed to predict their boiling point (bp), octanol-water partition coefficient (LogK ow ), and retention time index (RI). In addition to traditional molecular descriptors, structural fingerprints play an important role in the correlation of the above properties. Three regression methods, partial least squares (PLS), multiple linear regression (MLR), and genetic function approximation (GFA), were used to establish QSPR models for each property of PAHs. The correlation coefficient (R 2 test ) and root mean square error (RMSE) of best model were 0.980 and 24.39% (PLS), 0.979 and 35.80% (GFA), 0.926 and 22.90% (MLR) for bp, LogK ow, and RI, respectively. The model proposed here can be used to estimate physicochemical properties and inform toxicity prediction of environmental chemicals.
Keyphrases
- polycyclic aromatic hydrocarbons
- machine learning
- diffusion weighted imaging
- oxidative stress
- working memory
- molecular docking
- risk assessment
- gene expression
- amino acid
- magnetic resonance
- computed tomography
- climate change
- heavy metals
- drinking water
- mass spectrometry
- molecular dynamics simulations
- contrast enhanced