Explainable Prediction of Hydrophilic/Hydrophobic Property of Polymer Brush Surfaces by Chemical Modeling and Machine Learning.
Shiwei SuTsukuru MasudaMadoka TakaiPublished in: The journal of physical chemistry. B (2024)
Polymer informatics has attracted increasing attention as a specialized branch of material informatics. Hydrophilicity/hydrophobicity is one of the most important properties of interfaces involved in antifouling, self-cleaning, antifogging, oil/water separation, protein adsorption, and bioseparation. Establishing a quantitative structure-property relationship for the hydrophilicity/hydrophobicity of polymeric interfaces could significantly benefit from machine learning modeling. In this study, we aimed to construct machine learning models that could predict the static water contact angle (CA) as an indicator of hydrophilicity/hydrophobicity based on a data set of polymer brushes. The features of the polymer brush surfaces were numerically described using their grafted structures (thickness) and molecular descriptors derived from their chemical structures. We achieved accurate prediction and understanding of important parameters by employing appropriate molecular descriptors considering the Pearson correlation and machine learning models trained with nested cross-validation. The model interpretation by Shapley additive extension analysis indicated that the amount of partial polar/nonpolar structure in the molecule as well as the averaged hydrophobicity represented by MolLogP plays an important role in determining the CA. Moreover, the model can predict the CAs of polymer brushes composed of chemical structures that are not present in existing databases. The CA values of the hypothetical polymer brushes are predicted.
Keyphrases
- machine learning
- big data
- high resolution
- artificial intelligence
- electronic health record
- liquid chromatography
- escherichia coli
- crispr cas
- staphylococcus aureus
- mass spectrometry
- biofilm formation
- protein kinase
- ionic liquid
- palliative care
- optical coherence tomography
- fatty acid
- single molecule
- data analysis
- solid phase extraction