Accurate Protein p K a Prediction with Physical Organic Chemistry Guided 3D Protein Representation.
Siyuan LiuQi YangLong ZhangSanzhong LuoPublished in: Journal of chemical information and modeling (2024)
Protein p K a is a fundamental physicochemical parameter that dictates protein structure and function. However, accurately determining protein site-p K a values remains a substantial challenge, both experimentally and theoretically. In this study, we introduce a physical organic approach, leveraging a protein structural and physical-organic-parameter-based representation (P-SPOC), to develop a rapid and intuitive model for protein p K a prediction. Our P-SPOC model achieves state-of-the-art predictive accuracy, with a mean absolute error (MAE) of 0.33 p K a units. Furthermore, we have incorporated advanced protein structure prediction models, like AlphaFold2, to approximate structures for proteins lacking three-dimensional representations, which enhances the applicability of our model in the context of structure-undetermined protein research. To promote broader accessibility within the research community, an online prediction interface was also established at isyn.luoszgroup.com.