Predicting protein conformational motions using energetic frustration analysis and AlphaFold2.
Xingyue GuanQian-Yuan TangWeitong RenMingchen ChenWei WangPeter G WolynesWenfei LiPublished in: Proceedings of the National Academy of Sciences of the United States of America (2024)
Proteins perform their biological functions through motion. Although high throughput prediction of the three-dimensional static structures of proteins has proved feasible using deep-learning-based methods, predicting the conformational motions remains a challenge. Purely data-driven machine learning methods encounter difficulty for addressing such motions because available laboratory data on conformational motions are still limited. In this work, we develop a method for generating protein allosteric motions by integrating physical energy landscape information into deep-learning-based methods. We show that local energetic frustration, which represents a quantification of the local features of the energy landscape governing protein allosteric dynamics, can be utilized to empower AlphaFold2 (AF2) to predict protein conformational motions. Starting from ground state static structures, this integrative method generates alternative structures as well as pathways of protein conformational motions, using a progressive enhancement of the energetic frustration features in the input multiple sequence alignment sequences. For a model protein adenylate kinase, we show that the generated conformational motions are consistent with available experimental and molecular dynamics simulation data. Applying the method to another two proteins KaiB and ribose-binding protein, which involve large-amplitude conformational changes, can also successfully generate the alternative conformations. We also show how to extract overall features of the AF2 energy landscape topography, which has been considered by many to be black box. Incorporating physical knowledge into deep-learning-based structure prediction algorithms provides a useful strategy to address the challenges of dynamic structure prediction of allosteric proteins.
Keyphrases
- molecular dynamics simulations
- deep learning
- binding protein
- machine learning
- molecular dynamics
- single molecule
- high throughput
- protein protein
- amino acid
- small molecule
- molecular docking
- high resolution
- atrial fibrillation
- single cell
- big data
- oxidative stress
- physical activity
- healthcare
- multiple sclerosis
- social media
- protein kinase