Sampling Conformational Ensembles of Highly Dynamic Proteins via Generative Deep Learning.
Talant RuzmetovTa I HungSaisri Padmaja JonnalageddaSi-Han ChenParisa FasihianifardZhefeng GuoBir BhanuChia-En A ChangPublished in: Research square (2024)
Proteins are inherently dynamic, and their conformational ensembles are functionally important in biology. Large-scale motions may govern protein structure-function relationship, and numerous transient but stable conformations of intrinsically disordered proteins (IDPs) can play a crucial role in biological function. Investigating conformational ensembles to understand regulations and disease-related aggregations of IDPs is challenging both experimentally and computationally. In this paper first an unsupervised deep learning-based model, termed Internal Coordinate Net (ICoN), is developed that learns the physical principles of conformational changes from molecular dynamics (MD) simulation data. Second, interpolating data points in the learned latent space are selected that rapidly identify novel synthetic conformations with sophisticated and large-scale sidechains and backbone arrangements. Third, with the highly dynamic amyloid-β 1-42 (Aβ42) monomer, our deep learning model provided a comprehensive sampling of Aβ42's conformational landscape. Analysis of these synthetic conformations revealed conformational clusters that can be used to rationalize experimental findings. Additionally, the method can identify novel conformations with important interactions in atomistic details that are not included in the training data. New synthetic conformations showed distinct sidechain rearrangements that are probed by our EPR and amino acid substitution studies. The proposed approach is highly transferable and can be used for any available data for training. The work also demonstrated the ability for deep learning to utilize learned natural atomistic motions in protein conformation sampling.
Keyphrases
- molecular dynamics
- deep learning
- molecular dynamics simulations
- density functional theory
- electronic health record
- amino acid
- big data
- single molecule
- artificial intelligence
- machine learning
- convolutional neural network
- single cell
- mass spectrometry
- virtual reality
- small molecule
- protein protein
- brain injury
- blood brain barrier