Sampling Conformational Ensembles of Highly Dynamic Proteins via Generative Deep Learning.
Chia-En A ChangTalant RuzmetovTa I HungSaisri Padmaja JonnalageddaSi-Han ChenParisa FasihianifardZhefeng GuoBir BhanuPublished in: Research square (2024)
Proteins are inherently dynamic, and their conformational ensembles are functionally important in biology. Large-scale motions may govern protein structure-function relationship, and numerous transient but stable conformations of intrinsically disordered proteins (IDPs) can play a crucial role in biological function. Investigating conformational ensembles to understand regulations and disease-related aggregations of IDPs is challenging both experimentally and computationally. In this paper first an unsupervised deep learning-based model, termed Internal Coordinate Net (ICoN), is developed that learns the physical principles of conformational changes from molecular dynamics (MD) simulation data. Second, interpolating data points in the learned latent space are selected that rapidly identify novel synthetic conformations with sophisticated and large-scale sidechains and backbone arrangements. Third, with the highly dynamic amyloid-β1-42 (Aβ42) monomer, our deep learning model provided a comprehensive sampling of Aβ42's conformational landscape. Analysis of these synthetic conformations revealed conformational clusters that can be used to rationalize experimental findings. Additionally, the method can identify novel conformations with important interactions in atomistic details that are not included in the training data. New synthetic conformations showed distinct sidechain rearrangements that are probed by our EPR and amino acid substitution studies. The proposed approach is highly transferable and can be used for any available data for training. The work also demonstrated the ability for deep learning to utilize learned natural atomistic motions in protein conformation sampling.
Keyphrases
- molecular dynamics
- deep learning
- molecular dynamics simulations
- density functional theory
- electronic health record
- amino acid
- machine learning
- big data
- artificial intelligence
- single molecule
- convolutional neural network
- physical activity
- mental health
- virtual reality
- mass spectrometry
- single cell
- brain injury
- subarachnoid hemorrhage
- molecularly imprinted
- small molecule
- simultaneous determination
- crystal structure