Login / Signup

Learning Correlations between Internal Coordinates to Improve 3D Cartesian Coordinates for Proteins.

Jie LiOufan ZhangSeokyoung LeeAshley NaminiZi Hao LiuJoão M C TeixeiraJulie Deborah Forman-KayTeresa Head-Gordon
Published in: Journal of chemical theory and computation (2023)
We consider a generic representation problem of internal coordinates (bond lengths, valence angles, and dihedral angles) and their transformation to 3-dimensional Cartesian coordinates of a biomolecule. We show that the internal-to-Cartesian process relies on correctly predicting chemically subtle correlations among the internal coordinates themselves, and learning these correlations increases the fidelity of the Cartesian representation. We developed a machine learning algorithm, Int2Cart, to predict bond lengths and bond angles from backbone torsion angles and residue types of a protein, which allows reconstruction of protein structures better than using fixed bond lengths and bond angles or a static library method that relies on backbone torsion angles and residue types in a local environment. The method is able to be used for structure validation, as we show that the agreement between Int2Cart-predicted bond geometries and those from an AlphaFold 2 model can be used to estimate model quality. Additionally, by using Int2Cart to reconstruct an IDP ensemble, we are able to decrease the clash rate during modeling. The Int2Cart algorithm has been implemented as a publicly accessible python package at https://github.com/THGLab/int2cart.
Keyphrases
  • machine learning
  • transition metal
  • deep learning
  • neural network
  • amino acid
  • big data
  • mass spectrometry
  • small molecule