Login / Signup

Protein structure prediction assisted with sparse NMR data in CASP13.

Davide SalaYuanpeng Janet HuangCasey A ColeDavid A SnyderGaohua LiuYojiro IshidaG V T SwapnaKelly P BrockChris SanderKrzysztof FidelisAndriy KryshtafovychMasayori InouyeRoberto TejeroHomayoun ValafarAntonio RosatoGaetano T Montelione
Published in: Proteins (2020)
CASP13 has investigated the impact of sparse NMR data on the accuracy of protein structure prediction. NOESY and 15 N-1 H residual dipolar coupling data, typical of that obtained for 15 N,13 C-enriched, perdeuterated proteins up to about 40 kDa, were simulated for 11 CASP13 targets ranging in size from 80 to 326 residues. For several targets, two prediction groups generated models that are more accurate than those produced using baseline methods. Real NMR data collected for a de novo designed protein were also provided to predictors, including one data set in which only backbone resonance assignments were available. Some NMR-assisted prediction groups also did very well with these data. CASP13 also assessed whether incorporation of sparse NMR data improves the accuracy of protein structure prediction relative to nonassisted regular methods. In most cases, incorporation of sparse, noisy NMR data results in models with higher accuracy. The best NMR-assisted models were also compared with the best regular predictions of any CASP13 group for the same target. For six of 13 targets, the most accurate model provided by any NMR-assisted prediction group was more accurate than the most accurate model provided by any regular prediction group; however, for the remaining seven targets, one or more regular prediction method provided a more accurate model than even the best NMR-assisted model. These results suggest a novel approach for protein structure determination, in which advanced prediction methods are first used to generate structural models, and sparse NMR data is then used to validate and/or refine these models.
Keyphrases
  • high resolution
  • magnetic resonance
  • electronic health record
  • solid state
  • big data
  • protein protein
  • small molecule
  • data analysis
  • mass spectrometry
  • heat shock protein
  • neural network
  • solid phase extraction