Login / Signup

Large-scale discovery of protein interactions at residue resolution using co-evolution calculated from genomic sequences.

Anna G GreenHadeer ElhabashyKelly P BrockRohan MaddamsettiOliver KohlbacherDebora S Marks
Published in: Nature communications (2021)
Increasing numbers of protein interactions have been identified in high-throughput experiments, but only a small proportion have solved structures. Recently, sequence coevolution-based approaches have led to a breakthrough in predicting monomer protein structures and protein interaction interfaces. Here, we address the challenges of large-scale interaction prediction at residue resolution with a fast alignment concatenation method and a probabilistic score for the interaction of residues. Importantly, this method (EVcomplex2) is able to assess the likelihood of a protein interaction, as we show here applied to large-scale experimental datasets where the pairwise interactions are unknown. We predict 504 interactions de novo in the E. coli membrane proteome, including 243 that are newly discovered. While EVcomplex2 does not require available structures, coevolving residue pairs can be used to produce structural models of protein interactions, as done here for membrane complexes including the Flagellar Hook-Filament Junction and the Tol/Pal complex.
Keyphrases
  • amino acid
  • high throughput
  • protein protein
  • binding protein
  • high resolution
  • dna methylation
  • single cell
  • rna seq