Login / Signup

De novo design of protein interactions with learned surface fingerprints.

Pablo GainzaSarah WehrleAlexandra Van Hall-BeauvaisAnthony MarchandAndreas ScheckZander HarteveldStephen BuckleyDongchun NiShuguang TanFreyr SverrissonCasper GoverdePriscilla TurelliCharlène RaclotAlexandra TeslenkoMartin PacesaStéphane RossetSandrine GeorgeonJane MarsdenAaron S PetruzzellaKefang LiuZepeng XuYan ChaiPu HanGeorge Fu GaoElisa OricchioBeat FierzDidier TronoHenning StahlbergMichael BronsteinBruno E Correia
Published in: Nature (2023)
Physical interactions between proteins are essential for most biological processes governing life 1 . However, the molecular determinants of such interactions have been challenging to understand, even as genomic, proteomic and structural data increase. This knowledge gap has been a major obstacle for the comprehensive understanding of cellular protein-protein interaction networks and for the de novo design of protein binders that are crucial for synthetic biology and translational applications 2-9 . Here we use a geometric deep-learning framework operating on protein surfaces that generates fingerprints to describe geometric and chemical features that are critical to drive protein-protein interactions 10 . We hypothesized that these fingerprints capture the key aspects of molecular recognition that represent a new paradigm in the computational design of novel protein interactions. As a proof of principle, we computationally designed several de novo protein binders to engage four protein targets: SARS-CoV-2 spike, PD-1, PD-L1 and CTLA-4. Several designs were experimentally optimized, whereas others were generated purely in silico, reaching nanomolar affinity with structural and mutational characterization showing highly accurate predictions. Overall, our surface-centric approach captures the physical and chemical determinants of molecular recognition, enabling an approach for the de novo design of protein interactions and, more broadly, of artificial proteins with function.
Keyphrases
  • protein protein
  • small molecule
  • sars cov
  • deep learning
  • healthcare
  • amino acid
  • mental health
  • escherichia coli
  • machine learning
  • dna methylation
  • genome wide
  • copy number
  • pseudomonas aeruginosa
  • biofilm formation