Rapid and automated design of two-component protein nanomaterials using ProteinMPNN.
Robbert J de HaasNatalie BrunetteAlexander D GoodsonJustas DauparasSue Y YiErin C YangQuinton DowlingHannah NguyenAlex KangAsim K BeraBanumathi SankaranRenko de VriesJulien S BakerNeil P KingPublished in: Proceedings of the National Academy of Sciences of the United States of America (2024)
The design of protein-protein interfaces using physics-based design methods such as Rosetta requires substantial computational resources and manual refinement by expert structural biologists. Deep learning methods promise to simplify protein-protein interface design and enable its application to a wide variety of problems by researchers from various scientific disciplines. Here, we test the ability of a deep learning method for protein sequence design, ProteinMPNN, to design two-component tetrahedral protein nanomaterials and benchmark its performance against Rosetta. ProteinMPNN had a similar success rate to Rosetta, yielding 13 new experimentally confirmed assemblies, but required orders of magnitude less computation and no manual refinement. The interfaces designed by ProteinMPNN were substantially more polar than those designed by Rosetta, which facilitated in vitro assembly of the designed nanomaterials from independently purified components. Crystal structures of several of the assemblies confirmed the accuracy of the design method at high resolution. Our results showcase the potential of deep learning-based methods to unlock the widespread application of designed protein-protein interfaces and self-assembling protein nanomaterials in biotechnology.