Login / Signup

Machine learning for functional protein design.

Pascal NotinNathan RollinsYarin GalChris SanderDebora S Marks
Published in: Nature biotechnology (2024)
Recent breakthroughs in AI coupled with the rapid accumulation of protein sequence and structure data have radically transformed computational protein design. New methods promise to escape the constraints of natural and laboratory evolution, accelerating the generation of proteins for applications in biotechnology and medicine. To make sense of the exploding diversity of machine learning approaches, we introduce a unifying framework that classifies models on the basis of their use of three core data modalities: sequences, structures and functional labels. We discuss the new capabilities and outstanding challenges for the practical design of enzymes, antibodies, vaccines, nanomachines and more. We then highlight trends shaping the future of this field, from large-scale assays to more robust benchmarks, multimodal foundation models, enhanced sampling strategies and laboratory automation.
Keyphrases
  • machine learning
  • big data
  • artificial intelligence
  • amino acid
  • protein protein
  • binding protein
  • high resolution
  • high throughput
  • deep learning
  • mass spectrometry
  • chronic pain
  • sensitive detection