Generalized biomolecular modeling and design with RoseTTAFold All-Atom.
Rohith KrishnaJue WangWoody AhernPascal SturmfelsPreetham VenkateshIndrek KalvetGyu Rie LeeFelix S Morey-BurrowsIvan AnishchenkoIan R HumphreysRyan McHughDionne K VafeadosXinting LiGeorge A SutherlandAndrew HitchcockChristopher Neil HunterAlex KangEvans BrackenbroughAsim K BeraMinkyung BaekFrank DimaioJulien S BakerPublished in: Science (New York, N.Y.) (2024)
Deep learning methods have revolutionized protein structure prediction and design but are currently limited to protein-only systems. We describe RoseTTAFold All-Atom (RFAA) which combines a residue-based representation of amino acids and DNA bases with an atomic representation of all other groups to model assemblies containing proteins, nucleic acids, small molecules, metals, and covalent modifications given their sequences and chemical structures. By fine tuning on denoising tasks we obtain RFdiffusionAA, which builds protein structures around small molecules. Starting from random distributions of amino acid residues surrounding target small molecules, we design and experimentally validate, through crystallography and binding measurements, proteins that bind the cardiac disease therapeutic digoxigenin, the enzymatic cofactor heme, and the light harvesting molecule bilin.