Computational design of sequence-specific DNA-binding proteins.
Cameron J GlasscockRobert PecoraroRyan McHughLindsey A DoyleWei ChenOlivier BoivinBeau LonnquistEmily NaYuliya PolitanskaHugh K HaddoxDavid CoxChristoffer H NornBrian CoventryInna GoreshnikDionne K VafeadosGyu Rie LeeRaluca GordânBarry L StoddardFrank DiMaioDavid BakerPublished in: bioRxiv : the preprint server for biology (2023)
Sequence-specific DNA-binding proteins (DBPs) play critical roles in biology and biotechnology, and there has been considerable interest in the engineering of DBPs with new or altered specificities for genome editing and other applications. While there has been some success in reprogramming naturally occurring DBPs using selection methods, the computational design of new DBPs that recognize arbitrary target sites remains an outstanding challenge. We describe a computational method for the design of small DBPs that recognize specific target sequences through interactions with bases in the major groove, and employ this method in conjunction with experimental screening to generate binders for 5 distinct DNA targets. These binders exhibit specificity closely matching the computational models for the target DNA sequences at as many as 6 base positions and affinities as low as 30-100 nM. The crystal structure of a designed DBP-target site complex is in close agreement with the design model, highlighting the accuracy of the design method. The designed DBPs function in both Escherichia coli and mammalian cells to repress and activate transcription of neighboring genes. Our method is a substantial step towards a general route to small and hence readily deliverable sequence-specific DBPs for gene regulation and editing.