Login / Signup

Design of intrinsically disordered protein variants with diverse structural properties.

Francesco PesceAnne BremerGiulio TeseiJesse B HopkinsChristy R R GraceTanja MittagKresten Lindorff-Larsen
Published in: bioRxiv : the preprint server for biology (2023)
Intrinsically disordered proteins (IDPs) perform a wide range of functions in biology, suggesting that the ability to design IDPs could help expand the repertoire of proteins with novel functions. Designing IDPs with specific structural or functional properties has, however, been diffcult, in part because determining accurate conformational ensembles of IDPs generally requires a combination of computational modelling and experiments. Motivated by recent advancements in effcient physics-based models for simulations of IDPs, we have developed a general algorithm for designing IDPs with specific structural properties. We demonstrate the power of the algorithm by generating variants of naturally occurring IDPs with different levels of compaction and that vary more than 100 fold in their propensity to undergo phase separation, even while keeping a fixed amino acid composition. We experimentally tested designs of variants of the low-complexity domain of hnRNPA1 and find high accuracy in our computational predictions, both in terms of single-chain compaction and propensity to undergo phase separation. We analyze the sequence features that determine changes in compaction and propensity to phase separate and find an overall good agreement with previous findings for naturally occurring sequences. Our general, physics-based method enables the design of disordered sequences with specified conformational properties. Our algorithm thus expands the toolbox for protein design to include also the most flexible proteins and will enable the design of proteins whose functions exploit the many properties afforded by protein disorder.
Keyphrases
  • amino acid
  • machine learning
  • copy number
  • molecular dynamics
  • deep learning
  • protein protein
  • single molecule
  • high resolution
  • binding protein
  • small molecule
  • mass spectrometry
  • dna methylation