Molecular basis for the preferential recognition of β1,3-1,4-glucans by the family 11 carbohydrate-binding module from Clostridium thermocellum.
Diana O RibeiroAldino ViegasVirgínia M R PiresJoao Medeiros-SilvaPedro BuleWengang ChaiFilipa MarceloCarlos M G A FontesEurico J CabritaAngelina S PalmaAna Luísa CarvalhoPublished in: The FEBS journal (2019)
Understanding the specific molecular interactions between proteins and β1,3-1,4-mixed-linked d-glucans is fundamental to harvest the full biological and biotechnological potential of these carbohydrates and of proteins that specifically recognize them. The family 11 carbohydrate-binding module from Clostridium thermocellum (CtCBM11) is known for its binding preference for β1,3-1,4-mixed-linked over β1,4-linked glucans. Despite the growing industrial interest of this protein for the biotransformation of lignocellulosic biomass, the molecular determinants of its ligand specificity are not well defined. In this report, a combined approach of methodologies was used to unravel, at a molecular level, the ligand recognition of CtCBM11. The analysis of the interaction by carbohydrate microarrays and NMR and the crystal structures of CtCBM11 bound to β1,3-1,4-linked glucose oligosaccharides showed that both the chain length and the position of the β1,3-linkage are important for recognition, and identified the tetrasaccharide Glcβ1,4Glcβ1,4Glcβ1,3Glc sequence as a minimum epitope required for binding. The structural data, along with site-directed mutagenesis and ITC studies, demonstrated the specificity of CtCBM11 for the twisted conformation of β1,3-1,4-mixed-linked glucans. This is mediated by a conformation-selection mechanism of the ligand in the binding cleft through CH-π stacking and a hydrogen bonding network, which is dependent not only on ligand chain length, but also on the presence of a β1,3-linkage at the reducing end and at specific positions along the β1,4-linked glucan chain. The understanding of the detailed mechanism by which CtCBM11 can distinguish between linear and mixed-linked β-glucans strengthens its exploitation for the design of new biomolecules with improved capabilities and applications in health and agriculture. DATABASE: Structural data are available in the Protein Data Bank under the accession codes 6R3M and 6R31.
Keyphrases
- binding protein
- electronic health record
- healthcare
- dna binding
- public health
- magnetic resonance
- big data
- molecular dynamics simulations
- genome wide
- type diabetes
- mental health
- protein protein
- skeletal muscle
- machine learning
- insulin resistance
- heavy metals
- artificial intelligence
- data analysis
- deep learning
- blood glucose
- structural basis
- neural network