Phylogenetic spread of sequence data affects fitness of SOD1 consensus enzymes: Insights from sequence statistics and structural analyses.
Venuka Durani GoyalThomas J MaglieryPublished in: Proteins (2018)
Non-natural protein sequences with native-like structures and functions can be constructed successfully using consensus design. This design strategy is relatively well understood in repeat proteins with simple binding function, however detailed studies are lacking in globular enzymes. The SOD1 family is a good model for such studies due to the availability of large amount of sequence and structure data motivated by involvement of human SOD1 in the fatal motor neuron disease amyotrophic lateral sclerosis (ALS). We constructed two consensus SOD1 enzymes from multiple sequence alignments from all organisms and eukaryotic organisms. A significant difference in their catalytic activities shows that the phylogenetic spread of the sequences used affects the fitness of the construct obtained. A mutation in an electrostatic loop and overall design incompatibilities between bacterial and eukaryotic sequences were implicated in this disparity. Based on this analysis, a bioinformatics approach was used to classify mutations thought to cause familial ALS providing a unique high level view of the physical basis of disease-causing aggregation of human SOD1.
Keyphrases
- amyotrophic lateral sclerosis
- endothelial cells
- physical activity
- amino acid
- induced pluripotent stem cells
- electronic health record
- body composition
- clinical practice
- wastewater treatment
- pluripotent stem cells
- big data
- gram negative
- case control
- mental health
- early onset
- high resolution
- transcription factor
- machine learning
- multidrug resistant
- data analysis
- dna binding