From Protein Sequence to Structure: The Next Frontier in Cross Species Extrapolation for Chemical Safety Evaluations.
Carlie A LaLoneDonovan J BlatzMarissa A JensenSara M F VlietSally MayasichKali Z MattinglyThomas R TransueWilson MelendezAudrey WilkinsonCody W SimmonsCarla A NgChengxin ZhangYang ZhangPublished in: Environmental toxicology and chemistry (2022)
Computational screening for potentially bioactive molecules using advanced molecular modeling approaches including molecular docking and molecular dynamic simulation is mainstream in certain fields like drug discovery. Significant advances in computationally predicting protein structures from sequence information have also expanded the availability of structures for non-model species. Therefore, the objective of this work was to develop an analysis pipeline to harness the power of these bioinformatics approaches for cross-species extrapolation for evaluating chemical safety. The Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS) tool compares protein-sequence similarity across species for conservation of known chemical targets, providing an initial line of evidence for extrapolation of toxicity knowledge. However, with the development of structural models from tools like the Iterative Threading ASSEmbly Refinement (ITASSER), analyses of protein structural conservation can be included to add additional lines of evidence and generate protein models across species. Models generated through such a pipeline could then be used for advanced molecular modeling approaches in the context of species extrapolation. Two case examples illustrating this pipeline from SeqAPASS sequences to ITASSER generated protein structures were created for human liver fatty acid binding protein (LFABP) and androgen receptor (AR). Ninety-nine LFABP and 268 AR protein models representing diverse species, were generated and analyzed for conservation using TM-align. The results from the structural comparisons were in line with the sequence-based SeqAPASS workflow adding further evidence of LFABL and AR conservation across vertebrate species. This analysis lays the foundation for expanding the capabilities of the web-based SeqAPASS tool to include structural comparisons for species extrapolation, facilitating more rapid and efficient toxicological assessments among species with limited or no existing toxicity data. This article is protected by copyright. All rights reserved. Environ Toxicol Chem 2022;00:0-0. © 2022 SETAC.