Protein Structure Validation Derives a Smart Conformational Search in a Physically Relevant Configurational Subspace.
Takunori YasudaRikuri MoritaYasuteru ShigetaRyuhei HaradaPublished in: Journal of chemical information and modeling (2022)
Since proteins perform biological functions through their dynamic properties, molecular dynamics (MD) simulation is a sophisticated strategy for investigating their functions. Analyses of trajectories provide statistical information about a specific protein as a free-energy landscape (FEL). However, the timescale of normal MD is shorter than that of biological functions, resulting in statistically insufficient conformational sampling, finally leading to unreliable FEL calculation. To search for a broad configurational subspace, an external bias is imposed on a target protein as biased sampling. However, its regulation is challenging because the optimal strength of the perturbation is unknown. Furthermore, a physically irrelevant configurational subspace was searched when imposing an inappropriate external bias. To address this issue, we newly proposed an external biased regulation scheme known as the G-factor external bias limiter (GERBIL). In GERBIL, protein configurations generated by external bias are structurally validated by an indicator (G-factor), enabling the search for a physically relevant subspace. In addition to biased sampling, nonbiased sampling might search for a physically irrelevant configurational subspace because repeating multiple MD simulations from several initial structures tends to search for an overly broad configurational subspace. For this issue, the structural qualities of configurations generated by nonbiased sampling have not been investigated. Therefore, we confirmed whether the G-factor screened the collapsed (low-quality) configurations generated by nonbiased sampling. To address this issue, the outlier flooding method (OFLOOD) was adopted in GERBIL as a nonbiased sampling method, which is referred to as OFLOOD-GERBIL. OFLOOD rapidly expands a configurational subspace by resampling the rarely occurring states of a given protein and tends to search an overly broad subspace. Thus, we considered that GERBIL might improve the excessive conformational search of OFLOOD for a physically irrelevant configurational subspace. As a demonstration, OFLOOD and OFLOOD-GERBIL were applied to a globular protein (T4 lysozyme) and their conformational search qualities were assessed. Based on our assessment, normal OFLOOD without the outlier validation frequently sampled low-quality configurations, whereas OFLOOD-GERBIL with the outlier validation intensively sampled high-quality configurations. In conclusion, OFLOOD-GERBIL derives a smart conformational search in a physically relevant configurational subspace, indicating that protein structure validation works in both nonbiased and biased sampling methods.