Maximizing Kinetic Information Gain of Markov State Models for Optimal Design of Spectroscopy Experiments.
Shriyaa MittalDiwakar ShuklaPublished in: The journal of physical chemistry. B (2018)
Spectroscopic techniques such as Trp-Tyr quenching, luminescence resonance energy transfer, and triplet-triplet energy transfer are widely used for understanding the dynamic behavior of proteins. These experiments measure the relaxation of a particular labeled set of residue pairs, and the choice of residue pairs requires careful thought. As a result, experimentalists must pick residue pairs from a large pool of possibilities. In the current work, we show that molecular simulation datasets of protein dynamics can be used to systematically select an optimal set of residue positions to place probes for conducting spectroscopic experiments. The method described in this work, called Optimal Probes, can be used to rank trial sets of residue pairs in terms of their ability to capture the conformational dynamics of the protein. Optimal probes ensures two conditions: residue pairs capture the slow dynamics of the protein and their dynamics is not correlated for maximum information gain to score each trial set. Eventually, the highest scored set can be used for biophysical experiments to study the kinetics of the protein. The scoring methodology is based on kinetic network models of protein dynamics and a variational principle for molecular kinetics to optimize the hyperparameters used for the model. We also discuss that the scoring strategy used by Optimal Probes is the best possible way to ensure the ideal choice of residue pairs for experiments. We predict the best experimental probe positions for proteins λ-repressor, β2-adrenergic receptor, and villin headpiece domain. These proteins have been well-studied and allow for a rigorous comparison of Optimal Probes predictions with already available experiments. Additionally, we also illustrate that our method can be used to predict the best choice for experiments by including any previous experiment choices available from other studies on the same protein. We consistently find that the best choice cannot be based on intuition or structural information such as distance difference between few known stable structures of the protein. Therefore, we show that incorporating protein dynamics could be used to maximize the information gain from experiments.