A kinetic model predicts SpCas9 activity, improves off-target classification, and reveals the physical basis of targeting fidelity.
Behrouz Eslami-MossallamMisha KleinConstantijn V D SmagtKoen V D SandenStephen K JonesJohn A HawkinsIlya J FinkelsteinMartin DepkenPublished in: Nature communications (2022)
The S. pyogenes (Sp) Cas9 endonuclease is an important gene-editing tool. SpCas9 is directed to target sites based on complementarity to a complexed single-guide RNA (sgRNA). However, SpCas9-sgRNA also binds and cleaves genomic off-targets with only partial complementarity. To date, we lack the ability to predict cleavage and binding activity quantitatively, and rely on binary classification schemes to identify strong off-targets. We report a quantitative kinetic model that captures the SpCas9-mediated strand-replacement reaction in free-energy terms. The model predicts binding and cleavage activity as a function of time, target, and experimental conditions. Trained and validated on high-throughput bulk-biochemical data, our model predicts the intermediate R-loop state recently observed in single-molecule experiments, as well as the associated conversion rates. Finally, we show that our quantitative activity predictor can be reduced to a binary off-target classifier that outperforms the established state-of-the-art. Our approach is extensible, and can characterize any CRISPR-Cas nuclease - benchmarking natural and future high-fidelity variants against SpCas9; elucidating determinants of CRISPR fidelity; and revealing pathways to increased specificity and efficiency in engineered systems.
Keyphrases
- crispr cas
- single molecule
- genome editing
- high throughput
- dna binding
- machine learning
- deep learning
- mental health
- transcription factor
- dna repair
- gene expression
- genome wide
- ionic liquid
- drug delivery
- atomic force microscopy
- mass spectrometry
- binding protein
- electronic health record
- living cells
- dna damage
- cancer therapy
- data analysis
- dna methylation
- fluorescent probe
- single cell
- nucleic acid
- high speed