Machine learning methods for predicting guide RNA effects in CRISPR epigenome editing experiments.
Wancen MuTianyou LuoAlejandro BarreraLexi R BoundsTyler S KlannMaria Ter WeeleJulien BryoisGregory E CrawfordPatrick F SullivanCharles A GersbachMichael I LoveYun LiPublished in: bioRxiv : the preprint server for biology (2024)
CRISPR epigenomic editing technologies enable functional interrogation of non-coding elements. However, current computational methods for guide RNA (gRNA) design do not effectively predict the power potential, molecular and cellular impact to optimize for efficient gRNAs, which are crucial for successful applications of these technologies. We present "launch-dCas9" (machine LeArning based UNified CompreHensive framework for CRISPR-dCas9) to predict gRNA impact from multiple perspectives, including cell fitness, wildtype abundance (gauging power potential), and gene expression in single cells. Our launchdCas9, built and evaluated using experiments involving >1 million gRNAs targeted across the human genome, demonstrates relatively high prediction accuracy (AUC up to 0.81) and generalizes across cell lines. Method-prioritized top gRNA(s) are 4.6-fold more likely to exert effects, compared to other gRNAs in the same cis-regulatory region. Furthermore, launchdCas9 identifies the most critical sequence-related features and functional annotations from >40 features considered. Our results establish launch-dCas9 as a promising approach to design gRNAs for CRISPR epigenomic experiments.
Keyphrases
- crispr cas
- genome editing
- genome wide
- machine learning
- dna methylation
- gene expression
- induced apoptosis
- endothelial cells
- physical activity
- artificial intelligence
- single cell
- transcription factor
- cancer therapy
- cell cycle arrest
- oxidative stress
- signaling pathway
- drug delivery
- cell therapy
- climate change
- induced pluripotent stem cells
- bone marrow
- single molecule
- nucleic acid