Login / Signup

Improved prediction of bacterial CRISPRi guide efficiency from depletion screens through mixed-effect machine learning and data integration.

Yanying YuSandra GawlittLisa Barros de Andrade E SousaErinc MerdivanMarie PiraudChase L BeiselLars Barquist
Published in: Genome biology (2024)
CRISPR interference (CRISPRi) is the leading technique to silence gene expression in bacteria; however, design rules remain poorly defined. We develop a best-in-class prediction algorithm for guide silencing efficiency by systematically investigating factors influencing guide depletion in genome-wide essentiality screens, with the surprising discovery that gene-specific features substantially impact prediction. We develop a mixed-effect random forest regression model that provides better estimates of guide efficiency. We further apply methods from explainable AI to extract interpretable design rules from the model. This study provides a blueprint for predictive models for CRISPR technologies where only indirect measurements of guide activity are available.
Keyphrases
  • genome wide
  • dna methylation
  • machine learning
  • gene expression
  • copy number
  • high throughput
  • genome editing
  • artificial intelligence
  • big data
  • crispr cas
  • deep learning
  • oxidative stress
  • climate change
  • single cell