Multicenter integrated analysis of noncoding CRISPRi screens.
David YaoJosh TyckoJin Woo OhLexi R BoundsSager J GosaiLazaros LataniotisAva Mackay SmithBenjamin R DoughtyAditi K NarayananHenri SchmidtTania Guerrero-AltamiranoKeith SiklenkaKatherine GuoAlexander D WhiteIngrid YoungworthKalina AndreevaXingjie RenAlejandro BarreraYunhai LuoGalip Gürkan YardımcıRyan TewheyAnshul KundajeWilliam J GreenleafPardis C SabetiChristina S LeslieYuri PritykinJill E MooreMichael A BeerCharles A GersbachTimothy E ReddyYin ShenJesse M EngreitzMichael C BassikSteven K ReillyPublished in: Nature methods (2024)
The ENCODE Consortium's efforts to annotate noncoding cis-regulatory elements (CREs) have advanced our understanding of gene regulatory landscapes. Pooled, noncoding CRISPR screens offer a systematic approach to investigate cis-regulatory mechanisms. The ENCODE4 Functional Characterization Centers conducted 108 screens in human cell lines, comprising >540,000 perturbations across 24.85 megabases of the genome. Using 332 functionally confirmed CRE-gene links in K562 cells, we established guidelines for screening endogenous noncoding elements with CRISPR interference (CRISPRi), including accurate detection of CREs that exhibit variable, often low, transcriptional effects. Benchmarking five screen analysis tools, we find that CASA produces the most conservative CRE calls and is robust to artifacts of low-specificity single guide RNAs. We uncover a subtle DNA strand bias for CRISPRi in transcribed regions with implications for screen design and analysis. Together, we provide an accessible data resource, predesigned single guide RNAs for targeting 3,275,697 ENCODE SCREEN candidate CREs with CRISPRi and screening guidelines to accelerate functional characterization of the noncoding genome.
Keyphrases
- genome wide
- high throughput
- dna methylation
- copy number
- transcription factor
- crispr cas
- endothelial cells
- gene expression
- genome editing
- induced apoptosis
- high resolution
- randomized controlled trial
- clinical trial
- machine learning
- cell proliferation
- study protocol
- big data
- pluripotent stem cells
- heat shock
- genome wide identification