Identification of active transcriptional regulatory elements from GRO-seq data.
Charles G DankoStephanie L HylandLeighton J CoreAndre L MartinsColin T WatersHyung Won LeeVivian G CheungW Lee KrausJohn T LisAdam SiepelPublished in: Nature methods (2015)
Modifications to the global run-on and sequencing (GRO-seq) protocol that enrich for 5'-capped RNAs can be used to reveal active transcriptional regulatory elements (TREs) with high accuracy. Here, we introduce discriminative regulatory-element detection from GRO-seq (dREG), a sensitive machine learning method that uses support vector regression to identify active TREs from GRO-seq data without requiring cap-based enrichment (https://github.com/Danko-Lab/dREG/). This approach allows TREs to be assayed together with gene expression levels and other transcriptional features in a single experiment. Predicted TREs are more enriched for several marks of transcriptional activation—including expression quantitative trait loci, disease-associated polymorphisms, acetylated histone 3 lysine 27 (H3K27ac) and transcription factor binding—than those identified by alternative functional assays. Using dREG, we surveyed TREs in eight human cell types and provide new insights into global patterns of TRE function.
Keyphrases
- transcription factor
- single cell
- genome wide
- gene expression
- dna methylation
- rna seq
- dna binding
- machine learning
- high throughput
- big data
- electronic health record
- poor prognosis
- randomized controlled trial
- endothelial cells
- genome wide identification
- stem cells
- long non coding rna
- data analysis
- high resolution
- mass spectrometry
- sensitive detection