Modeling transcriptional regulation of model species with deep learning.
Evan M CoferJoão RaimundoAlicja TadychYuji YamazakiAaron K WongChandra L TheesfeldMichael S LevineOlga G TroyanskayaPublished in: Genome research (2021)
To enable large-scale analyses of transcription regulation in model species, we developed DeepArk, a set of deep learning models of the cis-regulatory activities for four widely studied species: Caenorhabditis elegans, Danio rerio, Drosophila melanogaster, and Mus musculus DeepArk accurately predicts the presence of thousands of different context-specific regulatory features, including chromatin states, histone marks, and transcription factors. In vivo studies show that DeepArk can predict the regulatory impact of any genomic variant (including rare or not previously observed) and enables the regulatory annotation of understudied model species.