Dynamic shifts in occupancy by TAL1 are guided by GATA factors and drive large-scale reprogramming of gene expression during hematopoiesis.
Weisheng WuChristapher S MorrisseyCheryl A KellerTejaswini MishraMaxim PimkinGerd A BlobelMitchell J WeissRoss Cameron HardisonPublished in: Genome research (2014)
We used mouse ENCODE data along with complementary data from other laboratories to study the dynamics of occupancy and the role in gene regulation of the transcription factor TAL1, a critical regulator of hematopoiesis, at multiple stages of hematopoietic differentiation. We combined ChIP-seq and RNA-seq data in six mouse cell types representing a progression from multilineage precursors to differentiated erythroblasts and megakaryocytes. We found that sites of occupancy shift dramatically during commitment to the erythroid lineage, vary further during terminal maturation, and are strongly associated with changes in gene expression. In multilineage progenitors, the likely target genes are enriched for hematopoietic growth and functions associated with the mature cells of specific daughter lineages (such as megakaryocytes). In contrast, target genes in erythroblasts are specifically enriched for red cell functions. Furthermore, shifts in TAL1 occupancy during erythroid differentiation are associated with gene repression (dissociation) and induction (co-occupancy with GATA1). Based on both enrichment for transcription factor binding site motifs and co-occupancy determined by ChIP-seq, recruitment by GATA transcription factors appears to be a stronger determinant of TAL1 binding to chromatin than the canonical E-box binding site motif. Studies of additional proteins lead to the model that TAL1 regulates expression after being directed to a distinct subset of genomic binding sites in each cell type via its association with different complexes containing master regulators such as GATA2, ERG, and RUNX1 in multilineage cells and the lineage-specific master regulator GATA1 in erythroblasts.
Keyphrases
- transcription factor
- single cell
- rna seq
- genome wide identification
- gene expression
- high throughput
- genome wide
- induced apoptosis
- dna binding
- electronic health record
- dna methylation
- cell cycle arrest
- big data
- bone marrow
- copy number
- magnetic resonance
- endoplasmic reticulum stress
- stem cells
- poor prognosis
- circulating tumor cells
- genome wide analysis
- cell therapy
- dna damage
- data analysis
- long non coding rna
- oxidative stress
- hematopoietic stem cell
- computed tomography
- contrast enhanced
- mesenchymal stem cells
- bioinformatics analysis
- pi k akt