Cell-type specific and combinatorial usage of diverse transcription factors revealed by genome-wide binding studies in multiple human cells

Genome Res. 2012 Jan;22(1):9-24. doi: 10.1101/gr.127597.111. Epub 2011 Nov 16.

Abstract

Cell-type diversity is governed in part by differential gene expression programs mediated by transcription factor (TF) binding. However, there are few systematic studies of the genomic binding of different types of TFs across a wide range of human cell types, especially in relation to gene expression. In the ENCODE Project, we have identified the genomic binding locations across 11 different human cell types of CTCF, RNA Pol II (RNAPII), and MYC, three TFs with diverse roles. Our data and analysis revealed how these factors bind in relation to genomic features and shape gene expression and cell-type specificity. CTCF bound predominantly in intergenic regions while RNAPII and MYC preferentially bound to core promoter regions. CTCF sites were relatively invariant across diverse cell types, while MYC showed the greatest cell-type specificity. MYC and RNAPII co-localized at many of their binding sites and putative target genes. Cell-type specific binding sites, in particular for MYC and RNAPII, were associated with cell-type specific functions. Patterns of binding in relation to gene features were generally conserved across different cell types. RNAPII occupancy was higher over exons than adjacent introns, likely reflecting a link between transcriptional elongation and splicing. TF binding was positively correlated with the expression levels of their putative target genes, but combinatorial binding, in particular of MYC and RNAPII, was even more strongly associated with higher gene expression. These data illuminate how combinatorial binding of transcription factors in diverse cell types is associated with gene expression and cell-type specific biology.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Gene Expression Regulation / physiology*
  • Genome, Human / physiology*
  • Genome-Wide Association Study / methods
  • HeLa Cells
  • Hep G2 Cells
  • Humans
  • K562 Cells
  • Organ Specificity / physiology
  • RNA Polymerase II / metabolism*
  • RNA Splicing / physiology
  • Response Elements / physiology*
  • Transcription Factors / metabolism*
  • Transcription, Genetic / physiology*

Substances

  • Transcription Factors
  • RNA Polymerase II

Associated data

  • GEO/GSE32883