Evolution of mouse circadian enhancers from transposable elements

Genome Biol. 2021 Jun 29;22(1):193. doi: 10.1186/s13059-021-02409-9.

Abstract

Background: Transposable elements are increasingly recognized as a source of cis-regulatory variation. Previous studies have revealed that transposons are often bound by transcription factors and some have been co-opted into functional enhancers regulating host gene expression. However, the process by which transposons mature into complex regulatory elements, like enhancers, remains poorly understood. To investigate this process, we examined the contribution of transposons to the cis-regulatory network controlling circadian gene expression in the mouse liver, a well-characterized network serving an important physiological function.

Results: ChIP-seq analyses reveal that transposons and other repeats contribute ~ 14% of the binding sites for core circadian regulators (CRs) including BMAL1, CLOCK, PER1/2, and CRY1/2, in the mouse liver. RSINE1, an abundant murine-specific SINE, is the only transposon family enriched for CR binding sites across all datasets. Sequence analyses and reporter assays reveal that the circadian regulatory activity of RSINE1 stems from the presence of imperfect CR binding motifs in the ancestral RSINE1 sequence. These motifs matured into canonical motifs through point mutations after transposition. Furthermore, maturation occurred preferentially within elements inserted in the proximity of ancestral CR binding sites. RSINE1 also acquired motifs that recruit nuclear receptors known to cooperate with CRs to regulate circadian gene expression specifically in the liver.

Conclusions: Our results suggest that the birth of enhancers from transposons is predicated both by the sequence of the transposon and by the cis-regulatory landscape surrounding their genomic integration site.

Keywords: Circadian rhythms; Enhancers; Gene regulation; Regulatory evolution; Transcription; Transposable elements.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • ARNTL Transcription Factors / genetics
  • ARNTL Transcription Factors / metabolism
  • Animals
  • Base Sequence
  • Binding Sites
  • CLOCK Proteins / genetics
  • CLOCK Proteins / metabolism
  • Circadian Clocks / genetics*
  • Cryptochromes / genetics
  • Cryptochromes / metabolism
  • DNA Transposable Elements*
  • Enhancer Elements, Genetic*
  • Evolution, Molecular*
  • Gene Expression Regulation
  • Gene Regulatory Networks*
  • Liver / metabolism
  • Mice
  • Period Circadian Proteins / genetics
  • Period Circadian Proteins / metabolism
  • Point Mutation
  • Protein Binding
  • Sequence Alignment
  • Sequence Homology, Nucleic Acid
  • Short Interspersed Nucleotide Elements
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism

Substances

  • ARNTL Transcription Factors
  • Bmal1 protein, mouse
  • Cry1 protein, mouse
  • Cry2 protein, mouse
  • Cryptochromes
  • DNA Transposable Elements
  • Per1 protein, mouse
  • Per2 protein, mouse
  • Period Circadian Proteins
  • Transcription Factors
  • CLOCK Proteins
  • Clock protein, mouse