5'-end SAGE for the analysis of transcriptional start sites

Nat Biotechnol. 2004 Sep;22(9):1146-9. doi: 10.1038/nbt998. Epub 2004 Aug 8.

Abstract

Identification of the mRNA start site is essential in establishing the full-length cDNA sequence of a gene and analyzing its promoter region, which regulates gene expression. Here we describe the development of a 5'-end serial analysis of gene expression (5' SAGE) that can be used to globally identify transcriptional start sites and the frequency of individual mRNAs. Of the 25,684 5' SAGE tags in the HEK293 human cell library, 19,893 matched to the human genome. Among 15,448 tags in one locus of the genome, 85.8%-96.1% of the 5' SAGE tags were assigned within -500 to +200 nt of mRNA start sites using the RefSeq, UniGene and DBTSS databases. This technique should facilitate 5'-end transcriptome analysis in a variety of cells and tissues.

Publication types

  • Comparative Study
  • Evaluation Study
  • Letter
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • 5' Flanking Region / genetics*
  • Base Sequence
  • Cell Line
  • Gene Expression Profiling / methods*
  • Humans
  • Kidney / metabolism*
  • Molecular Sequence Data
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism
  • Transcription Initiation Site*

Substances

  • Transcription Factors