Genome-wide identification and predictive modeling of polyadenylation sites in eukaryotes

Brief Bioinform. 2015 Mar;16(2):304-13. doi: 10.1093/bib/bbu011. Epub 2014 Apr 1.

Abstract

Polyadenylation [poly(A)] is a vital step in post-transcriptional processing of pre-mRNA. Alternative polyadenylation is a widespread mechanism of regulating gene expression in eukaryotes. Defining poly(A) sites contributes to the annotation of transcripts' ends and the study of gene regulatory mechanisms. Here, we survey methods for collecting poly(A) sites using high-throughput sequencing technologies and summarize the general processes for genome-wide poly(A) site identifications. We also compare the performances of various poly(A) site prediction models and discuss the relationship between poly(A) site identification from sequencing projects and predictive modeling. Moreover, we attempt to address some potential problems in current researches and propose future directions related to polyadenylation research.

Keywords: alternative polyadenylation; high-throughput technology; poly(A) signal; poly(A) site; predictive modeling.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Algorithms
  • Animals
  • Computational Biology
  • Eukaryota / genetics*
  • Genome-Wide Association Study / statistics & numerical data
  • Genomics / statistics & numerical data
  • High-Throughput Nucleotide Sequencing / statistics & numerical data
  • Humans
  • Models, Genetic
  • Polyadenylation*
  • RNA, Messenger / genetics*

Substances

  • RNA, Messenger