Eukaryotic transcription factor binding sites--modeling and integrative search methods

Bioinformatics. 2008 Jun 1;24(11):1325-31. doi: 10.1093/bioinformatics/btn198. Epub 2008 Apr 21.

Abstract

A comprehensive knowledge of transcription factor binding sites (TFBS) is important for a mechanistic understanding of transcriptional regulation as well as for inferring gene regulatory networks. Because the DNA motif recognized by a transcription factor is typically short and degenerate, computational approaches for identifying binding sites based only on the sequence motif inevitably suffer from high error rates. Current state-of-the-art techniques for improving computational identification of binding sites can be broadly categorized into two classes: (1) approaches that aim to improve binding motif models by extracting maximal sequence information from experimentally determined binding sites and (2) approaches that supplement binding motif models with additional genomic or other attributes (such as evolutionary conservation). In this review we will discuss recent attempts to improve computational identification of TFBS through these two types of approaches and conclude with thoughts on future development.

Publication types

  • Review

MeSH terms

  • Algorithms*
  • Base Sequence
  • Binding Sites
  • Computer Simulation
  • DNA / genetics*
  • Models, Genetic*
  • Molecular Sequence Data
  • Protein Binding
  • Sequence Analysis, DNA / methods*
  • Systems Integration
  • Transcription Factors / genetics*

Substances

  • Transcription Factors
  • DNA