A boosting approach for motif modeling using ChIP-chip data

Bioinformatics. 2005 Jun 1;21(11):2636-43. doi: 10.1093/bioinformatics/bti402. Epub 2005 Apr 7.

Abstract

Motivation: Building an accurate binding model for a transcription factor (TF) is essential to differentiate its true binding targets from those spurious ones. This is an important step toward understanding gene regulation.

Results: This paper describes a boosting approach to modeling TF-DNA binding. Different from the widely used weight matrix model, which predicts TF-DNA binding based on a linear combination of position-specific contributions, our approach builds a TF binding classifier by combining a set of weight matrix based classifiers, thus yielding a non-linear binding decision rule. The proposed approach was applied to the ChIP-chip data of Saccharomyces cerevisiae. When compared with the weight matrix method, our new approach showed significant improvements on the specificity in a majority of cases.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms*
  • Amino Acid Motifs
  • Binding Sites
  • DNA / analysis
  • DNA / chemistry*
  • Databases, Nucleic Acid
  • Oligonucleotide Array Sequence Analysis / methods*
  • Protein Binding
  • Saccharomyces cerevisiae Proteins / analysis
  • Saccharomyces cerevisiae Proteins / chemistry*
  • Saccharomyces cerevisiae Proteins / classification
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA / methods*
  • Sequence Homology, Nucleic Acid
  • Transcription Factors / analysis
  • Transcription Factors / chemistry*
  • Transcription Factors / classification

Substances

  • Saccharomyces cerevisiae Proteins
  • Transcription Factors
  • DNA