Towards an understanding of protein-DNA recognition

Philos Trans R Soc Lond B Biol Sci. 1996 Apr 29;351(1339):501-9. doi: 10.1098/rstb.1996.0048.

Abstract

Understanding how proteins recognize DNA in a sequence-specific manner is central to our understanding of the regulation of transcription and other cellular processes. In this article we review the principles of DNA recognition that have emerged from the large number of high-resolution crystal structures determined over the last 10 years. The DNA-binding domains of transcription factors exhibit surprisingly diverse protein architectures, yet all achieve a precise complementarity of shape facilitating specific chemical recognition of their particular DNA targets. Although general rules for recognition can be derived, the complex nature of the recognition mechanism precludes a simple recognition code. In particular, it has become evident that the structure and flexibility of DNA and contacts mediated by water molecules contribute to the recognition process. Nevertheless, based on known structures it has proven possible to design proteins with novel recognition specificities. Despite this considerable practical success, the thermodynamic and kinetic properties of protein/DNA recognition remain poorly understood.

Publication types

  • Review

MeSH terms

  • Amino Acid Sequence
  • Base Composition
  • Base Sequence
  • Binding Sites
  • Consensus Sequence
  • DNA / chemistry*
  • DNA / metabolism*
  • DNA-Binding Proteins / chemistry*
  • DNA-Binding Proteins / metabolism*
  • Models, Molecular
  • Nucleic Acid Conformation
  • Protein Conformation
  • Protein Structure, Secondary
  • TATA Box
  • TATA-Box Binding Protein
  • Thermodynamics
  • Transcription Factors / chemistry*
  • Transcription Factors / metabolism*
  • Zinc Fingers

Substances

  • DNA-Binding Proteins
  • TATA-Box Binding Protein
  • Transcription Factors
  • DNA