Analysis of chameleon sequences and their implications in biological processes

Proteins. 2007 May 15;67(3):548-58. doi: 10.1002/prot.21285.

Abstract

Chameleon sequences have been implicated in amyloid related diseases. Here we report an analysis of two types of chameleon sequences, chameleon-HS (Helix vs. Strand) and chameleon-HE (Helix vs. Sheet), based on known structures in Protein Data Bank. Our survey shows that the longest chameleon-HS is eight residues while the longest chameleon-HE is seven residues. We have done a detailed analysis on the local and global environment that might contribute to the unique conformation of a chameleon sequence. We found that the existence of chameleon sequences does not present a problem for secondary structure prediction programs, including the first generation prediction programs, such as Chou-Fasman algorithm, and the third generation prediction programs that utilize evolution information. We have also investigated the possible implication of chameleon sequences in structural conservation and functional diversity of alternatively spliced protein isoforms.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • 3-Hydroxyacyl CoA Dehydrogenases / chemistry
  • Algorithms
  • Amino Acid Sequence
  • Computational Biology / methods
  • Databases, Protein
  • Humans
  • Models, Molecular
  • Molecular Sequence Data
  • Oligopeptides / chemistry*
  • Protein Conformation
  • Protein Structure, Secondary
  • Sequence Homology, Amino Acid
  • Software*

Substances

  • Oligopeptides
  • 3-Hydroxyacyl CoA Dehydrogenases
  • HSD17B10 protein, human