Detection of native-like models for amino acid sequences of unknown three-dimensional structure in a data base of known protein conformations

Proteins. 1992 Jul;13(3):258-71. doi: 10.1002/prot.340130308.

Abstract

We present an approach which can be used to identify native-like folds in a data base of protein conformations in the absence of any sequence homology to proteins in the data base. The method is based on a knowledge-based force field derived from a set of known protein conformations. A given sequence is mounted on all conformations in the data base and the associated energies are calculated. Using several conformations and sequences from the globin family we show that the native conformation is identified correctly. In fact the resolution of the force field is high enough to discriminate between a native fold and several closely related conformations. We then apply the procedure to several globins of known sequence but unknown three dimensional structure. The homology of these sequences to globins of known structures in the data base ranges from 49 to 17%. With one exception we find that for all globin sequences one of the known globin folds is identified as the most favorable conformation. These results are obtained using a force field derived from a data base devoid of globins of known structure. We briefly discuss useful applications in protein structural research and future development of our approach.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Amino Acid Sequence*
  • Databases, Factual
  • Globins / chemistry
  • Hemoglobins / chemistry
  • Models, Molecular
  • Molecular Sequence Data
  • Protein Conformation
  • Sequence Alignment

Substances

  • Hemoglobins
  • Globins