Identifying non-crystallographic symmetry in protein electron-density maps: a feature-based approach

Acta Crystallogr D Biol Crystallogr. 2006 Sep;62(Pt 9):1012-21. doi: 10.1107/S0907444906023158. Epub 2006 Aug 19.

Abstract

Non-crystallographic symmetry (NCS) averaging is a well known method for improving the quality of an electron-density map and thus aiding structure determination. Prior methods of NCS-operator determination based on estimated heavy-atom positions are prone to errors arising from inaccuracies in these coordinates or differences in the relative orientations of domains between molecules. In this paper, two real-space methods to determine NCS relationships from initial electron-density maps are presented. A brute-force method identifies matching regions in a map by local density correlation. A feature-based algorithm uses rotation-invariant features to reduce the computational time taken by the brute-force algorithm by filtering out regions that are likely to have dissimilar density patterns. This makes the feature-based algorithm faster and as accurate as the brute-force approach. Neither method requires the positions of heavy atoms or any information regarding the protein sequence. Both methods have been tested on a diverse range of experimentally phased maps and the correct NCS relationships were accurately identified for almost all of the test cases. The NCS operators obtained by the feature-based algorithm were used to perform NCS averaging and an improvement in map correlation was observed for some cases.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Crystallography, X-Ray
  • Electrons
  • Models, Molecular
  • Models, Statistical
  • Molecular Conformation
  • Mycobacterium tuberculosis / enzymology
  • Pattern Recognition, Automated
  • Phosphoglycerate Dehydrogenase / chemistry
  • Protein Conformation
  • Protein Structure, Secondary
  • Proteins / chemistry
  • Reproducibility of Results

Substances

  • Proteins
  • Phosphoglycerate Dehydrogenase