Architecture, function and prediction of long signal peptides

Brief Bioinform. 2009 Sep;10(5):569-78. doi: 10.1093/bib/bbp030. Epub 2009 Jun 17.

Abstract

Protein targeting in eukaryotic cells is vital for cell survival and development. N-terminal signal peptides guide proteins to the membrane of the endoplasmic reticulum (ER) and initiate translocation into the ER lumen. Here, we review the status of signal peptide architecture and prediction with an emphasis on exceptionally long signal peptides, which often escape the notion of the currently available prediction methods. We benchmark publicly available prediction methods for their ability to correctly identify exceptionally long signal peptides. A set of 136 annotated eukaryotic signals served as reference data. The best prediction tool detected only 63%. A potential reason for the poor performance is the domain architecture of long signal peptides, whose structural peculiarities are insufficiently considered by current prediction algorithms. To overcome this limitation, we motivate a general domain view of long signal peptides, which becomes detectable when both the overall length and secondary structure of long signal peptides are taken into consideration. This concept provides a structural framework for identifying and understanding multiple targeting and post-targeting functions.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Amino Acid Sequence*
  • Animals
  • Computational Biology
  • Endoplasmic Reticulum / metabolism
  • Humans
  • Molecular Sequence Data
  • Protein Sorting Signals / genetics*
  • Protein Structure, Secondary*
  • Sequence Analysis, Protein / methods*

Substances

  • Protein Sorting Signals