Unexpected features of the dark proteome

Proc Natl Acad Sci U S A. 2015 Dec 29;112(52):15898-903. doi: 10.1073/pnas.1508380112. Epub 2015 Nov 17.

Abstract

We surveyed the "dark" proteome-that is, regions of proteins never observed by experimental structure determination and inaccessible to homology modeling. For 546,000 Swiss-Prot proteins, we found that 44-54% of the proteome in eukaryotes and viruses was dark, compared with only ∼14% in archaea and bacteria. Surprisingly, most of the dark proteome could not be accounted for by conventional explanations, such as intrinsic disorder or transmembrane regions. Nearly half of the dark proteome comprised dark proteins, in which the entire sequence lacked similarity to any known structure. Dark proteins fulfill a wide variety of functions, but a subset showed distinct and largely unexpected features, such as association with secretion, specific tissues, the endoplasmic reticulum, disulfide bonding, and proteolytic cleavage. Dark proteins also had short sequence length, low evolutionary reuse, and few known interactions with other proteins. These results suggest new research directions in structural and computational biology.

Keywords: protein disorder; secreted proteins; structure prediction; transmembrane proteins; unknown unknowns.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Archaea / genetics
  • Archaea / metabolism
  • Bacteria / genetics
  • Bacteria / metabolism
  • Computational Biology / methods*
  • Databases, Protein*
  • Eukaryota / metabolism
  • Humans
  • Models, Molecular
  • Protein Conformation
  • Proteins / chemistry
  • Proteins / genetics
  • Proteins / metabolism*
  • Proteome / chemistry
  • Proteome / genetics
  • Proteome / metabolism*
  • Viruses / genetics
  • Viruses / metabolism

Substances

  • Proteins
  • Proteome