Sequence, structure and evolutionary analysis of cold shock domain proteins, a member of OB fold family

J Evol Biol. 2018 Dec;31(12):1903-1917. doi: 10.1111/jeb.13382. Epub 2018 Oct 19.

Abstract

The cold shock domain (CSD) belongs to the oligosaccharide/oligonucleotide-binding fold superfamily which is highly conserved from prokaryotes to higher eukaryotes, and appears to function as RNA chaperones. CSD is involved in diverse cellular processes, including adaptation to low temperatures, nutrient stress, cellular growth and developmental processes. Structural Classification of Proteins (SCOP) database broadly classifies OB fold proteins into 18 different superfamilies, including nucleic acid-binding superfamily (NAB). The NAB is further divided into 17 families together with cold shock DNA-binding protein family (CSDB). The CSDB have more than 240 000 sequences in UniProt database consisting of 32 domains including CSD. Among these domains, CSD is the second largest sequence contributor (> 40 398 sequences). Herein, we have systematically analysed the relative abundance and distribution of CSD proteins based on sequences, structures, repeats and gene ontology (GO) molecular functions in all domains of life. Analysis of sequence distribution suggesting that CSDs are largely found in bacteria (83-94%) with single CSD repeat. However, repeat distribution in eukaryota varies from 1 to 5 in combination with other auxiliary domain that makes CSD proteins functionally more diverse compared to the bacterial counterparts. Further, analysis of repeats distributions on evolutionary scale suggest that existence of CSD in multiple repeats is mainly driven through speciation, gene shuffling and gene duplication events.

Keywords: OB fold; cold shock domain; phylogenetic analysis; repeats analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acclimatization
  • Amino Acid Sequence
  • Animals
  • Binding Sites
  • Biological Evolution*
  • Cold Shock Proteins and Peptides / genetics
  • Cold Shock Proteins and Peptides / metabolism*
  • Databases, Genetic*
  • Models, Molecular
  • Protein Conformation

Substances

  • Cold Shock Proteins and Peptides