Predicting gene expression levels from codon biases in alpha-proteobacterial genomes

Proc Natl Acad Sci U S A. 2003 Jun 10;100(12):7313-8. doi: 10.1073/pnas.1232298100. Epub 2003 May 29.

Abstract

Predicted highly expressed (PHX) genes in five currently available high G+C complete alpha-proteobacterial genomes are analyzed. These include: the nitrogen-fixing plant symbionts Sinorhizobium meliloti (SINME) and Mesorhizobium loti (MESLO), the nonpathogenic aquatic bacterium Caulobacter crescentus (CAUCR), the plant pathogen Agrobacterium tumefaciens (AGRTU), and the mammalian pathogen Brucella melitensis (BRUME). Three of these genomes, SINME, AGRTU, and BRUME, contain multiple chromosomes or megaplasmids (>1 Mb length). PHX genes in these genomes are concentrated mainly in the major (largest) chromosome with few PHX genes found in the secondary chromosomes and megaplasmids. Tricarboxylic acid cycle and aerobic respiration genes are strongly PHX in all five genomes, whereas anaerobic pathways of glycolysis and fermentation are mostly not PHX. Only in MESLO (but not SINME) and BRUME are most glycolysis genes PHX. Many flagellar genes are PHX in MESLO and CAUCR, but mostly are not PHX in SINME and AGRTU. The nonmotile BRUME also carries many flagellar genes but these are generally not PHX and all but one are located in the second chromosome. CAUCR stands out among available prokaryotic genomes with 25 PHX TonB-dependent receptors. These are putatively involved in uptake of iron ions and other nonsoluble compounds.

Publication types

  • Comparative Study
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Agrobacterium tumefaciens / genetics
  • Agrobacterium tumefaciens / metabolism
  • Alphaproteobacteria / genetics*
  • Alphaproteobacteria / metabolism
  • Base Composition
  • Brucella melitensis / genetics
  • Brucella melitensis / metabolism
  • Caulobacter crescentus / genetics
  • Caulobacter crescentus / metabolism
  • Citric Acid Cycle / genetics
  • Codon / genetics*
  • DNA, Bacterial / chemistry
  • DNA, Bacterial / genetics
  • Energy Metabolism / genetics
  • Flagella / genetics
  • Gene Expression
  • Genome, Bacterial*
  • Inactivation, Metabolic / genetics
  • Multigene Family
  • Nitrogen Fixation / genetics
  • Sinorhizobium meliloti / genetics
  • Sinorhizobium meliloti / metabolism
  • Species Specificity

Substances

  • Codon
  • DNA, Bacterial