Mistranslational errors associated with the rare arginine codon CGG in Escherichia coli

Protein Expr Purif. 2003 Feb;27(2):365-74. doi: 10.1016/s1046-5928(02)00610-1.

Abstract

In Escherichia coli, CGG is a rare arginine codon occurring at a frequency of 0.54% in all E. coli mRNAs or 9.8% when an arginine residue is encoded for. When present in high numbers or in clusters in highly expressed recombinant mRNA, rare codons can cause expression problems compromising product yield and translational fidelity. The coding region for an N-terminally polyhistidine tagged p27 protease domain from Herpes Simplex Virus 2 (HSV-2) contains 11 of these rare arginine codons, with 3 occurring in tandem near the C-terminus of the protein. When expressed in E. coli, the majority of the recombinant material produced had an apparent molecular mass of 31 kDa by SDS-PAGE gels or 3 kDa higher than predicted. Detailed biochemical analysis was performed on chemical and enzymatic digests of the protein and peptide fragments were characterized by Edman and MS/MS sequencing approaches. Two major species were isolated comprising +1 frameshift events at both the second and third CGG codons in the triplet cluster. Translation proceeded in the missense frame to the next termination codon. In addition, significant levels of glutamine misincorporating for arginine were discovered, suggesting second base misreading of CGG as CAG. Coexpression of the argX gene, which encodes the cognate tRNA for CGG codons, largely eliminated both the frameshift and misincorporation events, and increased expression levels of authentic product by up to 7-fold. We conclude that supplementation of the rare arginyl tRNA(CGG) levels by coexpression of the argX gene can largely alleviate the CGG codon bias present in E. coli, allowing for efficient and accurate translation of heterologous gene products.

MeSH terms

  • Amino Acid Sequence
  • Arginine / chemistry
  • Arginine / genetics*
  • Base Sequence
  • Chromatography, High Pressure Liquid
  • Cloning, Molecular
  • Codon
  • DNA, Complementary / metabolism
  • Electrophoresis, Polyacrylamide Gel
  • Escherichia coli / genetics*
  • Escherichia coli / metabolism
  • Frameshift Mutation
  • Genetic Vectors
  • Glutamine / chemistry
  • Herpesvirus 2, Human / metabolism
  • Mass Spectrometry
  • Molecular Sequence Data
  • Mutation, Missense
  • Protein Biosynthesis*
  • Protein Structure, Tertiary
  • RNA, Messenger / metabolism
  • RNA, Transfer / metabolism
  • Recombinant Proteins / chemistry
  • Time Factors

Substances

  • Codon
  • DNA, Complementary
  • RNA, Messenger
  • Recombinant Proteins
  • Glutamine
  • RNA, Transfer
  • Arginine