Ancient and recent adaptive evolution of primate non-homologous end joining genes

PLoS Genet. 2010 Oct 21;6(10):e1001169. doi: 10.1371/journal.pgen.1001169.

Abstract

In human cells, DNA double-strand breaks are repaired primarily by the non-homologous end joining (NHEJ) pathway. Given their critical nature, we expected NHEJ proteins to be evolutionarily conserved, with relatively little sequence change over time. Here, we report that while critical domains of these proteins are conserved as expected, the sequence of NHEJ proteins has also been shaped by recurrent positive selection, leading to rapid sequence evolution in other protein domains. In order to characterize the molecular evolution of the human NHEJ pathway, we generated large simian primate sequence datasets for NHEJ genes. Codon-based models of gene evolution yielded statistical support for the recurrent positive selection of five NHEJ genes during primate evolution: XRCC4, NBS1, Artemis, POLλ, and CtIP. Analysis of human polymorphism data using the composite of multiple signals (CMS) test revealed that XRCC4 has also been subjected to positive selection in modern humans. Crystal structures are available for XRCC4, Nbs1, and Polλ; and residues under positive selection fall exclusively on the surfaces of these proteins. Despite the positive selection of such residues, biochemical experiments with variants of one positively selected site in Nbs1 confirm that functions necessary for DNA repair and checkpoint signaling have been conserved. However, many viruses interact with the proteins of the NHEJ pathway as part of their infectious lifecycle. We propose that an ongoing evolutionary arms race between viruses and NHEJ genes may be driving the surprisingly rapid evolution of these critical genes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adaptation, Physiological / genetics
  • Amino Acid Sequence
  • Animals
  • Binding Sites / genetics
  • Carrier Proteins / chemistry
  • Carrier Proteins / genetics
  • Carrier Proteins / metabolism
  • Cell Cycle Proteins / chemistry
  • Cell Cycle Proteins / genetics
  • Cell Cycle Proteins / metabolism
  • DNA Breaks, Double-Stranded
  • DNA Polymerase beta / chemistry
  • DNA Polymerase beta / genetics
  • DNA Polymerase beta / metabolism
  • DNA Repair / genetics*
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / genetics
  • DNA-Binding Proteins / metabolism
  • Endodeoxyribonucleases
  • Endonucleases
  • Evolution, Molecular*
  • Humans
  • Models, Molecular
  • Molecular Sequence Data
  • Nuclear Proteins / chemistry
  • Nuclear Proteins / genetics
  • Nuclear Proteins / metabolism
  • Phylogeny
  • Primates / classification
  • Primates / genetics*
  • Protein Binding
  • Protein Structure, Tertiary
  • Recombination, Genetic / genetics*
  • Selection, Genetic
  • Sequence Homology, Amino Acid
  • Signal Transduction

Substances

  • Carrier Proteins
  • Cell Cycle Proteins
  • DNA-Binding Proteins
  • NBN protein, human
  • Nuclear Proteins
  • XRCC4 protein, human
  • DNA polymerase beta2
  • DNA Polymerase beta
  • DCLRE1C protein, human
  • Endodeoxyribonucleases
  • Endonucleases
  • RBBP8 protein, human