Local sequence assembly reveals a high-resolution profile of somatic structural variations in 97 cancer genomes

Nucleic Acids Res. 2015 Sep 30;43(17):8146-56. doi: 10.1093/nar/gkv831. Epub 2015 Aug 17.

Abstract

Genomic structural variations (SVs) are pervasive in many types of cancers. Characterizing their underlying mechanisms and potential molecular consequences is crucial for understanding the basic biology of tumorigenesis. Here, we engineered a local assembly-based algorithm (laSV) that detects SVs with high accuracy from paired-end high-throughput genomic sequencing data and pinpoints their breakpoints at single base-pair resolution. By applying laSV to 97 tumor-normal paired genomic sequencing datasets across six cancer types produced by The Cancer Genome Atlas Research Network, we discovered that non-allelic homologous recombination is the primary mechanism for generating somatic SVs in acute myeloid leukemia. This finding contrasts with results for the other five types of solid tumors, in which non-homologous end joining and microhomology end joining are the predominant mechanisms. We also found that the genes recursively mutated by single nucleotide alterations differed from the genes recursively mutated by SVs, suggesting that these two types of genetic alterations play different roles during cancer progression. We further characterized how the gene structures of the oncogene JAK1 and the tumor suppressors KDM6A and RB1 are affected by somatic SVs and discussed the potential functional implications of intergenic SVs.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Chromosome Breakpoints*
  • DNA End-Joining Repair
  • Genes, Tumor Suppressor
  • Genome
  • Genomic Structural Variation*
  • Genomics
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Leukemia, Myeloid, Acute / genetics
  • Neoplasms / genetics*
  • Oncogenes / genetics
  • Proteins / genetics
  • Recombinational DNA Repair
  • Regulatory Elements, Transcriptional
  • Sequence Analysis, DNA / methods*

Substances

  • Proteins