SNV-PPILP: refined SNV calling for tumor data using perfect phylogenies and ILP

Bioinformatics. 2015 Apr 1;31(7):1133-5. doi: 10.1093/bioinformatics/btu755. Epub 2014 Nov 13.

Abstract

Motivation: Recent studies sequenced tumor samples from the same progenitor at different development stages and showed that by taking into account the phylogeny of this development, single-nucleotide variant (SNV) calling can be improved. Accurate SNV calls can better reveal early-stage tumors, identify mechanisms of cancer progression or help in drug targeting.

Results: We present SNV-PPILP, a fast and easy to use tool for refining GATK's Unified Genotyper SNV calls, for multiple samples assumed to form a phylogeny. We tested SNV-PPILP on simulated data, with a varying number of samples, SNVs, read coverage and violations of the perfect phylogeny assumption. We always match or improve the accuracy of GATK, with a significant improvement on low read coverage.

Availability and implementation: SNV-PPILP, available at cs.helsinki.fi/gsa/snv-ppilp/, is written in Python and requires the free ILP solver lp_solve.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology
  • Computer Simulation
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Neoplasms / genetics*
  • Phylogeny*
  • Polymorphism, Single Nucleotide / genetics*
  • Programming, Linear*
  • Sequence Analysis, DNA / methods*
  • Software*