Sniper: improved SNP discovery by multiply mapping deep sequenced reads

Genome Biol. 2011 Jun 20;12(6):R55. doi: 10.1186/gb-2011-12-6-r55.

Abstract

SNP (single nucleotide polymorphism) discovery using next-generation sequencing data remains difficult primarily because of redundant genomic regions, such as interspersed repetitive elements and paralogous genes, present in all eukaryotic genomes. To address this problem, we developed Sniper, a novel multi-locus Bayesian probabilistic model and a computationally efficient algorithm that explicitly incorporates sequence reads that map to multiple genomic loci. Our model fully accounts for sequencing error, template bias, and multi-locus SNP combinations, maintaining high sensitivity and specificity under a broad range of conditions. An implementation of Sniper is freely available at http://kim.bio.upenn.edu/software/sniper.shtml.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Chromosome Mapping
  • Computational Biology
  • Genome, Human
  • Genotyping Techniques
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Models, Statistical
  • Molecular Structure
  • Polymorphism, Single Nucleotide*
  • Repetitive Sequences, Nucleic Acid
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Sequence Analysis, DNA*
  • Software*