An MDL method for finding haplotype blocks and for estimating the strength of haplotype block boundaries

Pac Symp Biocomput. 2003:502-13. doi: 10.1142/9789812776303_0047.

Abstract

We describe a new method for finding haplotype blocks based on the use of the minimum description length principle. We give a rigorous definition of the quality of a segmentation of a genomic region into blocks, and describe a dynamic programming algorithm for finding the optimal segmentation with respect to this measure. We also describe a method for finding the probability of a block boundary for each pair of adjacent markers: this gives a tool for evaluating the significance of each block boundary. We have applied the method to the published data of Daly et al. The results are in relatively good agreement with the published results, but also show clear differences in the predicted block boundaries and their strengths. We also give results on the block structure in population isolates.

MeSH terms

  • Algorithms*
  • Chromosomes, Human, Pair 1 / genetics
  • Computational Biology
  • Databases, Genetic
  • Finland
  • Genetic Markers
  • Genetics, Population
  • Genome, Human
  • Haplotypes*
  • Humans
  • Models, Genetic
  • Polymorphism, Single Nucleotide

Substances

  • Genetic Markers