Impact of missing genotype data on Monte-Carlo simulation based haplotype analysis

Tim Becker; Michael Knapp

doi:10.1159/000086696

Impact of missing genotype data on Monte-Carlo simulation based haplotype analysis

Hum Hered. 2005;59(4):185-9. doi: 10.1159/000086696. Epub 2005 Jul 7.

Authors

Tim Becker¹, Michael Knapp

Affiliation

¹ Institute for Medical Biometry, Informatics and Epidemiology University of Bonn, Bonn, Germany. becker@imbie.meb.uni-bonn.de

PMID: 16015028
DOI: 10.1159/000086696

Abstract

In the context of haplotype association analysis of unphased genotype data, methods based on Monte-Carlo simulations are often used to compensate for missing or inappropriate asymptotic theory. Moreover, such methods are an indispensable means to deal with multiple testing problems. We want to call attention to a potential trap in this usually useful approach: The simulation approach may lead to strongly inflated type I errors in the presence of different missing rates between cases and controls, depending on the chosen test statistic. Here, we consider four different testing strategies for haplotype analysis of case-control data. We recommend to interpret results for data sets with non-comparable distributions of missing genotypes with special caution, in case the test statistic is based on inferred haplotypes per individual. Moreover, our results are important for the conduction and interpretation of genome-wide association studies.

Publication types

Comparative Study
Research Support, Non-U.S. Gov't

MeSH terms

Case-Control Studies
Computer Simulation
Genotype*
Haplotypes*
Humans
Likelihood Functions
Monte Carlo Method*