The (in)famous GWAS P-value threshold revisited and updated for low-frequency variants

Eur J Hum Genet. 2016 Aug;24(8):1202-5. doi: 10.1038/ejhg.2015.269. Epub 2016 Jan 6.

Abstract

Genome-wide association studies (GWAS) have long relied on proposed statistical significance thresholds to be able to differentiate true positives from false positives. Although the genome-wide significance P-value threshold of 5 × 10(-8) has become a standard for common-variant GWAS, it has not been updated to cope with the lower allele frequency spectrum used in many recent array-based GWAS studies and sequencing studies. Using a whole-genome- and -exome-sequencing data set of 2875 individuals of European ancestry from the Genetics of Type 2 Diabetes (GoT2D) project and a whole-exome-sequencing data set of 13 000 individuals from five ancestries from the GoT2D and T2D-GENES (Type 2 Diabetes Genetic Exploration by Next-generation sequencing in multi-Ethnic Samples) projects, we describe guidelines for genome- and exome-wide association P-value thresholds needed to correct for multiple testing, explaining the impact of linkage disequilibrium thresholds for distinguishing independent variants, minor allele frequency and ancestry characteristics. We emphasize the advantage of studying recent genetic isolate populations when performing rare and low-frequency genetic association analyses, as the multiple testing burden is diminished due to higher genetic homogeneity.

MeSH terms

  • Alleles
  • Diabetes Mellitus, Type 2 / genetics
  • Exome
  • Genetic Heterogeneity
  • Genome-Wide Association Study / methods
  • Genome-Wide Association Study / standards*
  • Humans
  • Linkage Disequilibrium*
  • Mutation Rate*