Two-phase designs for joint quantitative-trait-dependent and genotype-dependent sampling in post-GWAS regional sequencing

Genet Epidemiol. 2018 Feb;42(1):104-116. doi: 10.1002/gepi.22099. Epub 2017 Dec 14.

Abstract

We evaluate two-phase designs to follow-up findings from genome-wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation-maximization-based inference under a semiparametric maximum likelihood formulation tailored for post-GWAS inference. A GWAS-SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT-SNP-dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme-QT strata yields significant power improvements compared to marginal QT- or SNP-based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure.

Keywords: Genetic Analysis Workshop 19; fine-mapping; genetic association studies; joint outcome covariate dependent sampling; outcome-/covariate-dependent sampling.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Blood Pressure / genetics
  • Genome-Wide Association Study*
  • Genotype*
  • Humans
  • Likelihood Functions*
  • Models, Genetic
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics
  • Quantitative Trait, Heritable*
  • Sequence Analysis, DNA*