Modeling the Dependence Structure in Genome Wide Association Studies of Binary Phenotypes in Family Data

Behav Genet. 2020 Nov;50(6):423-439. doi: 10.1007/s10519-020-10010-2. Epub 2020 Aug 17.

Abstract

Genome-wide association studies (GWASs) are a popular tool for detecting association between genetic variants or single nucleotide polymorphisms (SNPs) and complex traits. Family data introduce complexity due to the non-independence of the family members. Methods for non-independent data are well established, but when the GWAS contains distinct family types, explicit modeling of between-family-type differences in the dependence structure comes at the cost of significantly increased computational burden. The situation is exacerbated with binary traits. In this paper, we perform several simulation studies to compare multiple candidate methods to perform single SNP association analysis with binary traits. We consider generalized estimating equations (GEE), generalized linear mixed models (GLMMs), or generalized least square (GLS) approaches. We study the influence of different working correlation structures for GEE on the GWAS findings and also the performance of different analysis method(s) to conduct a GWAS with binary trait data in families. We discuss the merits of each approach with attention to their applicability in a GWAS. We also compare the performances of the methods on the alcoholism data from the Minnesota Center for Twin and Family Research (MCTFR) study.

Keywords: Family data; Generalized estimating equation; Generalized least squares; Generalized linear mixed effect model; Genome-wide scan; Population-based association analysis.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computational Biology / methods*
  • Computer Simulation
  • Data Analysis
  • Family
  • Genome-Wide Association Study / statistics & numerical data*
  • Humans
  • Least-Squares Analysis
  • Linear Models
  • Models, Genetic
  • Models, Statistical
  • Multifactorial Inheritance / genetics*
  • Polymorphism, Single Nucleotide / genetics
  • Quantitative Trait Loci / genetics