Human genetic research in the past decade has generated a wealth of data from the genome-wide association scan era, much of which is catalogued and freely available. These data will typically test the relationship between a single nucleotide variant or polymorphism (SNP) and some outcome, disease, or trait. Ongoing investigations will yield a similar wealth of data regarding epigenetic phenomena. These data will typically test the relationship between DNA methylation at a single genomic location/region and some outcome. Most of these findings will be the result of cross-sectional investigations typically using ascertained cases and controls. Consequently, most methodological consideration focuses on methods appropriate for simple case-control comparisons. It is expected that a growing number of investigators with longitudinal experimental prevention or intervention cohorts will also measure genetic and epigenetic indicators as part of their investigations, harvesting the wealth of information generated by the genome-wide association study (GWAS) era to allow for targeted hypothesis testing in the next generation of prevention and intervention trials. Herein, we discuss appropriate quality control and statistical modelling of genetic, polygenic, and epigenetic measures in longitudinal models. We specifically discuss quality control, population stratification, genotype imputation, pathway approaches, and proper modelling of an interaction between a specific genetic variant and an environment variable (GxE interaction).
Keywords: GWAS; Genetic; Methylation; Polygenic risk; Prevention.