In genetic epidemiologic studies, investigators often use generalized linear models to evaluate the relationships between a disease trait and covariates, such as one or more candidate genes or an environmental exposure. Recently, attention has turned to study designs that mandate the inclusion of family members in addition to a proband. Standard models for analysis assume independent observations, which is unlikely to be true for family data, and the usual standard errors for the regression parameter estimates may be too large or too small, depending on the distribution of the covariates within and between families. The consequences of familial correlation on the study efficiency can be measured by a design effect that is equivalent to the relative information in a sample of unrelated individuals compared to a sample of families with the same number of individuals. We examine design effects for studies in association, and illustrate how the design effect is influenced by the intra-familial distribution of covariate values such as would be expected for a candidate gene. Typical design effects for a candidate gene range between 1.1 and 2.4, depending on the size of the family and the amount of unexplained familial correlation. These values correspond to a modest 10% increase in the required sample size up to more than doubling the requirements. Design effect values are useful in study design to compare the efficiency of studies that sample families versus independent individuals and to determine sample size requirements that account for familial correlation.
Copyright 2001 Wiley-Liss, Inc.