Power analyses for longitudinal study designs with missing data

Stat Med. 2007 Jul 10;26(15):2958-81. doi: 10.1002/sim.2773.

Abstract

Existing methods for power analysis for longitudinal study designs are limited in that they do not adequately address random missing data patterns. Although the pattern of missing data can be assessed during data analysis, it is unknown during the design phase of a study. The random nature of the missing data pattern adds another layer of complexity in addressing missing data for power analysis. In this paper, we model the occurrence of missing data with a two-state, first-order Markov process and integrate the modelling information into the power function to account for random missing data patterns. The Markov model is easily specified to accommodate different anticipated missing data processes. We develop this approach for the two most popular longitudinal models: the generalized estimating equations (GEE) and the linear mixed-effects model under the missing completely at random (MCAR) assumption. For GEE, we also limit our consideration to the working independence correlation model. The proposed methodology is illustrated with numerous examples that are motivated by real study designs.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adolescent
  • Age Factors
  • Behavior Therapy / methods
  • Clinical Trials as Topic / methods
  • Female
  • HIV Infections / prevention & control
  • Humans
  • Longitudinal Studies*
  • Markov Chains*
  • Models, Statistical*
  • Sleep Wake Disorders