Missing covariates in longitudinal data with informative dropouts: bias analysis and inference

Biometrics. 2005 Sep;61(3):837-46. doi: 10.1111/j.1541-0420.2005.00340.x.

Abstract

We consider estimation in generalized linear mixed models (GLMM) for longitudinal data with informative dropouts. At the time a unit drops out, time-varying covariates are often unobserved in addition to the missing outcome. However, existing informative dropout models typically require covariates to be completely observed. This assumption is not realistic in the presence of time-varying covariates. In this article, we first study the asymptotic bias that would result from applying existing methods, where missing time-varying covariates are handled using naive approaches, which include: (1) using only baseline values; (2) carrying forward the last observation; and (3) assuming the missing data are ignorable. Our asymptotic bias analysis shows that these naive approaches yield inconsistent estimators of model parameters. We next propose a selection/transition model that allows covariates to be missing in addition to the outcome variable at the time of dropout. The EM algorithm is used for inference in the proposed model. Data from a longitudinal study of human immunodeficiency virus (HIV)-infected women are used to illustrate the methodology.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Bias*
  • CD4 Lymphocyte Count
  • Computer Simulation
  • Female
  • HIV Infections / drug therapy
  • HIV Protease Inhibitors / therapeutic use
  • Hospitalization
  • Humans
  • Linear Models*
  • Longitudinal Studies*
  • Patient Dropouts*

Substances

  • HIV Protease Inhibitors