Model checking in multiple imputation: an overview and case study

Emerg Themes Epidemiol. 2017 Aug 23:14:8. doi: 10.1186/s12982-017-0062-6. eCollection 2017.

Abstract

Background: Multiple imputation has become very popular as a general-purpose method for handling missing data. The validity of multiple-imputation-based analyses relies on the use of an appropriate model to impute the missing values. Despite the widespread use of multiple imputation, there are few guidelines available for checking imputation models.

Analysis: In this paper, we provide an overview of currently available methods for checking imputation models. These include graphical checks and numerical summaries, as well as simulation-based methods such as posterior predictive checking. These model checking techniques are illustrated using an analysis affected by missing data from the Longitudinal Study of Australian Children.

Conclusions: As multiple imputation becomes further established as a standard approach for handling missing data, it will become increasingly important that researchers employ appropriate model checking approaches to ensure that reliable results are obtained when using this method.

Keywords: Cross-validation; Diagnostics; Missing data; Model checking; Multiple imputation; Posterior predictive checking.