Missing data methods for intensive care unit SOFA scores in electronic health records studies: results from a Monte Carlo simulation

J Comp Eff Res. 2022 Jan;11(1):47-56. doi: 10.2217/cer-2021-0079. Epub 2021 Nov 2.

Abstract

Aim: Missing data cause problems through decreasing sample size and the potential for introducing bias. We tested four missing data methods on the Sequential Organ Failure Assessment (SOFA) score, an intensive care research severity adjuster. Methods: Simulation study using 2015-2017 electronic health record data, where the complete dataset was sampled, missing SOFA score elements imposed and performance examined of four missing data methods - complete case analysis, median imputation, zero imputation (recommended by SOFA score creators) and multiple imputation (MI) - on the outcome of in-hospital mortality. Results: MI performed well, whereas other methods introduced varying amounts of bias or decreased sample size. Conclusion: We recommend using MI in analyses where SOFA score component values are missing in administrative data research.

Keywords: Monte Carlo method; administrative data; critical care; health services research; risk adjustment; severity of illness.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Electronic Health Records*
  • Humans
  • Intensive Care Units
  • Monte Carlo Method
  • Organ Dysfunction Scores*
  • Retrospective Studies