Prognostic models based on literature and individual patient data in logistic regression analysis

Stat Med. 2000 Jan 30;19(2):141-60. doi: 10.1002/(sici)1097-0258(20000130)19:2<141::aid-sim334>3.0.co;2-o.

Abstract

Prognostic models can be developed with multiple regression analysis of a data set containing individual patient data. Often this data set is relatively small, while previously published studies present results for larger numbers of patients. We describe a method to combine univariable regression results from the medical literature with univariable and multivariable results from the data set containing individual patient data. This 'adaptation method' exploits the generally strong correlation between univariable and multivariable regression coefficients. The method is illustrated with several logistic regression models to predict 30-day mortality in patients with acute myocardial infarction. The regression coefficients showed considerably less variability when estimated with the adaptation method, compared to standard maximum likelihood estimates. Also, model performance, as distinguished in calibration and discrimination, improved clearly when compared to models including shrunk or penalized estimates. We conclude that prognostic models may benefit substantially from explicit incorporation of literature data.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Age Factors
  • Aged
  • Diabetes Complications
  • Female
  • Humans
  • Hypercholesterolemia / complications
  • Hypertension / complications
  • Likelihood Functions
  • Logistic Models*
  • Male
  • Middle Aged
  • Myocardial Infarction / mortality*
  • Prognosis*
  • Risk Factors
  • Sex Factors
  • Smoking / adverse effects
  • Time Factors