COVID-19 machine learning model predicts outcomes in older patients from various European countries, between pandemic waves, and in a cohort of Asian, African, and American patients

Behrooz Mamandipoor; Raphael Romano Bruno; Bernhard Wernly; Georg Wolff; Jesper Fjølner; Antonio Artigas; Bernardo Bollen Pinto; Joerg C Schefold; Malte Kelm; Michael Beil; Sviri Sigal; Susannah Leaver; Dylan W De Lange; Bertrand Guidet; Hans Flaatten; Wojciech Szczeklik; Christian Jung; Venet Osmani

doi:10.1371/journal.pdig.0000136

COVID-19 machine learning model predicts outcomes in older patients from various European countries, between pandemic waves, and in a cohort of Asian, African, and American patients

PLOS Digit Health. 2022 Nov 8;1(11):e0000136. doi: 10.1371/journal.pdig.0000136. eCollection 2022 Nov.

Authors

Behrooz Mamandipoor¹, Raphael Romano Bruno², Bernhard Wernly^{3

4}, Georg Wolff², Jesper Fjølner⁵, Antonio Artigas⁶, Bernardo Bollen Pinto⁷, Joerg C Schefold⁸, Malte Kelm², Michael Beil⁹, Sviri Sigal⁹, Susannah Leaver¹⁰, Dylan W De Lange¹¹, Bertrand Guidet^{12

13}, Hans Flaatten¹⁴, Wojciech Szczeklik¹⁵, Christian Jung², Venet Osmani¹

Affiliations

¹ Digital Health Centre, Fondazione Bruno Kessler Research Institute, Trento, Italy.
² Heinrich-Heine-University Duesseldorf, Medical Faculty, Department of Cardiology, Pulmonology and Vascular Medicine, Duesseldorf, Germany.
³ Department of Internal Medicine, General Hospital Oberndorf, Teaching Hospital of the Paracelsus Medical University Salzburg, 5020 Salzburg, Austria.
⁴ Institute of General Practice, Family Medicine and Preventive Medicine, Paracelsus Medical University, Salzburg, Austria.
⁵ Department of Anaesthesia and Intensive Care, Viborg Regional Hospital, Viborg, Denmark.
⁶ Department of Intensive Care Medicine, CIBER Enfermedades Respiratorias, Corporacion Sanitaria Universitaria Parc Tauli, Autonomous University of Barcelona, Sabadell, Spain.
⁷ Department of Acute Medicine, Geneva University Hospitals, Geneva, Switzerland.
⁸ Department of Intensive Care Medicine, Inselspital, Universitätsspital, University of Bern, Bern, Switzerland.
⁹ Dept. of Medical Intensive Care, Hadassah Medical Center and Faculty of Medicine, Hebrew University of Jerusalem, Israel.
¹⁰ General Intensive care, St George's University Hospitals NHS Foundation trust, London, United Kingdom.
¹¹ Department of Intensive Care Medicine, University Medical Center, University Utrecht, the Netherlands.
¹² Sorbonne Universités, UPMC Univ Paris 06, INSERM, UMR_S 1136, Institut Pierre Louis d'Epidémiologie et de Santé Publique, Equipe: épidémiologie hospitalière qualité et organisation des soins, F-75012, Paris, France.
¹³ Assistance Publique-Hôpitaux de Paris, Hôpital Saint-Antoine, service de réanimation médicale, Paris, France.
¹⁴ Department of Clinical Medicine, University of Bergen, Department of Anaesthesia and Intensive Care, Haukeland University Hospital, Bergen, Norway.
¹⁵ Jagiellonian University Medical College, Center for Intensive Care and Perioperative Medicine, Krakow, Poland.

Abstract

Background: COVID-19 remains a complex disease in terms of its trajectory and the diversity of outcomes rendering disease management and clinical resource allocation challenging. Varying symptomatology in older patients as well as limitation of clinical scoring systems have created the need for more objective and consistent methods to aid clinical decision making. In this regard, machine learning methods have been shown to enhance prognostication, while improving consistency. However, current machine learning approaches have been limited by lack of generalisation to diverse patient populations, between patients admitted at different waves and small sample sizes.

Objectives: We sought to investigate whether machine learning models, derived on routinely collected clinical data, can generalise well i) between European countries, ii) between European patients admitted at different COVID-19 waves, and iii) between geographically diverse patients, namely whether a model derived on the European patient cohort can be used to predict outcomes of patients admitted to Asian, African and American ICUs.

Methods: We compare Logistic Regression, Feed Forward Neural Network and XGBoost algorithms to analyse data from 3,933 older patients with a confirmed COVID-19 diagnosis in predicting three outcomes, namely: ICU mortality, 30-day mortality and patients at low risk of deterioration. The patients were admitted to ICUs located in 37 countries, between January 11, 2020, and April 27, 2021.

Results: The XGBoost model derived on the European cohort and externally validated in cohorts of Asian, African, and American patients, achieved AUC of 0.89 (95% CI 0.89-0.89) in predicting ICU mortality, AUC of 0.86 (95% CI 0.86-0.86) for 30-day mortality prediction and AUC of 0.86 (95% CI 0.86-0.86) in predicting low-risk patients. Similar AUC performance was achieved also when predicting outcomes between European countries and between pandemic waves, while the models showed high calibration quality. Furthermore, saliency analysis showed that FiO2 values of up to 40% do not appear to increase the predicted risk of ICU and 30-day mortality, while PaO2 values of 75 mmHg or lower are associated with a sharp increase in the predicted risk of ICU and 30-day mortality. Lastly, increase in SOFA scores also increase the predicted risk, but only up to a value of 8. Beyond these scores the predicted risk remains consistently high.

Conclusion: The models captured both the dynamic course of the disease as well as similarities and differences between the diverse patient cohorts, enabling prediction of disease severity, identification of low-risk patients and potentially supporting effective planning of essential clinical resources.

Trial registration number: NCT04321265.

Copyright: © 2022 Mamandipoor et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Associated data

ClinicalTrials.gov/NCT04321265

Grants and funding

The study was supported by a grant from Fondation Assistance Publique-Hôpitaux de Paris pour la recherche. The Health Region West also supported this study. EOSCsecretariat.eu provided support and has received funding from the European Union's Horizon Programme (grant agreement number: 831644); (H2020-INFRAEOSC-05-2018-2019). This work was supported by the Forschungskommission of the Medical Faculty of the Heinrich-Heine-University Düsseldorf, No. 2018-32 to G.W. and No. 2020-21 to RRB for a Clinician Scientist Track. The funders had no role in this work.