A hybrid machine learning/deep learning COVID-19 severity predictive model from CT images and clinical data

Sci Rep. 2022 Mar 14;12(1):4329. doi: 10.1038/s41598-022-07890-1.

Abstract

COVID-19 clinical presentation and prognosis are highly variable, ranging from asymptomatic and paucisymptomatic cases to acute respiratory distress syndrome and multi-organ involvement. We developed a hybrid machine learning/deep learning model to classify patients in two outcome categories, non-ICU and ICU (intensive care admission or death), using 558 patients admitted in a northern Italy hospital in February/May of 2020. A fully 3D patient-level CNN classifier on baseline CT images is used as feature extractor. Features extracted, alongside with laboratory and clinical data, are fed for selection in a Boruta algorithm with SHAP game theoretical values. A classifier is built on the reduced feature space using CatBoost gradient boosting algorithm and reaching a probabilistic AUC of 0.949 on holdout test set. The model aims to provide clinical decision support to medical doctors, with the probability score of belonging to an outcome class and with case-based SHAP interpretation of features importance.

MeSH terms

  • Algorithms
  • COVID-19* / diagnostic imaging
  • Deep Learning*
  • Humans
  • Machine Learning
  • Tomography, X-Ray Computed / methods