Machine learning-based predictive models for perioperative major adverse cardiovascular events in patients with stable coronary artery disease undergoing noncardiac surgery

Liang Shen; YunPeng Jin; AXiang Pan; Kai Wang; RunZe Ye; YangKai Lin; Safraz Anwar; WeiCong Xia; Min Zhou; XiaoGang Guo

doi:10.1016/j.cmpb.2024.108561

Machine learning-based predictive models for perioperative major adverse cardiovascular events in patients with stable coronary artery disease undergoing noncardiac surgery

Comput Methods Programs Biomed. 2024 Dec 13:260:108561. doi: 10.1016/j.cmpb.2024.108561. Online ahead of print.

Authors

Liang Shen¹, YunPeng Jin², AXiang Pan¹, Kai Wang², RunZe Ye², YangKai Lin², Safraz Anwar², WeiCong Xia², Min Zhou³, XiaoGang Guo⁴

Affiliations

¹ Department of Information Technology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China.
² Department of Cardiovascular Medicine, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China.
³ Department of Information Technology, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China. Electronic address: minzhou@zju.edu.cn.
⁴ Department of Cardiovascular Medicine, The First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China. Electronic address: gxg22222@zju.edu.cn.

PMID: 39708562
DOI: 10.1016/j.cmpb.2024.108561

Abstract

Background and objective: Accurate prediction of perioperative major adverse cardiovascular events (MACEs) is crucial, as it not only aids clinicians in comprehensively assessing patients' surgical risks and tailoring personalized surgical and perioperative management plans, but also for information-based shared decision-making with patients and efficient allocation of medical resources. This study developed and validated a machine learning (ML) model using accessible preoperative clinical data to predict perioperative MACEs in stable coronary artery disease (SCAD) patients undergoing noncardiac surgery (NCS).

Methods: We collected data from 9171 adult SCAD patients who underwent NCS and extracted 64 preoperative variables. First, the optimal data imputation, resampling, and feature selection methods were compared and selected to deal with missing data values and imbalances. Then, nine independent machine learning models (logistic regression (LR), support vector machine, Gaussian Naive Bayes (GNB), random forest, gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), light gradient boosting machine, categorical boosting (CatBoost), and deep neural network) and a stacking ensemble model were constructed and compared with the validated Revised Cardiac Risk Index's (RCRI) model for predictive performance, which was evaluated using the area under the receiver operating characteristic curve (AUROC), the area under the precision-recall curve (AUPRC), calibration curve, and decision curve analysis (DCA). To reduce overfitting and enhance robustness, we performed hyperparameter tuning and 5-fold cross-validation. Finally, the Shapley additive interpretation (SHAP) method and a partial dependence plot (PDP) were used to determine the optimal ML model.

Results: Of the 9,171 patients, 514 (5.6 %) developed MACEs. 24 significant preoperative features were selected for model development and evaluation. All ML models performed well, with AUROC above 0.88 and AUPRC above 0.39, outperforming the AUROC (0.716) and AUPRC (0.185) of RCRI (P < 0.001). The best independent model was XGBoost (AUROC = 0.898, AUPRC = 0.479). The calibration curve accurately predicted the risk of MACEs (Brier score = 0.040), and the DCA results showed that XGBoost had a high net benefit for predicting MACEs. The top-ranked stacking ensemble model, consisting of CatBoost, GBDT, GNB, and LR, proved to be the best (AUROC 0.894, AUPRC 0.485). We identified the top 20 most important features using the mean absolute SHAP values and depicted their effects on model predictions using PDP.

Conclusions: This study combined missing-value imputation, feature screening, unbalanced data processing, and advanced machine learning methods to successfully develop and verify the first ML-based perioperative MACEs prediction model for patients with SCAD, which is more accurate than RCRI and enables effective identification of high-risk patients and implementation of targeted interventions to reduce the incidence of MACEs.

Keywords: Feature selection; Imbalance data; Machine learning; Major adverse cardiovascular events; Noncardiac surgery; Prediction model.