Machine learning algorithm approach to complete blood count can be used as early predictor of COVID-19 outcome

J Leukoc Biol. 2024 Oct 21:qiae223. doi: 10.1093/jleuko/qiae223. Online ahead of print.

Abstract

Although the SARS-CoV-2 infection has established risk groups, identifying biomarkers for disease outcomes is still crucial to stratify patient risk and enhance clinical management. Optimal efficacy of COVID-19 antiviral medications relies on early administration within the initial five days of symptoms, assisting high-risk patients in avoiding hospitalization and improving survival chances. The complete blood count can be an efficient and affordable option to find biomarkers that predict the COVID-19 prognosis due to infection-induced alterations in various blood parameters. This study aimed to associate hematological parameters with different COVID-19 clinical forms and utilize them as disease outcome predictors. We performed a complete blood count in blood samples from 297 individuals with COVID-19 from Belo Horizonte, Brazil. Statistical analysis, as well as ROC Curves and machine learning Decision Tree algorithms were used to identify correlations, and their accuracy, between blood parameters and disease severity. In the initial four days of infection, traditional hematological COVID-19 alterations, such as lymphopenia, were not yet apparent. However, the monocyte percentage and granulocyte-to-lymphocyte ratio proved to be reliable predictors for hospitalization, even in cases where patients exhibited mild symptoms that later progressed to hospitalization. Thus, our findings demonstrate that COVID-19 patients with monocyte percentages lower than 7.7% and a granulocyte-to-lymphocyte ratio higher than 8.75 are assigned to the hospitalized group with a precision of 86%. This suggests that these variables can serve as important biomarkers in predicting disease outcomes and could be used to differentiate patients at hospital admission for managing therapeutic interventions, including early antiviral administration. Moreover, they are simple parameters that can be useful in minimally equipped health care units.

Keywords: Complete Blood Count; Machine Learning; Monocytes and GLR.