The utility of a machine learning model in identifying people at high risk of type 2 diabetes mellitus

Abdullah Alkattan; Abdullah Al-Zeer; Fahad Alsaawi; Alanoud Alyahya; Raghad Alnasser; Raoom Alsarhan; Mona Almusawi; Deemah Alabdulaali; Nagla Mahmoud; Rami Al-Jafar; Faisal Aldayel; Mustafa Hassanein; Alhan Haji; Abdulrahman Alsheikh; Amal Alfaifi; Elfadil Elkagam; Ahmed Alfridi; Amjad Alfaleh; Khaled Alabdulkareem; Nashwa Radwan; Edward W Gregg

doi:10.1080/17446651.2024.2400706

The utility of a machine learning model in identifying people at high risk of type 2 diabetes mellitus

Expert Rev Endocrinol Metab. 2024 Nov;19(6):513-522. doi: 10.1080/17446651.2024.2400706. Epub 2024 Sep 8.

Authors

Abdullah Alkattan^{1

2}, Abdullah Al-Zeer^{3

4}, Fahad Alsaawi⁴, Alanoud Alyahya⁴, Raghad Alnasser⁴, Raoom Alsarhan¹, Mona Almusawi¹, Deemah Alabdulaali⁴, Nagla Mahmoud¹, Rami Al-Jafar^{4

5}, Faisal Aldayel¹, Mustafa Hassanein¹, Alhan Haji¹, Abdulrahman Alsheikh^{4

6}, Amal Alfaifi¹, Elfadil Elkagam¹, Ahmed Alfridi¹, Amjad Alfaleh¹, Khaled Alabdulkareem^{1

6}, Nashwa Radwan^{1

7}, Edward W Gregg⁸

Affiliations

¹ Department of Research, Training and Development, Assisting Deputyship for Primary Health Care, Ministry of Health, Riyadh, Saudi Arabia.
² Department of Biomedical Sciences, College of Veterinary Medicine, King Faisal University, Al-Ahsa, Saudi Arabia.
³ Department of Clinical Pharmacy, College of Pharmacy, King Saud University, Riyadh, Saudi Arabia.
⁴ Data Services Sector, Lean Business Services, Riyadh, Saudi Arabia.
⁵ Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK.
⁶ Department of Family Medicine, College of Medicine, Al-Imam Mohammad Bin Saud Islamic University, Riyadh, Saudi Arabia.
⁷ Department of Public Health and Community Medicine, Faculty of Medicine, Tanta University, Tanta, Egypt.
⁸ School of Population Health, RCSI University of Medicine and Health Sciences, Dublin, Ireland.

PMID: 39245968
DOI: 10.1080/17446651.2024.2400706

Abstract

Background: According to previous reports, very high percentages of individuals in Saudi Arabia are undiagnosed for type 2 diabetes mellitus (T2DM). Despite conducting several screening and awareness campaigns, these efforts lacked full accessibility and consumed extensive human and material resources. Thus, developing machine learning (ML) models could enhance the population-based screening process. The study aims to compare a newly developed ML model's outcomes with the validated American Diabetes Association's (ADA) risk assessment regarding predicting people with high risk for T2DM.

Research design and methods: Patients' age, gender, and risk factors that were obtained from the National Health Information Center's dataset were used to build and train the ML model. To evaluate the developed ML model, an external validation study was conducted in three primary health care centers. A random sample (N = 3400) was selected from the non-diabetic individuals.

Results: The results showed the plotted data of sensitivity/100-specificity represented in the Receiver Operating Characteristic (ROC) curve with an AROC value of 0.803, 95% CI: 0.779-0.826.

Conclusions: The current study reveals a new ML model proposed for population-level classification that can be an adequate tool for identifying those at high risk of T2DM or who already have T2DM but have not been diagnosed.

Keywords: Machine learning; Saudi Arabia; health informatics; high risk; type-2 diabetes mellitus.

MeSH terms

Adult
Aged
Diabetes Mellitus, Type 2* / diagnosis
Diabetes Mellitus, Type 2* / epidemiology
Female
Humans
Machine Learning*
Male
Mass Screening / methods
Middle Aged
ROC Curve
Risk Assessment / methods
Risk Factors
Saudi Arabia / epidemiology
Sensitivity and Specificity