A Novel Sensitivity Maximization at a Given Specificity Method for Binary Classifications

Cancer Prev Res (Phila). 2024 Dec 2. doi: 10.1158/1940-6207.CAPR-24-0236. Online ahead of print.

Abstract

In the cancer early detection field, logistic regression is a frequently used approach to establish a combination rule that differentiates cancer from non-cancer. However, the application of logistic regression relies on a maximum likelihood approach, which may not yield optimal combination rules for maximizing sensitivity at a clinically desirable specificity and vice versa. Here, we have developed an improved regression framework, Sensitivity Maximization At a Given Specificity, SMAGS, for binary classification that finds the linear decision rule yielding the maximum sensitivity for a given specificity or the maximum specificity for a given sensitivity. We additionally expand the framework for feature selection that satisfies sensitivity and specificity maximizations. We compare our SMAGS method with normal logistic regression using two synthetic datasets and reported data for colorectal cancer (CRC) from the 2018 CancerSEEK study. In the CRC CancerSEEK dataset, we report 14% improvement in sensitivity at 98.5% specificity (0.31 vs 0.57; p-value:<0.05). The SMAGS method provides an alternative to logistic regression for modeling combination rules for biomarkers and early detection applications.