Background: We aimed to evaluate and compare the performance of multiple myeloma (MM) selection algorithms for use in Veterans Affairs (VA) research.
Methods: Using the VA Corporate Data Warehouse (CDW), the VA Cancer Registry (VACR), and VA pharmacy data, we randomly selected 500 patients from 01/01/1999 to 06/01/2021 who had (1) either one MM diagnostic code OR were listed in the VACR as having MM AND (2) at least one MM treatment code. A team reviewed oncology notes for each veteran to annotate details regarding MM diagnosis and initial treatment within VA. We evaluated inter-annotator agreement and compared the performance of four published algorithms (two developed and validated external to VA data and two used in VA data).
Results: A total of 859 patients were reviewed to obtain 500 patients who were annotated as having MM and initiating MM treatment in VA. Agreement was high among annotators for all variables: MM diagnosis (98.3% agreement, Kappa = 0.93); initial treatment in VA (91.8% agreement; Kappa = 0.77); and initial treatment classification (87.6% agreement; Kappa = 0.86). VA Algorithms were more specific and had higher PPVs than non-VA algorithms for both MM diagnosis and initial treatment in VA. We developed the "VA Recommended Algorithm," which had the highest PPV among all algorithms in identifying patients diagnosed with MM (PPV = 0.98, 95% CI = 0.95-0.99) and in identifying patients who initiated their MM treatment in VA (PPV = 0.93, 95% CI = 0.90-0.96).
Conclusion: Our VA Recommended Algorithm optimizes sensitivity and PPV for cohort selection and treatment classification.
Keywords: algorithm; cohort selection; multiple myeloma; sensitivity; veterans.
© 2022 John Wiley & Sons Ltd. This article has been contributed to by U.S. Government employees and their work is in the public domain in the USA.