Objectives: Endoscopists' abilities to diagnose early gastric cancers (EGCs) vary, especially between specialists and nonspecialists. We developed an artificial intelligence (AI)-based diagnostic support tool "Tango" to differentiate EGCs and compared its performance with that of endoscopists.
Methods: The diagnostic performances of Tango and endoscopists (34 specialists, 42 nonspecialists) were compared using still images of 150 neoplastic and 165 non-neoplastic lesions. Neoplastic lesions included EGCs and adenomas. The primary outcome was to show the noninferiority of Tango (based on sensitivity) over specialists. The secondary outcomes were the noninferiority of Tango (based on accuracy) over specialists and the superiority of Tango (based on sensitivity and accuracy) over nonspecialists. The lower limit of the 95% confidence interval (CI) of the difference between Tango and the specialists for sensitivity was calculated, with >-10% defined as noninferiority and >0% defined as superiority in the primary outcome. The comparable differences between Tango and the endoscopists for each performance were calculated, with >10% defined as superiority and >0% defined as noninferiority in the secondary outcomes.
Results: Tango achieved superiority over the specialists based on sensitivity (84.7% vs. 65.8%, difference 18.9%, 95% CI 12.3-25.3%) and demonstrated noninferiority based on accuracy (70.8% vs. 67.4%). Tango achieved superiority over the nonspecialists based on sensitivity (84.7% vs. 51.0%) and accuracy (70.8% vs. 58.4%).
Conclusions: The AI-based diagnostic support tool for EGCs demonstrated a robust performance and may be useful to reduce misdiagnosis.
Keywords: Helicobacter pylori; artificial intelligence; diagnosis; endoscopy; gastric cancer.
© 2022 Japan Gastroenterological Endoscopy Society.