ItpCtrl-AI: End-to-end interpretable and controllable artificial intelligence by modeling radiologists' intentions

Trong-Thang Pham; Jacob Brecheisen; Carol C Wu; Hien Nguyen; Zhigang Deng; Donald Adjeroh; Gianfranco Doretto; Arabinda Choudhary; Ngan Le

doi:10.1016/j.artmed.2024.103054

ItpCtrl-AI: End-to-end interpretable and controllable artificial intelligence by modeling radiologists' intentions

Artif Intell Med. 2024 Dec 12:160:103054. doi: 10.1016/j.artmed.2024.103054. Online ahead of print.

Authors

Trong-Thang Pham¹, Jacob Brecheisen², Carol C Wu³, Hien Nguyen⁴, Zhigang Deng⁵, Donald Adjeroh⁶, Gianfranco Doretto⁷, Arabinda Choudhary⁸, Ngan Le⁹

Affiliations

¹ AICV Lab, Department of EECS, University of Arkansas, AR 72701, USA. Electronic address: tp030@uark.edu.
² AICV Lab, Department of EECS, University of Arkansas, AR 72701, USA. Electronic address: jmbreche@uark.edu.
³ MD Anderson Cancer Center, Houston, TX 77079, USA. Electronic address: ccwu1@mdanderson.org.
⁴ Department of ECE, University of Houston, TX 77204, USA. Electronic address: hvnguy35@central.uh.edu.
⁵ Department of CS, University of Houston, TX 77204, USA. Electronic address: zdeng4@entral.uh.edu.
⁶ Department of CSEE, West Virginia University, WV 26506, USA. Electronic address: donald.adjeroh@mail.wvu.edu.
⁷ Department of CSEE, West Virginia University, WV 26506, USA. Electronic address: gianfranco.doretto@mail.wvu.edu.
⁸ University of Arkansas for Medical Sciences, Little Rock, AR 72705, USA. Electronic address: achoudhary@uams.edu.
⁹ AICV Lab, Department of EECS, University of Arkansas, AR 72701, USA. Electronic address: thile@uark.edu.

PMID: 39689443
DOI: 10.1016/j.artmed.2024.103054

Abstract

Using Deep Learning in computer-aided diagnosis systems has been of great interest due to its impressive performance in the general domain and medical domain. However, a notable challenge is the lack of explainability of many advanced models, which poses risks in critical applications such as diagnosing findings in CXR. To address this problem, we propose ItpCtrl-AI, a novel end-to-end interpretable and controllable framework that mirrors the decision-making process of the radiologist. By emulating the eye gaze patterns of radiologists, our framework initially determines the focal areas and assesses the significance of each pixel within those regions. As a result, the model generates an attention heatmap representing radiologists' attention, which is then used to extract attended visual information to diagnose the findings. By allowing the directional input, our framework is controllable by the user. Furthermore, by displaying the eye gaze heatmap which guides the diagnostic conclusion, the underlying rationale behind the model's decision is revealed, thereby making it interpretable. In addition to developing an interpretable and controllable framework, our work includes the creation of a dataset, named Diagnosed-Gaze++, which aligns medical findings with eye gaze data. Our extensive experimentation validates the effectiveness of our approach in generating accurate attention heatmaps and diagnoses. The experimental results show that our model not only accurately identifies medical findings but also precisely produces the eye gaze attention of radiologists. The dataset, models, and source code will be made publicly available upon acceptance.

Keywords: Computer-aided diagnosis; Gaze intention; Interpretable deep learning; Radiologist’s intention; Radiology; Vision-language model.