PheW2P2V: a phenome-wide prediction framework with weighted patient representations using electronic health records

Jia Guo; Krzysztof Kiryluk; Shuang Wang

doi:10.1093/jamiaopen/ooae084

PheW²P2V: a phenome-wide prediction framework with weighted patient representations using electronic health records

JAMIA Open. 2024 Sep 14;7(3):ooae084. doi: 10.1093/jamiaopen/ooae084. eCollection 2024 Oct.

Authors

Jia Guo¹, Krzysztof Kiryluk², Shuang Wang¹

Affiliations

¹ Department of Biostatistics, Columbia University, New York, NY 10032, United States.
² Department of Medicine, Columbia University, New York, NY 10032, United States.

Abstract

Objective: Electronic health records (EHRs) provide opportunities for the development of computable predictive tools. Conventional machine learning methods and deep learning methods have been widely used for this task, with the approach of usually designing one tool for one clinical outcome. Here we developed PheW²P2V, a Phenome-Wide prediction framework using Weighted Patient Vectors. PheW²P2V conducts tailored predictions for phenome-wide phenotypes using numeric representations of patients' past medical records weighted based on their similarities with individual phenotypes.

Materials and methods: PheW²P2V defines clinical disease phenotypes using Phecode mapping based on International Classification of Disease codes, which reduces redundancy and case-control misclassification in real-life EHR datasets. Through upweighting medical records of patients that are more relevant to a phenotype of interest in calculating patient vectors, PheW²P2V achieves tailored incidence risk prediction of a phenotype. The calculation of weighted patient vectors is computationally efficient, and the weighting mechanism ensures tailored predictions across the phenome. We evaluated prediction performance of PheW²P2V and baseline methods with simulation studies and clinical applications using the MIMIC-III database.

Results: Across 942 phenome-wide predictions using the MIMIC-III database, PheW²P2V has median area under the receiver operating characteristic curve (AUC-ROC) 0.74 (baseline methods have values ≤0.72), median max F₁-score 0.20 (baseline methods have values ≤0.19), and median area under the precision-recall curve (AUC-PR) 0.10 (baseline methods have values ≤0.10).

Discussion: PheW²P2V can predict phenotypes efficiently by using medical concept embeddings and upweighting relevant past medical histories. By leveraging both labeled and unlabeled data, PheW²P2V reduces overfitting and improves predictions for rare phenotypes, making it a useful screening tool for early diagnosis of high-risk conditions, though further research is needed to assess the transferability of embeddings across different databases.

Conclusions: PheW²P2V is fast, flexible, and has superior prediction performance for many clinical disease phenotypes across the phenome of the MIMIC-III database compared to that of several popular baseline methods.

Keywords: electronic health records (EHRs); patient representations; phenome-wide prediction.