Objective: Computational drug re-purposing has received a lot of attention in the past decade. However, methods developed to date focused on established compounds for which information on both, successfully treated patients and chemical and genomic impact, were known. Such information does not always exist for first-in-class drugs under development.
Methods: To identify indications (diseases) for drugs under development we extended and tested several unsupervised computational methods that utilize Electronic Health Record (EHR) data.
Results: We tested the methods on known drugs with multiple indications and show that a variant of matrix factorization leads to the best performance for first-in-line drugs improving upon prior methods that were developed for established drugs. The method also identifies novel predictions for key immunology and oncology drugs. Our results show that the performance of re-purposing methods differ greatly between oncology and inflammation/immunology. We hypothesize that the lower performance in oncology can be explained by the fact that many chemotherapies are not targeted therapies.
Conclusion: Finding new indications for drugs is extremely valuable. Our results explore how to best use EHR data for finding new indications for first in class drugs drug using a phenotypical-similarity driven approach. Our methods can be integrated with others methods using multiple data modalities such as chemical, molecular, genetic data.
Keywords: D000069558; D057286; D058492; Drug repositioning; Electronic health records; Unsupervised machine learning.
Copyright © 2024 The Authors. Published by Elsevier Ltd.. All rights reserved.