Web-based tools for mining the NCI databases for anticancer drug discovery

J Chem Inf Comput Sci. 2004 Jan-Feb;44(1):249-57. doi: 10.1021/ci034209i.

Abstract

In this paper, we describe the development of a set of integrated Web-based tools for mining the National Cancer Institute's (NCI) anticancer databases for anticancer drug discovery. For data mining, three different correlation algorithms were implemented, which included the commonly used Pearson's correlation algorithm available from the NCI's COMPARE program, the Spearman's and Kendall's correlation algorithms. In addition, we implemented the p-value test to evaluate the significance of the correlation results. These Web-based data mining tools allow robust analysis of the correlation between the in vitro anticancer activity of the drugs in the NCI anticancer database, the protein levels and mRNA levels of molecular targets (genes) in the NCI 60 human cancer cell lines for identification of potential lead compounds for a specific molecular target and for study of the molecular mechanism action of a drug. Examples were provided to identify PKC ligands using a lead compound and to identify potential ErbB-2 inhibitors using the mRNA levels of ErbB-2 in the NCI 60 tumor cell lines.

MeSH terms

  • Antineoplastic Agents*
  • Internet*
  • National Institutes of Health (U.S.)
  • Protein Kinase C / metabolism
  • United States

Substances

  • Antineoplastic Agents
  • Protein Kinase C