PathSys: integrating molecular interaction graphs for systems biology

BMC Bioinformatics. 2006 Feb 7:7:55. doi: 10.1186/1471-2105-7-55.

Abstract

Background: The goal of information integration in systems biology is to combine information from a number of databases and data sets, which are obtained from both high and low throughput experiments, under one data management scheme such that the cumulative information provides greater biological insight than is possible with individual information sources considered separately.

Results: Here we present PathSys, a graph-based system for creating a combined database of networks of interaction for generating integrated view of biological mechanisms. We used PathSys to integrate over 14 curated and publicly contributed data sources for the budding yeast (S. cerevisiae) and Gene Ontology. A number of exploratory questions were formulated as a combination of relational and graph-based queries to the integrated database. Thus, PathSys is a general-purpose, scalable, graph-data warehouse of biological information, complete with a graph manipulation and a query language, a storage mechanism and a generic data-importing mechanism through schema-mapping.

Conclusion: Results from several test studies demonstrate the effectiveness of the approach in retrieving biologically interesting relations between genes and proteins, the networks connecting them, and of the utility of PathSys as a scalable graph-based warehouse for interaction-network integration and a hypothesis generator system. The PathSys's client software, named BiologicalNetworks, developed for navigation and analyses of molecular networks, is available as a Java Web Start application at http://brak.sdsc.edu/pub/BiologicalNetworks.

MeSH terms

  • Computer Graphics*
  • Computer Simulation
  • Databases, Protein*
  • Information Storage and Retrieval / methods*
  • Models, Biological
  • Protein Interaction Mapping / methods*
  • Saccharomyces cerevisiae Proteins / metabolism
  • Signal Transduction / physiology
  • Software*
  • Systems Biology / methods*
  • Systems Integration
  • User-Computer Interface*

Substances

  • Saccharomyces cerevisiae Proteins