PRIDE: quality control in a proteomics data repository

Database (Oxford). 2012 Mar 20:2012:bas004. doi: 10.1093/database/bas004. Print 2012.

Abstract

The PRoteomics IDEntifications (PRIDE) database is a large public proteomics data repository, containing over 270 million mass spectra (by November 2011). PRIDE is an archival database, providing the proteomics data supporting specific scientific publications in a computationally accessible manner. While PRIDE faces rapid increases in data deposition size as well as number of depositions, the major challenge is to ensure a high quality of data depositions in the context of highly diverse proteomics work flows and data representations. Here, we describe the PRIDE curation pipeline and its practical application in quality control of complex data depositions. DATABASE URL: http://www.ebi.ac.uk/pride/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Database Management Systems / standards*
  • Databases, Protein / standards*
  • Humans
  • Mass Spectrometry
  • Proteins / chemistry*
  • Proteins / classification
  • Proteomics / methods*
  • Proteomics / standards*
  • Quality Control

Substances

  • Proteins