PRINCESS, a protein interaction confidence evaluation system with multiple data sources

Mol Cell Proteomics. 2008 Jun;7(6):1043-52. doi: 10.1074/mcp.M700287-MCP200. Epub 2008 Jan 29.

Abstract

Advances in proteomics technologies have enabled novel protein interactions to be detected at high speed, but they come at the expense of relatively low quality. Therefore, a crucial step in utilizing the high throughput protein interaction data is evaluating their confidence and then separating the subsets of reliable interactions from the background noise for further analyses. Using Bayesian network approaches, we combine multiple heterogeneous biological evidences, including model organism protein-protein interaction, interaction domain, functional annotation, gene expression, genome context, and network topology structure, to assign reliability to the human protein-protein interactions identified by high throughput experiments. This method shows high sensitivity and specificity to predict true interactions from the human high throughput protein-protein interaction data sets. This method has been developed into an on-line confidence scoring system specifically for the human high throughput protein-protein interactions. Users may submit their protein-protein interaction data on line, and the detailed information about the supporting evidence for query interactions together with the confidence scores will be returned. The Web interface of PRINCESS (protein interaction confidence evaluation system with multiple data sources) is available at the website of China Human Proteome Organisation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Area Under Curve
  • Bayes Theorem
  • Computational Biology
  • Data Interpretation, Statistical
  • Databases, Protein
  • Gene Expression Regulation
  • Genome
  • Humans
  • Internet
  • Protein Interaction Mapping
  • Proteomics / methods*
  • ROC Curve
  • Software