Enumeration of condition-dependent dense modules in protein interaction networks

Elisabeth Georgii; Sabine Dietmann; Takeaki Uno; Philipp Pagel; Koji Tsuda

doi:10.1093/bioinformatics/btp080

Enumeration of condition-dependent dense modules in protein interaction networks

Bioinformatics. 2009 Apr 1;25(7):933-40. doi: 10.1093/bioinformatics/btp080. Epub 2009 Feb 11.

Authors

Elisabeth Georgii¹, Sabine Dietmann, Takeaki Uno, Philipp Pagel, Koji Tsuda

Affiliation

¹ Max Planck Institute for Biological Cybernetics, Tübingen, Germany.

Abstract

Motivation: Modern systems biology aims at understanding how the different molecular components of a biological cell interact. Often, cellular functions are performed by complexes consisting of many different proteins. The composition of these complexes may change according to the cellular environment, and one protein may be involved in several different processes. The automatic discovery of functional complexes from protein interaction data is challenging. While previous approaches use approximations to extract dense modules, our approach exactly solves the problem of dense module enumeration. Furthermore, constraints from additional information sources such as gene expression and phenotype data can be integrated, so we can systematically mine for dense modules with interesting profiles.

Results: Given a weighted protein interaction network, our method discovers all protein sets that satisfy a user-defined minimum density threshold. We employ a reverse search strategy, which allows us to exploit the density criterion in an efficient way. Our experiments show that the novel approach is feasible and produces biologically meaningful results. In comparative validation studies using yeast data, the method achieved the best overall prediction performance with respect to confirmed complexes. Moreover, by enhancing the yeast network with phenotypic and phylogenetic profiles and the human network with tissue-specific expression data, we identified condition-dependent complex variants.

Availability: A C++ implementation of the algorithm is available at http://www.kyb.tuebingen.mpg.de/~georgii/dme.html.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Computational Biology / methods*
Humans
Multiprotein Complexes / metabolism*
Phenotype
Protein Interaction Mapping / methods*
Systems Biology

Substances

Multiprotein Complexes