Modifying the DPClus algorithm for identifying protein complexes based on new topological structures
Open Access
- 25 September 2008
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 9 (1) , 398
- https://doi.org/10.1186/1471-2105-9-398
Abstract
Background: Identification of protein complexes is crucial for understanding principles of cellular organization and functions. As the size of protein-protein interaction set increases, a general trend is to represent the interactions as a network and to develop effective algorithms to detect significant complexes in such networks. Results: Based on the study of known complexes in protein networks, this paper proposes a new topological structure for protein complexes, which is a combination of subgraph diameter (or average vertex distance) and subgraph density. Following the approach of that of the previously proposed clustering algorithm DPClus which expands clusters starting from seeded vertices, we present a clustering algorithm IPCA based on the new topological structure for identifying complexes in large protein interaction networks. The algorithm IPCA is applied to the protein interaction network of Sacchromyces cerevisiae and identifies many well known complexes. Experimental results show that the algorithm IPCA recalls more known complexes than previously proposed clustering algorithms, including DPClus, CFinder, LCMA, MCODE, RNSC and STM. Conclusion: The proposed algorithm based on the new topological structure makes it possible to identify dense subgraphs in protein interaction networks, many of which correspond to known protein complexes. The algorithm is robust to the known high rate of false positives and false negatives in data from high-throughout interaction techniques. The program is available at http://netlab.csu.edu.cn/bioinformatics/limin/IPCA.Keywords
This publication has 30 references indexed in Scilit:
- A novel functional module detection algorithm for protein-protein interaction networksAlgorithms for Molecular Biology, 2006
- Evaluation of clustering algorithms for protein-protein interaction networksBMC Bioinformatics, 2006
- Global landscape of protein complexes in the yeast Saccharomyces cerevisiaeNature, 2006
- Identification of Protein Complexes by Comparative Analysis of Yeast and Bacterial Protein Interaction DataJournal of Computational Biology, 2005
- Uncovering the overlapping community structure of complex networks in nature and societyNature, 2005
- High-Definition Macromolecular Composition of Yeast RNA-Processing ComplexesMolecular Cell, 2004
- MIPS: analysis and annotation of proteins from whole genomesNucleic Acids Research, 2004
- Bioinformatics Analysis of Experimentally Determined Protein Complexes in the Yeast Saccharomyces cerevisiaeGenome Research, 2003
- Functional organization of the yeast proteome by systematic analysis of protein complexesNature, 2002
- Cryo‐electron microscopy as an investigative tool: the ribosome as an exampleBioEssays, 2001