The relative vertex clustering value--a new criterion for the fast discovery of functional modules in protein interaction networks
- PMID: 25734691
- PMCID: PMC4347617
- DOI: 10.1186/1471-2105-16-S4-S3
The relative vertex clustering value--a new criterion for the fast discovery of functional modules in protein interaction networks
Abstract
Background: Cellular processes are known to be modular and are realized by groups of proteins implicated in common biological functions. Such groups of proteins are called functional modules, and many community detection methods have been devised for their discovery from protein interaction networks (PINs) data. In current agglomerative clustering approaches, vertices with just a very few neighbors are often classified as separate clusters, which does not make sense biologically. Also, a major limitation of agglomerative techniques is that their computational efficiency do not scale well to large PINs. Finally, PIN data obtained from large scale experiments generally contain many false positives, and this makes it hard for agglomerative clustering methods to find the correct clusters, since they are known to be sensitive to noisy data.
Results: We propose a local similarity premetric, the relative vertex clustering value, as a new criterion allowing to decide when a node can be added to a given node's cluster and which addresses the above three issues. Based on this criterion, we introduce a novel and very fast agglomerative clustering technique, FAC-PIN, for discovering functional modules and protein complexes from a PIN data.
Conclusions: Our proposed FAC-PIN algorithm is applied to nine PIN data from eight different species including the yeast PIN, and the identified functional modules are validated using Gene Ontology (GO) annotations from DAVID Bioinformatics Resources. Identified protein complexes are also validated using experimentally verified complexes. Computational results show that FAC-PIN can discover functional modules or protein complexes from PINs more accurately and more efficiently than HC-PIN and CNM, the current state-of-the-art approaches for clustering PINs in an agglomerative manner.
Figures
Similar articles
-
A fast hierarchical clustering algorithm for functional modules discovery in protein interaction networks.IEEE/ACM Trans Comput Biol Bioinform. 2011 May-Jun;8(3):607-20. doi: 10.1109/TCBB.2010.75. IEEE/ACM Trans Comput Biol Bioinform. 2011. PMID: 20733244
-
Identification of functional modules in a PPI network by clique percolation clustering.Comput Biol Chem. 2006 Dec;30(6):445-51. doi: 10.1016/j.compbiolchem.2006.10.001. Epub 2006 Nov 13. Comput Biol Chem. 2006. PMID: 17098476
-
From Function to Interaction: A New Paradigm for Accurately Predicting Protein Complexes Based on Protein-to-Protein Interaction Networks.IEEE/ACM Trans Comput Biol Bioinform. 2014 Jul-Aug;11(4):616-27. doi: 10.1109/TCBB.2014.2306825. IEEE/ACM Trans Comput Biol Bioinform. 2014. PMID: 26356332
-
Functional annotations for the Saccharomyces cerevisiae genome: the knowns and the known unknowns.Trends Microbiol. 2009 Jul;17(7):286-94. doi: 10.1016/j.tim.2009.04.005. Epub 2009 Jul 2. Trends Microbiol. 2009. PMID: 19577472 Free PMC article. Review.
-
Maximizing cohesion and separation for detecting protein functional modules in protein-protein interaction networks.PLoS One. 2020 Oct 13;15(10):e0240628. doi: 10.1371/journal.pone.0240628. eCollection 2020. PLoS One. 2020. PMID: 33048996 Free PMC article. Review.
Cited by
-
Matrix metalloproteinase expression and molecular interaction network analysis in gastric cancer.Oncol Lett. 2016 Oct;12(4):2403-2408. doi: 10.3892/ol.2016.5013. Epub 2016 Aug 16. Oncol Lett. 2016. PMID: 27698806 Free PMC article.
-
Topological and functional comparison of community detection algorithms in biological networks.BMC Bioinformatics. 2019 Apr 27;20(1):212. doi: 10.1186/s12859-019-2746-0. BMC Bioinformatics. 2019. PMID: 31029085 Free PMC article.
References
-
- Pei P, Zhang A. A 'Seed-Refine' Algorithm for Detecting Protein Complexes from Protein Interaction Data. IEEE Transcations of Nanobioscience. 2007;6(1):43–50. - PubMed
-
- Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E. Fast Unfolding of Communities in Large Networks. Journal of Statistical Mechanics: Theory and Experiment. 2008;2008(10):P1000.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases