A novel function prediction approach using protein overlap networks
- PMID: 23866986
- PMCID: PMC3720179
- DOI: 10.1186/1752-0509-7-61
A novel function prediction approach using protein overlap networks
Abstract
Background: Construction of a reliable network remains the bottleneck for network-based protein function prediction. We built an artificial network model called protein overlap network (PON) for the entire genome of yeast, fly, worm, and human, respectively. Each node of the network represents a protein, and two proteins are connected if they share a domain according to InterPro database.
Results: The function of a protein can be predicted by counting the occurrence frequency of GO (gene ontology) terms associated with domains of direct neighbors. The average success rate and coverage were 34.3% and 43.9%, respectively, for the test genomes, and were increased to 37.9% and 51.3% when a composite PON of the four species was used for the prediction. As a comparison, the success rate was 7.0% in the random control procedure. We also made predictions with GO term annotations of the second layer nodes using the composite network and obtained an impressive success rate (>30%) and coverage (>30%), even for small genomes. Further improvement was achieved by statistical analysis of manually annotated GO terms for each neighboring protein.
Conclusions: The PONs are composed of dense modules accompanied by a few long distance connections. Based on the PONs, we developed multiple approaches effective for protein function prediction.
Figures
Similar articles
-
Information theory applied to the sparse gene ontology annotation network to predict novel gene function.Bioinformatics. 2007 Jul 1;23(13):i529-38. doi: 10.1093/bioinformatics/btm195. Bioinformatics. 2007. PMID: 17646340 Free PMC article.
-
Using computational predictions to improve literature-based Gene Ontology annotations: a feasibility study.Database (Oxford). 2011 Mar 15;2011:bar004. doi: 10.1093/database/bar004. Print 2011. Database (Oxford). 2011. PMID: 21411447 Free PMC article.
-
Using PPI network autocorrelation in hierarchical multi-label classification trees for gene function prediction.BMC Bioinformatics. 2013 Sep 26;14:285. doi: 10.1186/1471-2105-14-285. BMC Bioinformatics. 2013. PMID: 24070402 Free PMC article.
-
An Experimental Approach to Genome Annotation: This report is based on a colloquium sponsored by the American Academy of Microbiology held July 19-20, 2004, in Washington, DC.Washington (DC): American Society for Microbiology; 2004. Washington (DC): American Society for Microbiology; 2004. PMID: 33001599 Free Books & Documents. Review.
-
Gene Ontology annotation of the rice blast fungus, Magnaporthe oryzae.BMC Microbiol. 2009 Feb 19;9 Suppl 1(Suppl 1):S8. doi: 10.1186/1471-2180-9-S1-S8. BMC Microbiol. 2009. PMID: 19278556 Free PMC article. Review.
Cited by
-
Gene Mining for Proline Based Signaling Proteins in Cell Wall of Arabidopsis thaliana.Front Plant Sci. 2017 Feb 27;8:233. doi: 10.3389/fpls.2017.00233. eCollection 2017. Front Plant Sci. 2017. PMID: 28289422 Free PMC article. Review.
-
FunPred 3.0: improved protein function prediction using protein interaction network.PeerJ. 2019 May 22;7:e6830. doi: 10.7717/peerj.6830. eCollection 2019. PeerJ. 2019. PMID: 31198622 Free PMC article.
-
An efficient method for protein function annotation based on multilayer protein networks.Hum Genomics. 2016 Sep 27;10(1):33. doi: 10.1186/s40246-016-0087-x. Hum Genomics. 2016. PMID: 27678214 Free PMC article.
-
Using multi-instance hierarchical clustering learning system to predict yeast gene function.PLoS One. 2014 Mar 12;9(3):e90962. doi: 10.1371/journal.pone.0090962. eCollection 2014. PLoS One. 2014. PMID: 24621610 Free PMC article.
-
NPF:network propagation for protein function prediction.BMC Bioinformatics. 2020 Aug 12;21(1):355. doi: 10.1186/s12859-020-03663-7. BMC Bioinformatics. 2020. PMID: 32787776 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases
Research Materials
Miscellaneous