Is protein classification necessary? Toward alternative approaches to function annotation
- PMID: 19269161
- PMCID: PMC2745633
- DOI: 10.1016/j.sbi.2009.02.001
Is protein classification necessary? Toward alternative approaches to function annotation
Abstract
The current nonredundant protein sequence database contains over seven million entries and the number of individual functional domains is significantly larger than this value. The vast quantity of data associated with these proteins poses enormous challenges to any attempt at function annotation. Classification of proteins into sequence and structural groups has been widely used as an approach to simplifying the problem. In this article we question such strategies. We describe how the multifunctionality and structural diversity of even closely related proteins confounds efforts to assign function on the basis of overall sequence or structural similarity. Rather, we suggest that strategies that avoid classification may offer a more robust approach to protein function annotation.
Figures
Similar articles
-
In silico characterization of proteins: UniProt, InterPro and Integr8.Mol Biotechnol. 2008 Feb;38(2):165-77. doi: 10.1007/s12033-007-9003-x. Epub 2007 Oct 4. Mol Biotechnol. 2008. PMID: 18219596 Review.
-
Identification of subfamily-specific sites based on active sites modeling and clustering.Bioinformatics. 2010 Dec 15;26(24):3075-82. doi: 10.1093/bioinformatics/btq595. Epub 2010 Oct 26. Bioinformatics. 2010. PMID: 20980272
-
ArchDB: automated protein loop classification as a tool for structural genomics.Nucleic Acids Res. 2004 Jan 1;32(Database issue):D185-8. doi: 10.1093/nar/gkh002. Nucleic Acids Res. 2004. PMID: 14681390 Free PMC article.
-
Protein family classification and functional annotation.Comput Biol Chem. 2003 Feb;27(1):37-47. doi: 10.1016/s1476-9271(02)00098-1. Comput Biol Chem. 2003. PMID: 12798038 Review.
-
A biocurator perspective: annotation at the Research Collaboratory for Structural Bioinformatics Protein Data Bank.PLoS Comput Biol. 2006 Oct 27;2(10):e99. doi: 10.1371/journal.pcbi.0020099. PLoS Comput Biol. 2006. PMID: 17069453 Free PMC article. Review. No abstract available.
Cited by
-
Toward a "structural BLAST": using structural relationships to infer function.Protein Sci. 2013 Apr;22(4):359-66. doi: 10.1002/pro.2225. Epub 2013 Feb 21. Protein Sci. 2013. PMID: 23349097 Free PMC article. Review.
-
Issues in bioinformatics benchmarking: the case study of multiple sequence alignment.Nucleic Acids Res. 2010 Nov;38(21):7353-63. doi: 10.1093/nar/gkq625. Epub 2010 Jul 17. Nucleic Acids Res. 2010. PMID: 20639539 Free PMC article. Review.
-
Structural relationships among proteins with different global topologies and their implications for function annotation strategies.Proc Natl Acad Sci U S A. 2009 Oct 13;106(41):17377-82. doi: 10.1073/pnas.0907971106. Epub 2009 Sep 24. Proc Natl Acad Sci U S A. 2009. PMID: 19805138 Free PMC article.
-
Maps of protein structure space reveal a fundamental relationship between protein structure and function.Proc Natl Acad Sci U S A. 2011 Jul 26;108(30):12301-6. doi: 10.1073/pnas.1102727108. Epub 2011 Jul 7. Proc Natl Acad Sci U S A. 2011. PMID: 21737750 Free PMC article.
-
Composite structural motifs of binding sites for delineating biological functions of proteins.PLoS One. 2012;7(2):e31437. doi: 10.1371/journal.pone.0031437. Epub 2012 Feb 8. PLoS One. 2012. PMID: 22347478 Free PMC article.
References
-
- Taylor WR. Evolutionary transitions in protein fold space. Current opinion in structural biology. 2007;17:354–361. - PubMed
-
- Commichau FM, Stulke J. Trigger enzymes: bifunctional proteins active in metabolism and in controlling gene expression. Molecular microbiology. 2008;67:692–702. - PubMed
-
- Reeves GA, Dallman TJ, Redfern OC, Akpor A, Orengo CA. Structural Diversity of Domain Superfamilies in the CATH Database. Journal of Molecular Biology. 2006;360:725–741. This paper, along with reference [5], describe the surprising amount of structural diversity that can arise in proteins that are related evolutionarily as a result of variations functional necessities, such as novel oligomeric states or binding of ligands with different moieties. - PubMed
-
- Andreeva A, Murzin AG. Evolution of protein fold in the presence of functional constraints. Current opinion in structural biology. 2006;16:399–408. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources