Structure-based active site profiles for genome analysis and functional family subclassification
- PMID: 14623182
- DOI: 10.1016/j.jmb.2003.09.062
Structure-based active site profiles for genome analysis and functional family subclassification
Abstract
In previous work, structure-based functional site descriptors, fuzzy functional forms (FFFs), were developed to recognize structurally conserved active sites in proteins. These descriptors identify members of protein families according to active-site structural similarity, rather than overall sequence or structure similarity. FFFs are defined by a minimal number of highly conserved residues and their three-dimensional arrangement. This approach is advantageous for function assignment across broad families, but is limited when applied to detailed subclassification within these families. In the work described here, we developed a method of three-dimensional, or structure-based, active-site profiling that utilizes FFFs to identify residues located in the spatial environment around the active site. Three-dimensional active-site profiling reveals similarities and differences among active sites across protein families. Using this approach, active-site profiles were constructed from known structures for 193 functional families, and these profiles were verified as distinct and characteristic. To achieve this result, a scoring function was developed that discriminates between true functional sites and those that are geometrically most similar, but do not perform the same function. In a large-scale retrospective analysis of human genome sequences, this profile score was shown to identify specific functional families correctly. The method is effective at recognizing the likely subtype of structurally uncharacterized members of the diverse family of protein kinases, categorizing sequences correctly that were misclassified by global sequence alignment methods. Subfamily information provided by this three-dimensional active-site profiling method yields key information for specific and selective inhibitor design for use in the pharmaceutical industry.
Similar articles
-
Method for prediction of protein function from sequence using the sequence-to-structure-to-function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases.J Mol Biol. 1998 Sep 4;281(5):949-68. doi: 10.1006/jmbi.1998.1993. J Mol Biol. 1998. PMID: 9719646
-
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975. J Mol Biol. 2000. PMID: 10966778
-
Prediction of deleterious functional effects of amino acid mutations using a library of structure-based function descriptors.Proteins. 2003 Dec 1;53(4):806-16. doi: 10.1002/prot.10458. Proteins. 2003. PMID: 14635123
-
[The active site of human glucocerebrosidase: structural predictions and experimental validations].J Soc Biol. 2002;196(2):151-60. J Soc Biol. 2002. PMID: 12360744 Review. French.
-
Unification of protein families.Curr Opin Struct Biol. 1998 Jun;8(3):372-9. doi: 10.1016/s0959-440x(98)80072-9. Curr Opin Struct Biol. 1998. PMID: 9666334 Review.
Cited by
-
Redox biology: computational approaches to the investigation of functional cysteine residues.Antioxid Redox Signal. 2011 Jul 1;15(1):135-46. doi: 10.1089/ars.2010.3561. Epub 2011 Apr 14. Antioxid Redox Signal. 2011. PMID: 20812876 Free PMC article. Review.
-
Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments.Proc Natl Acad Sci U S A. 2008 Apr 8;105(14):5441-6. doi: 10.1073/pnas.0704422105. Epub 2008 Apr 2. Proc Natl Acad Sci U S A. 2008. PMID: 18385384 Free PMC article.
-
New computational approaches to understanding molecular protein function.PLoS Comput Biol. 2018 Apr 5;14(4):e1005756. doi: 10.1371/journal.pcbi.1005756. eCollection 2018 Apr. PLoS Comput Biol. 2018. PMID: 29621256 Free PMC article. No abstract available.
-
Functional site profiling and electrostatic analysis of cysteines modifiable to cysteine sulfenic acid.Protein Sci. 2008 Feb;17(2):299-312. doi: 10.1110/ps.073096508. Protein Sci. 2008. PMID: 18227433 Free PMC article.
-
De-orphaning the structural proteome through reciprocal comparison of evolutionarily important structural features.PLoS One. 2008 May 7;3(5):e2136. doi: 10.1371/journal.pone.0002136. PLoS One. 2008. PMID: 18461181 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources