Fragment ranking in modelling of protein structure. Conformationally constrained environmental amino acid substitution tables
- PMID: 8421300
- DOI: 10.1006/jmbi.1993.1018
Fragment ranking in modelling of protein structure. Conformationally constrained environmental amino acid substitution tables
Abstract
Conformationally constrained environment-dependent amino acid residue substitution tables have been constructed from a database comprising 33 homologous families of protein sequences aligned on the basis of their three-dimensional structures. Residues are allotted to one of 216 (or 54) classes of combinations of structural features. These include nine main-chain conformation classes, three classes of side-chain accessibility and eight (or two) classes of side-chain involvement in three types of hydrogen bond. Seven different main-chain conformational classes outside of regions of regular structure were identified in an analysis of the distributions of phi-psi torsion angles in 84 high-resolution crystallographic structures. Residue substitutions at equivalent positions in the structural alignments are included where the main-chain conformational class is conserved. Frequency data in the form of 216 (or 54) environment specific (20 x 20 residue type) matrices are then converted to probabilities. Two smoothing regimes incorporating entropy-driven weights were applied to the set of 54 tables. Predicted residue substitutions have been generated for individual residue positions in beta-hairpins and the hypervariable regions of the immunoglobulins. These have been compared with the observed sequence variation at the same positions using rank correlation methods. Measurements of chi 2 distances demonstrate the considerable improvement in predictive power at key residue positions identified from interactive graphics studies when compared to the Dayhoff MDM250 mutation matrix. An illustrative example is given of an application of the method in the ranking of loop fragments in model building studies of structurally variable regions in two subtilisins. A combined template scoring procedure is found to be 26-fold more discriminatory than the Dayhoff matrix. The success rate is approximately 85%.
Similar articles
-
Conformational analysis and clustering of short and medium size loops connecting regular secondary structures: a database for modeling and prediction.Protein Sci. 1996 Dec;5(12):2600-16. doi: 10.1002/pro.5560051223. Protein Sci. 1996. PMID: 8976569 Free PMC article.
-
An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments.J Mol Biol. 2000 Aug 18;301(3):691-711. doi: 10.1006/jmbi.2000.3975. J Mol Biol. 2000. PMID: 10966778
-
Alignment and searching for common protein folds using a data bank of structural templates.J Mol Biol. 1993 Jun 5;231(3):735-52. doi: 10.1006/jmbi.1993.1323. J Mol Biol. 1993. PMID: 8515448
-
[A turning point in the knowledge of the structure-function-activity relations of elastin].J Soc Biol. 2001;195(2):181-93. J Soc Biol. 2001. PMID: 11727705 Review. French.
-
Are knowledge-based potentials derived from protein structure sets discriminative with respect to amino acid types?Proteins. 1998 May 15;31(3):225-46. Proteins. 1998. PMID: 9593195 Review.
Cited by
-
Local structural differences in homologous proteins: specificities in different SCOP classes.PLoS One. 2012;7(6):e38805. doi: 10.1371/journal.pone.0038805. Epub 2012 Jun 22. PLoS One. 2012. PMID: 22745680 Free PMC article.
-
Derivation of rules for comparative protein modeling from a database of protein structure alignments.Protein Sci. 1994 Sep;3(9):1582-96. doi: 10.1002/pro.5560030923. Protein Sci. 1994. PMID: 7833817 Free PMC article.
-
Modeling of loops in protein structures.Protein Sci. 2000 Sep;9(9):1753-73. doi: 10.1110/ps.9.9.1753. Protein Sci. 2000. PMID: 11045621 Free PMC article.
-
Comparative protein structure modeling using Modeller.Curr Protoc Bioinformatics. 2006 Oct;Chapter 5:Unit-5.6. doi: 10.1002/0471250953.bi0506s15. Curr Protoc Bioinformatics. 2006. PMID: 18428767 Free PMC article.
-
Domainal organization of the lower eukaryotic homologs of the yeast RNA polymerase II core subunit Rpb7 reflects functional conservation.Nucleic Acids Res. 2004 Jan 2;32(1):201-10. doi: 10.1093/nar/gkh163. Print 2004. Nucleic Acids Res. 2004. PMID: 14704357 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources