Structural diversity of domain superfamilies in the CATH database
- PMID: 16780872
- DOI: 10.1016/j.jmb.2006.05.035
Structural diversity of domain superfamilies in the CATH database
Abstract
The CATH database of domain structures has been used to explore the structural variation of homologous domains in 294 well populated domain structure superfamilies, each containing at least three sequence diverse relatives. Our analyses confirm some previously detected trends relating sequence divergence to structural variation but for a much larger dataset and in some superfamilies the new data reveal exceptional structural variation. Use of a new algorithm (2DSEC) to analyse variability in secondary structure compositions across a superfamily sheds new light on how structures evolve. 2DSEC detects inserted secondary structures that embellish the core of conserved secondary structures found throughout the superfamily. Analysis showed that for 56% of highly populated superfamilies (>9 sequence diverse relatives), there are twofold or more increases in the numbers of secondary structures in some relatives. In some families fivefold increases occur, sometimes modifying the fold of the domain. Manual inspection of secondary structure insertions or embellishments in 48 particularly variable superfamilies revealed that although these insertions were usually discontiguous in the sequence they were often co-located in 3D resulting in a larger structural motif that often modified the geometry of the active site or the surface conformation promoting diverse domain partnerships and protein interactions. These observations, supported by automatic analysis of all well populated CATH families, suggest that accretion of small secondary structure insertions may provide a simple mechanism for evolving new functions in diverse relatives. Some layered domain architectures (e.g. mainly-beta and alpha-beta sandwiches) that recur highly in the genomes more frequently exploit these types of embellishments to modify function. In these architectures, aggregation occurs most often at the edges, top or bottom of the beta-sheets. Information on structural variability across domain superfamilies has been made available through the CATH Dictionary of Homologous Structures (DHS).
Similar articles
-
Evolution of function in protein superfamilies, from a structural perspective.J Mol Biol. 2001 Apr 6;307(4):1113-43. doi: 10.1006/jmbi.2001.4513. J Mol Biol. 2001. PMID: 11286560
-
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis.Nucleic Acids Res. 2005 Jan 1;33(Database issue):D247-51. doi: 10.1093/nar/gki024. Nucleic Acids Res. 2005. PMID: 15608188 Free PMC article.
-
Progress of structural genomics initiatives: an analysis of solved target structures.J Mol Biol. 2005 May 20;348(5):1235-60. doi: 10.1016/j.jmb.2005.03.037. Epub 2005 Apr 2. J Mol Biol. 2005. PMID: 15854658
-
Protein folds, functions and evolution.J Mol Biol. 1999 Oct 22;293(2):333-42. doi: 10.1006/jmbi.1999.3054. J Mol Biol. 1999. PMID: 10529349 Review.
-
The CATH protein family database: a resource for structural and functional annotation of genomes.Proteomics. 2002 Jan;2(1):11-21. Proteomics. 2002. PMID: 11788987 Review.
Cited by
-
Chemogenomic approaches to rational drug design.Br J Pharmacol. 2007 Sep;152(1):38-52. doi: 10.1038/sj.bjp.0707307. Epub 2007 May 29. Br J Pharmacol. 2007. PMID: 17533416 Free PMC article. Review.
-
CUSP: an algorithm to distinguish structurally conserved and unconserved regions in protein domain alignments and its application in the study of large length variations.BMC Struct Biol. 2008 May 31;8:28. doi: 10.1186/1472-6807-8-28. BMC Struct Biol. 2008. PMID: 18513436 Free PMC article.
-
Structural phylogenomics reveals gradual evolutionary replacement of abiotic chemistries by protein enzymes in purine metabolism.PLoS One. 2013;8(3):e59300. doi: 10.1371/journal.pone.0059300. Epub 2013 Mar 13. PLoS One. 2013. PMID: 23516625 Free PMC article.
-
Towards a comprehensive structural coverage of completed genomes: a structural genomics viewpoint.BMC Bioinformatics. 2007 Mar 9;8:86. doi: 10.1186/1471-2105-8-86. BMC Bioinformatics. 2007. PMID: 17349043 Free PMC article.
-
The CATH hierarchy revisited-structural divergence in domain superfamilies and the continuity of fold space.Structure. 2009 Aug 12;17(8):1051-62. doi: 10.1016/j.str.2009.06.015. Structure. 2009. PMID: 19679085 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources