The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution
- PMID: 17135200
- PMCID: PMC1751535
- DOI: 10.1093/nar/gkl959
The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution
Abstract
We report the latest release (version 3.0) of the CATH protein domain database (http://www.cathdb.info). There has been a 20% increase in the number of structural domains classified in CATH, up to 86 151 domains. Release 3.0 comprises 1110 fold groups and 2147 homologous superfamilies. To cope with the increases in diverse structural homologues being determined by the structural genomics initiatives, more sensitive methods have been developed for identifying boundaries in multi-domain proteins and for recognising homologues. The CATH classification update is now being driven by an integrated pipeline that links these automated procedures with validation steps, that have been made easier by the provision of information rich web pages summarising comparison scores and relevant links to external sites for each domain being classified. An analysis of the population of domains in the CATH hierarchy and several domain characteristics are presented for version 3.0. We also report an update of the CATH Dictionary of homologous structures (CATH-DHS) which now contains multiple structural alignments, consensus information and functional annotations for 1459 well populated superfamilies in CATH. CATH is directly linked to the Gene3D database which is a projection of CATH structural data onto approximately 2 million sequences in completed genomes and UniProt.
Figures
Similar articles
-
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis.Nucleic Acids Res. 2005 Jan 1;33(Database issue):D247-51. doi: 10.1093/nar/gki024. Nucleic Acids Res. 2005. PMID: 15608188 Free PMC article.
-
The CATH classification revisited--architectures reviewed and new ways to characterize structural divergence in superfamilies.Nucleic Acids Res. 2009 Jan;37(Database issue):D310-4. doi: 10.1093/nar/gkn877. Epub 2008 Nov 7. Nucleic Acids Res. 2009. PMID: 18996897 Free PMC article.
-
CATH: comprehensive structural and functional annotations for genome sequences.Nucleic Acids Res. 2015 Jan;43(Database issue):D376-81. doi: 10.1093/nar/gku947. Epub 2014 Oct 27. Nucleic Acids Res. 2015. PMID: 25348408 Free PMC article.
-
The history of the CATH structural classification of protein domains.Biochimie. 2015 Dec;119:209-17. doi: 10.1016/j.biochi.2015.08.004. Epub 2015 Aug 4. Biochimie. 2015. PMID: 26253692 Free PMC article. Review.
-
Diversity in protein domain superfamilies.Curr Opin Genet Dev. 2015 Dec;35:40-9. doi: 10.1016/j.gde.2015.09.005. Epub 2015 Nov 3. Curr Opin Genet Dev. 2015. PMID: 26451979 Free PMC article. Review.
Cited by
-
The use of evolutionary patterns in protein annotation.Curr Opin Struct Biol. 2012 Jun;22(3):316-25. doi: 10.1016/j.sbi.2012.05.001. Epub 2012 May 24. Curr Opin Struct Biol. 2012. PMID: 22633559 Free PMC article. Review.
-
NMR structure of lipoprotein YxeF from Bacillus subtilis reveals a calycin fold and distant homology with the lipocalin Blc from Escherichia coli.PLoS One. 2012;7(6):e37404. doi: 10.1371/journal.pone.0037404. Epub 2012 Jun 5. PLoS One. 2012. PMID: 22693626 Free PMC article.
-
The crystal structure of the Dachshund domain of human SnoN reveals flexibility in the putative protein interaction surface.PLoS One. 2010 Sep 23;5(9):e12907. doi: 10.1371/journal.pone.0012907. PLoS One. 2010. PMID: 20957027 Free PMC article.
-
Extraction of human kinase mutations from literature, databases and genotyping studies.BMC Bioinformatics. 2009 Aug 27;10 Suppl 8(Suppl 8):S1. doi: 10.1186/1471-2105-10-S8-S1. BMC Bioinformatics. 2009. PMID: 19758464 Free PMC article.
-
Defining structural and evolutionary modules in proteins: a community detection approach to explore sub-domain architecture.BMC Struct Biol. 2013 Oct 16;13:20. doi: 10.1186/1472-6807-13-20. BMC Struct Biol. 2013. PMID: 24131821 Free PMC article.
References
-
- Todd A.E., Marsden R.L., Thornton J.M., Orengo C.A. Progress of structural genomics initiatives: an analysis of solved target structures. J. Mol. Biol. 2005;348:1235–1260. - PubMed
-
- Chandonia J.M., Brenner S.E. The impact of structural genomics: expectations and outcomes. Science. 2006;311:347–351. - PubMed
-
- Pearl F., Todd A., Sillitoe I., Dibley M., Redfern O., Lewis T., Bennett C., Marsden R., Grant A., Lee D., et al. The CATH domain structure database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis. Nucleic Acids Res. 2005;33:247–251. - PMC - PubMed