The SUPERFAMILY database in structural genomics
- PMID: 12393919
- DOI: 10.1107/s0907444902015160
The SUPERFAMILY database in structural genomics
Abstract
The SUPERFAMILY hidden Markov model library representing all proteins of known structure predicts the domain architecture of protein sequences and classifies them at the SCOP superfamily level. This analysis has been carried out on all completely sequenced genomes. The ways in which the database can be useful to crystallographers is discussed, in particular with a view to high-throughput structure determination. The application of the SUPERFAMILY database to different target-selection strategies is suggested: novel folds, novel domain combinations and targeted attacks on genomes. Use of the database for more general inquiry in the context of structural studies is also explained. The database provides evolutionary relationships between target proteins and other proteins of known structure through the SCOP database, genome assignments and multiple sequence alignments.
Similar articles
-
SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments.Nucleic Acids Res. 2002 Jan 1;30(1):268-72. doi: 10.1093/nar/30.1.268. Nucleic Acids Res. 2002. PMID: 11752312 Free PMC article.
-
SUPFAM--a database of potential protein superfamily relationships derived by comparing sequence-based and structure-based families: implications for structural genomics and function annotation in genomes.Nucleic Acids Res. 2002 Jan 1;30(1):289-93. doi: 10.1093/nar/30.1.289. Nucleic Acids Res. 2002. PMID: 11752317 Free PMC article.
-
The SUPERFAMILY database in 2004: additions and improvements.Nucleic Acids Res. 2004 Jan 1;32(Database issue):D235-9. doi: 10.1093/nar/gkh117. Nucleic Acids Res. 2004. PMID: 14681402 Free PMC article.
-
Protein families and their evolution-a structural perspective.Annu Rev Biochem. 2005;74:867-900. doi: 10.1146/annurev.biochem.74.082803.133029. Annu Rev Biochem. 2005. PMID: 15954844 Review.
-
Towards a covering set of protein family profiles.Prog Biophys Mol Biol. 2000;73(5):321-37. doi: 10.1016/s0079-6107(00)00013-4. Prog Biophys Mol Biol. 2000. PMID: 11063778 Review.
Cited by
-
Pseudomonas putida Metallothionein: Structural Analysis and Implications of Sustainable Heavy Metal Detoxification in Madinah.Toxics. 2023 Oct 16;11(10):864. doi: 10.3390/toxics11100864. Toxics. 2023. PMID: 37888714 Free PMC article.
-
Evolutionarily conserved properties of CLCA proteins 1, 3 and 4, as revealed by phylogenetic and biochemical studies in avian homologues.PLoS One. 2022 Apr 13;17(4):e0266937. doi: 10.1371/journal.pone.0266937. eCollection 2022. PLoS One. 2022. PMID: 35417490 Free PMC article.
-
Length variations amongst protein domain superfamilies and consequences on structure and function.PLoS One. 2009;4(3):e4981. doi: 10.1371/journal.pone.0004981. Epub 2009 Mar 31. PLoS One. 2009. PMID: 19333395 Free PMC article.
-
Evolutionary analysis of rhodopsin and cone pigments: connecting the three-dimensional structure with spectral tuning and signal transfer.FEBS Lett. 2003 Nov 27;555(1):151-9. doi: 10.1016/s0014-5793(03)01152-9. FEBS Lett. 2003. PMID: 14630336 Free PMC article.
-
TFCat: the curated catalog of mouse and human transcription factors.Genome Biol. 2009;10(3):R29. doi: 10.1186/gb-2009-10-3-r29. Epub 2009 Mar 12. Genome Biol. 2009. PMID: 19284633 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources