The Protein Information Resource
- PMID: 12520019
- PMCID: PMC165487
- DOI: 10.1093/nar/gkg040
The Protein Information Resource
Abstract
The Protein Information Resource (PIR) is an integrated public resource of protein informatics that supports genomic and proteomic research and scientific discovery. PIR maintains the Protein Sequence Database (PSD), an annotated protein database containing over 283 000 sequences covering the entire taxonomic range. Family classification is used for sensitive identification, consistent annotation, and detection of annotation errors. The superfamily curation defines signature domain architecture and categorizes memberships to improve automated classification. To increase the amount of experimental annotation, the PIR has developed a bibliography system for literature searching, mapping, and user submission, and has conducted retrospective attribution of citations for experimental features. PIR also maintains NREF, a non-redundant reference database, and iProClass, an integrated database of protein family, function, and structure information. PIR-NREF provides a timely and comprehensive collection of protein sequences, currently consisting of more than 1 000 000 entries from PIR-PSD, SWISS-PROT, TrEMBL, RefSeq, GenPept, and PDB. The PIR web site (http://pir.georgetown.edu) connects data analysis tools to underlying databases for information retrieval and knowledge discovery, with functionalities for interactive queries, combinations of sequence and text searches, and sorting and visual exploration of search results. The FTP site provides free download for PSD and NREF biweekly releases and auxiliary databases and files.
Similar articles
-
The Protein Information Resource: an integrated public resource of functional annotation of proteins.Nucleic Acids Res. 2002 Jan 1;30(1):35-7. doi: 10.1093/nar/30.1.35. Nucleic Acids Res. 2002. PMID: 11752247 Free PMC article.
-
iProClass: an integrated database of protein family, function and structure information.Nucleic Acids Res. 2003 Jan 1;31(1):390-2. doi: 10.1093/nar/gkg044. Nucleic Acids Res. 2003. PMID: 12520030 Free PMC article.
-
The protein information resource (PIR).Nucleic Acids Res. 2000 Jan 1;28(1):41-4. doi: 10.1093/nar/28.1.41. Nucleic Acids Res. 2000. PMID: 10592177 Free PMC article.
-
Protein family classification and functional annotation.Comput Biol Chem. 2003 Feb;27(1):37-47. doi: 10.1016/s1476-9271(02)00098-1. Comput Biol Chem. 2003. PMID: 12798038 Review.
-
Update on genome completion and annotations: Protein Information Resource.Hum Genomics. 2004 Mar;1(3):229-33. doi: 10.1186/1479-7364-1-3-229. Hum Genomics. 2004. PMID: 15588483 Free PMC article. Review.
Cited by
-
Prediction of catalytic residues using Support Vector Machine with selected protein sequence and structural properties.BMC Bioinformatics. 2006 Jun 21;7:312. doi: 10.1186/1471-2105-7-312. BMC Bioinformatics. 2006. PMID: 16790052 Free PMC article.
-
SPINE 2: a system for collaborative structural proteomics within a federated database framework.Nucleic Acids Res. 2003 Jun 1;31(11):2833-8. doi: 10.1093/nar/gkg397. Nucleic Acids Res. 2003. PMID: 12771210 Free PMC article.
-
Recent additions and improvements to the Onto-Tools.Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W762-5. doi: 10.1093/nar/gki472. Nucleic Acids Res. 2005. PMID: 15980579 Free PMC article.
-
Generation and gene ontology based analysis of expressed sequence tags (EST) from a Panax ginseng C. A. Meyer roots.Mol Biol Rep. 2010 Oct;37(7):3465-72. doi: 10.1007/s11033-009-9938-z. Epub 2009 Nov 27. Mol Biol Rep. 2010. PMID: 19943115
-
Systematic identification of cancer-specific MHC-binding peptides with RAVEN.Oncoimmunology. 2018 Jul 23;7(9):e1481558. doi: 10.1080/2162402X.2018.1481558. eCollection 2018. Oncoimmunology. 2018. PMID: 30228952 Free PMC article.
References
-
- Barker W.C., Pfeiffer,F. and George,D.G. (1996) Superfamily classification in PIR-International Protein Sequence Database. Methods Enzymol., 266, 59–71. - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases