The Protein Information Resource

doi:10.1093/nar/gkg040

. 2003 Jan 1;31(1):345-7.

doi: 10.1093/nar/gkg040.

The Protein Information Resource

Cathy H Wu¹, Lai-Su L Yeh, Hongzhan Huang, Leslie Arminski, Jorge Castro-Alvear, Yongxing Chen, Zhangzhi Hu, Panagiotis Kourtesis, Robert S Ledley, Baris E Suzek, C R Vinayaka, Jian Zhang, Winona C Barker

Affiliations

Affiliation

¹ Department of Biochemistry and Molecular Biology, Georgetown University Medical Center, 3900 Reservoir Road, NW, Box 571414, Washington, DC 20057-1414, USA. pirmail@georgetown.edu

PMID: 12520019
PMCID: PMC165487
DOI: 10.1093/nar/gkg040

The Protein Information Resource

Cathy H Wu et al. Nucleic Acids Res. 2003.

. 2003 Jan 1;31(1):345-7.

doi: 10.1093/nar/gkg040.

Authors

Affiliation

¹ Department of Biochemistry and Molecular Biology, Georgetown University Medical Center, 3900 Reservoir Road, NW, Box 571414, Washington, DC 20057-1414, USA. pirmail@georgetown.edu

PMID: 12520019
PMCID: PMC165487
DOI: 10.1093/nar/gkg040

Abstract

The Protein Information Resource (PIR) is an integrated public resource of protein informatics that supports genomic and proteomic research and scientific discovery. PIR maintains the Protein Sequence Database (PSD), an annotated protein database containing over 283 000 sequences covering the entire taxonomic range. Family classification is used for sensitive identification, consistent annotation, and detection of annotation errors. The superfamily curation defines signature domain architecture and categorizes memberships to improve automated classification. To increase the amount of experimental annotation, the PIR has developed a bibliography system for literature searching, mapping, and user submission, and has conducted retrospective attribution of citations for experimental features. PIR also maintains NREF, a non-redundant reference database, and iProClass, an integrated database of protein family, function, and structure information. PIR-NREF provides a timely and comprehensive collection of protein sequences, currently consisting of more than 1 000 000 entries from PIR-PSD, SWISS-PROT, TrEMBL, RefSeq, GenPept, and PDB. The PIR web site (http://pir.georgetown.edu) connects data analysis tools to underlying databases for information retrieval and knowledge discovery, with functionalities for interactive queries, combinations of sequence and text searches, and sorting and visual exploration of search results. The FTP site provides free download for PSD and NREF biweekly releases and auxiliary databases and files.

PubMed Disclaimer

Cited by

Prediction of catalytic residues using Support Vector Machine with selected protein sequence and structural properties.
Petrova NV, Wu CH. Petrova NV, et al. BMC Bioinformatics. 2006 Jun 21;7:312. doi: 10.1186/1471-2105-7-312. BMC Bioinformatics. 2006. PMID: 16790052 Free PMC article.
SPINE 2: a system for collaborative structural proteomics within a federated database framework.
Goh CS, Lan N, Echols N, Douglas SM, Milburn D, Bertone P, Xiao R, Ma LC, Zheng D, Wunderlich Z, Acton T, Montelione GT, Gerstein M. Goh CS, et al. Nucleic Acids Res. 2003 Jun 1;31(11):2833-8. doi: 10.1093/nar/gkg397. Nucleic Acids Res. 2003. PMID: 12771210 Free PMC article.
Recent additions and improvements to the Onto-Tools.
Khatri P, Sellamuthu S, Malhotra P, Amin K, Done A, Draghici S. Khatri P, et al. Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W762-5. doi: 10.1093/nar/gki472. Nucleic Acids Res. 2005. PMID: 15980579 Free PMC article.
Generation and gene ontology based analysis of expressed sequence tags (EST) from a Panax ginseng C. A. Meyer roots.
Sathiyamoorthy S, In JG, Gayathri S, Kim YJ, Yang DC. Sathiyamoorthy S, et al. Mol Biol Rep. 2010 Oct;37(7):3465-72. doi: 10.1007/s11033-009-9938-z. Epub 2009 Nov 27. Mol Biol Rep. 2010. PMID: 19943115
Systematic identification of cancer-specific MHC-binding peptides with RAVEN.
Baldauf MC, Gerke JS, Kirschner A, Blaeschke F, Effenberger M, Schober K, Rubio RA, Kanaseki T, Kiran MM, Dallmayer M, Musa J, Akpolat N, Akatli AN, Rosman FC, Özen Ö, Sugita S, Hasegawa T, Sugimura H, Baumhoer D, Knott MML, Sannino G, Marchetto A, Li J, Busch DH, Feuchtinger T, Ohmura S, Orth MF, Thiel U, Kirchner T, Grünewald TGP. Baldauf MC, et al. Oncoimmunology. 2018 Jul 23;7(9):e1481558. doi: 10.1080/2162402X.2018.1481558. eCollection 2018. Oncoimmunology. 2018. PMID: 30228952 Free PMC article.

See all "Cited by" articles

References

1. Wu C.H., Xiao,C., Hou,Z., Huang,H. and Barker,W.C. (2001) iProClass: an integrated and comprehensive protein classification database. Nucleic Acids Res., 29, 52–54. - PMC - PubMed
1. Barker W.C., Pfeiffer,F. and George,D.G. (1996) Superfamily classification in PIR-International Protein Sequence Database. Methods Enzymol., 266, 59–71. - PubMed
1. Bairoch A. and Apweiler,R. (2000) The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res., 28, 45–48. - PMC - PubMed
1. Pruitt K.D. and Maglott,D.R. (2001) RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acids Res., 29, 137–140. - PMC - PubMed
1. Westbrook J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S., Bourne,P.E. and Berman,H.E. (2002) The Protein Data Bank: unifying the archive. Nucleic Acids Res., 30, 245–248. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Molecular Biology Databases
- NIAID Data Ecosystem - Find datasets on Infectious and Immune-mediated Diseases

[1] Wu C.H., Xiao,C., Hou,Z., Huang,H. and Barker,W.C. (2001) iProClass: an integrated and comprehensive protein classification database. Nucleic Acids Res., 29, 52–54. - PMC - PubMed

[2] Wu C.H., Xiao,C., Hou,Z., Huang,H. and Barker,W.C. (2001) iProClass: an integrated and comprehensive protein classification database. Nucleic Acids Res., 29, 52–54. - PMC - PubMed

[3] Barker W.C., Pfeiffer,F. and George,D.G. (1996) Superfamily classification in PIR-International Protein Sequence Database. Methods Enzymol., 266, 59–71. - PubMed

[4] Barker W.C., Pfeiffer,F. and George,D.G. (1996) Superfamily classification in PIR-International Protein Sequence Database. Methods Enzymol., 266, 59–71. - PubMed

[5] Bairoch A. and Apweiler,R. (2000) The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res., 28, 45–48. - PMC - PubMed

[6] Bairoch A. and Apweiler,R. (2000) The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res., 28, 45–48. - PMC - PubMed

[7] Pruitt K.D. and Maglott,D.R. (2001) RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acids Res., 29, 137–140. - PMC - PubMed

[8] Pruitt K.D. and Maglott,D.R. (2001) RefSeq and LocusLink: NCBI gene-centered resources. Nucleic Acids Res., 29, 137–140. - PMC - PubMed

[9] Westbrook J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S., Bourne,P.E. and Berman,H.E. (2002) The Protein Data Bank: unifying the archive. Nucleic Acids Res., 30, 245–248. - PMC - PubMed

[10] Westbrook J., Feng,Z., Jain,S., Bhat,T.N., Thanki,N., Ravichandran,V., Gilliland,G.L., Bluhm,W., Weissig,H., Greer,D.S., Bourne,P.E. and Berman,H.E. (2002) The Protein Data Bank: unifying the archive. Nucleic Acids Res., 30, 245–248. - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The Protein Information Resource

Affiliation

The Protein Information Resource

Authors

Affiliation

Abstract

Similar articles

Cited by

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Molecular Biology Databases