The InterPro protein families database: the classification resource after 15 years
- PMID: 25428371
- PMCID: PMC4383996
- DOI: 10.1093/nar/gku1243
The InterPro protein families database: the classification resource after 15 years
Abstract
The InterPro database (http://www.ebi.ac.uk/interpro/) is a freely available resource that can be used to classify sequences into protein families and to predict the presence of important domains and sites. Central to the InterPro database are predictive models, known as signatures, from a range of different protein family databases that have different biological focuses and use different methodological approaches to classify protein families and domains. InterPro integrates these signatures, capitalizing on the respective strengths of the individual databases, to produce a powerful protein classification resource. Here, we report on the status of InterPro as it enters its 15th year of operation, and give an overview of new developments with the database and its associated Web interfaces and software. In particular, the new domain architecture search tool is described and the process of mapping of Gene Ontology terms to InterPro is outlined. We also discuss the challenges faced by the resource given the explosive growth in sequence data in recent years. InterPro (version 48.0) contains 36,766 member database signatures integrated into 26,238 InterPro entries, an increase of over 3993 entries (5081 signatures), since 2012.
© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures
Similar articles
-
InterPro in 2011: new developments in the family and domain prediction database.Nucleic Acids Res. 2012 Jan;40(Database issue):D306-12. doi: 10.1093/nar/gkr948. Epub 2011 Nov 16. Nucleic Acids Res. 2012. PMID: 22096229 Free PMC article.
-
InterPro, progress and status in 2005.Nucleic Acids Res. 2005 Jan 1;33(Database issue):D201-5. doi: 10.1093/nar/gki106. Nucleic Acids Res. 2005. PMID: 15608177 Free PMC article.
-
New developments in the InterPro database.Nucleic Acids Res. 2007 Jan;35(Database issue):D224-8. doi: 10.1093/nar/gkl841. Nucleic Acids Res. 2007. PMID: 17202162 Free PMC article.
-
The InterPro Database, 2003 brings increased coverage and new features.Nucleic Acids Res. 2003 Jan 1;31(1):315-8. doi: 10.1093/nar/gkg046. Nucleic Acids Res. 2003. PMID: 12520011 Free PMC article.
-
In silico characterization of proteins: UniProt, InterPro and Integr8.Mol Biotechnol. 2008 Feb;38(2):165-77. doi: 10.1007/s12033-007-9003-x. Epub 2007 Oct 4. Mol Biotechnol. 2008. PMID: 18219596 Review.
Cited by
-
A genome assembly of decaploid Houttuynia cordata provides insights into the evolution of Houttuynia and the biosynthesis of alkaloids.Hortic Res. 2024 Jul 30;11(9):uhae203. doi: 10.1093/hr/uhae203. eCollection 2024 Sep. Hortic Res. 2024. PMID: 39308792 Free PMC article.
-
A near-complete chromosome-level genome assembly of looseleaf lettuce (Lactuca sativa var. crispa).Sci Data. 2024 Sep 4;11(1):961. doi: 10.1038/s41597-024-03830-y. Sci Data. 2024. PMID: 39231996 Free PMC article.
-
Comparative transcriptomics identifies genes underlying growth performance of the Pacific black-lipped pearl oyster Pinctada margaritifera.BMC Genomics. 2024 Jul 24;25(1):717. doi: 10.1186/s12864-024-10636-0. BMC Genomics. 2024. PMID: 39049022 Free PMC article.
-
SNP and Structural Study of the Notch Superfamily Provides Insights and Novel Pharmacological Targets against the CADASIL Syndrome and Neurodegenerative Diseases.Genes (Basel). 2024 Apr 23;15(5):529. doi: 10.3390/genes15050529. Genes (Basel). 2024. PMID: 38790158 Free PMC article.
-
SAFPred: synteny-aware gene function prediction for bacteria using protein embeddings.Bioinformatics. 2024 Jun 3;40(6):btae328. doi: 10.1093/bioinformatics/btae328. Bioinformatics. 2024. PMID: 38775729 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources