COG database update: focus on microbial diversity, model organisms, and widespread pathogens
- PMID: 33167031
- PMCID: PMC7778934
- DOI: 10.1093/nar/gkaa1018
COG database update: focus on microbial diversity, model organisms, and widespread pathogens
Abstract
The Clusters of Orthologous Genes (COG) database, also referred to as the Clusters of Orthologous Groups of proteins, was created in 1997 and went through several rounds of updates, most recently, in 2014. The current update, available at https://www.ncbi.nlm.nih.gov/research/COG, substantially expands the scope of the database to include complete genomes of 1187 bacteria and 122 archaea, typically, with a single genome per genus. In addition, the current version of the COGs includes the following new features: (i) the recently deprecated NCBI's gene index (gi) numbers for the encoded proteins are replaced with stable RefSeq or GenBank\ENA\DDBJ coding sequence (CDS) accession numbers; (ii) COG annotations are updated for >200 newly characterized protein families with corresponding references and PDB links, where available; (iii) lists of COGs grouped by pathways and functional systems are added; (iv) 266 new COGs for proteins involved in CRISPR-Cas immunity, sporulation in Firmicutes and photosynthesis in cyanobacteria are included; and (v) the database is made available as a web page, in addition to FTP. The current release includes 4877 COGs. Future plans include further expansion of the COG collection by adding archaeal COGs (arCOGs), splitting the COGs containing multiple paralogs, and continued refinement of COG annotations.
Published by Oxford University Press on behalf of Nucleic Acids Research 2020.
Similar articles
-
Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaea.Biol Direct. 2007 Nov 27;2:33. doi: 10.1186/1745-6150-2-33. Biol Direct. 2007. PMID: 18042280 Free PMC article.
-
Expanded microbial genome coverage and improved protein family annotation in the COG database.Nucleic Acids Res. 2015 Jan;43(Database issue):D261-9. doi: 10.1093/nar/gku1223. Epub 2014 Nov 26. Nucleic Acids Res. 2015. PMID: 25428365 Free PMC article.
-
Updated clusters of orthologous genes for Archaea: a complex ancestor of the Archaea and the byways of horizontal gene transfer.Biol Direct. 2012 Dec 14;7:46. doi: 10.1186/1745-6150-7-46. Biol Direct. 2012. PMID: 23241446 Free PMC article.
-
Sensing of environmental signals: classification of chemoreceptors according to the size of their ligand binding regions.Environ Microbiol. 2010 Nov;12(11):2873-84. doi: 10.1111/j.1462-2920.2010.02325.x. Epub 2010 Aug 25. Environ Microbiol. 2010. PMID: 20738376 Review.
-
A genomic perspective on protein families.Science. 1997 Oct 24;278(5338):631-7. doi: 10.1126/science.278.5338.631. Science. 1997. PMID: 9381173 Review.
Cited by
-
Characterization of gut microbiota dynamics in an Alzheimer's disease mouse model through clade-specific marker-based analysis of shotgun metagenomic data.Biol Direct. 2024 Oct 30;19(1):100. doi: 10.1186/s13062-024-00541-7. Biol Direct. 2024. PMID: 39478626 Free PMC article.
-
Genomic insights and functional evaluation of Lacticaseibacillus paracasei EG005: a promising probiotic with enhanced antioxidant activity.Front Microbiol. 2024 Oct 14;15:1477152. doi: 10.3389/fmicb.2024.1477152. eCollection 2024. Front Microbiol. 2024. PMID: 39469458 Free PMC article.
-
Sequencing-guided re-estimation and promotion of cultivability for environmental bacteria.Nat Commun. 2024 Oct 20;15(1):9051. doi: 10.1038/s41467-024-53446-4. Nat Commun. 2024. PMID: 39426960 Free PMC article.
-
Halosquirtibacter laminarini gen. nov., sp. nov. and Halosquirtibacter xylanolyticus sp. nov., marine anaerobic laminarin and xylan degraders in the phylum Bacteroidota.Sci Rep. 2024 Oct 17;14(1):24329. doi: 10.1038/s41598-024-74787-6. Sci Rep. 2024. PMID: 39414901 Free PMC article.
-
Quest for Orthologs in the Era of Biodiversity Genomics.Genome Biol Evol. 2024 Oct 9;16(10):evae224. doi: 10.1093/gbe/evae224. Genome Biol Evol. 2024. PMID: 39404012 Free PMC article. Review.
References
-
- Tatusov R.L., Koonin E.V., Lipman D.J.. A genomic perspective on protein families. Science. 1997; 278:631–637. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources