NCBI Reference Sequences: current status, policy and new initiatives
- PMID: 18927115
- PMCID: PMC2686572
- DOI: 10.1093/nar/gkn721
NCBI Reference Sequences: current status, policy and new initiatives
Abstract
NCBI's Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. RefSeq records integrate information from multiple sources and represent a current description of the sequence, the gene and sequence features. The database includes over 5300 organisms spanning prokaryotes, eukaryotes and viruses, with records for more than 5.5 x 10(6) proteins (RefSeq release 30). Feature annotation is applied by a combination of curation, collaboration, propagation from other sources and computation. We report here on the recent growth of the database, recent changes to feature annotations and record types for eukaryotic (primarily vertebrate) species and policies regarding species inclusion and genome annotation. In addition, we introduce RefSeqGene, a new initiative to support reporting variation data on a stable genomic coordinate system.
Similar articles
-
NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy.Nucleic Acids Res. 2012 Jan;40(Database issue):D130-5. doi: 10.1093/nar/gkr1079. Epub 2011 Nov 24. Nucleic Acids Res. 2012. PMID: 22121212 Free PMC article.
-
Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.Nucleic Acids Res. 2016 Jan 4;44(D1):D733-45. doi: 10.1093/nar/gkv1189. Epub 2015 Nov 8. Nucleic Acids Res. 2016. PMID: 26553804 Free PMC article.
-
RefSeq: an update on mammalian reference sequences.Nucleic Acids Res. 2014 Jan;42(Database issue):D756-63. doi: 10.1093/nar/gkt1114. Epub 2013 Nov 19. Nucleic Acids Res. 2014. PMID: 24259432 Free PMC article.
-
NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.Nucleic Acids Res. 2007 Jan;35(Database issue):D61-5. doi: 10.1093/nar/gkl842. Epub 2006 Nov 27. Nucleic Acids Res. 2007. PMID: 17130148 Free PMC article.
-
NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.Nucleic Acids Res. 2005 Jan 1;33(Database issue):D501-4. doi: 10.1093/nar/gki025. Nucleic Acids Res. 2005. PMID: 15608248 Free PMC article.
Cited by
-
Tracing evolutionary footprints to identify novel gene functional linkages.PLoS One. 2013 Jun 25;8(6):e66817. doi: 10.1371/journal.pone.0066817. Print 2013. PLoS One. 2013. PMID: 23825567 Free PMC article.
-
Evolutionary change driven by metal exposure as revealed by coding SNP genome scan in wild yellow perch (Perca flavescens).Ecotoxicology. 2013 Jul;22(5):938-57. doi: 10.1007/s10646-013-1083-8. Epub 2013 May 31. Ecotoxicology. 2013. PMID: 23722603
-
Systems genomics evaluation of the SH-SY5Y neuroblastoma cell line as a model for Parkinson's disease.BMC Genomics. 2014 Dec 20;15(1):1154. doi: 10.1186/1471-2164-15-1154. BMC Genomics. 2014. PMID: 25528190 Free PMC article.
-
Widespread recognition of 5' splice sites by noncanonical base-pairing to U1 snRNA involving bulged nucleotides.Genes Dev. 2012 May 15;26(10):1098-109. doi: 10.1101/gad.190173.112. Genes Dev. 2012. PMID: 22588721 Free PMC article.
-
Complete genome sequence of Burkholderia sp. Strain GG4, a betaproteobacterium that reduces 3-oxo-N-acylhomoserine lactones and produces different N-acylhomoserine lactones.J Bacteriol. 2012 Nov;194(22):6317. doi: 10.1128/JB.01578-12. J Bacteriol. 2012. PMID: 23105060 Free PMC article.