RefSeq Release 225 Now Available!

RefSeq Release 225 Now Available!

Check out RefSeq release 225, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets.

What’s included in this release?

As of July 8, 2024, this full release incorporates genomic, transcript, and protein data containing:

  • 448,507,905 records
  • 334,845,613 proteins
  • 63,542,774 RNAs
  • Sequences from 152,668 organisms

The release is provided in several directories as a complete dataset and also as divided by logical groupings.

New eukaryotic genome annotations

This release contains new annotations generated by NCBI’s eukaryotic genome annotation pipeline for 35 species, including:

WormBase and PomBase db_xrefs

This release does not contain db_xref qualifiers for Caenorhabditis elegans (WormBase) and Schizosaccharomyces pombe (PomBase). We expect to restore these cross-references in the next release.

GenBank qualifier updated

As previously announced, the name of the GenBank qualifier /country was updated to /geo_loc_name in June 2024, to better represent the diversity of sample collection location types.

Stay up to date

RefSeq is part of the NIH Comparative Genomics Resource (CGR)CGR facilitates reliable comparative genomics analyses for all eukaryotic organisms through an NCBI Toolkit and community collaboration. Follow us on social @NCBI and join our mailing list to keep up to date with RefSeq and other CGR news.

Questions?

If you have questions or would like to provide feedback, please reach out to us! 

Leave a Reply