Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jan 8;47(D1):D259-D264.
doi: 10.1093/nar/gky1022.

The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications

Affiliations

The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications

Rolf Henrik Nilsson et al. Nucleic Acids Res. .

Abstract

UNITE (https://unite.ut.ee/) is a web-based database and sequence management environment for the molecular identification of fungi. It targets the formal fungal barcode-the nuclear ribosomal internal transcribed spacer (ITS) region-and offers all ∼1 000 000 public fungal ITS sequences for reference. These are clustered into ∼459 000 species hypotheses and assigned digital object identifiers (DOIs) to promote unambiguous reference across studies. In-house and web-based third-party sequence curation and annotation have resulted in more than 275 000 improvements to the data over the past 15 years. UNITE serves as a data provider for a range of metabarcoding software pipelines and regularly exchanges data with all major fungal sequence databases and other community resources. Recent improvements include redesigned handling of unclassifiable species hypotheses, integration with the taxonomic backbone of the Global Biodiversity Information Facility, and support for an unlimited number of parallel taxonomic classification systems.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
A screenshot of a UNITE SH Digital Object Identifier (DOI) page for Tomentella atroarenicolor (https://plutof.ut.ee/#/datacite/10.15156%2FBIO%2FSH009889.07FU). (A) The most accurate taxon name chosen automatically (or manually, if the default was overridden by an expert user) from the available sequence identifications. (B) Short ID of the DOI. (C) Data on reference sequence chosen to represent this SH. (D) Placement of the SH in the fungal classification and identification records for individual sequences. The number after the taxon name indicates how many sequences that carry that name. (E) Select statistics on the SH. The minimum distance 3.0% is the mandatory genetic difference between sister SHs. (F) Distribution map of the individual sequences. (G) Information on ecology (interacting taxa) if associated with the individual sequences. (H) DataCite-specific data on the DOI. (I) Images of the specimen or sample from which the DNA was extracted. Only a limited number of sequences have images attached to them. (J) Graphical overview of the SH with detailed information. (K) SH inclusiveness across sequence similarity threshold values. A threshold value ( = minimum distance) of 1.5% will split these sequences into two SHs, shown here in different colours. (L) A threshold value of 2.5% will lump all sequences into a single SH. Each such SH is hyperlinked to its own unique web page. (M) Scrollable multiple sequence alignment of the SH. ‘RefSeq’ indicates that the sequence was selected manually to be the representative sequence for the SHs. RefSeqs stem from type specimens or other authentic and particularly trustworthy material. This particular SH contains both INSDC sequences (brown) and sequences that are only found in UNITE (yellow). Some 29 000 sequences are only found in UNITE at this stage, but will be submitted to the INSDC upon publication of the underlying studies. These sequences are included in the various UNITE sequence releases and download files.

Similar articles

Cited by

References

    1. Hawksworth D.L., Lücking R.. Fungal diversity revisited: 2.2 to 3.8 million species. Microbiol. Spec. 2017; 5:1–17. - PubMed
    1. Schoch C.L., Seifert K.A., Huhndorf S., Robert V., Spouge J.L., Levesque C.A., Chen W.. Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proc. Natl. Acad. Sci. Biol. 2012; 109:6241–6246. - PMC - PubMed
    1. Karsch-Mizrachi I., Takagi T., Cochrane G.. The international nucleotide sequence database collaboration. Nucleic Acids Res. 2017; 46:D48–D51. - PMC - PubMed
    1. Kõljalg U., Nilsson R.H., Abarenkov K., Tedersoo L., Taylor A.F., Bahram M., Bates S.T., Bruns T.D., Bengtsson-Palme J., Callaghan T.M. et al. . Towards a unified paradigm for sequence-based identification of fungi. Mol. Ecol. 2013; 22:5271–5277. - PubMed
    1. Blaxter M., Mann J., Chapman T., Whitton T.F., Floyd R., Abebe E.. Defining operational taxonomic units using DNA barcode data. Philos. Trans. R. Soc. B. 2005; 360:1935–1943. - PMC - PubMed

Publication types