The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications

Nucleic Acids Res. 2019 Jan 8;47(D1):D259-D264. doi: 10.1093/nar/gky1022.

Abstract

UNITE (https://unite.ut.ee/) is a web-based database and sequence management environment for the molecular identification of fungi. It targets the formal fungal barcode-the nuclear ribosomal internal transcribed spacer (ITS) region-and offers all ∼1 000 000 public fungal ITS sequences for reference. These are clustered into ∼459 000 species hypotheses and assigned digital object identifiers (DOIs) to promote unambiguous reference across studies. In-house and web-based third-party sequence curation and annotation have resulted in more than 275 000 improvements to the data over the past 15 years. UNITE serves as a data provider for a range of metabarcoding software pipelines and regularly exchanges data with all major fungal sequence databases and other community resources. Recent improvements include redesigned handling of unclassifiable species hypotheses, integration with the taxonomic backbone of the Global Biodiversity Information Facility, and support for an unlimited number of parallel taxonomic classification systems.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods*
  • DNA Barcoding, Taxonomic / methods*
  • Databases, Nucleic Acid*
  • Fungi / classification*
  • Fungi / genetics*
  • Genome, Fungal*
  • Genomics* / methods
  • Software
  • Web Browser