We are happy to annouce that a new GBIF Backbone just went live, available also as an improved Darwin Core Archive for download. Here are some facts highlighting the important changes.
Apart from continously updated source like the Catalog of Life or WoRMS here are the new datasets we used as a source to build the backbone.
- New Type specimen checklist listing all distinct names of type specimens found in GBIF occurrences contributing 252,410 new species and 57,410 infra specific names.
- ZooBank joined GBIF and was added as a nomenclator with 175,775 names, contributing 3460 new generic and 39695 new species names.
- Added phylum Myzozoa with 136 families under kingdom Chromista to GBIF Algae Classification to fill the classification gap for Dinoflagellates
- tiny new dataset listing species named after famous people and which are often found in news
![Backbone sources][nub-sources.png] The 43 sources used in this backbone build
- Merging of duplicate taxa across kingdoms, especially with taxa from the incertae sedis kingdom. Examples
- Exclude genus & species synonyms for taxa at a higher rank: http://dev.gbif.org/issues/browse/POR-3169
- Restrict name normalisation with double letters to bi/trinomials. Finally the fish Lota lota is a fish again. Examples of other previously wrongly conflated families that have been reported:
- Stable identifier for pro parte taxa in the backbone.
All other fixed issues in the source code that generates the backbone can be found in our Jira epic and github milestone.
The new backbone has a total of 5,887,500 names of which it treats 2,818,534 species names as accepted (up from 5,307,978 and 2,525,274 respectively). More backbone metrics are available through our portal and in more detail through our API.
- 105,296 deleted names, many of them previous erroneous duplicates
- 685,853 new names
- Animalia: 164 families; 6,616 genera; 257,196 species; 87,660 infraspecific
- Archaea: 2 families; 6 genera; 48 species
- Bacteria: 27 families; 225 genera; 2,470 species; 615 infraspecific
- Chromista: 2 phyla; 13 classes; 58 order; 54 families; 767 genera; 12,124 species; 2,953 infraspecific
- Fungi: 2 families; 269 genera; 8,703 species; 2,993 infraspecific
- Plantae: 3 families; 795 genera; 63,617 species; 33,282 infraspecific
- Protozoa: 4 families; 65 genera; 1,412 species; 280 infraspecific
- Viruses: 8 families; 1,227 genera; 8,488 species
- Unknown: 4 families; 2,708 genera; 13,076 species; 2,237 infraspecific
A very large and detailed log of the backbone build is also available.
The largest taxonomic groups in the backbone, exceeding 3% of all accepted species is shown in the following diagram:
All contributors to the backbone arranged by number of names the source serves as the primary reference:
- 3,330,535 Catalogue of Life
- 685,831 Interim Register of Marine and Nonmarine Genera
- 312,746 World Register of Marine Species
- 309,820 GBIF Type Specimen Names
- 285,859 The Plant List with literature
- 140,937 Fauna Europaea
- 136,981 Index Fungorum
- 134,285 GBIF Backbone Taxonomy
- 126,960 The Paleobiology Database
- 114,089 International Plant Names Index
- 53,848 Integrated Taxonomic Information System
- 44,732 ZooBank
- 30,482 GRIN Taxonomy
- 29,267 Plazi
- 25,749 Artsnavnebasen
- 24,996 Afromoths, online database of Afrotropical moth species
- 15,007 Species Files
- 13,818 Brazilian Flora 2020 project - Projeto Flora do Brasil 2020
-
8,923 [Dyntaxa - Svensk taxonomisk databas](http://www.gbif.org/dataset/de8934f4-a136-481c-a87a-b0b202b80a31) -
6,807 [DiversityTaxonNames Lists](http://www.gbif.org/publisher/0674aea0-a7e1-11d8-9534-b8a03c50a862) -
5,696 [Official Lists and Indexes of Names in Zoology](http://www.gbif.org/dataset/80b4b440-eaca-4860-aadf-d0dfdd3e856e) -
5,317 [Prokaryotic Nomenclature Up-to-date](http://www.gbif.org/dataset/52a423d2-0486-4e77-bcee-6350d708d6ff) -
4,617 [International Cichorieae Network](http://www.gbif.org/dataset/ded724e7-3fde-49c5-bfa3-03b4045c4c5f) -
4,611 [Catalogue of Afrotropical Bees](http://www.gbif.org/dataset/da38f103-4410-43d1-b716-ea6b1b92bbac) -
4,416 [Database of Vascular Plants of Canada](http://www.gbif.org/dataset/3f8a1297-3259-4700-91fc-acc4170b27ce) -
4,312 [ICTV Master Species List](http://www.gbif.org/dataset/e01b0cbb-a10a-420c-b5f3-a3b20cc266ad) -
3,874 [The Clements Checklist](http://www.gbif.org/dataset/47f16512-bf31-410f-b272-d151c996b2f6) -
2,702 [Checklist of Beetles of Canada and Alaska. Second Edition.](http://www.gbif.org/dataset/7a9bccd4-32fc-420e-a73b-352b92267571) -
1,198 [IOC World Bird List, v6.3](http://www.gbif.org/dataset/c696e5ee-9088-4d11-bdae-ab88daffab78) -
1,087 [GBIF Algae Classification](http://www.gbif.org/dataset/7ea21580-4f06-469d-995b-3f713fdcc37c) -
578 [ION Taxonomic Hierarchy](http://www.gbif.org/dataset/8dc469b3-8e61-4f6f-b9db-c70dbbc8858c) -
272 [Mammal Species of the World](http://www.gbif.org/dataset/672aca30-f1b5-43d3-8a2b-c1606125fa1b) -
144 [GBIF Backbone Patch](http://www.gbif.org/dataset/daacce49-b206-469b-8dc2-2257719f3afa) -
39 [Species named after famous people](http://www.gbif.org/dataset/00e791be-36ae-40ee-8165-0b2cb0b8c84f) -
36 [True Fruit Flies of the Afrotropical Region](http://www.gbif.org/dataset/bd25fbf7-278f-41d6-bc17-9f08f2632f70) -
7 [Backbone Family Classification Patch](http://www.gbif.org/dataset/6e4c3b6f-0126-4c5f-bd63-fe6ffd3b29fa) -
7 [TAXREF](http://www.gbif.org/dataset/0e61f8fe-7d25-4f81-ada7-d970bbb2c6d6)
With a new backbone we have reprocessed all of our 712 million occurrences.
The distribution of the major taxonomic groups exceeding 3%, i.e have a minimum of 36.800 species, is shown in this last diagram:
The 1,226,520 accepted species in GBIF occurrences (140 less than before) represnt 44% of all accepted backbone species.
@robszurr could you please add my personal github account mdoering to Species2000 instead of our jenkins bot? Sorry for commenting here, but I could not find any other way to get in touch with you :)