Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Global Coordination and Standardisation in Marine Biodiversity through the World Register of Marine Species (WoRMS) and Related Databases

Abstract

The World Register of Marine Species is an over 90% complete open-access inventory of all marine species names. Here we illustrate the scale of the problems with species names, synonyms, and their classification, and describe how WoRMS publishes online quality assured information on marine species.

Within WoRMS, over 100 global, 12 regional and 4 thematic species databases are integrated with a common taxonomy. Over 240 editors from 133 institutions and 31 countries manage the content. To avoid duplication of effort, content is exchanged with 10 external databases. At present WoRMS contains 460,000 taxonomic names (from Kingdom to subspecies), 368,000 species level combinations of which 215,000 are currently accepted marine species names, and 26,000 related but non-marine species. Associated information includes 150,000 literature sources, 20,000 images, and locations of 44,000 specimens. Usage has grown linearly since its launch in 2007, with about 600,000 unique visitors to the website in 2011, and at least 90 organisations from 12 countries using WoRMS for their data management.

By providing easy access to expert-validated content, WoRMS improves quality control in the use of species names, with consequent benefits to taxonomy, ecology, conservation and marine biodiversity research and management. The service manages information on species names that would otherwise be overly costly for individuals, and thus minimises errors in the application of nomenclature standards. WoRMS' content is expanding to include host-parasite relationships, additional literature sources, locations of specimens, images, distribution range, ecological, and biological data. Species are being categorised as introduced (alien, invasive), of conservation importance, and on other attributes. These developments have a multiplier effect on its potential as a resource for biodiversity research and management. As a consequence of WoRMS, we are witnessing improved communication within the scientific community, and anticipate increased taxonomic efficiency and quality control in marine biodiversity research and management.

Introduction

The taxonomic challenge

Taxonomy, the science of discovering and naming species, must have been one of the earliest human activities. Names are given to species when they are recognised as distinctive and important to human culture, whether because of their value for food, ecology (e.g. habitat forming), recreation, potential hazards they may pose, and as objects of admiration. Today, biological diversity is threatened with mass extinction due to climate change, over-hunting, species introductions (especially to islands), and habitat loss [1], [2], [3]. Indeed, some authors worry that the rate of species extinction is exceeding their rate of scientific description [4]. Species are the fundamental practical units of biology, and thus the accurate naming of species is critical for all biology [5]. However, progress in their description and classification is slower than for elements in chemistry and particles in physics simply because there are thousands times more species than of these non-biological units. Thus a major challenge in taxonomy is to accelerate the process of species discovery.

Local and regional species checklists are in demand for conservation and fisheries management, ecological surveys, and training in marine ecology and environmental management. However, these lists are inevitably compromised by either by not being updated by experts, by inheriting past misuse of names, by using the same name for dissimilar species in different locations, by using differing names for the same species in different regions, or, finally by combinations of these problems. The simplest solution to this confusion would be a single authoritative world register routinely updated by experts that is freely accessible on the World Wide Web. The lack of such a world register partly reflected the local and regional focus of biology in the past. It also reflected the high diversity of species, and the hundreds of publications in which they are described, all problems that made collating a checklist beyond the capability of even a modest group of scientists.

Without standardised names for species, the management and use of biodiversity is compromised [6]. Even within different languages and countries, species may have different common or vernacular names, and the same names applied to different species. For example, the ‘common blue’ is a damselfly, a butterfly or a thistle in the UK. The ‘green sea urchin’ is Psammechinus miliaris (Müller, 1771) in the North-East Atlantic, but in the North-West Atlantic it is the commercial species Strongylocentrotus droebachiensis (Müller, 1776); note that the latter was not only described from Europe, but remains common there.

Linnaeus's binominal system for naming species in Latin, taken to have commenced in 1753 for Botany and 1758 for Zoology [7], overcame the problem of vernacular names in different languages but introduced its own set of problems. In the Annual checklist of the Catalogue of Life (CoL) [8] the name vulgaris occurs 1,106 times and is used for many plants (including seaweeds, conifers, legumes), insects (including flies, aphids, weevils, fleas, grasshoppers, lepidopterans, wasps), octopus, starfish, crustaceans, bacteria, viruses, fish, and reptiles. Even when synonyms are excluded, it occurs 382 times as ‘Accepted Names’. Similarly, virginea is the specific epithet of a rush (plant), mollusc, sea squirt, fly, weevil, butterfly, and several fungi, and occurs 92 times and 52 times under Accepted Names. The same words have been used for different genera, e.g. Morus is a genus of marine bird (the gannet) and the mulberry plant; Crepis a genus of Bryozoa and a composite plant; Sphenopus is a zoanthid (Cnidaria: Anthozoa: Hexacorallia: Zoantharia: Sphenopidae) and a plant; and Ficus is a genus of gastropod and fig tree. Other names are used as both genus names and specific epithets, e.g. the name Veronica is a genus of plants (speedwells), and the specific epithet of a species of butterfly and a legume. Generic names from different kingdoms can also be similar: Cantharellus is a genus of mushrooms (terrestrial fungi belonging to the Basidiomycota: Agaricomycetes: Cantharellales: Cantharelaceae) but also of mushroom corals (Cnidaria: Anthozoa: Hexacorallia: Scleractinia: Fungiidae); Turbinaria is a scleractinian genus (Dendrophylliidae) occurring on Indo-Pacific coral reefs with a genus of brown alga of the same name (Ochrophyta: Phaeophyceae: Fucales: Sargassaceae). Species can also be named for people and geographic places, further complicating searches for information unless they are clearly context specific. In these cases, confusion is usually avoided because the genus name must always be used in combination with the ‘specific epithet’ and it is unusual for the same genus and specific epithet to be combined (but see below under Homonyms). However, unintentionally, species have often been given more than one scientific name, or the same name may have been used for more than one species, a species may have been described in one genus and later moved to one or more other genera, or often names are misspelled.

Choosing the correct name is governed by international codes, the International Code of Zoological Nomenclature [9], [10], the International Code of Nomenclature for algae, fungi, and plants [11], [12] and the International Code of Nomenclature of Bacteria [13]. Future discoveries often find that what was once considered one species is now several, so the application of a name may change over time, and it may be re-classified. New genus assignations can confuse users because it appears to be a new name often for a well-known species. The rules of nomenclature also require the specific epithet to agree in gender with the genus, so a change in genus may mean that the ending of the specific epithet changes (e.g. –um to –a). Additionally, the higher classifications of life have significantly changed in recent decades owing to discoveries of relationships. New kingdoms and phyla have been recognized and more than one phylum has been merged to another (e.g., the formerly recognized phyla Vestimentifera, Pogonophora, Echiura and Sipuncula are now included in the phylum Annelida), and groups of species re-allocated within classes, orders and families. For example, the Microsporidia were transferred from the protists (protozoans), or animals, to the fungi [14], [15], [16]. Changing species names, especially reclassification, is not a fault of the system but reflects the nature of discovery. Indeed, we may know most species in Europe [17], [18] (but see [19]), and amongst vertebrates and higher plants, but one third to four fifths of all species may remain to be described [20], [21], [22], [23]. Thus we expect more species to be discovered, species reclassified into different genera and families, and some currently recognized species to be synonymised.

To further standardise species nomenclature, all new bacteria species must be described in a particular journal [13], [24], and from 2013 scientific names of fungi will have to be registered in a recognized repository (e.g., MycoBank) [25]. In contrast, animal and plant species can be named in any print publication and no mandatory register of names exists. Having an online inventory of all accepted species names is an essential precursor to such a registration system for animal and plant names. The International Plant Names Index provides such a register for flowering plants [26] of which few occur in the ocean. The International Commission on Zoological Nomenclature, which is responsible for the International Code of Zoological Nomenclature, has established ‘ZooBank’, as an online registration system for animal names [27], [28], [29]. A further opportunity is for zoologists to standardise the nomenclature of particular taxa by restricting availability of names to a ‘List of Available Names’, as proposed for the 3,570 names in the Phylum Rotifera [30]. This could help taxonomy by making names applied to uncertain species (e.g., species poorly described and/or without type specimens) unavailable and thus no longer usable. Already having an expert validated list of species names is a prerequisite for such an initiative.

Synonyms

Synonyms arise where different specimens that later are found to be the same species have been given different names, i.e. subjective (in zoology) or heterotypic (in botany) synonyms. The fraction of junior synonyms has been reported to be: 7 to 80% (32% overall) in different insect orders and families [31], [32]; 37% for molluscs [33]; 81% in European freshwater fish [34]; 27% for fossil North American mammals [35]; 33% to 88% for groups of seed plants [36], [37]; and 50% for marine fish [38]. At first, it seems that the most popular taxa, which are most intensively studied and by most people, have more synonyms. However, it is possible that similar proportions of synonyms occur in other taxa that are less well studied. Furthermore, some of these taxa may be very species-rich. The only way to discover these problems is for specialists to revise the taxonomy of each group of species, including re-examining type specimens, usually more thoroughly describing species (including genetic analysis) to avoid future confusion. A first step in a taxonomic revision is to review a list of species named and ask whether some may be synonyms.

Synonyms can also be discovered for taxa above the species level, resulting in changed classifications of species. For example, Johnson et al. [39] found that three families of fish, two known only from the deep-sea (>1,000 m), namely (1) bignose fish (Megalomycteridae Myers & Freihofer, 1966), (2) whalefish (Cetomimidae Goode & Bean, 1895), and (3) the shallow-water (<200 m) hairy and tape-fish (Mirapinnidae Bertelsen & Marshall, 1956), represented males, females and juveniles of just one family. Thus two families were subsumed as synonyms of the first described family. However, synonyms are more common at the species level. Male and female cuckoo wrasse look very different and Linnaeus described them under two different names in the same book, namely Labrus mixtus Linnaeus, 1758 and L. bimaculatus Linnaeus, 1758, and until recently both names were in common use. The distinctive and widely known sperm whale Physeter macrocephalus Linnaeus, 1758 has been described as 19 different species: three each from Linnaeus in 1758, Bonaterre in 1789, Lacépède in 1804, and Gray in 1846, 1850 and 1856; two from Borowski in 1780; and one each from G. Cuvier, Kerr, Desmoulins, Fleming and Risso [40]. However, even when scientists have clarified synonyms, old names still exist in the past literature so a reader needs to know which names may have been used for a species. One of the most popular fish in research and aquaculture, the rainbow trout, was known as Salmo gairdneri Richardson, 1836, but is correctly named Oncorhynchus mykiss (Walbaum, 1792), an older available name. A search of Google Scholar in 2009 found 39,000 citations of the incorrect name and 18,000 of the correct one; in 2012, 38,900 and 60,600 hits respectively (276,000 and 1,050,000 in Google). A sponge widely used in medical research into cell biology and cancer is widely named Microciona prolifera (Ellis & Solander, 1786) but should be called Clathria prolifera (Ellis & Solander, 1786). In this case, the species epithet is unchanged but the genus to which it belongs has been changed. Thus, information about this species needs to be sought under both names.

Homonyms

Homonyms are identical accepted names applied to unrelated species. Examples are Paridotea munda Hale, 1924 and Paridotea munda Nunomura, 1988, both similar isopods, one in Australia and the second in Japan. As yet, a replacement name has not been proposed for the second usage. Homonyms exist within marine species, and between marine and non-marine species (Table 1). In many cases the names can be distinguished if the authority and year of description are included in the citation. Thus most journals require that the species name includes the authority and year of publication.

thumbnail
Table 1. Examples of the same names being used for different species (including a marine species) found by Rees [114].

https://doi.org/10.1371/journal.pone.0051629.t001

The same name may be used for different genera. Marine examples include Duplicaria Dall, 1908 [Gastropoda] and Duplicaria Vine, 1972 [Polychaeta]; Luetkenia Duncan, 1878 [Ophiuroidea] and Luetkenia Claus, 1864 [Copepoda]; and Acanthopharynx Marion, 1870 [Nematoda] and Acanthopharynx Reisinger, 1924 [Platyhelminthes]. Replacement names must be proposed for the more junior name if they occur with the same code of nomenclature. One example is the case of Singula Blazewicz-Paszkowycz, 2005, a new name for the tanaidacean Singularia Blazewicz-Paszkowycz, 2005 and Biuncus Huys, 1995 a replacement name for Singularia Huys, 1995, both preoccupied by Singularia Arenberger, 1988, a moth.

Some accepted species names may be so similar to each other that they resemble misspelled homonyms and may cause confusion as well, such as the solitary ascidians Polycarpa aurata (Quoy & Gaimard, 1834) and P. aurita (Sluiter, 1890), or the shrimp genera Allopontonia Bruce, 1972, and Altopontonia Bruce, 1990. If they are all included in a common database then these distinctions become more apparent and reduce confusion. Thus to find information on a species one needs to know which names may be in fact referring to the same species. When a comprehensive review of a species is undertaken, a search on synonyms, misspellings and homonyms is required.

Misspellings

Misspellings abound in the literature and are perpetuated because authors neither check the original descriptions nor even validated lists of names when available. Some misspellings are not surprising considering the similarities and peculiarities of some accepted species names. For example, Acipenser oxyrinchus Mitchill, 1815, Amblycirrhitus oxyrhynchos (Bleeker, 1858), Cheilinus oxyrhynchus Bleeker, 1862, Arnoglossus oxyrhynchus Amaoka, 1969, Coregonus oxyrinchus (Linnaeus, 1758), Cestraeus oxyrhyncus Valenciennes, 1836, Dipturus oxyrinchus (Linnaeus, 1758), Facciolella oxyrhyncha (Bellotti, 1883) and Himantura oxyrhyncha (Sauvage, 1878), Rhynchopelates oxyrhynchus (Temminck & Schlegel, 1842) are nine species of fish; and the amphipod Westwoodilla oxyrhyncha Bulycheva, 1952 stomatopod Raoulserenea oxyrhyncha (Borradaile, 1898) and decapod Oxyrhynchaxius Parisi, 1917 comprise two species and one genus of crustaceans with similar names. With such similar-sounding and -spelt specific epithets it is no wonder that misspellings abound in the literature. A common misspelling for the Atlantic sturgeon Acipenser oxyrinchus Mitchill, 1815 is Acipenser oxyrhynchus, and similarly for houting Coregonus oxyrinchus (Linnaeus, 1758) is C. oxyrhynchus. What all these species have in common is that they have something that could be called a pointed “nose”, which the original describers found so striking that they named their species for it.

Another issue is that spelling errors from the literature may be entered into databases, perhaps the error may occur during data entry, and then perpetuated, sometimes unknowingly, but on other occasions intentionally. For example the spelling error Ammothea sextarticulata (instead of Ammothea sexarticulata Munilla, 1990) was first published in 1994, and was entered as such in WoRMS in 2005. It was later corrected by the taxonomic editor but has already had 410 Google hits, whereas the correct spelling has had only 118 (checked 19 April 2012). In this case, the incorrect spelling is present in several online databases and continues to perpetuate in the literature; even the original author used the misspelling in 2008 [41]. To enable tracking of such errors, a taxonomic database should retain all published spellings but indicate which are in error.

Economic consequences

The problems arising from incorrectly applying species names are not only of academic interest but have economic and conservation consequences. A species must have a scientific name to be included in the IUCN Red List which assesses the conservation status of species. Failure to correctly name pests and pathogens has resulted in wasted control measures [42], [43]. A major problem in tracking the status of fish populations is that catches are often mislabelled owing to reporters being unaware of related species and their correct names. FAO (UN Food and Agriculture Organisation) produced species identification guides so countries could better identify, and thus report, actual catches by species; instead of just listing ‘shark’ for example which could refer to any of hundreds of species. This correction resulted in an improvement from 46% to 95% of catch being reported at species level [44]. In Europe, five species of large skates have been landed under two species names, so the status of the stocks was unknown [45]. One species, the well-known European common skate, previously known as Dipturus batis (Linnaeus, 1758), became locally extinct in parts of Europe owing to overfishing but was recently proposed to consist of two previously described but synonymised species, D. flossada and the flapper skate, D. intermedia; the conservation status of both is now unclear [45]. The European sturgeon Acipenser sturio Linnaeus, 1758 is near extinction in Europe. It was assumed that it was the only sturgeon species in Europe, but examination of museum records found that sturgeons from the Baltic Sea, now extinct, were A. oxyrinchus which survives in NE America [46]. Thus, the species could be restocked to the Baltic. Many more cases of the importance of correct identification and naming of species are provided on the BioNET website.

Biodiversity informatics

Several initiatives to better organise species names have been undertaken. In the early 1990s, van der Land [47] began to list species names through contacting experts and published the UNESCO-IOC Register of Marine Organisms (URMO) on diskette. In 1972 in the USA, NOAA's National Ocean Data Center developed a list of marine species names with code numbers, the NODC Taxonomic Code. This became part of the Integrated Taxonomic Information system (ITIS) in 1996 (http://www.nodc.noaa.gov/General/CDR-detdesc/taxonomic-v8.html). In 1997, Frank Bisby and colleagues launched a global effort called Species 2000 to link together and publish Global Species Databases (GSDs) on the internet, and subsequently also as the ‘Annual Checklist’ on CD-ROM. Most of the GSDs had not been previously published on the internet but sat on individual scientists' computers. The application of information technologies (IT) to biodiversity data, called biodiversity informatics, enables international collaboration and data management to be fast at low cost [48], [49], [50], [51], [52], [53].

The compilation of species names is aided by the ability of computers to search names from the literature and other databases [6]. Indeed, several important compilations of names exist, such as the Index of Organism Names (ION) which includes Zoological Record (www.organismnames.com) and the Global Names Index (http://gni.globalnames.org/). The latter now has 20 million name strings but this represents about 1.5 million accepted species when as yet unrecognised synonyms are accounted for [22]. ION has 1.5 million names and 1.2 million species and subspecies gathered from publications it regularly checks. Neither resource is revised by taxonomic experts so the validity of the names is not known. Gathering and classifying such names is essential, but finding the correct name to use for each species is more difficult. The same names may be used for an animal, plant or bacterium but because each of these groups is subject to different codes of nomenclature they are not considered homonyms.

Resolving taxonomic issues requires informed individuals who understand how the problems have arisen, know the rules and the literature well and have access to type specimens. The diversity of species limits the knowledge of any one expert to a particular taxon, sometimes with hundreds to thousands of species, and often only to the representatives of that taxon in a particular environment (e.g. marine) or geographic area. Thus, it takes many experts to cover all species, and some less popular or economically unimportant groups may have few or no experts. Species have been described in thousands of journals and books, so gathering the literature has also been expensive and time consuming. Here again the internet can help; for example by getting the old literature online, as underway by the Biodiversity Heritage Library. Not all species were well-described, especially those recognized early in the 19th century. Accurately applying species names often requires physical examination of the type specimens in a natural history museum or herbarium collection and their re-description. Knowing where these type specimens are located and accessing them is time-consuming and sometimes impossible. Thus Moretzsohn [54] proposed a special online database called TaxonBank, to register the location and other details of type specimens. The Australian Faunal Directory [55] includes type specimen information. Such a resource is needed for all species.

Scientific natural history museums and herbaria are depositories for reference collections of botanical, zoological, and paleontological specimens used in taxonomy and other life science disciplines. Synonymies are difficult to establish without reference to type specimens. These are kept in such collections and are accessible for that purpose [56]. Museum collections store specimens with collection data indicating locality and date of sampling. When there is uncertainty about species records if similar species are involved (including sibling species), then the study of museum specimens may yield solutions. This is also relevant when species have become locally extinct and past distribution ranges have to be reconstructed as for the Baltic Sea sturgeon [46]. Species that were believed to be endemic and became locally extinct would be considered globally extinct. However, they may be rediscovered in recent collections from elsewhere in the world indicating that they are still extant and that their status as endemics was erroneous [57]. The history of populations of non-native species reaching pest proportions in certain areas may also be traced back with the help of specimens deposited in museums. Comparisons of species lists of recent surveys and historical collections of the same areas, like in the proximity of large cities such as Jakarta or Singapore, may indicate that species have disappeared from their local faunas [58], [59], [60]. Thus these collections can be used to re-establish baselines in the context of historical ecology. They can also be increasingly important if they contain material from protected areas where species are not allowed to be sampled anymore [61]. It is the combined, complementary availability of marine biological collections worldwide that makes them useful for global change studies, which is enhanced as data pertaining to such specimens are made available in digital electronic form [56], [62]. Thus an online resource that indicates the location of specimens will aid researchers in correctly naming, identifying and classifying species; and improve quality control in taxonomy. For example, in the Swedish Museum of Natural History a Department of Biodiversity Informatics has been established which, amongst other things, will manage information about the collections.

Many authors have argued that the management and quality control of taxonomic and biodiversity data requires an online register of species [28], [63], [64], [65], [66], [67], [68]. However, there are practical limitations to what a group of scientists can achieve with limited resources. Providing a full web-based taxonomy, including expert-validated species nomenclatures and information on all species, is beyond the scope of a few scientists. However, clusters of scientists can contribute the parts of the ultimate resource, which is exactly what was achieved with the European Register of Marine Species (ERMS) [69], the Gulf of Mexico biodiversity inventory [70], AFD since the 1980s [55], and the New Zealand inventory of biodiversity [71], [72], [73]. In 1997–1999, ERMS was published on the internet and subsequently as a book [69]. This was notable in (a) bringing together over 170 experts to pool their knowledge on what species occurred in European seas into one database, (b) legally establishing the Society for the Management of Electronic Biodiversity Data (SMEBD) to hold the Intellectual Property Rights (IPR) of the contributors and thus facilitate the systems succession planning, and (c) having all the content in one standardised database [69], [74]. In 2000, the A. P. Sloan Foundation launched the Census of Marine Life (CoML), a decade of globally coordinated discovery in marine biology. CoML established an Ocean Biogeographic Information System (OBIS), which published species distribution data over the internet. This used a similar standard to, and is the largest marine contributor to the Global Biodiversity Information Facility (GBIF), established in 2001. These initiatives, and the increased use of databases to manage biological data, increased the demand for a standard checklist of marine species names and their relationships to synonyms. Following the completion of its start-up project, ERMS became hosted by a professional marine data centre at the Flanders Marine Institute (VLIZ). This provided the computing and data management infrastructure and support team on which to expand ERMS to become a World Register of Marine Species (WoRMS), and SMEBD provided the community of experts who invited colleagues to expand the content [51], [74], [75]. WoRMS was thus independent of, but collaborated with and contributed to, CoML and Species 2000. In this paper, we report how WoRMS has become an established part of the global biodiversity infrastructure, and is playing an increasingly important role in taxonomic data management.

Methods

Expert community

WoRMS editors were selected by their peers through knowledge of their publications and expertise in a taxon. The advantage of this approach was that the best known and most senior experts were first involved. They provided leadership and example to younger researchers and the wider community. However, the editors were encouraged to invite their colleagues to spread the workload and provide succession, including young researchers who may be more comfortable with using online databases for publication. Engaging potential editors was greatly helped by personal relationships and contacts at scientific meetings. In particular, the frequent workshops and meetings of the Census of Marine Life significantly helped such interactions, and most of the WoRMS Steering Committee (SC) members were involved in CoML. Two special WoRMS editors workshops have been held to determine policy and direction (Figure 1), but most coordination has been by email.

thumbnail
Figure 1. Some of the WoRMS editors at workshops in Ostend in 2008 (upper panel) and Aberdeen in 2011.

https://doi.org/10.1371/journal.pone.0051629.g001

Communication

The website is the primary method of communication. It includes News items which provide a history of WoRMS progress with links to further documents (e.g., reports of meetings), and Twitter feed with brief news items. Users can sign up to RSS feeds that will notify them of updates to WoRMS content. An email list provides the primary method for communication with editors.

Host institution

Another important factor in attracting editors is the security a professional data centre provides for the continuity of the database. The data centre provides a database support team, so that changes in staff do not interrupt WoRMS development, and can provide 24/7 support, archiving, and professional IT system design and management. The host institution, VLIZ, is a leading Ocean Data Centre within the Intergovernmental Oceanographic Commission's (IOC) International Oceanographic Data and Information Exchange (IODE) programme, and a certified member of the World Data System of the International Council of Scientific Unions (ICSU). It finds the WoRMS database invaluable in its wider data-management activities. Thus it can support WoRMS as it supports other projects.

Cost

The development of WoRMS, including ERMS since 1997, is estimated to have involved about €2 million in project funding for IT, editors' time, and meetings. However, the in-kind cost of SC members and editors' time directly involved in WoRMS is estimated at over €3 million. At present, the effort is equivalent to two full-time staff at the host-institution and similar in-kind effort by the editors, so including allowance for additional expenses, including overheads, a total annual cost of about €500,000 is estimated.

Role of SMEBD

SMEBD was established to hold the Intellectual Property Rights of the ERMS, the precursor to WoRMS. The WoRMS SC was established within SMEBD to manage WoRMS. All contributors to WoRMS have the right to become honorary life-members of SMEBD. WoRMS editors nominate and elect people to the SC. As a legal entity and holder of the contributors' IPR, SMEBD has a key role in formally approving the host institution of the database, how it is disseminated, negotiating exceptional uses of the database, and following up on misuse of the data. For example, SMEBD successfully had a book withdrawn from publication because it had largely republished a WoRMS GSD without attribution of the source. SMEBD can also act as a contractor in research projects and manage their finances. It has been a full partner in two European Commission research contracts. SMEBD thus provides the governance for WoRMS. Its legal incorporation in Ireland requires it to have a detailed annual audit, and limits the financial liability of its Directors and members from any claims made against them in relation to the activities and assets of SMEBD. In contributing to the database, past and present, the editors have agreed to voluntarily provide data, information, opinion, or other expert assistance to the database. They retain the right to use and publish any data and intellectual property created by themselves, but authorise SMEBD to store, compile, modify, revise, and disseminate the data provided and derived by any means (e.g. electronic, World Wide Web, book). This includes appointing new editors who may add to and modify the original contributions of previous editors. They recognise that products of the database are the copyright of SMEBD, and they exercise control over the databases through election of the SMEBD Council. The WoRMS SC is elected from members nominated by its editors (SMEBD members).

Content

The minimum requirement for WoRMS is an accepted full species name (i.e. accepted combination of genus, specific epithet, author, year) placed in an accepted higher taxon group (at least family) and environment (e.g. marine, brackish, terrestrial and/or freshwater). Desirable additional information is original genus-species combination (called basionym in plants), alternative past combinations, junior (subjective or otherwise) synonyms, key literature (ideally a link to the original publication), location of type material, and type locality. However, some species pages include considerable additional information, from biology to distributions and images. A system to label species fossil status and time stratigraphy is being added. Considerable data is entered by assistants, some at the host institution and others at editors' offices. This content is ‘quarantined’ until it is approved by the appropriate taxonomic editor.

Citability

We recognised the importance of making the editors responsible for WoRMS visible on the web pages for two reasons. First, doing this indicates the authority behind the database content. Second, it was recognised that the editors wished their work to be recognised and attributed to them. We thus follow the well-established method of citing publications [76]. Each species and higher-taxon page has a citation at the foot of the page. Thus a user is expected to cite the species page, a higher-taxon page (e.g., Amphipoda), a GSD or the database as a whole, that is [77] depending on how they use it. WoRMS may be the first online biodiversity database to provide multiple levels of citation.

Glossary

In preparation for the further expansion of the content to include ecological information, a glossary has been developed by a group of ecologists, geologists and taxonomists [78]. This is the first step to provide consistent definitions for use within WoRMS, i.e. a controlled vocabulary. This glossary is a collaboration between the scientists of the GEOHAB (Marine Geological and Biological Habitat Mapping) and WoRMS communities. It is authoritative in that definitions are approved by scientists who are well-established in the subject areas; peer-reviewed by both prior approval of experts and exposure to feedback from users; open-access (freely available online) for others to use; transparent by contributors and persons responsible being acknowledged; expert controlled by a small editorial group that approves changes to the definitions; and participatory in encouraging users to criticize definitions and suggest additional terminology for inclusion. The glossary can be expanded as users demand and experts are willing; modified based on feedback and changing use of terminology; contributes to data management by providing definitions for use of terminology in databases, and assists the development of ontologies that relate terms to each other. It is permanent with editors being replaced as their availability changes and new expertise is desirable; and contributes to associated initiatives including the Encyclopedia of Life (EoL), CoML, WoRMS, GBIF, OBIS, and IODE of IOC. It does not intend to provide a review or history of all uses of particular terms, nor how they may be used in other fields of research. However, a further development may be to make relationships between terms apparent in a ‘semantic ontology’. The definitions are those recommended for use in marine biology, ecology and geology. Where a term has different uses that the editors feel require clarification, these will be included. At present, this glossary excludes terminology specific to the following areas: names of marine species and higher taxa as these are in WoRMS; place names (see gazetteers at www.vliz.be/vmdcdata/vlimar and www.gebco.net/data_and_products/undersea_feature_names); taxonomy; physiology; archaeology; fisheries; legal and regulatory terms; and acronyms.

Higher classification

The WoRMS editors determine the classification within the taxa for which they are responsible. However, the overall higher classification needed a standard approach to simplify data management. The first WoRMS editors' workshop discussed a proposal for a classification of Animalia to aid data management [79]. This, and the modification of all the other eukaryote kingdoms, has been adopted by the Species 2000 Catalogue of Life (CoL) [80], with a rationale explained by Gordon [81], and is implemented in WoRMS. The use of a common classification greatly aids data exchange. Its principles include: adoption following discussion with experts and consensus building; not implementing proposals for modifications to classification until there has been a year or two for them to be discussed by the taxonomic community; and only altering the classification at perhaps 5-year intervals. This conservative approach is designed to provide stability for data management, and so users do not get confused by new classifications and terminology.

Infrastructure

The WoRMS data are stored in a relational MS SQL 2008 database called Aphia. An MS Access front end is built for administration purposes to control edit rights and perform quality control. The database contains over 440 fields, of which accepted species name is the most complete (100%). These fields are organised into 79 related tables described on the website at http://www.marinespecies.org/structure.

AphiaID provides a unique and permanent number for every species name within WoRMS (e.g. AphiaID 127160). It enables users to match up names in their databases with future versions of WoRMS, particularly where the status of a name may have changed (e.g. become a synonym) or the classification of the species may have changed. The AphiaID is included within the WoRMS LifeSience Identifier (LSID, http://sourceforge.net/projects/lsid), which is an implementation of a persistent Globally Unique Identifier (GUID). An example of a GUID is urn:lsid:marinespecies.org:taxname:127160. In addition, these LSIDs are resolvable and that they can produce structured taxonomic information in RDF (Resource Description Framework) format.

The editorial board has direct access to the database via a PHP (Hypertext Preprocessor) web interface. If editors prefer to work off-line they can use an MS Excel template, which is often also used for bulk updates. The WoRMS website is running on an Apache2 windows server, which backs up the data on a daily basis. The entire database is archived each month and users can download previous versions upon request. Copies of the database can be downloaded by organisations or individuals following approval by SMEBD. This involves completing a request form in which the recipient agrees not to further distribute the database or make it available online. These limitations are to avoid multiple or corrupted versions appearing on other websites, and to encourage users to contact WoRMS directly.

Distribution maps

WoRMS stores published species distributions by using location names. The status of the location name (including different spellings and languages), coordinates, shapefiles, and geographic hierarchy is provided by linking to the VLIZ Marine Gazetteer (VLIMAR, http://www.vliz.be/vmdcdata/vlimar). The coordinates and shapefiles can be used to build species distribution maps, as currently implemented on the sponge database [82]. Maps are built using OpenLayers (www.openlayers.org), an open source javascript library to display dynamic maps in any web page. The back-end of both occurrence types is GeoServer (www.geoserver.org), an open source implementation of WMS that implements the Open Geospatial Consortium (OGC) standards.

Photogallery

The WoRMS image library is a user-controlled facility for the upload and display of images adjusted for online publication (i.e. 800 px, 72–96 DPI) [83]. It is not necessary to create an account and log-in to do so, but account holders have edit privileges. It automatically resizes the image while storing the original size, to a 800 px wide ‘thumbnail’ image. If permitted, the original size can be provided online or made publicly available upon request. Video files can also be stored and displayed. The user must add some minimal metadata: including title, author, email, keywords; and terms of use (e.g. Creative Commons licence). It can automatically read embedded camera capture metadata (i.e. exif, gps) from uploaded pictures. The keywords are part of a controlled vocabulary and multiple entries are possible. A drop-down list of taxonomic names avoids users entering misspellings.

Editors can link images to specimens, which can have additional metadata (e.g. details on code number, storage, identification, locality, biology etc). Because many images are not uploaded by the taxonomic editors, the species they contain may not be correctly identified or the image may not be of sufficient quality for species recognition. Thus whether or not the image has been verified by an editor is indicated.

Images can then be searched on species name, title, author and other keywords. There is an option to allow users to provide comments, which are moderated by the database administrator. Because they are associated with species names, the images are thus available to all Regional Species Databases in WoRMS, and can be accessed by external organisations, such as the Encyclopedia of Life. The number of times an image has been viewed is tracked.

Results

Content

In 2012, WoRMS contained almost 100 global, 12 regional and 4 thematic species databases overseen by 240 editors (Tables 2, 3). The editors are located in 133 institutions and 31 countries (Table 4). Of the GSDs, 22 have their own entry web page which provides scope for the editors to provide additional background and profile for their taxon (Table 2). Regional Species Databases (RSDs) cover less than half the oceans (Figure 2), but additional RSDs are planned. The RSD editors add distributional context to WoRMS, and work with the GSDs taxonomic editors to resolve nomenclatural discrepancies and omissions. Some editors are involved in a GSD, RSD and/or Thematic Species Database (TSD).

thumbnail
Figure 2. The approximate present geographic coverage of the larger Regional Species Databases (i.e. all-species inventories) within WoRMS.

https://doi.org/10.1371/journal.pone.0051629.g002

thumbnail
Table 2. The Global Species Databases hosted within WoRMS. Those with their own web entry page are underlined.

https://doi.org/10.1371/journal.pone.0051629.t002

thumbnail
Table 3. The (a) Regional Species Databases (RSD) and (b) Thematic Species Databases (TSD), hosted within WoRMS, and their editors.

https://doi.org/10.1371/journal.pone.0051629.t003

thumbnail
Table 4. The countries and institutes represented by the editors of WoRMS and its associated databases. These are mapped at http://www.marinespecies.org/imis.php?module=gmap&spcolid=507.

https://doi.org/10.1371/journal.pone.0051629.t004

One principle in setting up WoRMS was to not ask taxonomists to repeat their work. Thus WoRMS built on authoritative registers of all-taxon marine species lists that existed at regional levels (e.g. Europe) and for particular taxa at global levels. Several GSDs were incorporated into WoRMS, including the world databases on all marine, freshwater and terrestrial Copepoda and Isopoda developed at the Smithsonian Institution, and world databases on Cumacea, Brachiopoda and Phoronida. Externally sourced content is from the collaborating databases: Biogeoinformatics of the Hexacorals (sea anemones and their relatives), World list of marine Fungi (from Index Fungorum), World list of Marine Pisces (from Catalog of Fishes via FishBase), World list of Algae (from AlgaeBase) [84], World list of free-living Nematodes – NeMys, World list of Marine Rotifers (from FADA), World list of marine reptiles (From Reptile Database), World list of Turbellaria (from Turbellarian Taxonomic Database), World list of Recent and Fossil Bryozoa, and the World list of Ctenophora. WoRMS is updated by content from these scholarly resources, and, in turn WoRMS provides its content and/or services to other resources that might otherwise need to recreate it.

At present, WoRMS contains 460,000 taxonomic names (from kingdom to species), and 368,000 species names. The latter include synonyms, nomina dubia, nomen nuda, misspellings, and old genus combinations. The species with the most synonyms is the breadcrumb sponge Halichondria panicea (Pallas, 1766), with 64. There are 215,000 accepted species names (Table 5). About 10% of the species names, entered by data assistants or editors, remain to be checked by editors. Some editors make their taxon complete across all environments, so there are 26,000 non-marine species also in the database (Table 5). Associated information includes about 150,000 literature sources, 20,000 pictures, and information on 44,000 specimens, of which over 5,000 are holotypes. Specimen information in museum collections can be matched to species. For example, WoRMS has over 40,000 linked taxon records to the invertebrates deposited at the Smithsonian Institution, National Museum of Natural History. ERMS was moved to the present host institution in 2004 and once WoRMS was launched in 2008 significantly more content was entered (Figure 3). Since 2010 there have been fewer additional species to enter and thus effort has shifted to other content, notably vernacular names and distribution data (Figure 3).

thumbnail
Figure 3. Annual numbers of taxa names (species and above including synonyms), accepted species names, vernacular names, distribution data, and specimens, added to WoRMS and its precursor ERMS.

https://doi.org/10.1371/journal.pone.0051629.g003

thumbnail
Table 5. The number of marine (a) taxa (infra-species and above), (b) species names (includes synonyms), (c) accepted species names (excluding synonyms); and (d) additional non-marine species in the WoRMS database (note that there are additional non-marine species of many taxa not in WoRMS).

https://doi.org/10.1371/journal.pone.0051629.t005

Matching taxa

Determining the correct spelling of a scientific name is not always a trivial task (e.g., which one is correct: Cirrhitichthys, Cirrhitychthys or Cirritichthys?) and it is very difficult for non-taxonomists to keep up with the status of species names. WoRMS has an online, semi-automated name validation tool called Taxon Match, to cross-check the spelling and taxonomic status of species against the WoRMS database. The tool is an implementation of the TaxaMatch algorithm which comprises a suite of custom filters and tests used in succession on genus, species epithet, plus authority where supplied [85]. It also uses the Scientific Names Parser [86]. The tool returns standard taxonomic information in a user-friendly format (e.g., MS Excel or tab delimited text file). The user needs to upload a list of species names, match the columns with the fields in the database and the system will return the file with valid names. The tool corrects the spelling if there are close matches found, notifies when the name is an unaccepted synonym, and provides the authority and publication date, the hierarchical classification, quality status (expert validated or not), and the WoRMS LSID. Up to 95% of common spelling mistakes are captured. When there are multiple potential matches the system provides a pick-list. It is a very popular tool, already appreciated by thousands of users (with on average 14 files uploaded on a daily basis).

Web services

In contrast to the Taxon Match, where the user has to upload a species list, the portal also provides a platform-independent web service, that is it can run on PC, Mac and Linux operating systems. It uses the Web Services Description Language (WSDL) and Simple Object Access Protocol (SOAP) to enable data exchange. This web service allows users to dynamically link their own applications to the WoRMS database and will allow them to match a locally stored species list and add taxonomic and additional information derived from WoRMS (Table 6). WoRMS is also linking with other online data systems (link-out). They may use the web-services and/or use the WoRMS Taxon Match tool to cross-link names in their database andlink back to WoRMS (link-in) (Table 7).

thumbnail
Table 6. Examples of possible applications of the WoRMS web service.

https://doi.org/10.1371/journal.pone.0051629.t006

thumbnail
Table 7. Examples of how WoRMS links with other online biodiversity resources.

https://doi.org/10.1371/journal.pone.0051629.t007

Usage

The WoRMS web service is being used by at least 28 organisations from 12 countries (Table 8). Permission is not required to use the Web service so there may be additional users we are not aware of. Copies of the database have been licensed out to 61 organisations in 21 countries (Table 9) with demand growing steadily (Figure 4). Since 2007, all website-use metrics show a steady increase in access (Figure 5). There were about 600,000 unique visitors in 2011, and 57,000 in December 2011 alone, and on average >3,000 unique visitors per day (based on IP addresses). This represents over 3 million hits per month. Google scholar (24 April 2012) found over 800 citations for “World Register of Marine Species” and that it was cited in over 100 publications.

thumbnail
Figure 4. The number of organisations using WoRMS for their research and/or data management as listed in Table 9.

https://doi.org/10.1371/journal.pone.0051629.g004

thumbnail
Figure 5. The annual use of the WoRMS website since January 2007 in terms of pages viewed and web page hits (left axis), and numbers of visits, unique visitors and bandwidth (MB) (right axis).

The data for 2012 are estimated based on the trend until October.

https://doi.org/10.1371/journal.pone.0051629.g005

thumbnail
Table 8. Some of the organisations using the WoRMS web service for their data management systems and/or research.

https://doi.org/10.1371/journal.pone.0051629.t008

thumbnail
Table 9. Organisations with licensed copies of WoRMS for their research and/or data management.

https://doi.org/10.1371/journal.pone.0051629.t009

User feedback

Typical benefits of WoRMS to users were that: (1) the process of reconciling names was automated; (2) the entry of names in a database could use a drop-down menu of existing names from WoRMS, so errors in manual entry could be avoided; (3) the names followed a standardized taxonomic hierarchy, thus aiding a user's classification of species in their own database and publications; (4) it was a single standard authoritative and time-saving resource to reference names and their classification; (5) it has an efficiently responding editorial system (Neil Holdsworth, personal communication, 25 November 2010); and (6) checking of names from collaborators and the literature could be automated. Including researching names not in WoRMS that would need to be checked from other sources, WoRMS saved users significant time compared to manually checking names using search engines and the literature (e.g. 14 times less time, Karen Stocks personal communication, 18 November 2010). Thus the availability of WoRMS not only saves users time but will improve quality control in the use of marine species names. WoRMS is also used as a naming standard for semantic frameworks used in databases for different projects (Roy Lowry, personal communication, 4 November 2010).

Discussion

WoRMS was formally launched to the world media in 2008. A press release in collaboration with the Census of Marine Life resulted in remarkable media uptake in 27 countries and nine languages. By June 2008, it was covered in 298 online stories, 23 newswires, 23 newspapers, and interviews on eight radio and two television stations. This was impressive for an online biodiversity database, and reflected the great media and popular interest in discoveries of marine biodiversity found by the Census of Marine Life [87].

The development of WoRMS has accelerated the availability of taxonomically authoritative GSDs for Species 2000, OBIS and GBIF. A growing number of GSDs are provided to Species 2000, sometimes replacing earlier GSDs that are no longer updated. In addition to WoRMS being directly provided to other organisations, many more use WoRMS GSDs via Species 2000 and its Catalogue of Life (CoL), and through the EOL website. Uniquely amongst species name systems, WoRMS provides environmental context (i.e. marine) for species. The European component of WoRMS, ERMS, has been recommended as a standard when using species names in the European Union under the Infrastructure for Spatial Information in the European Community (INSPIRE) Directive.

Benefits

Some of the most important benefits of WoRMS will be improved taxonomic efficiency, and greater quality control in the use of species names in the wider literature and environmental management. For example, EurOBIS corrected 28% of the names in its database by using WoRMS [88]. By making a single inventory of all marine species names easily accessible on the internet it is anticipated that people will use it to correct spelling mistakes, use the currently accepted names rather than synonyms, and bring omissions, errors and anomalies to the attention of the taxonomic editors to address. The authors of popular species identification guides will find it easy to update the species names they use, and ecologists, conservationists and environmental managers will be using species names more consistently. The increasing usage of WoRMS indicates this is happening.

Taxonomic research will also benefit. Duplicate descriptions of the same species will be reduced because researchers will have a checklist of related species to compare their specimens and observations with, and contact details of experts to discuss their findings with. Authors of species descriptions can check if similar names are already in use, and thus may choose more unique names and avoid homonyms. The production of WoRMS has added benefits in fostering collaboration between experts at a global scale. Easy access to the register allows ecologists and local observers to correct their use of taxonomic names. In turn, this stimulates biogeographic and evolutionary research.

Use in research

Although initially established to provide open-access information on marine species nomenclature, the aggregation of so much content is providing unanticipated benefits to researchers. ERMS provided the basis for (1) a review of taxonomic expertise and resources, including a list of species identification guides [89], [90], and (2) an analysis of trends in species discoveries and predictions of how many more species remain to be discovered [17], [91]. This research stimulated the development of a new statistical approach, unusual in that it allowed calculation of confidence limits, to predict species richness from past rates of discovery [92]. This work was then extended to WoRMS and CoL to predict global species richness [22]. Other approaches to estimate species richness used the rate of discovery of higher taxa in WoRMS and other databases [21], and developed a software tool to provide a structured approach to using expert knowledge to estimate richness [93]. WoRMS has also contributed to the annual reports of species discoveries [94], [95]. Fisher et al. [96] matched 2,380 species names from WoRMS to a bibliographic database so as to identify bias in research on coral, kelp, seagrass and mangrove habitats.

Groups of WoRMS taxonomic editors have begun to synthesise knowledge on their taxon, including a major collaborative paper co-authored by over 100 editors [91]. These studies form the basis for a special collection of papers in PLoS ONE. To date, they review the global diversity of several taxa: (1) Crustacea: Remipedia [97], Monstrilloida copepods [98], Tanadiacea [99], and non-asellote isopods [100], [101]; (2) Cnidaria: Stylasteridae corals [102] and Pennatulacea corals [103]; (3) Echinodermata: Ophiuroidea [104] and Asteroidea [105]; as well as (4) Porifera [106] Ascidacea [107]; Oligotrichea protists [108]; Reptilia [109]; and Placozoa [110]. The present paper provides the introduction and context for this collection. It complements other PLoS ONE collections, notably those from the Census of Marine Life, e.g., [111] and one paper fits two collections [112]. The study synthesises how many species are described, the number of accepted species names and synonyms, estimates of how many molecular cryptic species may exist, how many undescribed species are already in specimen collections, how many undescribed species have been found in field samples, and predictions of how many more species may yet be discovered [91]. This study provides a baseline of current knowledge of marine biodiversity at the species level, summarises the rate of progress in discovering species, and should be reviewed every few years.

Future prospects

Several initiatives are underway within WoRMS but not yet visible. These include new GSDs websites, and Thematic Species Databases on introduced species and parasite-host relationships. The major taxonomic gaps are amongst Mollusca, but no doubt there are omissions in other taxa and continuing updates needed. In the absence of alternative infrastructure and for taxonomic convenience, editors may add freshwater and terrestrial relatives to their marine GSDs, as already the case for Copepoda, Isopoda, Porifera, Gastrotricha, and Tardigrada. Users are encouraged to contact editors regarding possible omissions and errors in the database content. Continual improvements to content and database functionality are required. For example, about 5% of the literature sources are estimated to be duplicate entries and need to be manually rationalised. Species' fossil status is being categorised using a standard stratigraphy following a proposal from the editors for Foraminifera and Echinoidea. Linking of literature references to electronic copies of the publication is being implemented through hosting documents within WoRMS, and linking out to sources, such as the Biodiversity Heritage Library. Thus the content continually expands at the initiative of editors, or users, and may be funded by special projects with particular research goals.

Users may like all content on one page but this is increasingly being provided from different experts (e.g. taxonomist, ecologist, biogeographer). Thus developments can present challenges for web page design, distinguishing which experts have validated which content, agreement on controlled vocabularies, and patience to reconcile different perspectives. We expect greater linkage with species distribution data in OBIS and GBIF. Several editors have developed online species identification resources. The future may see an online guide to all marine species. Some species' conservation status is indicated and WoRMS updates species names for the IUCN Red List. Thus, there is potential to create a thematic database on marine species of special conservation interest. New tools and online resources are materialising that provide opportunities for WoRMS to be more interoperable with online journals (e.g. using DOI or other identifiers), and other databases; such as the FilterPush (http://etaxonomy.org/mw/FilteredPush) that networks species names.

The classification of species by their biological (e.g. body size, parasites, dispersal), ecological (e.g. habitat), and other (e.g. invasive, threatened) attributes, has a multiplier effect on the potential research and user audience for WoRMS. Already there are improvements in the ability to sample and analyse marine species. As a consequence of WoRMS, we are already witnessing improved communication within the scientific community, and anticipate increased taxonomic efficiency and quality control in marine biodiversity research and management.

Acknowledgments

This paper is dedicated to the late Jacob (Jaap) van der Land (Netherlands Centre for Biodiversity Naturalis, formerly National Museum of Natural History at Leiden) who began a checklist of all marine species in the early 1990s, and was a contributor to ERMS, WoRMS, Species 2000 and CoL. He was employed as Head of the Department of Invertebrates and Curator of ‘worms’ at the Leiden museum [113].

We thank Karen Stocks (San Diego Supercomputer Center, La Jolla, USA), Neil Holdsworth (ICES Data Centre, Denmark), and Roy Lowry (British Oceanographic Data Centre, UK) for helpful comments as database users. We acknowledge the important roles of past and present editors of WoRMS, and previous members of WoRMS Steering Committee for their collaboration: Charles Griffiths (University of Cape Town, South Africa), Michael D. Guiry (NUI, Galway, Ireland), Maria Lourdes D. Palomares (Los Baños, Philippines), David (Paddy) J. Patterson (Marine Biological Laboratory, Woods Hole, USA), Gary Rosenberg (Academy of Natural Sciences, Philadelphia, USA), Tony Rees (CSIRO Marine and Atmospheric Research, Hobart, Australia), Edward Vanden Berghe (Rutgers State University of New Jersey, USA). We also thank the following for helpful comments in aspects of this paper: Sabine Agatha (University of Strasbourg, Austria), Nicole Boury-Esnault (CNRS Centre d'Océanologie de Marseille), James Reimer (University of the Ryukyus, Japan), Bernd Schierwater (School of Veterinary Medicine Hannover), Eduardo Suárez-Morales (Unidad Chetumal), William Perrin (National Oceanic and Atmospheric Administration, Southwest Fisheries Science Center), Hendrik Segers (Royal Belgian Institute of Natural Science), Christian Emig (Brachnet), Roger Bamber (ARTOO Marine Biology Consultants), Geoff Read (NIWA), Mary Kennedy (Fisheries and Oceans, Canada), Chris Boyko (Dowling College, USA), and Peter Uetz (Virginia Commonwealth University, USA).

Author Contributions

Conceived and designed the experiments: MJC PB GB KF DG BWH GCBP RWMvS SS TCW BV WA. Analyzed the data: MJC BV WD WA. Wrote the paper: MJC PB GB KF DG BWH GCBP RWMvS SS TCW BV WD WA.

References

  1. 1. Carpenter KE, Abrar M, Aeby G, Aronson RB, Banks S, et al. (2008) One-third of reef-building corals face elevated extinction risk from climate change and local impacts. Science 321: 560–563.
  2. 2. Hoffmann M, Hilton-Taylor C, Angulo A, Böhm M, Brooks TM, et al. (2010) The impact of conservation on the status of the world's vertebrates. Science 330: 1503–1509.
  3. 3. Stork NE (2010) Re-assessing current extinction rates. Biodiversity Conservation 19: 357–371.
  4. 4. Thomas JD (1997) Using marine invertebrates to establish research and conservation priorities. In: Reaka-Kudla ML, Wilson DE, Wilson EO, editors. Biodiversity II. Washington Joseph Henry Press. pp. 357–369.
  5. 5. Agapow P-M, Bininda-Edonds ORP, Crandall KA, Gittleman JL, Mace GM, et al. (2004) The impact of the species concept on biodiversity studies. Quarterly Review of Biology 79: 161–179.
  6. 6. Patterson DJ, Cooper J, Kirk PM, Pyle RL, Remsen DP (2010) Names are key to the big new biology. Trends in Ecology & Evolution 1297: 1–6.
  7. 7. Linnaeus C (1758) Systema Naturae per Regna tria Naturae, secundum classes, ordines, genera, species, cum characteribus, differentiis, synonymis, locis. 10th edn. Vol. 1 Animalia. Holmiae: Laurentii Salvii. iv, 823 p.
  8. 8. Bisby FA, Roskov YR, Orrell TM, Nicolson D, Paglinawan LE, et al. (2010) Species 2000 & ITIS Catalogue of Life: 2010. Annual Checklist. CD-ROM. Reading: Species 2000.
  9. 9. International Commision on Zoological Nomemclature (1999) International Code of Zoological Nomenclature. 4th edn. London: International Trust for Zoological Nomemclature.
  10. 10. International Commision on Zoological Nomemclature (2012) Amendment of Articles 8, 9, 10, 21 and 78 of the International Code of Zoological Nomenclature to expand and refine methods of publication. Zootaxa 3450: 1–7.
  11. 11. McNeill J, Barrie FR, Burdet HM, Demoulin V, L. HD, et al.. (2006) International Code of Botanical Nomenclature (Vienna Code) Electronic version of the original English text.adopted by the Seventeenth International Botanical Congress Vienna, Austria, July 2005. International Association for Plant Taxonomy. http://ibot.sav.sk/icbn/main.htm. Accessed 22 Jun 2012.
  12. 12. Knapp S, McNeill J, Turland NJ (2011) Changes to publication requirements made at the XVIII International Botanical Congress in Melbourne – what does e-publication mean for you. :. Phyto Keys 6: 5–11.
  13. 13. Lapage SP, Sneath PH, Skerman VBD, Lessel EF, Seeliger HPR, et al.. (1992) International Code of Nomenclature of Bacteria, 1990 Revision. (Bacteriological Code). Washington: ASM Press.
  14. 14. Cavalier-Smith T (1998) A revised six-kingdom system of life. Biological Reviews 73: 203–266.
  15. 15. Cavalier-Smith T (2001) What are fungi? In: Esser K, Lemke PA, editors. The Mycota VII Systematics and Evolution Part A. Berlin: Springer-Verlag. pp. 3–38.
  16. 16. Kirk PM, Cannon PF, David JC, Stalpers JA (2001) Ainsworth & Bisby's Dictionary of the Fungi. 9th edn. Wallingford: CAB International. 655 p.
  17. 17. Costello MJ, Wilson SP (2011) Predicting the number of known and unknown species in European seas using rates of description. Global Ecology and Biogeography 20: 319–330.
  18. 18. Essl F, Rabitsch W, Dullinger S, Moser D, Milasowszky N (2012) How well do we know species richness in a well-known continent? Temporal patterns of endemic and widespread species descriptions in the European fauna. Global Ecology and Biogeography in press
  19. 19. Fontaine B, van Achterberg K, Alonso-Zarazaga MA, Araujo R, Asche M, et al. (2012) New species in the Old World: Europe as a frontier in biodiversity exploration, a test bed for 21st century taxonomy. PLoS ONE 7(5): e36881 .
  20. 20. Hamilton AJ, Basset Y, Benke KK, Grimbacher PS, Miller SE, et al. (2010) Quantifying uncertainty of tropical arthropod species richness. American Naturalist 176: 90–95.
  21. 21. Mora C, Tittensor DP, Adl S, Simpson AGB, Worm B (2011) How many species are there on Earth and in the ocean? PLoS Biol 9: e1001127 .
  22. 22. Costello MJ, Wilson S, Houlding B (2012) Predicting total global species richness using rates of species description and estimates of taxonomic effort. Systematic Biology 61: 871–883.
  23. 23. Scheffers BR, Joppa LN, Pimm SL, Laurance WF (2012) What we know and don't know about Earth's missing biodiversity. Trends in Ecology and Evolution 27: 501–510.
  24. 24. Euzéby JP (2012) LPSN. List of Prokaryotic names with Standing in Nomenclature. Formerly List of Bacterial names with Standing in Nomenclature (LBSN) Last full update: Oct 5, 2012. http://www.bacterio.net. Accessed 6 Oct 2012.
  25. 25. Hawksworth DL (2011) A new dawn for the naming of fungi: impacts of decisions made in Melbourne in July 2011 on the future publication and regulation of fungal names. IMA Fungus 2: 55–162.
  26. 26. IPNI (2012) The International Plant Names Index. http://www.ipni.org. Accessed 15 January 2012.
  27. 27. Polaszek A, Agosit D, Alonso-Zarazaga M, Beccaloni G, Bjorn PP, et al. (2005) A universal register for animal names. Nature 437: 477.
  28. 28. Wheeler QD, Krell FT (2007) Codes must be updated so that names are known to all. Nature 447: 142.
  29. 29. Pyle R, Michel E (2008) Zoobank: Developing a nomenclatural tool for unifying 250 years of biological information. Zootaxa 1950: 39–50.
  30. 30. Segers H, De Smet WH, Fischer C, Fontaneto D, Michaloud E, et al. (2012) Towards a list of available names in zoology, partim phylum Rotifera. Zootaxa 3179: 61–68.
  31. 31. Gaston KJ, Mound LA (1993) Taxonomy, hypothesis testing and the biodiversity crisis. Proceedings of the Royal Society B: Biological Sciences 251: 139–142.
  32. 32. Stork NE (1997) Measuring global biodiversity and its decline. In: Reaka-Kudla ML, Wilson DE, Wilson EO, editors. Biodiversity II. Washington Joseph Henry Press. pp. 41–68.
  33. 33. Bouchet P (1997) Inventorying the molluscan diversity in the world: what is our rate of progress? The Veliger 40: 1–11.
  34. 34. Kottelat M (1997) European freshwater fishes. Biologia, Bratislava 52/Supplement 5: 1–23.
  35. 35. Alroy J (2002) How many named species are valid? Proceedings of the National Academy of Sciences, USA 99: 3706–3711.
  36. 36. Govaerts R (2003) How many species of seed plants are there? – a response. Taxon 52: 583–584.
  37. 37. Scotland RW, Wortley AH (2003) How many species of seed plants are there? Taxon 52: 101–104.
  38. 38. Eschmeyer WN, Fricke R, Fong JD, Polack D (2010) Marine fish biodiversity: a history of knowledge and discovery (Pisces). Zootaxa 2525: 19–50.
  39. 39. Johnson GD, Paxton JR, Sutton TT, Satoh TP, Sado T, et al. (2009) Deep-sea mystery resolved: astonishing larval transformations and extreme sexual dimorphism unite three fish families. Biology Letters 5: 235–239.
  40. 40. Perrin WF (2012) World Cetacea Database. http://www.marinespecies.org/cetacea. Accessed 25 Jan 2012.
  41. 41. Munilla T, Soler A (2008) Check-list of the pycnogonids from Antarctic and sub-Antarctic waters: zoogeographic implications. Antarctic Science 2: 99–111.
  42. 42. Bowman DD (2005) What's in a name? Trends in Parasitology 21: 267–269.
  43. 43. Bartolus A (2008) Error cascades in the biological sciences: the unwanted consequences of using bad taxonomy in ecology. Ambio 37: 114–118.
  44. 44. Lleonart J, Taconet M, Lamboeuf M (2006) Integrating information on marine species identification for fishery purposes. Marine Ecology Progress Series 316: 231–238.
  45. 45. Iglésias SP, Toulhoat L, Sellos DY (2010) Taxonomic confusion and market mislabelling of threatened skates: important consequences for their conservation status. Aquatic Conservation: Marine and Freshwater Ecosystems 20: 319–333.
  46. 46. Ludwig A, Debus L, Benecke N, Jenneckens I, Williot P, et al. (2002) When the American sea sturgeon swam east. Nature 419: 447.
  47. 47. van der Land J, editor (1994) UNESCO-IOC Register of Marine Organisms. A common base for biodiversity inventories. Families and bibliography of keyworks. DOS-formatted floppy disk. Leiden: National Museum of Natural History (Naturalis).
  48. 48. Bisby FA (2000) The quiet revolution: biodiversity informatics and the internet. Science 289: 2309–2312.
  49. 49. Edwards JL, Lane MA, Nielsen ES (2000) Interoperability of biodiversity databases: biodiversity information on every desktop. Science 289: 2312–2314.
  50. 50. Wilson EO (2000) A global biodiversity map. Science 289: 2279.
  51. 51. Costello MJ (2004) A new infrastructure for marine biology in Europe: marine biodiversity informatics. MARBEF Newsletter 1: 22–24.
  52. 52. Costello MJ, Grassle JF, Zhang Y, Stocks K, Vanden Berghe E (2005) Where is what, and what is where? Online mapping of marine species. MARBEF Newsletter 2: 20–22.
  53. 53. Costello MJ, Vanden Berghe E (2006) “Ocean Biodiversity Informatics” enabling a new era in marine biology research and management. Marine Ecology Progress Series 316: 203–214.
  54. 54. Moretzsohn F (2002) TaxonBank - proposal for a new online database for taxonomic research on type specimens. In: Shimura J, Wilson KL, Gordon D, editors. To the interoperable “Catalog of Life” with partners - Species 2000 Asia Oceania Proceedings of the Second International Workshop of Species 2000, Tsukuba, Japan, July 1999 Research Report from the National Institute for Environmental Studies, Japan 171: : 142–147.
  55. 55. ABRS (2009) Australian Faunal Directory. Canberra: Australian Biological Resources Study. http://www.environment.gov.au/biodiversity/abrs/online-resources/fauna/afd/index.html. Accessed 6 Oct 2012.
  56. 56. Rainbow PS (2009) Marine biological collections in the 21st century. Zoologica Scripta 38 (Suppl. 1): 33–40.
  57. 57. Hoeksema BW, van der Land J, van der Meij SET, Ofwegen LP, Reijnen BT, et al. (2011) Unforeseen importance of historical collections as baselines to determine biotic change of coral reefs: the Saba Bank case. Marine Ecology 32: 135–141.
  58. 58. Hoeksema BW, Koh EGL (2009) Depauperation of the mushroom coral fauna (Fungiidae) of Singapore (1860s–2006) in changing reef conditions. Raffles Bulletin of Zoology Supplement 22: 91–101.
  59. 59. Van der Meij SET, Moolenbeek RG, Hoeksema BW (2009) Decline of the Jakarta Bay molluscan fauna linked to human impact. Marine Pollution Bulletin 59: 101–107.
  60. 60. Van der Meij SET, Suharsono , Hoeksema BW (2010) Long-term changes in coral assemblages under natural and anthropogenic stress in Jakarta Bay (1920–2005). Marine Pollution Bulletin 60: 1442–1454.
  61. 61. Johnson KG, Brooks SJ, Fenberg PB, James KE, Glover AG, et al. (2011) Climate change and biosphere response: unlocking the collections vault. BioScience 61: 147–153.
  62. 62. Drew J (2011) The role of natural history institution and bioinformatics in conservation biology. Conservation Biology 25: 1250–1252.
  63. 63. Pennisi E (2006) Taxonomy revival. Science 289: 2306–2308.
  64. 64. Godfray HCJ (2002) Challenges for taxonomy. Nature 417: 17–19.
  65. 65. Godfray HCJ (2007) Linnaeus in the information age. Nature 446: 259–260.
  66. 66. Mallett J, Willmott K (2003) Taxonomy: renaissance or Tower of Babel? Trends in Ecology and Evolution 18: 57–59.
  67. 67. Wilson EO (2003) The encyclopedia of life. Trends in Ecology and Evolution 18: 77–78.
  68. 68. Knapp S, Polaszek A, Watson M (2007) Spreading the word. Nature 446: 261–262.
  69. 69. Costello MJ, Emblow C, White R, editors (2001) European register of marine species : a check-list of the marine species in Europe and a bibliography of guides to their identification. Paris: Muséum National d'Histoire Naturelle. 463 p.
  70. 70. Felder DL, Camp DK, editors (2009) Gulf of Mexico. Origin, waters, and biota. Vol. 1, Biodiversity. College Station: Texas A&M University Press. 1393 p.
  71. 71. Gordon DP, editor (2009) New Zealand Inventory of Biodiversity Vol. 1. Kingdom Animalia: Radiata, Lophotrochozoa, Deuterostomia. Christchurch: Canterbury University Press. 566 p.
  72. 72. Gordon DP, editor (2010) New Zealand Inventory of Biodiversity. Vol. 2. Kingdom Animalia: Chaetognatha, Ecdysozoa, ichnofossils. Christchurch: Canterbury University Press. 528 p.
  73. 73. Gordon DP, editor (2012) New Zealand Inventory of Biodiversity. Vol. 3. Kingdoms Bacteria, Protozoa, Chromista, Plantae, Fungi. Christchurch: Canterbury University Press. 616 p.
  74. 74. Costello MJ (2000) Developing species information systems: the European Register of Marine Species. Oceanography 13: 48–55.
  75. 75. Cuvelier D, Claus S, Appeltans W, Vanhoorne B, Vanden Berghe E, et al. (2006) European Register of Marine Species (ERMS) – plans turning into reality!. MarBEF Newsletter 4: 14–15.
  76. 76. Costello MJ (2009) Motivation of online data publication. BioScience 59: 418–427.
  77. 77. Appeltans W, Bouchet P, Boxshall GA, Fauchald K, Gordon D, et al.. (2012) World register of marine species. http://www.marinespecies.org. Accessed 6 Oct 2012.
  78. 78. Costello MJ, Harris P, Pearce B, Fauchald K, Fiorentino A, et al.. (2010) A glossary of terminology used in marine biology, ecology, and geology. Version 1.0. http://www.marinespecies.org/glossary/. Accessed 4 May 2012.
  79. 79. Gordon DP (2009) Towards a management hierarchy (classification) for the Catalogue of Life: draft discussion document. In: Bisby FA, Roskov YR, Orrell TM, Nicolson D, Paglinawan LE et al.., editors. Species 2000 & ITIS Catalogue of Life: 2009 Annual Checklist (CD-ROM). Reading: Species 2000.
  80. 80. Ruggiero M, Gordon DP, Bailly N, Kirk P, Nicolson D (2012) The Catalogue of Life Taxonomic Classification, Edition 2, Part A. In: Bisby FA, Roskov YR, Cullham A, Orrell TM, Nicolson D et al.., editors. Species 2000 & ITIS Catalogue of Life: 2012 Annual Checklist (CD-ROM). Reading: Species 2000.
  81. 81. Gordon DP (2012) Scoping the tree of life. In: Gordon DP, editor. New Zealand Inventory of Biodiversity. Vol. 3. Kingdoms Bacteria, Protozoa, Chromista, Plantae, Fungi. Christchurch: Canterbury University Press. pp. 17–21.
  82. 82. Van Soest RWM, Boury-Esnault N, Hooper JNA, Rützler K, De Voogd NJ, et al.. (2012) World Porifera database. http://www.marinespecies.org/porifera/porifera.php?p=taxdetails&id=169025. Accessed 4 May 2012.
  83. 83. Kennedy MK, Nozères C, Miller R, Vanhoorne B, Appeltans W (2011) The Canadian Register of Marine Species Photo Gallery: A User's Guide, Version 1. Canadian Technical Reports of Fisheries and Aquatic Sciences 2933: 1–47.
  84. 84. Nic Donnacha E, Guiry MD (2002) AlgaeBase: documenting seaweed biodiversity in Ireland and the world. Biology & Environment: Proceedings of the Royal Irish Academy 102: 185–188.
  85. 85. Rees T (2011) TAXAMATCH - fuzzy matching algorithm for genus and species scientific names. http://www.cmar.csiro.au/datacentre/taxamatch.htm. Accessed 4 May 2012.
  86. 86. Mozzherin D (2012) Scientific Names Parser. https://github.com/GlobalNamesArchitecture/biodiversity. Accessed 4 May 2012.
  87. 87. Bouchet P (2006) The magnitude of marine biodiversity. In: Duarte CM, editor. The exploration of marine biodiversity Scientific and technological challenges. Fundación BBVA. pp. 33–63.
  88. 88. Vandepitte L, Hernandez F, Claus S, Vanhoorne B, De Hauwere N, et al. (2011) Analysing the content of the European Ocean Biogeographic Information System (EurOBIS): available data, limitations, prospects and a look to the future. Hydrobiologia 667: 1–14.
  89. 89. Costello MJ (2001) To know, research, manage, and conserve marine biodiversity. Océanis 24: 25–49.
  90. 90. Costello MJ, Bouchet P, Emblow CS, Legakis A (2006) European marine biodiversity inventory and taxonomic resources: state of the art and gaps in knowledge. Marine Ecology Progress Series 316: 257–268.
  91. 91. Appeltans W, Ahyong ST, Anderson G, Angel MV, Artois T, et al. (in press) The magnitude of global marine species diversity. Current Biology
  92. 92. Wilson SP, Costello MJ (2005) Predicting future discoveries of European marine species by using a non-homogeneous renewal process. Applied Statistics 54: 897–918.
  93. 93. Fisher R, O'Leary RA, Low-choy S, Mengersen K, Caley MJ (2012) A software tool for elicitation of expert knowledge about species richness or similar counts. Environmental Modelling & Software 30: 1–14.
  94. 94. Wheeler QD, Pennak S (2010) The State of Observed Species Report. International Institute for Species Exploration. http://species.asu.edu. Accessed 24 Feb 2011.
  95. 95. Arizona State University International Institute for Species Exploration (2012) Top 10 New Species - 2012. http://species.asu.edu/Top10.
  96. 96. Fisher R, Knowlton N, Brainard RE, Caley MJ (2011) Difference among major taxa in the extent of ecological knowledge across four major ecosystems. PLoS ONE 6 (11): e26556 .
  97. 97. Neiber MT, Hartke TR, Stemme T, Bergmann A, Rust J, et al. (2011) Global biodiversity and phylogenetic evaluation of Remipedia (Crustacea). PLoS ONE 6(5): e19627 .
  98. 98. Suarez-Morales E (2011) Diversity of the Monstrilloida (Crustacea: Copepoda). PLoS ONE 6(8): e22915 .
  99. 99. Błażewicz-Paszkowycz M, Bamber R, Anderson G (2012) Diversity of Tanaidacea (Crustacea: Peracarida) in the world's oceans – how far have we come? PLoS ONE 7(4): e33068 .
  100. 100. Williams JD, Boyko CB (2012) The global diversity of parasitic isopods associated with crustacean hosts (Isopoda: Bopyroidea and Cryptoniscoidea). PLoS ONE 7(4): e35350 .
  101. 101. Poore GCB, Bruce NL (2012) Global diversity of marine isopods (except Asellota and crustacean symbionts). PLoS ONE 7(8): e43529 .
  102. 102. Cairns SD (2011) Global diversity of the Stylasteridae (Cnidaria: Hydrozoa: Athecatae). PLoS ONE 6(7): e21670 .
  103. 103. Williams GC (2011) The global diversity of sea pens (Cnidaria: Octocorallia: Pennatulacea). PLoS ONE 6(7): e22747 .
  104. 104. Stöhr S, O'Hara TD, Thuy B (2012) Global diversity of brittle stars (Echinodermata: Ophiuroidea). PLoS ONE 7(3): e31940 .
  105. 105. Mah CL, Blake DB (2012) Global diversity and phylogeny of the Asteroidea (Echinodermata). PLoS ONE 7(4): e35644 .
  106. 106. Van Soest RWM, Boury-Esnault N, Vacelet J, Dohrmann M, Erpenbeck D, et al. (2012) Global diversity of sponges (Porifera). PLoS ONE 7(4): e35105 .
  107. 107. Shenkar N, Swalla BJ (2011) Global diversity of Ascidiacea. PLoS ONE 6(6): e20657 .
  108. 108. Agatha S (2011) Global diversity of aloricate Oligotrichea (Protista, Ciliophora, Spirotricha) in marine and brackish sea water. PLoS ONE 6(8): e22466 .
  109. 109. Rasmussen AR, Murphy JC, Ompi M, Gibbons JW, Uetz P (2011) Marine reptiles. PLoS ONE 6(11): e27373 .
  110. 110. Eitel M, Osigus H-J, DeSalle R, Schierwater B (submitted) Global diversity of the Placozoa. PLoS ONE
  111. 111. Costello MJ, Danovaro R, Halpin PN, Ojaveer H, Miloslavich P (2010) A census of marine biodiversity knowledge, resources, and future challenges. PLoS ONE 5(8): e12110 .
  112. 112. Gollner S, Ivanenko VN, Arbizu PM, Bright M (2010) Advances in taxonomy, ecology, and biogeography of Dirivultidae (Copepoda) associated with chemosynthetic environments in the deep sea. PLoS ONE 5(8): e9801 .
  113. 113. van Bruggen AC (2001) Dr Jacob van der Land, marine biologist extraordinary. Zoologische Verhandelingen, Leiden 334: 7–20.
  114. 114. Rees T (2012) IRMNG homonyms list - species level. Available: http://www.marine.csiro.au/mirrorsearch/ir_search.list_homonyms?hlevel=species. Accessed 2012 May 4.