The SPECIES and ORGANISMS Resources for Fast and Accurate Identification of Taxonomic Names in Text
Figure 1
Speed and memory efficiency of the LINNAEUS and SPECIES taggers.
The major advantage of the SPECIES tagger over existing methods is its efficiency. Compared to the methodologically similar LINNAEUS tagger, it starts up and loads its dictionary 55× faster (6 seconds vs. 6 minutes 35 seconds), tags Medline abstracts 15× faster (0.26 vs. 4.05 seconds per 1000 documents), and uses 5× less memory in the process (0.5 GB vs. 3.0 GB).