Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?
Table 10
NLP analysis: Number of datasets with named entities (out of 10,000 processed files in a reduced OAI-DC structure) per repository.
Each file contains a subset of the original metadata, namely, dc:title, dc:description, dc:subject and dc:date.