-
Loading metrics
Establishing vocabulary tests as a benchmark for evaluating large language models
- Gonzalo Martínez,
- Javier Conde,
- Elena Merino-Gómez,
- Beatriz Bermúdez-Margaretto,
- José Alberto Hernández,
- Pedro Reviriego,
- Marc Brysbaert
x
- Published: December 12, 2024
- https://doi.org/10.1371/journal.pone.0308259