Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

< Back to Article

Table 1.

The criteria used to evaluate LLMs’ responses.

More »

Table 1 Expand

Fig 1.

APP architecture.

More »

Fig 1 Expand

Fig 2.

Platform features.

More »

Fig 2 Expand

Table 2.

The difference between the large language model (GPT-4o vs Llama 3.1-8B) and mixed-effects models adjusted for evaluator (n = 816).

More »

Table 2 Expand

Table 3.

Example of five pairs of consecutive interactions for the large language models (GPT-4o vs. Llama 3.1-8B).

More »

Table 3 Expand