Skip to main content
Advertisement

< Back to Article

Fig 1.

Model architecture.

In this example, the speaker has to communicate the concept of a blue circle and the listener has to identify all blue circles. The loss function rewards shorter messages from the speaker (efficiency) and correct selections from the listener (informativeness). This figure was created by the first author.

More »

Fig 1 Expand

Fig 2.

Game scenarios in Experiment 1.

A: Context-unaware - The speaker agent only observes the set of target objects, i.e., the target concept (displayed in a green frame for visualization purposes). B: Context-aware - The speaker agent observes the target concept in a context, i.e., a set of target objects (visualized in blue) and a set of distractor objects (visualized in green). This figure was created by the first author.

More »

Fig 2 Expand

Fig 3.

Game scenarios added in Experiment 2.

A: Context-unaware + RSA - Speaker agents trained in the context-unaware condition reason about the listener’s likely interpretation of their message. B: Context-aware + RSA - Speaker agents trained in the context-aware condition reason about the listener’s likely interpretation of their message. This figure was created by the first author.

More »

Fig 3 Expand

Table 1.

Summary of predictions and measures.

More »

Table 1 Expand

Table 2.

Mean accuracies on the training, validation and test datasets.

More »

Table 2 Expand

Fig 4.

Message lengths per concept hierarchy level.

Concept specificity, or the amount of information that needs to be communicated, increases with the number of fixed attributes that is shared among target objects from more generic concepts (with fewer fixed attributes) on the left, to more specific concepts (with more fixed attributes) on the right.

More »

Fig 4 Expand

Table 3.

Mean entropy-based scores, i.e., NMI, effectiveness and consistency.

More »

Table 3 Expand

Fig 5.

NMI, consistency and effectiveness scores for each level of the conceptual hierarchy.

More »

Fig 5 Expand

Table 4.

Accuracies on the test dataset.

More »

Table 4 Expand

Fig 6.

Distribution of message lengths for different levels of the conceptual hierarchy.

More »

Fig 6 Expand

Table 5.

Lexicon sizes.

More »

Table 5 Expand

Fig 7.

Tradeoff between lexicon size and informativeness.

The lexicon size is normalized by the number of concepts in a dataset.

More »

Fig 7 Expand

Fig 8.

Zipf’s law like distribution of message frequency plotted for dataset D(4,4).

D(4,4) is a medium-sized dataset where objects consist of four attributes which can each take four different values.

More »

Fig 8 Expand

Fig 9.

Zipf’s law like distribution of message length plotted for dataset D(4,4).

More »

Fig 9 Expand

Table 6.

Qualitative examples.

More »

Table 6 Expand

Fig 10.

Dataset examples.

The three objects in the top row are the target objects that form a target concept together. The three objects in the bottom row are the distractor objects that form the context. A: An example for a specific concept LARGE BLUE CIRCLE in a fine context (two attributes shared). B: An example for a specific concept LARGE BLUE CIRCLE in a coarse context (no attribute shared). C: An example for a generic concept CIRCLE in a coarse context.

More »

Fig 10 Expand

Fig 11.

Information-theoretic scores and ambiguity in language.

The set of concepts C is the red circle on the left (red and purple), the set of messages M is the full blue circle on the right (blue and purple). The mutual information I(C,M) (only purple) captures one-to-one mappings between messages and concepts. The conditional entropy (only red) captures many-to-one mappings between messages and concepts. The conditional entropy (only blue) captures one-to-many mappings between messages and concepts.

More »

Fig 11 Expand