Maynard Smith revisited: A multi-agent reinforcement learning approach to the coevolution of signalling behaviour

Fig 3

Beneficiary resulting strategies.

Percentage of runs that the Beneficiary learns each strategy. Not thirsty state, . Strategies described in Table 5.