Maynard Smith revisited: A multi-agent reinforcement learning approach to the coevolution of signalling behaviour
Fig 3
Beneficiary resulting strategies.
Percentage of runs that the Beneficiary learns each strategy. Not thirsty state, . Strategies described in Table 5.