Maynard Smith revisited: A multi-agent reinforcement learning approach to the coevolution of signalling behaviour
Table 6
Strategies most often learned by the Donor with parameters S = 0.2, V = 0.2, r = 0.5 (Case 1), see Fig 4.
Strategies most often learned by the Donor with parameters S = 0.2, V = 0.2, r = 0.5 (Case 1), see Fig 4.