Skip to main content
Advertisement

< Back to Article

Maynard Smith revisited: A multi-agent reinforcement learning approach to the coevolution of signalling behaviour

Fig 5

Q-values Case 2, darker line is average across all, faint lines are average for each state.

Fig 5

doi: https://doi.org/10.1371/journal.pcbi.1013302.g005