Skip to main content
Advertisement
  • Loading metrics

Emergence of an unpredictable evolution in a spatial prisoner’s dilemma via a player’s multiple perspectives

  • Tomoko Sakiyama ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    sakiyama@soka.ac.jp

    Affiliation Department of Information Systems Science, Faculty of Science and Engineering, Soka University, Tokyo, Japan

  • Ken’ichi Kojo

    Roles Methodology, Software, Writing – review & editing

    Affiliation Department of Information Systems Science, Faculty of Science and Engineering, Soka University, Tokyo, Japan

Abstract

Spatial prisoner’s dilemma (SPD) has attracted researchers’ attention as a model of conflict for players. In SPD, players have two different strategies, namely, defectors and cooperators. A defector earns a high payoff from an opponent co-operator while getting nothing from an opponent defector. On the contrary, cooperators promote a win–win relationship between the two cooperators. These mechanisms influence population dynamics in SPD, and many SPD models have been developed. However, little is known about the emergence of an unstable or unpredictable evolution in population dynamics using an SPD model, which may be observed in living systems. In addressing this issue, two SPD models were proposed. In both models, players change the neighborhood definition in accordance with their strategies and sometimes select the rule for this change using probability or local information. Result showed that our models generated characteristic population patterns that may be linked to a self-organized criticality (SOC), a term referring to many systems of interconnected, nonlinear elements that evolve over time into a critical state. In fact, the second model could be spontaneously close to the critical point using local information.

Author summary

In this study, we proposed two different Spatial prisoner’s dilemma (SPD) models. In the first model, the system presents a phase transition using parameter tuning and evolves dynamically around a critical parameter value. The other model, where players coordinate their actions not by an external parameter but by local information, can be spontaneously close to a critical point in the former proposed model.

Introduction

Spatial prisoner’s dilemma (SPD) is a well-known spatial game theory that addresses the evolution of two types of strategies [15]. In SPD, defectors and cooperators interact with each other and update their strategies with the opponent. When two defectors meet, they get a punishment payoff P, but when two cooperators meet, they have a win–win relationship by earning a reward payoff R. If a defector and a co-operator meet, then a defector receives temptation payoff T, while a co-operator receives sucker payoff S. Defectors become dominant when they obtain a larger payoff T during interaction with cooperators [18]. However, evolutions of cooperative behavior have received attention from researchers because some animals show cooperative behaviors, which may flourish the species [9,10]. In addressing this problem, many SPD models have been proposed [1120]. For example, environmental conditions support the evolution of cooperators [11,12], memory use enhances cooperation [1719], and the use of incentives has been considered for promoting cooperation [21].

The evolution of cooperators in SPD is an important issue, but it is only one aspect of moral behavior [22,23]. Therefore, focusing the discussion on the evolution/maintaining of cooperators in SPD is not always necessary. Living systems, including animals, demonstrate unpredictable dynamics with regard to evolution [2426]. Therefore, SPD can be used as an evolutionary system model that represents and describes the population dynamics observed in social or biological systems. In fact, several studies have focused on evolutionary game dynamics to reveal complex population dynamics [27,28]. In SPD, population dynamics can vary in accordance with time evolution. In some time, defectors can be dominant.

Self-organized criticalities (SOCs) describe a situation in which systems evolve over time into a critical state, and represent a key feature of some living systems [24]. Around a critical point, the system shows some fluctuations and behaves unpredictably because it presents a phase transition around a critical point. Several studies have reported the occurrence of a phase transition in SPD by considering parameter effects [29,30]. Criticality or SOC has also been observed in evolutionary game theory [3133]. Unpredictable population dynamics can emerge in SOC systems. However, previous studies have not focused on SOCs achieved via bottom-up self-organization but have instead focused on SOCs that have developed using critical parameter values [31]. On the contrary, Mahmoodi et al. (2017) reported that a bottom-up self-organization achieved SOC using an evolutionary game theory, although the decision-making model was used for strategy update, and the PD model was only partly used [33].

Thus, few studies have investigated the emergence of SOC using the extension model of pure SPD. In this study, two SPD models were developed: one was created by focusing on parameter tuning and the other involved a bottom-up (i.e., ‘agent perspective’) process. Moreover, from a game theory perspective, it is likely that players change the neighbors they can interact with if they want improve their circumstances. Therefore, in both models, players increase or decrease neighboring areas if they become losers in competitions with these neighbors. In both models, players’ strategies determine the way of change of neighboring areas. Cooperators are open-minded and have a wide perspective while defectors are selfish and have a narrow-perspective [34]. However, it is not always reasonable for them to do so. Therefore, cooperators/defectors in our models sometimes switch expansion/reduction of neighboring area with reduction/expansion of it.

In the first model, the system presents a phase transition using parameter tuning regarding the switching of neighboring areas’ expansion/reduction and evolves dynamically around a critical parameter value. The other model, where players coordinate their actions regarding the switching of neighboring areas’ expansion/reduction not by an external parameter but by local information, can be spontaneously close to a critical point in the former proposed model.

Materials and methods

Space and agent

We set 10,000 players on a two-dimensional lattice (100 × 100 sizes). Individual players were randomly assigned to either a co-operator or defector, that is, the initial defector density was 0.50. Periodic boundaries were used [34]. We run 10,000 time steps for each simulation trial. 100 trials were conducted. In this study, we considered the weak version of the PD game [35]. Therefore, we set R = 1, P = S = 0 and T = b where 1<b<2 according to values reported in previous studies [34,35] (Table 1).

Model description

Probabilistic switching (PS) model.

In this model, a variable known as neighborhood, which refers to the number of neighborhoods, was assigned to every player before starting each trial [34]. This variable is expressed as follows:

,

which shows the neighborhood for player (i, j).

At each time step, individual players randomly select neighborhood neighboring players from the Moore neighborhood (eight players) and play against individually selected neighboring players using the payoff matrix (Table 1). For example, when neighborhood(i, j) = 3, player (i, j) randomly selects three neighbors from the following: and (i+1, j+1). The selected neighbors are then elements of ωt(i, j), a variable that specifies the set of neighborhoods of player (i, j) at time t, such that the following equation is satisfied:

Then, the score is earned as follows: which shows the score of the player (i, j) at time t. Here, payofft(i,j)(k) indicates a payoff that the player (i, j) earns from its neighboring player k at time t.

After all players get their score, they compare them with their selected neighboring players and update their strategy to the neighboring player’s strategy, which earns the highest score among the neighborhoods. At that time, players become losers if they are not the player earning the highest score in their neighborhood. In that case, the variable state changes to state = loser from state = null. This is because state is set to state = null for all players at the beginning of the strategy update event. For example, for player (i, j),

Here, strategyt+1(i, j) and statet+1(i, j) are the strategy and state values for player (i, j) at time t+1 while max _strategyt(i, j) and max _scoret(i, j) indicate the strategy and score values of the player who earns the highest score at time t among player (i, j) and all other players in its neighborhoods.

Moreover, for all players for whom the variable state is now set to loser from null, they can then change the variable neighborhood at the beginning of their next iteration [34]. According to a previous study [34], cooperators are likely to expand their neighboring area while opponent players are likely to decrease the number of neighbors. In this model however, we introduce a reverse situation where cooperators decrease the number of neighbors while defectors increase the number of neighbors. Furthermore, these two situations are controlled by a parameter p.

More precisely, player (i, j) replaces the variable neighborhood(i, j) at the beginning of time t whenever statet(i, j) = loser as follows:

With a probability p,

if

if

With a probability 1−p,

if

if

In that case, updated neighborhood is used for score calculation and strategy update afterward.

Switching via local information (SLI) model

This model operates in a similar way to the PS model except for the replacement of the variable neighborhood, which is expressed as follows:

With a probability ,

if

if

With a probability

if

Here, max _scoret−1(i, j) and min _scoret−1(i, j) indicate the highest and lowest score in the neighborhood of the player (i, j) calculated at one previous iteration, respectively (considering that neighborhood updates occur at the beginning of one iteration, if needed). Therefore, individual players are able to modulate the variable neighborhood using a probability in accordance with their local information (i.e., max _score and min _score).

Increasing or decreasing neighboring areas may promote players to improve their situation since they can interact with different players from the current set after modulating neighboring areas. This is why players in our model are allowed to increase/decrease their neighboring areas when they become losers. It is likely for cooperative players to expanding their neighboring areas, which indicating that cooperators establish win–win relationships with others due to their unselfish nature. Conversely, defectors are egocentric, reflecting their narrow perspective [34]. At the same time however, it may be unclear for players whether increasing their neighboring areas or decreasing their neighboring areas brings better results.

In the SLI model, the ratio indicates the degree of score difference among players within the neighborhood. If that difference is high (i.e., has a relatively large ratio of , a cooperative player who is a loser becomes more cooperative by expanding its neighboring area and tries earning a higher score. Similarly, a defective player who is a loser becomes more selfish by decreasing its neighboring area in order to improve its situation and win the next round. In contrast, if score differences among players within the neighborhood is not so large, players adopt a reverse condition and tries keeping the size balance of their neighboring area. Thus, they change their neighboring area in an opposite way from the above situation.

Please see S1 and S2 Figs for the flowcharts.

Results

Here, we set parameter b to high payoff values (i.e., b = 1.7, 1.8 and 1.9) since a defector facing a co-operator can earn a high payoff. In fact, SPD models sometimes focus on such a high parameter value [34]. As shown in Fig 1, which demonstrates the fraction of defectors after each trial obtained from 100 trials (Fig 1A) and its standard deviations (Fig 1B), the PS model presents a phase transition with p≈0.8. The SD of the fraction of defectors around that critical value showed the highest SD, suggesting that the fraction of defectors present at the end of the trial can vary and is dependent on the trials themselves. Interestingly, the SLI model exhibits similar deviations in the fraction of defectors. This is true for all cases except b = 1.7, in which the SD of the SLI model is lower than the highest SD of the PS model; however, it still shows a high SD. Thus, these data confirm that the PS model presents a phase transition with p≈0.8 and that the SLI model can evolve criticality in some b values. However, according to Fig 2, which depicts the relationship between parameter b and fraction of defectors at the end of trials in the SLI model, during which parameter b decreases, defectors vanish, and no fluctuations in population dynamics occur thereafter. Therefore, we speculate that the SLI model can evolve criticality at high b values.

thumbnail
Fig 1. Fraction of defectors at the end of simulation trials (averaged over 100 trials) and its standard deviation (b = 1.9, 1.8 and 1.7).

(A) Fraction of defectors. (B) Standard deviation of fraction of defectors.

https://doi.org/10.1371/journal.pcsy.0000003.g001

thumbnail
Fig 2. Fraction of defectors at the end of simulation trials (averaged over 100 trials) regarding parameter b in the SLI model.

https://doi.org/10.1371/journal.pcsy.0000003.g002

Next, we evaluated system dynamics according to time evolution using both models. Here, we set b = 1.9. Fig 3 demonstrates the time evolution of defectors obtained from one trial using the SLI model and PS model with p = 0.8 and p = 0.85. The PS model with p = 0.8 and the SLI model show some fluctuations in the time evolution of defectors. Furthermore, we evaluated the time interval between becoming a loser and becoming a loser once again. Fig 4 shows the relationship between the abovementioned time interval obtained from a single player over one trial and its cumulative distribution in both models. As shown in this figure, both models exhibit a power law tailed distribution (Fig 4A: PS model with p = 0.8, μ = 2.29, AIC weights for power law against exponential law = 1.00, Fig 4B: SLI model, μ = 2.37, AIC weights for power law against exponential law = 1.00). These results indicate that the SLI model and the PS model with p = 0.8 evolve criticality because self-organized systems exhibit power law tailed distributions [36,37]. We also evaluated the emergence of power-law tailed distributions from a different perspective, i.e., the time elapsed during operation (e.g., time persistence for cooperation) [38]. S3 Fig confirms that the SLI model exhibits power-law tailed distributions with respect to this variable (μ = 1.67, AIC weights for power law against exponential law = 1.00).

thumbnail
Fig 3. Time evolutions of defectors using the PS model with p = 0.8, p = 0.85, and the SLI model (b = 1.9).

https://doi.org/10.1371/journal.pcsy.0000003.g003

thumbnail
Fig 4. Time interval between becoming a loser and becoming a loser once again (b = 1.9). (A) The PS model with p = 0.8.

(B) The SLI model.

https://doi.org/10.1371/journal.pcsy.0000003.g004

Next, we determined that the SLI model was not affected by varying initial defector density. We replaced the initial defector density from 0.50 with 0.35 or 0.65 (Fig 5A and 5B). As shown in figures, the system evolves as time progresses, suggesting that the SLI model is flexible to the initial defector densities to some extent.

thumbnail
Fig 5.

Time evolution of defectors in two different initial defector densities (b = 1.9) (A) The initial defector density: r = 0.35. (B) The initial defector density: r = 0.65.

https://doi.org/10.1371/journal.pcsy.0000003.g005

Finally, we evaluated the performance of these two proposed models for the evolution of cooperators. Fig 6 presents the results regarding the fraction of defectors at the end of trials and its relationship with parameter b. Both proposed models outperformed the classical SPD model when parameter b increased. Moreover, the SLI model outperformed the PS model. Here, players in the classical SPD model always interact with the players defined as the Von Neumann neighborhood and never change the interacting opponents until the end of the trial. The other procedures are the same as those of the proposed models. Notably, the SLI model showed similar results for this analysis even after the calculation time is extended to 20,000 time steps (Fig 7). These results confirm that calculation time of 10,000 time steps is enough. Moreover, the SLI also exhibited similar results when the system size was expanded from 100 × 100 to 200 × 200 (Fig 7).

thumbnail
Fig 6. Fraction of defectors at the end of simulation trials (averaged over 100 trials) regarding parameter b (p = 0.80 for the PS model).

https://doi.org/10.1371/journal.pcsy.0000003.g006

thumbnail
Fig 7. Fraction of defectors at the end of simulation trials (averaged over 100 trials) after varying calculation time and/or system size.

https://doi.org/10.1371/journal.pcsy.0000003.g007

Discussion

In this paper, two different SPD models were proposed to induce criticality or SOC in the population. The PS model, in which players reversibly switched the rule of the modification of neighborhood with a fixed probability, showed a phase transition regarding population dynamics. Moreover, the SLI model where players switched reversibly the rule of variable neighborhood without using a fixed probability but via local information was likely to be close to the critical point autonomously. We also found that the time interval between once becoming a loser and again becoming a loser was consistent with a power law tailed distribution. This result indicates that players in these models likely become a loser at irregular time intervals. This scale-free property is linked to the SOC [36,37].

Numerous SPD models have been proposed to address the evolution of cooperators [1120]. Nevertheless, few studies have focused on the critical behavior in population dynamics for SPD, although some studies have reported the occurrence of phase transitions in SPD [28,30]. Our proposed models have two important aspects. First, a phase transition can be induced if players use two perspectives in response to one event. In our models, players updates variable neighborhood in accordance with their strategy [34]. However, players sometimes switch the neighborhood update rule reversibly in accordance with their strategy. Thus, players have two perspectives regarding one event (the rule for the neighborhood update). Second, the players in the SLI model use local information to set the switching probability instead of using a fixed parameter (p). Therefore, individual players use different values regarding the switching probability, which can vary over time for every player. Such effects promote cooperative behavior in spatial game theories [39,40]. No parameter tuning is required to bring the system SOC [33]. In the SLI model, if max _score and min _score vary remarkably, then players likely increase or decrease the area of their neighborhood if they are cooperators/defectors. Therefore, cooperative players expand their neighborhood area to seek other cooperators because a single defector around them can be a winner who takes all. On the contrary, defector players likely decrease their neighborhood area to avoid competition with other defectors around them, who have higher scores than them. In this study, we introduced criticality or SOC in two SPD models. In inducing the SOC in SPD, players have to have dual perspectives regarding input information. Moreover, populations can move to the critical point using local information to trigger the dual perspectives.

As mentioned, SOCs describe a key feature of some living systems [24]. Our model may reveal what kind of decision-making processes of the players bring such properties to the system.

In future studies, we will modify and apply these models such that they can be used to study network topologies. This will permit further consideration of the link connection/disconnection and its relation to emerging criticalities.

Supporting information

S3 Fig. Cumulative duration of cooperation for a single player (b = 1.9).

https://doi.org/10.1371/journal.pcsy.0000003.s003

(TIF)

References

  1. 1. Nowak MA, Sigmund K. Evolutionary dynamics of biological games. Science. 2004;303: 793–799. pmid:14764867
  2. 2. Perc M, Szolnoki A. Coevolutionary games—A mini review. Biosystems. 2010;99: 109–125. pmid:19837129
  3. 3. Smith JM, Price GR. The logic of animal conflict. Nature. 1973;246: 15–18.
  4. 4. Axelrod A. The evolution of cooperation. New York: Basic Books; 1984.
  5. 5. Doebeli M, Hauert C. Models of cooperation based on the prisoner’s Dilemma and the snowdrift game. Ecol Lett. 2005;8: 748–766.
  6. 6. Szabó G, Tőke C. Evolutionary prisoner’s dilemma game on a square lattice. Phys Rev E. 1998;58: 69–73.
  7. 7. Szabó G, Vukov J, Szolnoki A. Phase diagrams for an evolutionary prisoner’s dilemma game on two-dimensional lattices. Phys Rev E Stat Nonlin Soft Matter Phys. 2005;72: 047107. pmid:16383580
  8. 8. Sakiyama T, Arizono I. An adaptive replacement of the rule update triggers the cooperative evolution in the hawk–dove game. Chaos Solitons Fract. 2019;121: 59–62.
  9. 9. Dugatkin LA. Cooperation among animals: an evolutionary perspective. Oxford: Oxford University Press; 1997.
  10. 10. Enquist M, Leimar O. The evolution of cooperation in mobile organisms. Anim Behav. 1993;45: 747–757.
  11. 11. Qin J, Chen Y, Fu W, Kang Y, Perc M. Neighborhood diversity promotes cooperation in social dilemmas. IEEE Access. 2018;6: 5003–5009.
  12. 12. Shu G, Du X, Li Y. Surrounding information consideration promotes cooperation in prisoner’s dilemma game. Chaos Solitons Fract. 2016;91: 689–694.
  13. 13. Shen C, Chu C, Shi L, Perc M, Wang Z. Aspiration-based coevolution of link weight promotes cooperation in the spatial prisoner’s dilemma game. R Soc Open Sci. 2018;5: 180199. pmid:29892454
  14. 14. Li Y, Zhang J, Perc M. Effects of compassion on the evolution of cooperation in spatial social dilemmas. Appl Math Comput. 2018;320: 437–443.
  15. 15. Perc M. Uncertainties facilitate aggressive behavior in a spatial hawk–dove game. Int J Bifurcation Chaos. 2007;17: 4223–4227.
  16. 16. Jiménez R, Lugo H, Cuesta JA, Sánchez A. Emergence and resilience of cooperation in the spatial prisoner’s dilemma via a reward mechanism. J Theor Biol. 2008;250: 475–483. pmid:18045621
  17. 17. Javarone MA. Statistical physics of the spatial prisoner’s dilemma with memory-aware agents. Eur Phys J B. 2016;89: 1–6.
  18. 18. Danku Z, Perc M, Szolnoki A. Knowing the past improves cooperation in the future. Sci Rep. 2019;9: 262. pmid:30670732
  19. 19. Cheng H, Meng X. Evolution of cooperation in multigame with environmental space and delay. Biosystems. 2023;223: 104801. pmid:36372199
  20. 20. Cardillo A, Gómez-Gardeñes J, Vilone D, Sánchez A. Co-evolution of strategies and update rules in the prisoner’s dilemma game on complex networks. New J Phys. 2010;12: 103034.
  21. 21. Wang S, Chen X, Xiao Z, Szolnoki A, Vasconcelos VV. Optimization of institutional incentives for cooperation in structured populations. Journal of the Royal Society Interface. 2023; 20(199): 20220653. pmid:36722070
  22. 22. Capraro V, Perc M. Mathematical foundations of moral preferences. J R Soc Interface. 2021;18: 20200880. pmid:33561377
  23. 23. Jusup M, Holme P, Kanazawa K, Takayasu M, Romić I, Wang Z, et al. Social physics. Phys Rep. 2022;948: 1–148.
  24. 24. Bak P, Tang C, Wiesenfeld K. Self-organized criticality. Phys Rev A Gen Phys. 1988;38: 364–374. pmid:9900174
  25. 25. Kauffman SA. The origins of order self-organization and selection in evolution. Oxford: Oxford University Press; 1993.
  26. 26. Sigmund K. Games of life. Oxford: Oxford University Press; 1993.
  27. 27. Wang Q, He N, Chen X. Replicator dynamics for public goods game with resource allocation in large populations. Applied Mathematics and Computation. 2017; 328: 162–170.
  28. 28. Liu L, Chen X, Szolnoki A. Evolutionary dynamics of cooperation in a population with probabilistic corrupt enforcers and violators. Mathematical Models and Methods in Applied Sciences. 2019; 29(11): 2127–2149.
  29. 29. Szabó G, Hauert C. Phase transitions and volunteering in spatial public goods games. Phys Rev Lett. 2002;89: 118101. pmid:12225171
  30. 30. Szolnoki A, Perc M. Correlation of positive and negative reciprocity fails to confer an evolutionary advantage: phase transitions to elementary strategies. Phys Rev X. 2013;3: 041021.
  31. 31. Killingback T, Doebeli M. Self-organized criticality in spatial evolutionary game theory. J Theor Biol. 1998;191: 335–340. pmid:9631572
  32. 32. Mahmoodi K, Grigolini P. Evolutionary game theory and criticality. J Phys A: Math Theor. 2017;50: 015101.
  33. 33. Mahmoodi K, West BJ, Grigolini P. Self-organizing complex networks: individual versus global rules. Front Physiol. 2017;8: 478. pmid:28736534
  34. 34. Kojo KI, Sakiyama T. Restructuring of neighborhood definition based on strategies will enhance the cooperation in a spatial prisoner’s dilemma. Chaos Solitons Fract. 2024;179: 114404.
  35. 35. Nowak MA, May RM. Evolutionary games and spatial chaos. nature, 1992; 359(6398), 826–829.
  36. 36. Sakiyama T. A game of life shifted toward a critical point. Complex Syst. 2023;32: 57–70.
  37. 37. Sakiyama T, Gunji YP. Uncertain density balance triggers scale-free evolution in game of life. Complex Syst. 2017;26: 31–38.
  38. 38. Rong Z, Wu ZX, Li X, Holme P, Chen G. Heterogeneous cooperative leadership structure emerging from random regular graphs. Chaos: An Interdisciplinary Journal of Nonlinear Science, 2019; 29(10).
  39. 39. Sakiyama T. A power law network in an evolutionary hawk–dove game. Chaos Solitons Fract. 2021;146: 110932.
  40. 40. Takahara A, Sakiyama T. Twisted strategy may enhance the evolution of cooperation in spatial prisoner’s dilemma. Phys A. 2023;629: 129212.