Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Cascades in capacity constrained agents

Abstract

Many sorts of contagious phenomenon, such as music, do not exist in isolation but as part of a competitive marketplace. In these settings there are often superstars with out-sized popularity along with a large number of flops with little popularity. It could be the case that superstars are more popular because they are higher quality but I suggest that capacity constraints may be a structural factor that influences these disparities. In this agent-based model, there are multiple potentially cascading states that the agent can potentially occupy. The agents have a certain capacity of states that they can occupy at once. For example, suppose someone has a workout playlist that lasts 1 hour. As they discover new music to add to the playlist, they have to remove songs currently in the playlist to keep the playlist 1 hour. Thus, in this setting, the states indirectly trade off with each other by virtue of the capacity constraint. The key question is whether the indirect trade offs imposed by the capacity constraint are enough to induce disparities in popularity, even when the states are otherwise identical. I find that increasing the number of states in excess of capacity increases the disparities between popular and unpopular states. This suggests that capacity constraints may be a structural factor in explaining market concentration and superstar phenomenon.

Introduction

Diffusion processes are an important social phenomenon that has been studied across multiple domains such as consumer goods [1], adoption of hybrid corn [2], the spread of disease [3], information on social media [4], revolutions [5], as well as many theoretical explorations [611].

Cascades often do not occur in isolation, but exist in an environment with multiple potential cascades that could occur simultaneously. One way to model multiple simultaneous cascades or contagions is through direct interaction between the contagions. These models are typically used in an epidemiological setting where there is some sort of infection that is spreading and a social behavior like vaccination or distancing also spreads through the population, which modulates the spread of the infection but is also driven by the infection [1215]. The interaction between multiple contagions can become computationally challenging but in well mixed populations it has been shown that these models are equivalent to complex contagion models [16].

Simultaneous cascade models can help clarify why some things before popular while others do not. The first class of explanations is around quality: things are popular because they are better. The implication here is that popularity is proportional to quality, so if one musician sells 100 times more tickets than another one would expect their music to be 100 times better. The second class of explanations is based on market structure: things are popular because they are somehow advantaged. The implication here is that one would be able to find components of the market structure that could generate disparities in popularity even among identical products. On way to think about identical products is: products which can be differentiated via a label but are identical in use (generic drugs vs name brand drugs in a world where they are also the same price).

When considering identical valued products, it is natural to think of them as substitutes for each other. For example no matter how many washing machine brands are on the market, I as a consumer only need one washing machine and would be unlikely to buy two washing machines of different brands. Thus once my capacity for washing machines is met, I have no need to buy any more washing machines. While products that someone only needs one of can easily demonstrate the capacity constraint, the capacity for other products could be greater than one. For example one could regularly listen to a playlist of songs that lasts 1 hour, changing out the different songs over time as their tastes change instead of adding new songs to the end. It it also reasonable that different people could have different capacities for the same product, for example a family of four probably wants more spoons than an individual who lives alone does or different people could be perscribed different doses of the same drug. Given the similarities between this process and the types of congestion White observed in human communication networks, it is reasonable to expect these kinds of dynamics to influence a wide range of products [17].

Models of Oligopolistic competition can be a good way to undestand the market dymanics of identical products. The classic comparison would be between the Cournot Model (simultaneous output choice) and the Stackelberg model (sequential output setting), suggesting that the structural advantage of being able to set prices first drives a disparity between the equilibrium profits of the firms [18, 19]. In this way, capacity constraints could provide a sort of incumbency advantage to products which are adopted first and could serve to add Stackelberg like dynamics to diffusion models.

Given that the music market regularly experiences multiple overlapping cascades, the stylized characteristics of the market can be useful for understanding how multicascade processes empirically function. Notably the music market is somewhere one would expect to find elements of both quality driving popularity and structural popularity. On a structural level, the music industry historically has experienced high levels of firm concentration and this concentration is associated with a lower level musical diversity [20, 21]. At the level of artists the music industry also shows signs of concentration where a small number of popular artists have out sized influence at any given time [2224]. Quality in music has proven difficult to empirically measure, as experimental evidence suggests that social influence and more typical cascade dynamics are at play [25, 26], but more recent analysis suggests that these effects may only be temporary perturbations from songs’ fundamental value [27]. In their experiment [26] presented their participants with 48 different songs to potentially listen to and download for free and found that they listened to only 7 songs on average only downloaded 1 on average. This suggests that the their participants may be capacity constrained in both their interest in listening to songs and downloading them.

Materials and methods

Model

This model is a variation of the Threshold Cascade model as described in [28]. There are N agents in the population who each have a cascade capacity of C. This means that if that agent would adopt a cascading state that would bring the number of states adopted greater than C, one of the currently adopted states is randomly dropped. There are S total states that can cascade and in order for the capacity constraint to be binding S > C. Agents can adopt a new state through either Threshold Cascading or Random adoption. Threshold cascading follows the process described in [7], each agent has a threshold Ti and adopts the state if the number of agents who have already adopted the state is greater than or equal to Ti. Each state is evaluated independently of each other state, so an agent would not consider the number of agents who adopted state 1 when evaluating state 2. The threshold Ti varies across agents, but it does not vary between states. As in [7], the thresholds drawn from a normal distribution with mean μ and variance σ. Since the number of states adopted is discrete, the values of output by the normal distribution were rounded. For Random adoption, each agent is challenged each time step to adopt a random state. If the agent has already adopted the state, nothing changes. If the agent has not already adopted the state it adopts the state with probability p as defined by . A similar formulation for random adoption was used by [28]. At each time step each agent first checks for threshold adoption, then random adoption and finally checks for capacity. Since states are randomly dropped at the end of the round, it is possible for a state to be adopted and dropped within the same round.

When the model is initialized all agents are assigned their threshold but begin having adopted no states. This means that in order for a state to spread via threshold contagion it must first be randomly adopted. All agents have access to the set of states every other agent has adopted at the end of the round (after dropping for capacity) and uses this to determine threshold adoption in the next round. The variation in adoption thresholds, Ti, among the agents represents the key structure in this population. Therefore one could interpret the threshold contagion model as a form of complex contagion model, but on a complete graph [29]. Instead of focusing on spread between individuals this model is better suited for describing population level phenomenon. While it is true than individual and network level dynamics can matter, the presence of small global signals can overpower local diffusion [30].

There have been some attempts to quantify thresholds at the individual and network level [31, 32], but attempts to merge threshold models with more classical diffusion models have also been successful [33, 34]. This suggests that even while thresholds may be difficult to empirically measure the results of these models can be compared to other classes of diffusion models.

Parameter space

This model has 7 parameters. N represents the total number of agents in the population. T represents the total number of time steps. m represents the slope parameter in the random adoption function. C represents the capacity of each agent. S represents the total number of states. μ represents the mean of the threshold distribution and σ represents the variance of the threshold distribution. Table 1 shows the ranges of each of the parameters contained in this experiment.

Since this simulation is designed to test the effect of imposing the capacity constraint it was designed similarly to a randomized control trial with a control arm and two treatment arms. The first arm represents the control, with no capacity constraints. In this arm capacity was set to the number of states, so when states was 5 capacity was 5 and so on. The next arm is a treatment arm, where capacity was set to 3. In this arm there are excess capacities of 2,7,12, and 17. The final arm is a treatment arm where capacity was set to 4. In this arm there are excess capacities of 1,6,11, and 16.

There are 122412 different combinations in this parameter space and each unique parameter combination was replicated 100 times so there were 12241200 total runs in this experiment. This experiment was constructed using the python package AgentPy. [35]

Results

Outcomes of interest

The core outcome of interest is the number of agents who have adopted a given state. Going forward I will refer to one agent adopting a state at a given time as a count. Since counts vary both over time within a simulation run and across states, I need to aggregate in two dimensions. To aggregate across time, I look at the average number of counts over time (average counts), variance in counts over time (variance in counts), the maximum count reached (max counts), the time to reach the maximum (time to max), the sum of counts over time (final counts), the minimum count after the maximum count was reached (min after max), the time between the maximum and the minimum after maximum (decay time), as well as the Shannon entropy [36] in counts (entropy counts). Once these aggregations across time are calculated for each state, I aggregate across state using the mean, median, variance, mean absolute deviation, maximum and range (maximum—minimum).

It is also important to measure concentration/ tailedness in the distributions across states. To do this, I also the kurtosis, the Herfindahl-Hirschman Index [37, 38] across final counts and number of states where final count is zero. Since these are measuring concentration, final counts (sum over time) is the natural way to aggregate over time. Table 2 shows the summary statistics for these outcomes.

Fig 1 shows the average counts averaged across states for each μ and σ value grouped by number of excess states, while Figs 2 and 3 shows the same type of plot for final counts and decay time respectively. These plots have similar fan like shape across both excess states as well as the range of μ and σ combinations for which there are interesting count values. Figs 4 and 5 show the Herfindahl index and kurtosis respectively averaged by each μ and σ pair grouped by number of excess states. Given the similarity between these groups of graphs, it suggests that the Herfindahl index and kurtosis are both effectively functioning as measures of concentration across the states. Notably both the Herfindahl index and kurtosis reach high values in a band across the bottom right corner going from around μ = 40 to σ = 30.

Fig 6 shows the counts over time from four different runs with different configurations of parameters. The lower left graph has μ = 41, σ = 5, and 2 excess states putting it within the high concentration band. There are not many total counts in this run, and since the high concentration bad overlaps with areas of low final and average counts generally this suggest that in this high concentration there is minimal adoption of any states. Thus this region of the parameter space somewhat resembles a natural monopoly, where it is difficult to successfully enter and those that do dominate.

Regressions

The primary independent variable of interest is excess states, which is S-C. The OLS regressions follow the form: Y = β1ExcessStates + β2μ + β3σ + c. Tables 3 and 4 show the regression coefficients for the variable excess states for all outcomes of interest. Each time aggregation—state aggregation pair is the dependant variable. So, the mean across states of the average counts over time is a separate model from the one considering the range across states of the average counts over time. All coefficients listed in this table are statistically significant at the 1% level. In Table 3 the columns Mean, Standard Deviation, Max, and Range refer to the method of aggregation across states within a run.

thumbnail
Table 4. Excess states regression coefficient for global variables.

https://doi.org/10.1371/journal.pone.0280326.t004

As a robustness check, I also preformed aggregations by Median and Mean Absolute Deviation. The Median is used as an outlier robust aggregation for the center of the distribution (compared to the Mean) and Mean Absolute Deviation is used as an outlier robust measure of dispersion (compared to Standard Deviation). The table with regression coefficients for Median and Mean Absolute Deviation aggregations are in the S1 File.

As the results in Table 3 shows the sign for excess states coefficient for all measures except decay time are negative. For decay time this suggests that increasing excess states increases time between the maximum and the minimum after the maximum. For the rest of the outcomes of interest this suggests that increasing the number of excess stats is associated with less overall reductions on average. The reductions in the dispersion measures of variance in counts and entropy in counts suggests that increasing excess states leads to runs which are more consistent in time. On the other hand a negative coefficient for Final Counts, Max Counts, Average Counts and Min after Max suggest less average adoption across states.

Alternatively the coefficient for the range aggregation has a positive coefficient for all of the outcomes of interest. With the exception of Time to Maximum and Decay Time, the coefficients for the maximum aggregation are negative. In these cases given the coefficients for the means, and the range this implies that while the increase in excess states reduces the average and maximum level, the impact disproportional effects the minimum since the range increases. When looking at the number of zeros, the regression coefficient for excess states is also positive, providing additional evidence for disproportionate effect of increasing excess states on the minimum counts. For Time to Maximum and Variance in Counts, it is possible that the positive effect of excess states on the range is due to the increases in the maximum. The broadly suggests that increasing the number of states can widen the disparities between the popularity of the states.

Similar to the range, the regression coefficients for the standard deviation aggregation are positive for all outcomes. [28] use standard deviation as a measure of unpredictability, but while they were measuring unpredictability across runs this measures unpredictability across states. This suggests that increasing the excess states makes all of the outcomes of interest less predictable.

The full regression tables are listed in the S1 File.

Discussion

The parameter configurations shown in Fig 6 demonstrated four different competitive regimes. For example while both μ = 10, σ = 5, C = 3, S = 5 and μ = 20, σ = 5, C = 3, S = 5 are both highly competitive, there is a clear losing state in μ = 20, σ = 5, C = 3, S = 5. On the other hand μ = 30, σ = 40, C = 3, S = 10 is characterized by the emergence of a few clear winners with the rest of the states struggling to compete. As I mentioned earlier, in μ = 41, σ = 5, C = 3, S = 5 adoption is very difficult, so the three states that end up getting adopted are only adopted once. This suggests that the threshold parameters do capture a rich enough space of possible outcomes.

One way to think of these results is in terms of observed and unobserved cascades, in the sense that we are more likely to observe cascades that succeed, but not those that fizzle. In the context of music, consider that for every artist that becomes an “overnight success” there may be many others toiling away in obscurity. In this sense the given states that do not achieve popularity could be thought of counter factually as ones that could have. Each of the cascading states was facing the same distribution of thresholds as each other, with the only differences in their outcomes being due to chance. This shows that large disparities in popularity can occur even without and underlying differences in the “quality” of the cascading state, and purely arise from structure. Due to the trade off between the different states imposed by the capacity constraint, single cascade models and multicascade models without the capacity constraint will miss the effect unpopular states have on successful states. This also suggests that just because something is popular, that does not imply it is high quality.

Since the capacity constraint provides a mechanism for agents to regularly remove states, this can help determine their behavior as they fade away. The decay time results suggest that while on average increases excess states reduces the decay time, it actually increases the maximum decay time. This suggests that the in more competitive environments the most stable states are even more stable. A similar pattern is shown with time to maximum, suggesting that in more competitive environments even though the average peak is earlier, the maximum peak is delayed. It is possible that this could be due to a sort of lock in effect, where once a state is adopted by a certain threshold of agents there is a minimum level it can no longer dip below. The analysis of minimum post maximum, suggest that this may not be the case as adding excess states decreases the maximum minimum post maximum. If there is lock in, this suggests that increasing excess states reduces the floor that states are locked in above.

Conclusion

These results suggest that the capacity constraint may play an important role in the diffusion dynamics of environments with multiple states that could potentially cascade. Increasing the number of total states in excess of capacity is associated with increased concentration of popularity, larger disparities between popular and unpopular states as well as greater unpredictability in which states will become popular, even while the popularity of a given state over time tended to become more predictable. Unsurprisingly increased competition from greater excess states tended to reduce average popularity overall, the heterogeneous impact suggests that capacity constraints may play a role in driving the superstar phenomena that [22] describes. Since each of the states begin equally preferable, this suggests a mechanism for how random chance and structure can drive popularity as opposed to underlying value. Thus, more empirical work is needed to measure people’s capacities as well as determine the influence of the capacity constraint on real systems.

Acknowledgments

I’d like to acknowledge valuable advice from Nicholas Christakis, Scott Boorman, Edoardo Airoldi, and Michael Macy. I’d also like to thank the Human Nature Lab and the Yale Institute for Network Science for providing computational resources and feedback. Finally, thanks to Derek Schultz.

References

  1. 1. Bass FM. A new product growth for model consumer durables. Management science. 1969;15(5):215–227.
  2. 2. Griliches Z. Hybrid corn: An exploration in the economics of technological change. Econometrica, Journal of the Econometric Society. 1957; p. 501–522.
  3. 3. Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. In: Proceedings of the Royal Society of London A: mathematical, physical and engineering sciences. vol. 115. The Royal Society; 1927. p. 700–721.
  4. 4. Goel S, Anderson A, Hofman J, Watts DJ. The structural virality of online diffusion. Management Science. 2015;62(1):180–196.
  5. 5. Kuran T. Now out of never: The element of surprise in the East European revolution of 1989. World politics. 1991;44(1):7–48.
  6. 6. Boorman SA. Island models for takeover by a social trait facing a frequency-dependent selection barrier in a Mendelian population. Proceedings of the National Academy of Sciences. 1974;71(5):2103–2107.
  7. 7. Granovetter M. Threshold models of collective behavior. American journal of sociology. 1978;83(6):1420–1443.
  8. 8. Granovetter M, Soong R. Threshold models of diffusion and collective behavior. Journal of Mathematical sociology. 1983;9(3):165–179.
  9. 9. Macy MW. Chains of cooperation: Threshold effects in collective action. American Sociological Review. 1991; p. 730–747.
  10. 10. Centola D, Eguíluz VM, Macy MW. Cascade dynamics of complex propagation. Physica A: Statistical Mechanics and its Applications. 2007;374(1):449–456.
  11. 11. Watts DJ. In: A simple model of global cascades on random networks. Princeton University Press; 2011. p. 497–502.
  12. 12. Epstein JM, Parker J, Cummings D, Hammond RA. Coupled contagion dynamics of fear and disease: mathematical and computational explorations. PLoS One. 2008;3(12):e3955. pmid:19079607
  13. 13. Perra N, Balcan D, Gonçalves B, Vespignani A. Towards a characterization of behavior-disease models. PloS one. 2011;6(8):e23084. pmid:21826228
  14. 14. Bauch CT, Galvani AP. Social factors in epidemiology. Science. 2013;342(6154):47–49. pmid:24092718
  15. 15. Fu F, Christakis NA, Fowler JH. Dueling biological and social contagions. Scientific Reports. 2017;7. pmid:28252663
  16. 16. Hébert-Dufresne L, Scarpino SV, Young JG. Macroscopic patterns of interacting contagions are indistinguishable from social reinforcement. Nature Physics. 2020; p. 1–6. pmid:34221104
  17. 17. White HC. Everyday Life in Stochastic Networks. Sociological Inquiry. 1973;43(1):43–49.
  18. 18. Anderson SP, Engers M. Stackelberg versus Cournot oligopoly equilibrium. International Journal of Industrial Organization. 1992;10(1):127–135.
  19. 19. Hamilton JH, Slutsky SM. Endogenous timing in duopoly games: Stackelberg or Cournot equilibria. Games and Economic Behavior. 1990;2(1):29–46.
  20. 20. Peterson RA, Berger DG. Cycles in symbol production: The case of popular music. American sociological review. 1975; p. 158–173.
  21. 21. Peterson RA, Berger DG. Measuring industry concentration, diversity, and innovation in popular music. American Sociological Review. 1996;61(1):175–178.
  22. 22. Rosen S. The economics of superstars. The American economic review. 1981;71(5):845–858.
  23. 23. Hamlen WA Jr. Superstardom in popular music: Empirical evidence. The Review of Economics and Statistics. 1991; p. 729–733.
  24. 24. Krueger AB. The economics of real superstars: The market for rock concerts in the material world. Journal of Labor Economics. 2005;23(1):1–30.
  25. 25. Salganik MJ, Dodds PS, Watts DJ. Experimental study of inequality and unpredictability in an artificial cultural market. science. 2006;311(5762):854–856. pmid:16469928
  26. 26. Salganik MJ, Watts DJ. Leading the herd astray: An experimental study of self-fulfilling prophecies in an artificial cultural market. Social psychology quarterly. 2008;71(4):338–355.
  27. 27. Van de Rijt A. Self-correcting dynamics in social influence processes. American journal of sociology. 2019;124(5):1468–1495.
  28. 28. Macy M, Evtushenko A. Threshold Models of Collective Behavior II: The Predictability Paradox and Spontaneous Instigation. Sociological Science. 2020;7:628–648.
  29. 29. Centola D, Macy M. Complex contagions and the weakness of long ties. American journal of Sociology. 2007;113(3):702–734.
  30. 30. Rossman G, Fisher JC. Network hubs cease to be influential in the presence of low levels of advertising. Proceedings of the National Academy of Sciences. 2021;118(7). pmid:33579818
  31. 31. Valente TW. Social network thresholds in the diffusion of innovations. Social networks. 1996;18(1):69–89.
  32. 32. Romero DM, Meeder B, Kleinberg J. Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter. In: Proceedings of the 20th international conference on World wide web; 2011. p. 695–704.
  33. 33. Hedström P. Contagious collectivities: On the spatial diffusion of Swedish trade unions, 1890-1940. American journal of sociology. 1994;99(5):1157–1179.
  34. 34. Braun N. Individual thresholds and social diffusion. Rationality and Society. 1995;7(2):167–182.
  35. 35. Foramitti J. AgentPy: A package for agent-based modeling in Python. Journal of Open Source Software. 2021;6(62):3065.
  36. 36. Shannon CE. A mathematical theory of communication. The Bell system technical journal. 1948;27(3):379–423.
  37. 37. Herfindahl OC. Concentration in the steel industry [Thesis]; 1950.
  38. 38. Hirschman AO. National power and the structure of foreign trade. vol. 105. Univ of California Press; 1945.