Figures
Abstract
In an era where digital communication accelerates the global spread of false narratives, understanding how misinformation and disinformation propagate, especially during crises such as the COVID-19 pandemic, is vital to public health and policy. To delve into the diffusion mechanisms of misinformation (unintentionally false information) and disinformation (intentionally false information), we introduce a novel enhanced agent-based model (ABM) that integrates psycho-social factors and communication networks, which are elements often overlooked in traditional equation-based models (EBMs). We assess the two distinct techniques (ABMs and EBMs) through the lens of six classical SEIRS-class models (S susceptible, E exposed, I infected, R recovered). Beside the enhanced ABM, we also develop a simple ABM to emulate the EBM structure. We compare the ABMs with the EBMs over their entire parameter ranges in a total of 11110 experiments. Results show an overall weak equivalence between the two types of models, even if, under certain conditions, the outcomes of the EBMs and ABMs are similar. Furthermore, we evaluate the two model types by fitting them to real-world infodemic data on vaccine acceptance over 36 weeks using a multi-objective optimization procedure. The enhanced ABM shows an exceptionally better fit to real-world data (Pearson’s correlation coefficient and normalized root mean of square error NRMSE = 0.055) than the EBM (
, NRMSE = 0.418) and the simple ABM (
, NRMSE = 0.103). These findings underscore the critical role of model structure in capturing infodemic dynamics, and advocate for the use of ABMs when psycho-social influences and network interactions are central to the phenomenon.
Citation: Berceanu C, Bertolotti F, Arshad N, Patrascu M (2025) Understanding the mechanisms of infodemics: Equation-based vs. agent-based models. PLoS One 20(12): e0338614. https://doi.org/10.1371/journal.pone.0338614
Editor: Siew Ann Cheong, Nanyang Technological University, SINGAPORE
Received: December 20, 2024; Accepted: November 24, 2025; Published: December 17, 2025
Copyright: © 2025 Berceanu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Models, experiment configurations, and heatmap results are available in the Github Repository “Understanding the Mechanisms of Infodemics: Equation-Based vs Agent-Based Models” (https://github.com/cristian-berceanu/understanding_the_mechanisms_of_infodemics). The real world data is described by Collis, A., Garimella, K., Moehring, A. et al. Global survey on COVID-19 beliefs, behaviours and norms. Nat Hum Behav 6, 1310–1317 (2022). https://doi.org/10.1038/s41562-022-01347-1, and can be accessed via the dataset “COVID-19 Beliefs, Behaviors & Norms Survey” (https://covidsurvey.mit.edu/).
Funding: No direct funding is declared for this article. MP would like to thank the GC Rieber Foundation, the Norwegian Government, and the Research Council of Norway (288164) for supporting their work at the Centre for Elderly and Nursing Home Medicine, University of Bergen, Norway. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
In 1995, a year that highlights the rising wave of internet-related development, Kevin Kelly was writing [1]: “The central act of the coming era is to connect everything to everything.” Nearly 30 years later, we are witnessing an unprecedented upsurge in internet-based social media platforms and instruments for person-to-person and person-to-group communication. Humans, more so than before, are now connected in ways that foster high and ubiquitous access to information and to each other [2]. Our social networks are larger than in previous decades, in both number of actors (nodes) and ties (edges), which makes way for new questions and calls for new analysis methodologies [3]. The field of complex systems provides such opportunities in the form of network science, simulation of computational models, and specific concepts (e.g., small worlds, co-evolution, emergence, etc.) [4,5].
Social communities can be considered multi-layered networks in which people interact through a variety of relationships. This interpretation allows them to be investigated as complex networks [6,7]. When considering the available communication channels (phone, messaging apps, social media platforms, etc.), the complexity of these systems increases, both in size and relationship heterogeneity [8]. Topology-based modeling is already intricate [9], but when we must also account for the content of the information exchange between nodes, modeling social networks overlaid onto social media or communication networks becomes even more challenging. In a complex systems perspective, humans can be seen as systems with agency (capable of autonomous decision-making and reasoning), while the interactions between them are interpreted as signals: either information, energy, or matter over time. For the specific case of information diffusion, this approach allows us to consider the exchanged content.
In complex systems science, the chosen level of analysis typically depends on the modeling purpose and can significantly impact the overall results [10,11]. To enhance scientific communication and ensure replicability, researchers often employ a simplifying interpretative method, which results into a model on one of three scales [12]: a) macroscopic, when the whole network is represented as a black-box (i.e., without understanding its internal behavior), and only large-scale variations of outcomes are observed; b) mesoscopic, in which some parts of the network are differentiated, but the level of granularity does not permit modeling individual nodes; and c) microscopic, when each entity in the network has agency and manages its own dynamics and psycho-social interactions.
Given this framework, the study of social-network information diffusion on a microscopic level is of increasing interest. Network science has been investigating the emergence of social ties and how ideas could travel through internet-enabled media, such as political blog analysis [13] and election manipulation [14]. During the COVID-19 pandemic, the diffusion of untrue or malicious information resulted in an infodemic [15], and a co-evolution effect has been observed with the spread of the virus itself [16]. Consequently, analyzing and predicting the effect of infodemics is of high interest [17], and efforts have been focused on integrating epidemiological models of viral spread with those of information diffusion [16,18].
Study aim and hypothesis
The aim of this paper is to investigate two model types for the spread of mis- and disinformation: macroscopic equation-based models (EBMs) and microscopic agent-based models (ABMs). We hypothesize that microscopic agent-based models are necessary to capture the elements of the human psycho-social context. To achieve this, we define a set of key objectives:
- Model development based on six classical viral spread EBMs that comprise the SEIRS-class, with S susceptible, E exposed, I infected, R recovered:
- Develop a simple ABM parameterized to entail mis- and disinformation behaviors at microscopic level, which emulates the assumptions of the classical EBMs.
- Develop an enhanced ABM that, aside from the simple ABM functionality, accounts for the communication network and psycho-social interactions.
- Inter-model equivalence of EBMs and ABMs:
- Perform a detailed cross-correlation analysis over the entire parameter range for all models
- Compare the models via a fitting experiment using real-world infodemic data
Background
Information diffusion models
A diffusion model is a framework, usually implemented mathematically or computationally, that represents the spreading of entities or features in a space [19], which can be either topological or relational (i.e., a network) [20]. Even if they were first developed in physical sciences, social sciences have widely adopted them, assuming that different kinds of features could spread within a population [21]. Diffusion models provide a systematic way to understand and analyze the dynamics of spread processes as a means of predicting their outcomes or investigating the factors that influence diffusion [22]. Diffusion models are well established in different disciplines, such as epidemics [23,24], information spread [25,26], opinion and strategy dynamics [27], and the economics of innovation [28,29]. In particular, the epidemic models derived from Kermack and McKendrick’s seminal works [30–32] have received increased attention. The underlying assumptions of these models are: a) an entity within a population could exist in one specific state (i.e., susceptible, infected, recovered, or exposed); b) an infection could be transmitted by proximity to another individual (i.e., in the classic susceptible-infected-recovered SIR model the number of newly infected individuals at time t depends on the product of susceptible and infected individuals, which stands for the number of interactions); and c) any state could be reached by internal dynamics. Infodemic models equate “infection” with “knowledge”, thus proposing an analogous interpretation [33].
Diffusion models are generally characterized by four elements: the structure of the interactions, dynamics of the interactions, possible states that model the diffused feature, and their dynamics. A classical approach to the diffusion model (for instance, the one adopted by Kermack and McKendrick [30] which is derived from Newtonian mechanics), consists of modeling a population as state variables. Each state typically varies over time according to a differential equation, such that the structure of the model is an oriented graph where each node is a population subset in a given state, and the connections depict functional dependencies. This approach is often called equation-based modeling [34,35]. In contrast, in the last 30 years, an opposite paradigm has emerged, where the atomic unit of the model is not the state but the individual decision-making entity itself [36], usually represented as a computational object [37]. When these computational entities are agents (i.e., systems with agency [38]), the methodology is known as agent-based modeling [39,40].
In spread modeling, equation- and agent-based approaches are often seen as concurrent, with complementary strengths and weaknesses [41,42], and there has been a long discussion regarding the preference for each of these methods [43], as well as the effect of the interaction structure on diffusion over a network [21,25] or more generally on its dynamics [44]. On the one hand, EBMs are much less computationally expensive, at the cost of assuming homogeneity of sub-populations and using a mean-field approximation to the structure of interactions [45]. On the other hand, the smaller the population, the more individual features matter, and employing ABMs adheres closer to reality [46,47].
In epidemiology, there is an observable interest in assessing which methodology is better [48–52] and in identifying strategies to develop hybrid models that encompass the best of both [46,53,54]. However, very few studies establish a correlation between epidemics and infodemics [55].
Infodemics
According to the World Health Organization, “an infodemic is too much information including false or misleading information in digital and physical environments during a disease outbreak” [56]. Based on the degree of misleading intentionality, diffused untruths can be classified into [57,58]: a) misinformation, which spreads without the intention to mislead; b) disinformation, which is produced purposely to cause harm (e.g., reporting manipulated statistics); c) malinformation, which refers to broadcasting true information to cause harm, such as circulating a report without its original context; d) rumor, which concerns the distribution of unverified information; and e) fake news, which are fabricated information mimicking news content.
On the one hand, gossip and rumor-sharing contribute to building and maintaining social ties. The drive to express views and perspectives in unreliable circumstances fuels the spread of mis- and disinformation [59], which has had detrimental effects on public health and politico-economic issues [60]. Notably, the waves of misinformation associated with the COVID-19 pandemic have raised multiple psychological and psycho-social issues that led to inappropriate measures, political instability, and mistrust in governing bodies [61–64]. Even when scientific inaccuracies are confined within seemingly closed communities [65], they have widespread and destructive effects on social groups, especially marginalized ones, leading to widescale societal changes and unrest [66].
On the other hand, misinformation and disinformation affect reliability and trust in social media networks as avenues for the dissemination of news or other important verified facts. In general, the spread of untruths intended to manipulate the perceptions of users has been recognized as a fundamental issue in democratic societies [67]. Subsequently, detecting falsity in diffused information has become an important concern, and studies are looking to minimize affected users and reduce propagation [68].
We surmise that proper analysis tools and models are necessary, to study not only the paths of misinformation but also its long-term impact on human psycho-social behavior.
Methods
In this section we describe the equation- and agent-based models for mis- and disinformation diffusion, the comparative analysis method, and tools for model implementation and development. All models and results are available in the Github repository [69], including the scripts to create a docker image that runs the experiments for the two cases automatically.
Preliminary analysis: A psychological perspective on the infodemic infection mechanisms
Infodemic models have been relying on the concept of “misinformation epidemic” [70], with subsequent deterministic models based on the epidemic SIR-class equations [42]. While this assumptions is not without merit, information travels in an inherently different manner than biological viruses, being affected by how communication is carried out and by individualized factors such as the illusion of knowledge [71].
Thus, in this section we analyze whether the four states of susceptible, exposed, infected and recovered are suitable for describing the spread of mis- and disinformation. Research on the psychology of misinformation has provided much insight into the psychological processes underlying susceptibility to misinformation in multiple domains [72,73]. Cognitive factors that contribute to supporting false beliefs include intuitive thinking (a lack of analytical thinking), cognitive failures (forgetting sources), and illusory truth (familiarity).
A recent model [74] proposes four psychological processes underlying susceptibility to health misinformation. Not only individuals with a capability to reason accurately are less susceptible, but also their resilience increases with the motivation to reason accurately. In contrast, directionally- and identity-motivated reasoning increases susceptibility, derived from a desire to reach a preferred conclusion that is often consistent with one’s pre-existing views. These intertwining mechanisms are not possible to be explicitly included in EBMs, but they are suitable for ABMs.
The exposed state is that in which a piece of information (e.g., news, blog posts, communication from social contacts) takes a while to be processed, especially when its complexity makes it difficult to grasp, or requires repeated exposures to generate belief. However, the mechanism behind the establishment of false and accurate belief is the same [75]. People are often biased to believe in the validity of information and “go with their gut” and intuition when deciding what is true instead of deliberating [76]. For instance, 31% respondents in a U.S. survey (n = 2023) in March 2020 agreed that COVID-19 was purposefully created and spread, despite the absence of any plausible evidence for its intentional development [77]. People might have encountered conspiracy theories about the source of the virus many times, which might have contributed to this widespread belief because simply repeating a claim makes it more believable than presenting it only once [78]. Repetition increases belief in both misinformation and facts, and people get “infected”. Regardless of cognitive ability and despite contradictory advice from an accurate source or accurate previous knowledge, there is a possibility that illusory truth persists months after the first exposure [79]. EBMs do not differentiate between repeated exposures, even though they do account for incubation time, whereas in an ABMs it would only be a matter of adding a condition to a state transition trigger. We thus put forward that the incubation rate should be redefined for infodemics.
But does belief really equate infection? [80] The exposure to false information is a strong contributor to the formation of false beliefs. Access to high-quality information, whether true or not, is not necessary; instead, a range of precursors, cognitive and socio-affective drivers, influence the formation and storage of false beliefs [75], which draws a parallel with the properties of cells that determine whether they are or are not hosts to viral multiplication. But misinformation and its siblings are not singular pieces that are passed on from person to person, nor are they easily identifiable by the “symptoms” of the “disease” they cause. Information and the knowledge it is stored as always come with related facts, untruths, beliefs, consequences, premises, and relationships that either support or restrain the “infection”, thus affecting incubation, recovery, and loss of immunity. Where do we draw the line between infection and other beliefs that are not necessarily true but not harmful either? For instance, the belief that leaving scissors open on the table leads to discord is not true, but it serves the purpose of avoiding accidents caused by exposed blades. Perhaps the definition of the infected state in infodemic models needs more interdisciplinary investigation.
Even with these limitations, there are enough arguments to analyze infodemics based on the four epidemiological states. In what follows, we thus utilize the four SEIR states in the development of a simple ABM that emulates the corresponding EBMs, following the parallel with biological virus spreads. The psychological perspective paints a larger picture, for which we propose an enhanced ABM design that takes into account the properties of how information diffuses among humans. Based on this preliminary analysis, we select the agent state variables to encode social navigation (coordinates, orientation), the information they share with others (information status), and their cognitive engagement in communicating with others (energy). In terms of behavior, we choose to implement information-seeking based on access (amount in observable vicinity), affinity to peer groups (homophily), one-on-one exchange, and communication through different media (as information does not need physical contact between people to spread). The next sections describe in detail the mechanisms employed for the models, with a conceptual representation in Fig 1 and with pseudocode available in the supplementary material.
The equation-based model (SEIRS) is structured as a finite-state machine in which population subsets mass-transition between states based on probabilities. The agent-based model (ABM) is structured as layered complex networks in which individuals transition between states based on local interactions.
Equation-based models (SEIRS)
The prevalent equation-based models (EBMs) for infodemics are based on epidemic interpretations of how information is transmitted throughout a population. They are macroscopic models, in which the persons engaged in receiving or relaying information lose their individuality. The main variables of the models become numbers reflecting portions of the population in one of several distinct states: susceptible, infected, recovered or exposed (Fig 1a). In this study, we analyze six such diffusion models: SI, SIS, SIR, SIRS, SEIR and SEIRS [42]. They are differentiated by how many states are defined within the population and how many state transitions are allowed. For instance, in the SI model, the states are susceptible and infected, with the only transition ; the SIS model also allows the reverse transition
. For brevity, we only describe model SEIRS in this section; all six models are listed in Appendix I.
SEIRS [81] consists of four equations as dynamic representations of the timewise state transitions of the four variables (Fig 1a), defined as shown in Eq (1).
Where S(t), E(t), I(t), R(t) are the numbers of individuals with susceptible, exposed, infected, or recovered state, respectively, at time t. Notations represent the first order derivatives of these variables and they model how the states of the population change between two moments in time. N is the total number of individuals in the population. The model parameters are: infection rate β, incubation rate σ, recovery rate γ, and immunity loss rate
.
Although SEIRS-class models can be expanded to include vital dynamics (asymmetrical birth and natural-cause death rates) [82], the total population is constant in the classic version: .
The susceptible state is the nominal state of individuals and it models the population that can become either infected with (models SI(S) and SIR(S)) or exposed to (models SEIR(S)) mis/disinformation. The population with exposed state cannot infect other individuals, but can become infected after an incubation period. Those with infected state are able to spread the “infection” to the susceptible individuals and can become recovered after a period of time. The individuals with recovered state are immune to the “infection” but can lose this immunity over time, at which point they transition to the susceptible state.
Parameter β represents the infection rate and is defined as the population percentage switching states from susceptible to exposed (or infected) in the unit of time. Parameter σ represents the incubation rate and is defined as the population percentage switching states from exposed to infected in the unit of time. Parameter γ represents the recovery rate and is defined as the population percentage switching states from infected to recovered in the unit of time. Parameter represents the immunity loss rate and is defined as the population percentage switching states from recovered to susceptible in the unit of time.
For the SEIRS models, the unit of time is chosen to match the modeled epidemic, e.g. days for rapidly spreading infections, or months for slower dynamics. In this study, we consider the unit of time to be equivalent to the unit of time of the agent-based models, measured in ticks. Thus, we maintain the generality and scalability of the models.
Agent-based models
Agent-based models are built with two types of entities: (a) an agent is a simulated system capable of perception, action, communication, and reasoning; (b) a cell is a simulated system part of the environment. The ABM unit of time is called a tick. Its real-world equivalence is selected based on problem context and dynamics. During one tick, all behaviors defined in the ABM (agents, cells, etc.) are executed in parallel asynchronously, by one step.
For this study, we propose an ABM structure to include the separation and dependence between the psycho-social component and the communication medium. We base this choice on the argument that one network alone is not sufficient to embody these types of dynamics, put forth by studies on epidemic-infodemic interactions [83] and opinion dynamics [84]. Thus, agents model individuals (who can interact with the environment or each other) and the psycho-social network, while cells model the communication channels and/or network (Fig 1b).
Network structures.
The psycho-social network is formed of all agents in the ABM. From an information diffusion perspective, this network appears as a graph with temporary edges, which are formed when two agents meet to directly transmit information to each other. This reflects the real world in which two persons open ad-hoc point-to-point communication sessions (e.g., face-to-face, phone call) with each other, forming a connection, but are not actively transmitting information to each other around-the-clock.
The social media/communication network (further referred to as the environment) is formed of all the cells in the ABM. In this study, we interpret each cell as a platform (e.g., blogging, micro-blogging, forums, social apps), broadcasting entity (e.g., newspaper website, television) or communication channel (e.g., VoIP apps). Equivalent to the real world, an agent “navigates” among these platforms; in this paper, we visualize this process by allowing agents to travel across the cell grid. While the cells are connected via adjacency to each other, we do not consider the network edges relevant in this situation, as platforms themselves do not exchange information; instead, the agents carry it across media.
The environment thus represents the online (social) media network overlaid onto the direct point-to-point communication of the agents. The difference in behavior between direct communication and online media is that the latter functions as a broadcast instead of a conversation. Blog posts, news articles, etc. are stored online and available for anyone in the network. In the real world, this type of broadcast is not received by all individuals at the same time, as navigating through personal media dashboards limits exposure (e.g., local news relayed within a region and not internationally, or social media clusters formed of family members, etc.): we achieve this effect by implementing the agent movement functions as they seek interactions and limiting their perception to a radius around themselves.
Information modeling.
The relationship humans have with information has been studied at length [85,86]. The interpretation we adhere to in this paper associates this interaction process with foraging behaviors, through which people seek, select, consume and avoid information [87,88]. For agents, cells are both information sources and sinks. Agents generate information by “posting” on cells (interpreted as either new content, or interactions with existing content such as commenting). When agents are infected, they might deposit mis- or disinformation, which is then picked up by other agents. In complex systems, this process of communicating through the environment is known as stigmergy [89]. Agents gain energy by consuming information and lose energy by posting, thus simulating engagement (e.g., scrolling through dashboards, commenting) or exposure fatigue (e.g., avoiding news applets).
Aside from content deposited by agents on cells, we implement a generation mechanism through which information “grows”. The underlying hypothesis is that information available in the environment can be generated by sources outside that environment. Thus, there is an external component to the model which agents cannot affect, but that can influence the agent or its group. For instance, the social cluster of an individual being influenced by the media, over which single individuals do not have, in turn, an effect. Moreover, if cells are instances of online media platforms where agents interact with content, the information they carry can be influenced by recommendation engine algorithms, which promote any type of engagement, whether positive or negative [90], and thus become vectors for mis- and disinformation.
We developed two models: (a) the simple model is a direct translation of the finite-state machine of the EBMs into agent behaviors in which misinformation spreads point-to-point between agents; (b) the enhanced model is built upon the simple model by adding psycho-social components to the agent behaviors (e.g., homophily), as well as the misinformation spread through the online (social) media network. All algorithms associated with the two models are detailed in Appendices II and III. The two models are briefly described in the following sections Simple ABM and Enhanced ABM using the ODD (Overview, Design Concepts and Details) Protocol [91].
Simple ABM.
Overview:
The purpose of the simple ABM is to model infodemics taking into account complex interactions between agents based on the information model.
The state of an agent Ai (i = 1..N) is described by four variables: coordinates , orientation hi, information status si, and energy level ei. Coordinates are discrete (integer) and spatially define the position of the agent in the environment. Orientation is defined by the heading angle. Information status is a categorical variable
with four categories equivalent to the SEIRS model. Energy level is a continuous variable modeling the interest of agents to communicate (receive and relay information) and thus to move through the environment.
The state of a cell Cj (j = 1..M2) is described by two variables: coordinates and information quantity qj. Cells are static, arranged on a torus, and visible in the model interface as an M × M square grid (agents leaving one border reappear on the opposite side). Together, cells form the communication network. Coordinates are discrete (integer) and define the position of the cell inside the grid. Information quantity is a continuous variable representing how much information is available to agents in one particular communication channel (displayed in shades of green: lighter for less information, darker for more). For this model all available information is considered true.
Fig 2 shows the initialization of the simple ABM at t = 0 ticks and the visualization of the model at t = 50 ticks.
The color of each cell (shades of green) represents the level of information present in that specific area, while agents are depicted by arrows in a visual representation that shows both the position and the current orientation: orange susceptible, magenta infected, blue recovered, and violet exposed.
The scheduler triggers the two main behaviors of the agent: Agent movement and Agent information status. The agent information status describes how the agent’s state changes as a consequence of direct (point-to-point communication) and indirect (information posting and consumption) diffusion of information. The implementation of the scheduler is explained in Appendix II.
Design concepts:
Agent movement. Agents move through the environment toward a new position with a specified heading. The agent’s target is computed based on vicinity: choosing the cell (patch) with the highest amount of information in agent radius r and viewing angle θ. The meeting of two agents on the same cell represents two individuals simultaneously using the same instance of a communication channel (e.g., a phone call, a messaging app, face-to-face conversation, etc.).
Agent information status. The categorical agent state variable defines the set of actions it can perform, i.e. its behaviors (algorithms in Appendix III). The transitions between the four categories follow the rules of the EBMs finite-state machine and are implemented via threshold tests with the probabilities of infection β (relative to world and population size), successful incubation σ, recovery γ, and immunity loss
.
The simple model implements diffusion using point-to-point communication between agents, meaning that mis- and disinformation can only be transmitted when agents talk directly to each other. The point-to-point communication is established by infected agents with closest susceptible agents. This is the epidemiological interpretation in which biological viruses are transmitted through direct contact. Agents lose energy by moving through the world. This ABM is designed to allow for disinterest or sudden interest, and thus implement vital dynamics (e.g., “death” by leaving the network or “birth” by joining the network). However, to match the ABM to the classical SEIRS EBMs, we deactivated these functions. Because in this case the environment does not provide or store false information, it does not affect the dynamics of the infection spread, thus allowing for a fair comparison between the two modeling approaches.
While the movement path of agents is random, the starting location of all agents is deterministic. The agents sense their environment and interact with other agents in their vicinity () directly (point-to-point communication).
Details:
Each agent is initialized with a starting position, heading, and state (S,E,I, or R, depending on the submodel type). No data is exchanged with the outside world. We consider the following 6 submodels for the simple ABM: SI, SIS, SIR, SIRS, SEIR and SEIRS.
Enhanced ABM.
Overview:
The purpose of the enhanced ABM is to model infodemics taking into account complex interactions between agents based on the information model and communication network structure.
The state of an agent Ai (i = 1..N) is described by five variables: coordinates, orientation, information status, energy level, and group gi, which is a categorical variable that tags the agent as belonging to one of two social groups: .
The state of a cell Cj (j = 1..M) is described by three variables: coordinates, information quantity, and information type cj, which is a categorical variable . Truthful information ctrue is displayed as green, while falsity cfalse is displayed as red, both in shades based on quantity. Cells are arranged in the same grid as the simple model.
The scheduler triggers the main behaviors of the agent: Agent movement and information status. The agent information status describes how the agent’s state changes as a consequence of direct (point-to-point communication) and indirect (information posting and consumption) diffusion of information. The implementation of the scheduler is explained in Appendix II.
Design concepts:
Agent movement and information status. Agent movement and information status are similar to the simple model. The difference is that the and
transitions can be triggered externally by false information from the environment. The infection rate β is also adjusted to account for two infection sources (other agents and cells).
Homophily. In this extension, we implemented a simple mechanic based on group adherence: agents in groups G1 and G2 will only believe false information from agents in the same group. Thus, the and
transitions become conditional.
The enhanced model implements diffusion using both point-to-point and stigmergy communication between agents. Thus, mis- and disinformation can be transmitted when agents talk directly to each other and when they interact with the information shared through the communication network. Agents lose energy by moving through the world, but also when posting information. The point-to-point communication is similar to the simple model, while stigmergy allows for mis- and disinformation to spread through the shared media stored online, adding another informational dimension to the epidemiological interpretation.
While the movement path of agents is random, the starting location of all agents is deterministic. The agents sense their environment and interact with other agents in their vicinity () directly (point-to-point communication) or indirectly (information posting and consumption) through the environment.
Details:
Each agent is initialized with a starting position, heading, and state (S,E,I, or R, depending on the submodel type). No data is exchanged with the outside world. We consider the following 6 submodels for the enhanced ABM: SI, SIS, SIR, SIRS, SEIR and SEIRS.
The differences in characteristics [92] between the EBM, the simple ABM, and the enhanced version of the ABM are summarized in Table 1.
Implementation.
Both the simple and enhanced ABM function on the same base structure (scheduling), detailed in Appendix II together with the two model interfaces.
The scheduling of the model can be summarized into phases. The initialization phase loads the agent list and positions, and the cell distribution with information quantities into the model (here, we choose the same initialization for all model runs). The next phases execute agent behaviors.
First is the movement phase, in which an agent Ai chooses a target to move toward. The decision-making mechanism considers a given radius de around agent Ai. Agents deplete a fixed amount of energy ec for each movement. Second, the spreading phase, in which an agent Ai with status si = I spreads mis- or disinformation to the nearest neighbor Ak with status sk = S. To be infected or exposed, Ak should be at a distance (preset radius). Third, agent Ak changes status to sk = E for models SEIR(S) or to sk = I for the others. Fourth, an agent Ai with si = I can recover to si = R for SIR(S), SEIR(S). Model selection is possible through the ABM interface. Fifth, a recovered agent Ai with si = R can become susceptible in the case of models SIS, SIRS, SEIRS. (Deactivated function: an agent Ai with negative energy ei is removed from the agent list, i.e., leaves the simulated society, which occurs whenever it expends more energy than it collects and stores.) The cell behavior execution phase consists of the information qj on each cell Cj replenishing with a preset growth rate up to a maximum limit.
Comparative analysis
Directly comparing EBMs and ABMs is crucial because these two approaches rely on fundamentally different assumptions about population structure and individual behavior. While EBMs offer analytical tractability by averaging over large populations, ABMs explicitly represent individual heterogeneity and local interactions, which can strongly influence diffusion dynamics but could be difficult to simulate when the number of agents exceeds a threshold. Thus, we compare pairs of homonymous outcome variables from the same-class model (e.g., the SI EBM’s susceptible vs. the SI ABM’s susceptible): evolution over time and scale. We focus on the key parameters common to the two frameworks (), which we vary across their entire ranges. This approach allows us to highlight the conditions under which the two representations diverge.
To reduce the effect of randomness inherent to ABMs, we fix all initializations and the other ABM parameters. We thus define the same starting coordinates and orientation for each agent in the initial population, and specify the same initial distribution of information on cells and size of the homophily group. To account for probability-based choices (e.g., assigning an agent to a group), we apply the standard practice in complex systems simulation of repeating ABM experiments under the same initial conditions and averaging the set of outcomes before comparison with the EBM. In what concerns the implementation framework, NetLogo is deterministic in the sense that one simulation returns the same results every time on different devices, if it is initialized with the same seed (specifiable parameter).
The comparative analysis we perform in this study has two outcome measures which describe how well the ABM-generated variables match the corresponding EBM-generated variables ye over a predetermined period of simulated time, corresponding to a number of q samples (i.e., ticks within the chosen period).
The Pearson correlation coefficient ρ [93] is given by Eq (2).
Where and
are the standard deviations. By calculating the sample-to-sample cross-sectional correlation between ya and ye over q samples, we investigate whether this relationship is linear, which would mean that the transient responses of the models are similar over time. This measure thus shows how well the two variables match longitudinally (shape over time) and is defined over the interval [−1;1], where 1 is best match, –1 mirrored evolution, and 0 is complete mismatch.
The normalized root mean of square error (NRMSE) is defined as in Eq (3).
Where q is the number of samples, with and
the maximum and minimum values of ya and ye (in this case, for a constant total population size, the same for both variables). NRMSE is a dimensionless score for scale-independent comparisons of model outcomes, defined over the interval [0;1], where 0 means best match and 1 worst [94].
Note that ρ cannot be calculated when one of the contributing variables is zero; to avoid skewing the summary calculations, these were adjusted to match (1) or not-a-match (0) based on the NRMSE of the pair.
In this study, we use the combined outcome measures ρ and NRMSE to assess the similarity of the model outcomes as discrete signals: ρ quantifies how well ya and ye match in shape, whereas NRMSE offers an estimate of the relative differences between model outcomes. Together, the two metrics allow for an analysis of both transient and steady state responses of the models, while providing an intuitive descriptor of why the models do not match. For instance, the combination of and NRMSE = 0.8 means that the models show a similar outcome variation over time, but with large differences in amplitude. In practical terms, this would mean that the two populations adapt in almost the same manner to the spread of misinformation, but the amount of people affected differs widely.
Fitting to real-world data: Case study on vaccine acceptance
We also evaluate the two types of models in a fitting experiment. The real world data we choose for this case study describes vaccine acceptance and disapproval during the pandemic. The model type is SIS, in which we consider the susceptible state as “accepting the vaccine” and the infected state as “not accepting the vaccine”, with the possibility to recover to susceptible state.
Infodemic real-world dataset. The data was collected via a global survey on COVID-19 beliefs, described in [95] from 23 countries between June 2020 and March 2021. The data are available in the section Data Availability of the same paper [95]. One of the questions asked: “If a vaccine for COVID-19 becomes available, would you choose to get vaccinated?”. Martinelli and Veltri [96] then conducted a study on COVID-19 vaccine acceptance, which produced a dataset expressing the percentage of the populating accepting of the vaccine. The dataset is longitudinal over 36 weeks, with a sample of 2 weeks between measurements. From this dataset, we choose Romania as a case study.
Fitting method. We search for the enhanced ABM and the EBM parameters so that the model outputs Si(k) and Ii(k), with , match the real world data described by the real world susceptible population SRW(k) and the real world infected population IRW(k). Considering that the two signals are mirrored, for a parameter set
and discrete time step k = 0..19, the multi-objective optimization problem is given by Eq (4).
The ABM parameter set is , where
is the infection rate and
is the recovery rate.
is the population multiplier and scales the number of agents from a initialization baseline of 100.
∩
is the time factor and determines the size of an ABM tick relative to the real world data sampling time; thus, TF represents how many ABM ticks pass for each week in the real world. IR is the information regrowth rate.
and
represent the energy loss of agents from movement and from posting information, for susceptible and infected agents, respectively. Finally,
represent the delay between agents consuming and posting information, for susceptible and infected agents, respectively.
The EBM parameter set is , where
is the infection rate and
is the recovery rate. TF represents how many data points we are considering for each week in the real world,
.
For both, the population size is normalized to percentages and the two outcome variables of all models (susceptible and infected) are initialized to approximate the first datapoint of the real world data. To calculate the criteria ρ and NRMSE, we bin the model outputs to the size of the real-world dataset (19 points) by averaging.
The optimization procedure is performed using the Python hyper-optimization library Optuna, which provides samplers and pruners and determines the importance of parameters and their interdependence. In this case study, we choose the Tree-structured Parzen Estimator (TPE) sampler and the hyperband pruner.
Tools, frameworks, and software
The ABMs were implemented and simulated using NetLogo 6.3.0 [97]. The EBMs were implemented in Python 3.11.0 in discrete form, with a time step equivalent to one ABM tick. All visualizations, plots, heatmaps and calculations were also performed in Python 3.11.0. The following Python packages were used: matplotlib 3.7.1, numpy 1.24.3, scikit-learn 1.2.2, scipy 1.10.1, optuna 3.5.0, optuna-dashboard 0.14.0.
Results
Evaluation of intermodel equivalence
In this section we present a selection of relevant results (limited by space considerations), with more examples in Appendices IV and V, and a full set of simulation results in the repository. Results are generated from three instances of the ABMs: a) a small variant of the simple ABM with a world size of M = 33 × 33 cells (the default world size in NetLogo), b) a large variant of the simple ABM with cells, and c) the enhanced ABM with
cells. These environment sizes fall within research findings on the numbers of news outlets [98] in North America. For each of the three ABM instances, we set up two experiments: Case I and Case II, with a difference in initialization values for E (zero for I, nonzero for II). These cases along with the full lists of initialization and configuration parameters are included in the Github repository [69].
Figs 3 and 4 show two examples of matching and non-matching model outcomes, respectively, as variations over 200 ticks, for the SEIRS model using 1000 runs of the simple ABM (small variant). In the matching case, the means of the ABM-runs closely follow the outcomes of the EBM (equivalence measures for all six models are in Appendix IV). Even though the ABM was initialized in the same manner at every run, the deviations of models outcomes show that infodemic dynamics are sensitive to small individual variations happening at local agent level (caused by the probabilistic conditions), whereas the EBM applies the same probabilities of state transitions at group level, i.e., mean.
Table 2 shows cumulative results (means and standard deviations) for parameter variations: with a step of 0.1 over 2000 ticks, resulting in 10 experiments for SI, 100 SIS and SIR each, 1000 SIRS and SEIR each, 10000 SEIRS. Results are consistent for model size in the simple ABM case. Some models fare better (e.g., SIR) than others (e.g., SIS) on average, but even so, none of the EBMs reproduce exactly the ABM response across the entire parameter range. The enhanced ABM shows similar results, except for SIR, which is considerably less well matched.
Fig 5 shows the intermodel equivalence outcomes NRMSE and ρ for models SIS and SI, obtained using parameter variations with a step of 0.05 over 10 ABM runs each and 2000 ticks. The results for S and I are similar due to the mirror-effect in these two outcomes, which is expected. While there are parameter combinations and intervals for which results match, this is not consistent over their entire range. The heatmaps for the SIS model show combinations for which the ABM and the EBM are profoundly distinct in dynamics: an infodemic with high infection rate β and mid-range recovery rate γ will cause the ABM and EBM models to behave differently. The addition of the online social media network in the enhanced ABM visibly shifts the matching/non-matching coverages.
Case I configuration.
Fig 6 shows the effect of homophily (conditional infection) on the enhanced ABM vs. simple ABM outcomes for groups of different sizes ,
(total N = 1350 agents), over 200 ticks and 1000 runs (means and deviations). While the exposed outcome seems similar enough, the others (susceptible, infected, recovered) show considerable change in infodemic dynamics. Group sizes do not show an effect (see Appendix V), but this observation is enough to raise questions regarding the many social group dynamics of the real world.
Fitting to real world data: Results
First, we applied the fitting procedure to the enhanced ABM. After 700 iterations, the best result has and
. The model parameter are:
,
, IR = 0.41, PM = 10.35, TF = 19,
,
,
,
,
,
. Second, we applied the fitting procedure to the EBM. After 700 iterations, the best result has
and
. The model parameter are:
,
, TF = 30. Fig 7 shows the results of these two fitting procedures against the real world data. While the EBM outputs seem to follow an overall trend of decrease or increase, they do not capture the dynamics of people changing their opinions over time. In this case the enhanced ABM manages to show how the proportions of the population invert their ratio at weeks 18-22. This period corresponds to October 2020 when Romania faced a considerable increase in cases.
Further, we perform a comparative test with the parameters resulting from the enhanced ABM fitting procedure (where applicable). Fig 8 shows the responses of the three models (enhanced and simple ABM, EBM) against real world data. Table 3 presents the two outcomes (ρ, NRMSE), with the best fit for the enhanced ABM, which shows that even when the infection and recovery rates are determined to match the dynamics of real world data, the EBM response does not manage to illustrate these transient fluctuations. The simple ABM response comes closer, but the fit is still poor, highlighting the importance of the online communication network as a misinformation facilitator.
,
. Both ABM outputs are averaged over 100 runs and have a variance of 5498.17 for the enhanced and 1577.25 for the simple versions.
Notations: ABM agent-based model, EBM equation-based model.
Discussion
In this study we investigated EBMs and ABMs for the spread of mis- and disinformation. For this, we designed two models: a) a simple ABM as a direct translation of SEIRS-type EBMs’ underlining logic, and b) an enhanced ABM to reflect communication and online media networks, as well as homophily. Then, we performed an intermodel equivalence analysis. Results show that EBMs and ABMs display both matching and non-matching outcomes, depending on parameter ranges. Our hypothesis that microscopic models (e.g., ABMs) are necessary to capture the elements of the human psycho-social context is confirmed.
ABM and EBM discrepancies
The critical difference between these types of models is that EBMs assume individual homogeneity, whereas the ABM structure allows for more complex reasoning, such as homophily, multiple pieces/types of information, believing views opposite their own group, etc. While there are attempts to rework EBMs for various epidemics [51,99], in some cases combining epidemics and infodemics in one state machine [100], these models have the same issue: losing individuality.
The main reason for the mismatch of these types of models is that the effects observed at population level (e.g., infection rate) are not observed at individual level (e.g., probability of infection) because humans, as psycho-social networks, behave like a complex system in which the infodemic is an emergent process, and to paraphrase Kevin Kelly [1], even made of the same building blocks, we would not find the beehive in the bee. The top-down approach of equation-based modeling transforms a population into one black-box mass, whereas the bottom-up design of complex systems allows for these diffusion processes to emerge naturally, from interactions between individuals. Strong emergence is the “magic” that can explain multi-layered human interactions at large scale [101]. This is why, when the conditions allow it, the ABM approach would be preferable to analyze, for instance, the effects of individual vs. mass-level interventions of information manipulation (e.g., propaganda).
The SEIRS-type EBMs we analyzed here are not necessarily entirely wrong, but they are based on assumptions that do not consider the whole nature of information diffusion. Epidemics driven by biological viruses assume some level of physical contact, whereas information spreads through entirely different media and mechanics. Epidemic research has modified these basic models to better represent various diseases, and the same should happen for infodemics. As they are now, the SEIRS EBM models are missing the communication and online media networks, as well as psycho-social attachments and beliefs, meaning that each person would have different infection rates based on information coming from different sources.
Implications of the results
So, then, what? Models cannot be perfect imitations of the real world [102], but their level of wrongness matters when humans are involved. Infodemic and epidemic models are useful for testing in a safe environment (simulation) various interventions before deployment or adoption, and it is critical that the model predictions are as accurate as possible.
Concerning the corrective interventions on mis- and disinformation, EBMs do not currently offer ways to simulate or evaluate these mechanisms, as they only show the outcomes of a population (or subset). Efficiency estimations of local interventions require ABMs. Often, interventions to resist misinformation must overcome various cognitive and socio-affective barriers. The most common types of correction are individual. Fact-based corrections directly address inaccuracies and provide accurate information [103]. Broader protection against different types of misleading tactics is offered by addressing the logical fallacies common in disinformation [104] or by challenging the plausibility of the misinformation or the credibility of its source. Multiple approaches can be combined into a single correction [105]; however they all have to be applied locally. In fact, effective regulatory actions must be implemented at individual level. Even though some corrections can propagate through specific social groups, there is always an element of internally-generated intent if coercive action at the population level is to be avoided. This is what in systems science is known as decentralized control, as opposed to centralized, in which an authority body applies interventions at societal level.
As it currently stands, EBMs can still be useful. Because they provide mean trends within a population, EBMs can supplement predictive ABM simulations and act as baselines for identifying unusual, unexpected, or outlier behaviors. However, we suggest that EBMs should be revised when it comes to misinformation diffusion, so that the parameters of the wide-scale infodemic would reflect the emergence from local behaviors. As the case study on vaccine acceptance shows, the enhanced ABM fits the real world data better than the simple version. This suggests that adding elements to a model can be helpful and sometimes even necessary to correctly interpret the behavior of the modeled system, especially when it is generated from non-trivial interactions, such as information diffusion in an infodemic. Future studies should consider the limitations and advantages of both ABMs and EBMs, and keep in mind that their outcomes are not always equivalent.
Effect of the number of parameters
The enhanced ABM includes a larger number of parameters compared to the EBM, due to the increase in structural complexity required to simulate individual-level mechanisms, which is absent when a system is studied in mean-field. Parameters such as energy dynamics, group membership, and environmental information flows represent psycho-social and communication-related processes non-modelable in equation-based frameworks. To guarantee the same parameter space would mean to lose adherence to real-world dynamics; thus, the increased dimensionality is not a modeling convenience but a representational necessity.
Moreover, we believe that the superior performance of the ABM in fitting real-world data does not merely arise from parameter abundance, but from a capacity to replicate emergent dynamics enabled by its structure. For instance, the enhanced ABM reproduces non-monotonic behaviors observed in vaccine acceptance data, including an inversion in public sentiment corresponding to a real-world event. These nonlinear transitions are a product of local agent interactions which can not be encoded in the EBM’s aggregate-level formulation while maintaining a correspondence with the modeled system. In this sense, the ABM does not merely fit better—it explains better.
Also, while in classical fitting problems the fitness typically increases with the number of parameters used to tune the model’s behavior, this relationship does not always hold in causal models in general, and in ABMs specifically. If the underlying structure is not well designed—meaning it fails to adequately represent the cause-effect relationships within the system in some stylized functional form—then adding new parameters does not necessarily improve the fit; in fact, it can even reduce it.
Finally, the fitting to empirical data was conducted via a multi-objective optimization procedure that penalizes poor generalization through metrics such as ρ and NRMSE. With regard to intermodel equivalence assessment, the combination of these metrics provide a summary perspective on outcome similarity. We chose them for their capacity to describe the timewise longitudinal relationships between discrete signals through single-scalar quantities that are easily interpretable through interoperable terminology across fields of science (modeling, statistics, sociology, engineering, etc.). These metrics do present limitations. Pearson’s correlation coefficient ρ is famously sensitive to the smallest differences, which we mitigate with the rougher NRMSE. Nonlinear correlations will not be captured by ρ and NRMSE—but because the two model types would represent the same phenomenon/process, we should not find noncausal nonlinearities between their outcomes, which reinforces our conclusion that infodemic EBMs must be redefined.
Conclusions and future developments
In this study, we evaluated the intermodel equivalence for infodemics. We designed agent-based models (ABMs) and compared their outcomes with classical equation-based models (EBMs) inspired by viral epidemics. We found low equivalence over the entire key parameter range, although the outcomes were similar for specific values. We also found that ABMs can capture the dynamics of real world data better than EBMs. We surmise that ABMs and EBMs serve different purposes with widely different structures (one is microscopic, the other macroscopic) and the decision of choosing one over the other should be informed, with awareness to their limitations and the fact that they are not interchangeable. Moreover, EBMs for infodemics should be revised from their counterparts modeling biological viral spreads.
Future developments include building an interaction topology between agents on a network and subsequently assessing how the structure of interactions (i.e., the features of the network) and its dynamics (creating or destroying links between agents) affect the results of the equation- and agent-based diffusion models. Moreover, we will implement and investigate time- or location-variant parameters (e.g., variable infection rate), introduce vital dynamics to both types of models and mitigation mechanisms, as well as different pieces of mis- and disinformation spreading concurrently through the agent population.
Supporting information
Appendix I. Equation-based SEIRS-class models.
List of the equation-based models and their parameters, together with the state transition representations.
https://doi.org/10.1371/journal.pone.0338614.s001
(PDF)
Appendix II. Implementation.
The flow diagram for model scheduling and the interfaces of the two ABMs.
https://doi.org/10.1371/journal.pone.0338614.s002
(PDF)
Appendix III. Algorithms.
Definitions and algorithms specific to the implementation fo the two ABMs.
https://doi.org/10.1371/journal.pone.0338614.s003
(PDF)
Appendix IV. Examples of matching outcomes.
Parameters and visualization of matching outcomes examples for the six SEIRS-class models.
https://doi.org/10.1371/journal.pone.0338614.s004
(PDF)
Appendix V. Selected results (exemplification).
Examples for the effect of homophily.
https://doi.org/10.1371/journal.pone.0338614.s005
(PDF)
Repository. Understanding the Mechanisms of Infodemics: Equation-Based vs Agent-Based Models. Models, experiment configurations, and heatmap results: https://github.com/cristian-berceanu/understanding_the_mechanisms_of_infodemics
Video. Simulation demonstration.
A brief video demonstration of the two ABMs during simulation.
https://doi.org/10.1371/journal.pone.0338614.s006
(MP4)
References
- 1.
Kelly K. Out of control: The new biology of machines, social systems, and the economic world. Hachette UK; 2009.
- 2. Gradoń KT, Hołyst JA, Moy WR, Sienkiewicz J, Suchecki K. Countering misinformation: a multidisciplinary approach. Big Data & Society. 2021;8(1):20539517211013848.
- 3. Smith JE, Pinter-Wollman N. Observing the unwatchable: Integrating automated sensing, naturalistic observations and animal social network analysis in the age of big data. J Anim Ecol. 2021;90(1):62–75. pmid:33020914
- 4.
Hooker C. Introduction to philosophy of complex systems: A: Part A: Towards a framework for complex systems. Philosophy of complex systems. Elsevier; 2011. p. 3–90.
- 5. Roman S, Bertolotti F. A master equation for power laws. R Soc Open Sci. 2022;9(12):220531. pmid:36483760
- 6. Mitchell M. Complex systems: Network thinking. Artificial Intelligence. 2006;170(18):1194–212.
- 7. Broido AD, Clauset A. Scale-free networks are rare. Nat Commun. 2019;10(1):1017. pmid:30833554
- 8. Siegenfeld AF, Bar-Yam Y. An introduction to complex systems science and its applications. Complexity. 2020;2020:1–16.
- 9.
Sen P. Complexities of social networks: a physicist’s perspective. arXiv preprint 2006. https://doi.org/physics/0605072
- 10. Edmonds B, Le Page C, Bithell M, Chattoe-Brown E, Grimm V, Meyer R, et al. Different modelling purposes. JASSS. 2019;22(3).
- 11. Serena L, Marzolla M, D’Angelo G, Ferretti S. A review of multilevel modeling and simulation for human mobility and behavior. Simulation Modelling Practice and Theory. 2023;127:102780.
- 12. Lachowicz M. Microscopic, mesoscopic and macroscopic descriptions of complex systems. Probabilistic Engineering Mechanics. 2011;26(1):54–60.
- 13. Meraz S. The democratic contribution of weakly tied political networks: moderate political blogs as bridges to heterogeneous information pools. Social Science Computer Review. 2013;31(2):191–207.
- 14. Baptista JP, Gradim A. Online disinformation on Facebook: the spread of fake news during the Portuguese 2019 election. Journal of Contemporary European Studies. 2020;30(2):297–312.
- 15. Peng Z, Li M, Wang Y, Ho GTS. Combating the COVID-19 infodemic using prompt-based curriculum learning. Expert Syst Appl. 2023;229:120501. pmid:37274611
- 16. Agley J, Xiao Y. Misinformation about COVID-19: evidence for differential latent profiles and a strong association with trust in science. BMC Public Health. 2021;21(1):89. pmid:33413219
- 17. Raja E, Soni B, Lalrempuii C, Borgohain SK. An adaptive cyclical learning rate based hybrid model for Dravidian fake news detection. Expert Systems with Applications. 2024;241:122768.
- 18. Yang K-C, Pierri F, Hui P-M, Axelrod D, Torres-Lugo C, Bryden J, et al. The COVID-19 infodemic: Twitter versus Facebook. Big Data & Society. 2021;8(1):20539517211013861.
- 19. Fick A. Ueber diffusion. Annalen der Physik. 1855;170(1):59–86.
- 20. Anderson JR. A spreading activation theory of memory. Journal of Verbal Learning and Verbal Behavior. 1983;22(3):261–95.
- 21. Newman MEJ. Spread of epidemic disease on networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2002;66(1 Pt 2):016128. pmid:12241447
- 22. Morris M. Epidemiology and social networks: modeling structured diffusion. Sociological Methods & Research. 1993;22(1):99–126.
- 23. Vespignani A, Tian H, Dye C, Lloyd-Smith JO, Eggo RM, Shrestha M. Modelling covid-19. Nature Reviews Physics. 2020;2(6):279–81.
- 24. Sorapure M. Anthropographics in COVID-19 simulations. Big Data & Society. 2022;9(1):20539517221098414.
- 25.
Gruhl D, Guha R, Liben-Nowell D, Tomkins A. Information diffusion through blogspace. In: Proceedings of the 13th international conference on World Wide Web. 2004. p. 491–501. https://doi.org/10.1145/988672.988739
- 26. Cinelli M, Pelicon A, Mozetič I, Quattrociocchi W, Novak PK, Zollo F. Dynamics of online hate and misinformation. Sci Rep. 2021;11(1):22083. pmid:34764344
- 27. Leo´n-Medina FJ. Social learning and the complex contagion of political symbols in Twitter: the case of the yellow ribbon in Catalonia. Big Data & Society. 2023;10(2):20539517231180569.
- 28. Bass FM. A new product growth for model consumer durables. Management Science. 2004;50(12_supplement):1825–32.
- 29. Wei W, Xin-Gang Z, Jieying W. Can support policies promote the innovative diffusion of waste-to-energy technology?. Environ Sci Pollut Res Int. 2022;29(37):55580–95. pmid:35318603
- 30.
Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Proceedings of the Royal Society of London Series A, Containing Papers of a Mathematical and Physical Character. 1927;115(772):700–21.
- 31.
Kermack WO, McKendrick AG. Contributions to the mathematical theory of epidemics. II.—The problem of endemicity. Proceedings of the Royal Society of London Series A, containing papers of a mathematical and physical character. 1932;138(834):55–83.
- 32.
Kermack WO, McKendrick AG. Contributions to the mathematical theory of epidemics. III.—Further studies of the problem of endemicity. Proceedings of the Royal Society of London Series A, Containing papers of a mathematical and physical character. 1933;141(843):94–122.
- 33. Woo J, Chen H. Epidemic model for information diffusion in web forums: experiments in marketing exchange and political dialog. Springerplus. 2016;5:66. pmid:26839759
- 34. Daun S, Rubin J, Vodovotz Y, Clermont G. Equation-based models of dynamic biological systems. J Crit Care. 2008;23(4):585–94. pmid:19056027
- 35.
Sayama H. A partial integro-differential equation-based model of adaptive social network dynamics. In: Artificial Life Conference Proceedings. 2022. p. 286–8.
- 36. Bonabeau E. Agent-based modeling: methods and techniques for simulating human systems. Proc Natl Acad Sci U S A. 2002;99 Suppl 3(Suppl 3):7280–7. pmid:12011407
- 37. Jennings NR. On agent-based software engineering. Artificial Intelligence. 2000;117(2):277–96.
- 38. Berceanu C, Banu I, Husebo BS, Patrascu M. Predictive agent-based crowd model design using decentralized control systems. IEEE Systems Journal. 2023;17(1):1383–94.
- 39. Bankes SC. Agent-based modeling: a revolution?. Proc Natl Acad Sci U S A. 2002;99(Suppl 3):7199–200. pmid:11997445
- 40.
Bertolotti F, Locoro A, Mari L. Sensitivity to initial conditions in agent-based models. In: European Conference on Multi-Agent Systems. Springer; 2020. p. 501–8.
- 41. Kiss IZ, Miller JC, Simon PL. Mathematics of epidemics on networks. Springer. 2017;598(2017):31.
- 42. Duan W, Fan Z, Zhang P, Guo G, Qiu X. Mathematical and computational approaches to epidemic modeling: a comprehensive review. Front Comput Sci (Berl). 2015;9(5):806–26. pmid:32288946
- 43.
Van Dyke Parunak H, Savit R, Riolo RL. Agent-based modeling vs. equation-based modeling: a case study and users’ guide. In: International workshop on multi-agent systems and agent-based simulation. 1998. p. 10–25.
- 44. Bertolotti F, Roman S. Balancing long-term and short-term strategies in a sustainability game. iScience. 2024;27(6):110020. pmid:38947507
- 45.
Sun Y, Cheng L. A survey on agent-based modelling and equation-based modelling. Atlanta: Department of Computer Science, Georgia State University; 2005. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.123.4567
- 46.
Bobashev GV, Goedecke DM, Feng Yu, Epstein JM. A hybrid epidemic model: combining the advantages of agent-based and equation-based approaches. In: 2007 Winter Simulation Conference. 2007. p. 1532–7. https://doi.org/10.1109/wsc.2007.4419767
- 47. Marini M, Brunner C, Chokani N, Abhari RS. Enhancing response preparedness to influenza epidemics: Agent-based study of 2050 influenza season in Switzerland. Simulation Modelling Practice and Theory. 2020;103:102091.
- 48.
Milling P, Schieritz N. Modeling the forest or modeling the trees-a comparison of system dynamics and agent-based simulation. In: Proceedings of the 21st international conference of the System Dynamics Society; 2003. p. 1–15.
- 49. Rahmandad H, Sterman J. Heterogeneity and network structure in the dynamics of diffusion: comparing agent-based and differential equation models. Management Science. 2008;54(5):998–1014.
- 50.
Sukumar SR, Nutaro JJ. Agent-based vs. equation-based epidemiological models: a model selection case study. In: 2012 ASE/IEEE International Conference on BioMedical Computing (BioMedCom). 2012. p. 74–9. https://doi.org/10.1109/biomedcom.2012.19
- 51. Kasereka S, Zohinga G, Kiketa V, Ngoie R-B, Mputu E, Kasoro N, et al. Equation-based modeling vs. agent-based modeling with applications to the spread of COVID-19 outbreak. Mathematics. 2023;11(1):253.
- 52. Schneckenreither G, Popper N, Zauner G, Breitenecker F. Modelling SIR-type epidemics by ODEs, PDEs, difference equations and cellular automata – a comparative study. Simulation Modelling Practice and Theory. 2008;16(8):1014–23.
- 53. Gräbner C, Bale CSE, Furtado BA, Alvarez-Pereira B, Gentile JE, Henderson H, et al. Getting the best of both worlds? Developing complementary equation-based and agent-based models. Comput Econ. 2017;53(2):763–82.
- 54.
Hunter E, Namee BM, Kelleher JD. A hybrid agent-based and equation based model for the spread of infectious diseases. Technological University Dublin; 2020.
- 55. Dunn AG, Purnat TD, Ishizumi A, Nguyen T, Briand S. Measuring the burden of infodemics with a research toolkit for connecting information exposure, trust, and health behaviours. Arch Public Health. 2023;81(1):102. pmid:37277857
- 56.
World Health Organization. World Health Organization WHO infodemic. 2024. https://www.who.int/health-topics/infodemic
- 57. Carmi E, Yates SJ, Lockley E, Pawluczuk A. Data citizenship: rethinking data literacy in the age of disinformation, misinformation, and malinformation. Internet Policy Review. 2020;9(2).
- 58. Lazer DMJ, Baum MA, Benkler Y, Berinsky AJ, Greenhill KM, Menczer F, et al. The science of fake news. Science. 2018;359(6380):1094–6. pmid:29590025
- 59. Wu L, Morstatter F, Carley KM, Liu H. Misinformation in social media: definition, manipulation, and detection. ACM SIGKDD Explorations Newsletter. 2019;21(2):80–90.
- 60. Yang L, Li Z. Deadline-aware misinformation prevention in social networks with time-decaying influence. Expert Systems with Applications. 2024;238:121847.
- 61. Dong W, Tao J, Xia X, Ye L, Xu H, Jiang P, et al. Public emotions and rumors spread during the COVID-19 epidemic in China: web-based correlation study. J Med Internet Res. 2020;22(11):e21933. pmid:33112757
- 62. Pian W, Chi J, Ma F. The causes, impacts and countermeasures of COVID-19 “Infodemic”: a systematic review using narrative synthesis. Inf Process Manag. 2021;58(6):102713. pmid:34720340
- 63.
Garrett HJ. Why does fake news work? On the psychosocial dynamics of learning, belief, and citizenship. Unpacking fake news: an educator’s guide to navigating the media with students. 2019. p. 15–29.
- 64. Charquero-Ballester M, Walter JG, Nissen IA, Bechmann A. Different types of COVID-19 misinformation have different emotional valence on Twitter. Big Data & Society. 2021;8(2).
- 65.
Conrad C. Inside a cult: gender critical. 2023. http://www.caelanconrad.ca/insideacult
- 66. Horne SG, McGinley M, Yel N, Maroney MR. The stench of bathroom bills and anti-transgender legislation: Anxiety and depression among transgender, nonbinary, and cisgender LGBQ people during a state referendum. J Couns Psychol. 2022;69(1):1–13. pmid:34197153
- 67. Franceschi J, Pareschi L, Zanella M. From agent-based models to the macroscopic description of fake-news spread: the role of competence in data-driven applications. SN Partial Differ Equ Appl. 2022;3(6):68. pmid:36213149
- 68. Zareie A, Sakellariou R. Minimizing the spread of misinformation in online social networks: a survey. Journal of Network and Computer Applications. 2021;186:103094.
- 69.
Berceanu C, Bertolotti F, Arshad N, Patrascu M. Repository for understanding the mechanisms of infodemics: equation-based vs agent-based models. 2025. https://github.com/cristian-berceanu/understanding_the_mechanisms_of_infodemics
- 70. d’Andrea V, Artime O, Castaldo N, Sacco P, Gallotti R, De Domenico M. Epidemic proximity and imitation dynamics drive infodemic waves during the COVID-19 pandemic. Phys Rev Research. 2022;4(1):013158.
- 71.
Berceanu C, Arshad N, Patrascu M. Contagion propagation with rule-based reasoning and decentralized control in an agent-based susceptible-infected-recovered-susceptible infodemic model. In: 2023 International Conference on Big Data, Knowledge and Control Systems Engineering (BdKCSE). 2023. p. 1–6.
- 72. van der Linden S. Misinformation: susceptibility, spread, and interventions to immunize the public. Nat Med. 2022;28(3):460–7. pmid:35273402
- 73. Ecker UKH, Lewandowsky S, Cook J, Schmid P, Fazio LK, Brashier N, et al. The psychological drivers of misinformation belief and its resistance to correction. Nat Rev Psychol. 2022;1(1):13–29.
- 74. Nan X, Wang Y, Thier K. Why do people believe health misinformation and who is at risk? A systematic review of individual differences in susceptibility to health misinformation. Soc Sci Med. 2022;314:115398. pmid:36327631
- 75.
Marsh EJ, Cantor AD, Brashier NM. Believing that humans swallow spiders in their sleep: False beliefs as side effects of the processes that support accurate knowledge. Psychology of learning and motivation. Elsevier. 2016. p. 93–132.
- 76. Prike T, Arnold MM, Williamson P. The relationship between anomalistic belief, misperception of chance and the base rate fallacy. Thinking & Reasoning. 2019;26(3):447–77.
- 77. Uscinski JE, Enders AM, Klofstad C, Seelig M, Funchion J, Everett C. Why do people believe COVID-19 conspiracy theories?. Harvard Kennedy School Misinformation Review. 2020;1(3).
- 78. Unkelbach C, Koch A, Silva RR, Garcia-Marques T. Truth by repetition: explanations and implications. Curr Dir Psychol Sci. 2019;28(3):247–53.
- 79. Fazio LK. Repetition increases perceived truth even for known falsehoods. Collabra: Psychology. 2020;6(1).
- 80. Bonnevie E, Sittig J, Smyser J. The case for tracking misinformation the way we track disease. Big Data & Society. 2021;8(1).
- 81. Bjørnstad ON, Shea K, Krzywinski M, Altman N. The SEIRS model for infectious disease dynamics. Nat Methods. 2020;17(6):557–8. pmid:32499633
- 82. Otunuga OM. Estimation of epidemiological parameters for COVID-19 cases using a stochastic SEIRS epidemic model with vital dynamics. Results Phys. 2021;28:104664. pmid:34395184
- 83. Hong X, Han Y, Tanaka G, Wang B. Co-evolution dynamics of epidemic and information under dynamical multi-source information and behavioral responses. Knowledge-Based Systems. 2022;252:109413.
- 84. Aghbolagh HD, Ye M, Zino L, Chen Z, Cao M. Coevolutionary dynamics of actions and opinions in social networks. IEEE Trans Automat Contr. 2023;68(12):7708–23.
- 85. Shannon CE. A mathematical theory of communication. Bell System Technical Journal. 1948;27(3):379–423.
- 86. Gontier N, Hartmann S, Pleyer M, Rodrigues ED. Combinatoriality and compositionality in communication, skills, tool use, and language. International Journal of Primatology. 2024;45(3):473–81.
- 87. Whittaker S. Personal information management: from information consumption to curation. Annual review of information science and technology. 2011;45(1):1.
- 88. Cooke NA. Posttruth, truthiness, and alternative facts: information behavior and critical information consumption for a new age. The Library Quarterly. 2017;87(3):211–21.
- 89. Holland O, Melhuish C. Stigmergy, self-organization, and sorting in collective robotics. Artif Life. 1999;5(2):173–202. pmid:10633575
- 90. Bryant LV. The YouTube algorithm and the alt-right filter Bubble. Open Information Science. 2020;4(1):85–90.
- 91. Grimm V, Railsback SF, Vincenot CE, Berger U, Gallagher C, DeAngelis DL, et al. The ODD protocol for describing agent-based and other simulation models: a second update to improve clarity, replication, and structural realism. JASSS. 2020;23(2).
- 92.
Kasereka SK. Dynamics of a tuberculosis outbreak model in a multi-scale environment. 2024. https://doi.org/10.48550/arXiv.241104297
- 93.
Benesty J, Chen J, Huang Y, Cohen I. Pearson correlation coefficient. Noise reduction in speech processing. Springer; 2009. p. 1–4.
- 94. Shcherbakov MV, Brebels A, Shcherbakova NL, Tyukov AP, Janovsky TA, Kamaev VA. A survey of forecast error measures. World Applied Sciences Journal. 2013;24(24):171–6.
- 95. Collis A, Garimella K, Moehring A, Rahimian MA, Babalola S, Gobat NH, et al. Global survey on COVID-19 beliefs, behaviours and norms. Nat Hum Behav. 2022;6(9):1310–7. pmid:35606513
- 96. Martinelli M, Veltri GA. COVID-19 vaccine acceptance: a comparative longitudinal analysis of the association between risk perception, confidence, and the acceptance of a COVID-19 vaccine. Risk Anal. 2024;44(4):802–16. pmid:37496470
- 97.
Wilensky U. Netlogo. 1999. http://ccl.northwestern.edu/netlogo/
- 98.
Quantitative Social Science HI. The future of media project. 2024. http://www.projects.iq.harvard.edu/futureofmedia/about
- 99. Hunter E, Mac Namee B, Kelleher J. A hybrid agent-based and equation based model for the spread of infectious diseases. JASSS. 2020;23(4).
- 100.
Owoeye SM. Epidemiological game-theory dynamics of the co-evolution of infodemic and pandemic with vaccination. Georgia State University; 2022.
- 101. Bedau MA. Weak emergence. Philosophical perspectives. 1997;11:375–99.
- 102. Box GEP. Science and statistics. Journal of the American Statistical Association. 1976;71(356):791–9.
- 103. Brashier NM, Pennycook G, Berinsky AJ, Rand DG. Timing matters when correcting fake news. Proc Natl Acad Sci U S A. 2021;118(5):e2020043118. pmid:33495336
- 104. Cook J, Lewandowsky S, Ecker UKH. Neutralizing misinformation through inoculation: exposing misleading argumentation techniques reduces their influence. PLoS One. 2017;12(5):e0175799. pmid:28475576
- 105. MacFarlane D, Tay LQ, Hurlstone MJ, Ecker UKH. Refuting spurious COVID-19 treatment claims reduces demand and misinformation sharing. J Appl Res Mem Cogn. 2021;10(2):248–58. pmid:33391983