Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The statistical and dynamic modeling of the first part of the 2013-2014 Euromaidan protests in Ukraine: The Revolution of Dignity and preceding times

  • Yassin Bahid ,

    Roles Formal analysis, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    yassin.bahid@colorado.edu

    Affiliation Department of Applied Mathematics, University of Colorado, Boulder, Colorado, United State of America

  • Olga Kutsenko,

    Roles Conceptualization, Writing – review & editing

    Affiliations Department of Sociology, Taras Shevchenko National University of Kyiv, Kyiv, Ukraine, Berlin Technical University, Berlin, Germany

  • Nancy Rodríguez,

    Roles Conceptualization, Formal analysis, Methodology, Supervision, Writing – original draft, Writing – review & editing

    Affiliation Department of Applied Mathematics, University of Colorado, Boulder, Colorado, United State of America

  • David White

    Roles Data curation, Formal analysis, Methodology, Software, Writing – original draft, Writing – review & editing

    Affiliation Department of Mathematics, Denison University, Granville, Ohio, United States of America

Abstract

Ukraine’s tug-of-war between Russia and the West has had significant and lasting consequences for the country. In 2013, Viktor Yanukovych, the Ukrainian president aligned with Russia, opted against signing an association agreement with the European Union. This agreement aimed to facilitate trade and travel between the EU and Ukraine. This decision sparked widespread protests that coalesced in Kyiv’s Maidan Square, eventually becoming known as the Euromaidan protests. In this study, we analyze the protest data from 2013, sourced from Ukraine’s Center for Social and Labor Research. Despite the dataset’s limitations and occasional inconsistencies, we demonstrate the extraction of valuable insights and the construction of a descriptive model from such data. Our investigation reveals a pre-existing state of self-excitation within the system even before the onset of the Euromaidan protests. This self-excitation intensified during the Euromaidan protests. A statistical analysis indicates that the government’s utilization of force correlates with increased future protests, exacerbating rather than quelling the protest movement. Furthermore, we introduce the implementation of Hawkes process models to comprehend the spatiotemporal dynamics of the protest activity. Our findings highlight that, while protest activities spread across the entire country, the driving force behind the dynamics of these protests was the level of activity in Kyiv. Furthermore, in contrast to prior research that emphasized geographical proximity as a key predictor of event propagation, our study illustrates that the political alignment among oblasts, which are the distinct municipalities comprising Ukraine, had a more profound impact than mere geographic distance. This underscores the significance of social and cultural factors in molding the trajectory of political movements.

Introduction

Ukraine’s historical and cultural ties with Russia forged over centuries and solidified by its emergence as an independent nation from the dissolution of the Soviet Union in December 1991, stand in stark contrast to its more recent inclinations toward Western nations. This dynamic has given rise to a complex and ongoing struggle for national identity [1]. While a significant portion of the Ukrainian population maintains enduring connections with Russia, an equally substantial contingent leans toward Western influences. In 1994, Ukraine entered into an important partnership and cooperation agreement with the European Union, a landmark accord detailed in [2]. This agreement not only facilitated bilateral trade but also encompassed diverse domains such as science, technology, mining, and maritime transport, signifying the comprehensive scope of engagement between Ukraine and the EU.

During the early years of the 21st century, a pressing question emerged within Ukraine’s political discourse: whether to align itself with the European Union. This deliberation was viewed by Russian President Vladimir Putin as a destabilizing development [3]. The extent to which individuals within Ukraine aspired to align with the West has evolved into a central issue catalyzing political polarization within the nation [3]. From 2005 to 2010, the Ukrainian presidency was held by Viktor Yushchenko, who pursued closer integration with the European Union. Yushchenko’s aspiration to formalize this commitment through an association agreement, aiming for a deepened engagement, was hindered by the limitations of his political influence. The outcome of the 2010 presidential election underscored the challenges he faced, as he was defeated by President Viktor Yanukovych, who maintained a more Russia-oriented stance [4]. The year 2013 marked a turning point when President Yanukovych, despite broad parliamentary approval, declined to sign the aforementioned association agreement with the European Union. The interplay of domestic and external influences, including significant pressure from Russia, guided his decision to reject the agreement [5]. This choice was the triggering event for the Euromaidan protests, known as the Revolution of Dignity, which swept across Ukraine, predominantly centered in Kyiv, before spreading throughout the nation [3].

In this study, our primary objective is to discern the underlying determinants propelling these protests. Specifically, our investigation centers on two key aspects: the potential diffusion of protest activities and the ramifications stemming from the frequency of arrests and injuries witnessed within a given protest incident, and the relationship to subsequent protest dynamics. To study these phenomena, we make use of an extensive repository of protest data amassed by the Center for Social and Labor Research in Ukraine, accessible through their https://www.cslr.org.ua/en/protests/ website. Our analytical approach is anchored in the deployment of both regression and ARIMA models. Through these quantitative methodologies, we aim to understand the interplay of variables. Our findings underscore the existence of self-propagation within the realm of protests, with prominence during the latter phase of 2013, coinciding with the critical Euromaidan upheaval.

The second fundamental aim of this study is the development of models of the spatial and temporal dynamics characterizing the escalation of events during the 2013 phase of the Euromaidan revolution. This analysis covers a spectrum of events, encompassing both pro-Maidan and anti-Maidan manifestations. To accomplish this, we make use of Hawkes processes, a class of point processes known for modeling self-excitation. These models can detail the trajectories of escalation, which extend beyond Kyiv. Notably, our investigation unveils Kyiv as the central force behind the spread of events throughout the nation. While geographical distance does not exert a discernible effect, our study discerns a significant impact emerging from the political affinity of the different regions. Moreover, we identify contagion patterns both within Kyiv and across many oblasts, which represent the nation’s principal administrative subdivisions.

The Euromaidan movement has encompassed a diverse set of events, ranging from peaceful protests to more tumultuous episodes such as violent riots. Although there are substantial distinctions among these various event types, as outlined in [6], all of these occurrences contribute to and can be regarded as components of civil unrest. While a more comprehensive discussion could delve into the nuances of each event type, this falls beyond the purview of our research. The deployment of mathematical models has gained recognition as a powerful framework for studying such events, and diverse methodological approaches have been developed. In previous works such as [6, 7], an agent-based paradigm was adopted to model civil unrest, encapsulating the intricate dynamics underlying such phenomena. A distinct avenue of inquiry ventured into the use of non-linear dynamics and chaos theory, as showcased in [8], wherein the Russian Empire’s labor strikes during the period spanning 1895 to 1905 were subjected to rigorous analysis. Other pertinent dynamical approaches can be found in [9, 10]. Meanwhile, evolutionary game theory has found application in the works [11, 12]. Epidemiological models, as exemplified in [1315], have gained traction as tools to understand these societal events. A noteworthy framework is that of kinetic theory, which was used in [16] to model the dynamics of epidemic propagation.

In this work, we opt to use stochastic point processes known as Hawkes processes to model the spatiotemporal dynamics of the 2013 protesting events. While Hawkes processes were initially introduced to model seismic activities, their utility has extended to the modeling of events in diverse fields including finance, neuroscience, social sciences, and computer science. Some examples of the uses of Hawkes processes include but are not limited to EU instances of terrorist attacks [17], gun violence [18], gang-related violence [19], and incidents of disorder that unfolded during the COVID-19 pandemic [20]. Hawkes processes are particularly well suited to model self-propagating events, a pattern observed in the dynamics of Euromaidan.

Historical context and timeline

The Orange Revolution and Ukraine’s pull between the West and Russia

In the aftermath of World War I and the Russian Revolution of 1917, a substantial portion of the Ukrainian territory came under the dominion of the Soviet Union. Notably, certain enclaves of western Ukraine found themselves divided among Poland, Romania, and Czechoslovakia [21]. Ukraine’s independence materialized in 1991, following the dissolution of the Soviet Union. The initial phase of Ukrainian autonomy was marked by numerous challenges as the nation grappled to navigate the implementation of both economic and political reforms. These initial stages were marked by escalating social tensions, which grew into a watershed moment characterized by large-scale demonstrations, now known as the Orange Revolution. The origins of this movement lay in the contested aftermath of the 2004 presidential elections, during which President Viktor Yanukovych proclaimed victory, an assertion contested [3, 22]. The incendiary nature of these events led to the nullification of the election results, forcing Ukraine’s Supreme Court to mandate a new election process. After the new election period, Viktor Yushchenko was named president of Ukraine. However, the support for the protests was not uniform among the Ukrainian people. The support primarily coalesced through a collaborative effort spearheaded by individuals coming from the western and central regions of Ukraine [23].

Certain scholars, including Kuzio, a distinguished professor of political science at the National University of Kyiv-Mohyla Academy, advance the perspective that Ukraine’s trajectory post-independence has led to the emergence of two distinct strands of nationalism. Within this paradigm, civic nationalism takes root in western and central Ukraine, while pro-Russian sentiment gains prominence in the eastern regions [23]. Kuzio underscores that the pronounced mobilization witnessed during the Orange Revolution was propelled by the fervor of civic nationalism.

The 2010 Viktor Yanukovych became the fourth president of Ukraine. This event in itself served as a testament to the enduring fracture in Ukraine’s political landscape. A trend surfaces when studying the political allegiances of the various administrative divisions. Generally, the oblasts situated in the western and central regions align more closely with Western principles and ideologies, while their eastern and southern counterparts tend to exhibit a pro-Russian disposition. This dichotomy is illustrated in Fig 1, which shows the support for the pro-European Union candidate across the 2004, 2010, and 2014 presidential elections. The figure underscores a gradual attenuation in pro-Western inclinations as one traverses from the western to the eastern extremities of the nation, rather than a stark demarcation between the two political inclinations.

thumbnail
Fig 1.

The presidential voting maps of Ukraine: (a) 2004 (b) 2010 (c) 2014.

https://doi.org/10.1371/journal.pone.0301639.g001

Euromaidan: The Revolution of Dignity

In 2009 the Eastern Partnership (EaP), a joint initiative involving the European Union, its Member States, and six eastern European Partner countries: Armenia, Azerbaijan, Belarus, Georgia, the Republic of Moldova, and Ukraine, was launched [24]. A high point of this partnership for Ukraine was the Association Agreement with the European Union, a bilateral agreement between the EU and a third country. It served as a way to open up borders between the EU and the third-party countries for trade and travel [25]. This agreement established political and economic ties between Ukraine and the West [26]. The Ukrainian parliament had approved the final agreement and in late November 2013, President Yanukovych signaled his support for the agreement. However, a visit to Moskow appears to have changed his mind. On November 21, 2013, Ukraine rejected draft laws that aimed to release jailed opposition leader Yulia Tymoshenko, a requirement imposed by the European Union for the association agreement, and suspended plans for the signing of this landmark agreement. Moreover, Ukraine announced that it would renew dialogue with Russia [5]. While President Yanukovych’s support in the western oblasts was always small, his support in the eastern regions started to decline after the contested decisions associated with his presidency, Fig 1c.

The Euromaidan protests began in earnest the evening of November 21, 2013, when more than 1,500 Ukrainians marched to the Maidan Square in Kyiv in disapproval of the decisions taken by their government earlier that day [27]. In the next days, the peaceful civil protests continued in Maidan Square and throughout other regions of the country, with these events achieving a very large number of protesters over the weekend [3]. On the night of November 30, 2013, things became violent when the government ordered the Berkut police, a special unit of the Ukrainian police within the Ministry of Internal Affairs, to disperse the square. It was documented that the unit used force to violently disperse the Maidan Square protesters [28]. The next day the protesters reoccupied the square. That day saw multiple riots in Kyiv and a large number of journalists being injured by the police [29]. On December 11, 2013, the Berkut special police unit and interior ministry troops descended on protesters violently in an attempt to break up the Euromaidan protests [30].

On January 16, 2014, the Ukrainian Parliament signed new anti-protest laws, which restricted freedom of speech and freedom of assembly [31]. In reaction, on Sunday, January 19, 2014, over 200,000 Ukrainians showed up to protest in the center of Kyiv [32]. More violence erupted on January 22, 2014, on Hrushevskoho Street in Kyiv resulting in the deaths of protesters [33]. A few days later, on January 25, 2014, Arsenii Yatsenyuk, the former Economy and Foreign Minister of Ukraine and leader of the Batkivshchyna, an EU-leaning party was offered the Prime Minister’s position, which he declined, citing the need for the Ukrainian citizens to decide on their future leader and not the current government they were protesting against [3]. More deadly protests occurred between February 18-20, leaving more than 100 individuals dead [34]. The European Union became involved on February 21, 2014, and introduced sanctions against the Ukrainian leaders. On February 22, 2014, President Yanukovych fled to Donetsk and then Crimea and parliament voted to remove him from office [35]. Afterward, the Russian army annexed Crimea, which led to the Russo-Ukrainian War. The election, first scheduled for March 2015, was held following Euromaidan. It led to the victory of President Petro Poroshenko [36] who led Ukraine through the integration with the EU by signing the EU Association Agreement [37].

Materials and methods

Data collection and limitations

Created in 2013, the Center for Social and Labor Research is a Ukrainian non-profit organization focused on gathering and analyzing data about socioeconomic problems in Ukraine. To model the spatiotemporal dynamics observed in the Euromaidan protests, we use their protests data set [38] from 2013 (although some events spill into 2014). This data set was created and curated, largely by academics without any explicit political agenda [39], using information aggregated from newspaper articles found through monitoring of news feeds from more than 190 national, regional, and activist web media. The data set has been continually updated from 2013 until the present day. The data set includes protests, rallies, riots, and police crackdowns, among others, as events spanning a range of days [40]. The main unit of the data set is a “protest event” and therefore we use the terms “protest” and “event” interchangeably. The data takes into consideration the bias of certain events and their over-representation. The data curators addressed the over-representation by adhering to certain principles, e.g., including data from national, regional, and local news sources, and taking care not to double-count any event. Potential bias is also taken into consideration, as certain events, e.g., those backed and instigated by the government, were not counted. The data set contains both large and successful events that were reported in mass media, as well as events that were considered as failed, e.g., because of violent law enforcement intervention. In some cases, the data set details the number, demographics, and political leanings of the participants. In every case, the data set details the date(s) and the locations of the events. In some cases, e.g., an extremely large protest or a failed protest, data on the number of protesters is missing. We discuss this in more detail in Appendix 0.1. While the report does give a breakdown of the level at which the events happen, e.g., neighborhoods or cities, we choose to aggregate the events per oblast as we want to look at a holistic view of the events’ spread, but also because certain events’ location is only defined on the oblast level. Note that some oblasts share the same name as certain cities in the region, however, in this work, we use these names only to refer to the oblasts. For example, when we refer to Kyiv we mean the oblast and not the city.

For every event in the data set, we know which oblast it occurred in. However, since the data comes from media reports, it is possible that events happened that were never included in the data set. For each oblast, the data curators drew from eight or nine news sources to determine which events happened in that oblast. It is important to recognize that an event only appears in the data set if it was reported in the news, and this presents the potential of bias in the data. We did find evidence of events that occurred but were not reported in the media, especially in eastern oblasts. We did not find evidence that this phenomenon was widespread, but we nevertheless stress the caveat that, if the data set exhibited significant bias, this would affect our models. We encourage future researchers to replicate our findings on data sets with less bias. Related research (replicating the appropriateness of Hawkes process models for protest dynamics, and the relationship between injuries and the number of protests) on a data set of protests in the United States has been carried out in [41].

The statistical models below demonstrate that events exhibit self-exciting behavior, e.g., more events today are associated with even more tomorrow. The statistical model is based on the entire data set and does not omit any oblasts. There is a variable, “Action type”, that takes values such as “protest”, “positive response”, or “negative response.” The “negative response” events are ones where law enforcement was involved in a way perceived as negative. More precisely, the term “negative response” refers to events featuring repression, suppression, or obstruction of the protesters, either physical or legal [40]. An event is coded as a “negative response” if it features arrests, attacks, beatings, a blockade, a confrontation with police, a conviction, deportation, employees being fired (or students being expelled) for participating in a protest, a fight or gunfight, harassment, hacking, interrogation, imprisonment, lockout in response to the protest, martial law, a criminal case, law enforcement preemptive obstruction, police searches of participants, or a shooting. There is also a variable, “Event series”, that gives an idea of what the event is about. When this variable is valued as “Euromaidan”, the event was about the Euromaidan movement.

Our best model includes covariates detailing the number of “Euromaidan” events per day, the number of “negative response” events per day, and the number of civilian injuries per day. Of the 6627 events, 3220 have the string “Euromaidan” occurring in the “Event series” variable, which denotes that these events were associated with the Euromaidan event. This includes both pro and anti-Euromaidan events, in any oblast in the country. Similarly, there are 1102 events with “negative response” and 405 that are both “Euromaidan” and “negative response.”

Pre-Euromaidan.

Fig 2 depicts the number of events per day from January 1 to December 31, 2013. We observed a relatively small activity before Euromaidan, which lasted between November 21, 2013, and February 23, 2014. The average number of events before the beginning of the Euromaidan protests is very low, less than one per day per oblast. We shall use each oblast’s average number of events before self-excitation as a base number of protests in our model. Most of the events presented during this time were small-scale protests with few protesters.

thumbnail
Fig 2. The number of events including protests, riots, and negative police response in Ukraine from January 1, 2013, to January 04, 2014.

https://doi.org/10.1371/journal.pone.0301639.g002

Euromaidan.

The impact of the EU trade agreements on Ukraine’s relationship with Russia, Ukraine’s biggest trade partner at the time, pushed President Yanukovych to delay signing the agreement. In turn, this act triggered the protests constituting Euromaidan. As seen in Fig 3, protesters started gathering in the Maidan square on November 21, 2013. After the failure of the EU trade agreement on November 29, there were major police crackdowns on protesters, which in turn led to riots in Kyiv. The protests continued for multiple weeks afterward.

thumbnail
Fig 3. The timeline of Euromaidan from November 25, 2013, to February 23, 2014.

https://doi.org/10.1371/journal.pone.0301639.g003

Fig 4 offers a geographic visual of the spread of events for the first six days of protests. The protests were heavily localized in the Kyiv oblast, and more specifically in the Maidan square located in the capital. Once these events reach their peak, we observe the spread of activity throughout the western and central parts of the country. These are the regions that are most pro-EU [42] as one can see in Fig 1 which shows the votes for candidate Petro Poroshenko, the pro-EU candidate, in the 2014 elections following Euromaidan. We shall use these votes as an indicator of the political leanings of each oblast in our model later on. We believe that the 2014 votes are most reflective of the Ukrainian political map as they are from the election that happened in response to the protests and reflect the public opinion’s shift from the 2010 election where President Yanukovych won by a thin margin.

thumbnail
Fig 4. A spatiotemporal illustration of the number of total events per oblast from November 21, 2013, to December 05, 2013.

These events include protests, rallies, riots, and police crackdowns.

https://doi.org/10.1371/journal.pone.0301639.g004

Fig 5 shows the wide difference between the magnitude of events in Kyiv and all other oblasts. Fig 6 details the maximum number of events per day and total number of events per oblast. These graphs purposefully omit Kyiv oblast, as the total number of events and the maximum number of events far exceeds any other oblast (a total of more than 800 events over the entire period with a maximum of 47 events for one day). Moreover, one can see that some regions register little activity, namely: Chernihiv, Donetsk, Dnipropetrovsk, Kharkiv, Kherson, Kirovohrad, Poltava, Sumy, and Zhytomyr. The geopolitical situation, i.e., their proximity to Russia and their political leanings, might be the direct cause of these discrepancies. As there is no represented activity in such oblasts, we choose to omit them from our model. The lack of activity is mainly due to the lack of representation of such oblasts’ activity in the media. There were regular pro-EU protests and later anti-Maidan ones in these oblasts. However, they were not very crowded, rarely manifested violent events, and were not sensational. Thus, they did not regularly make the pages of popular media.

thumbnail
Fig 5. The number of events, such as protests, throughout different oblasts in Ukraine from November 25, 2013 to February 20, 2014.

https://doi.org/10.1371/journal.pone.0301639.g005

thumbnail
Fig 6.

(a) The maximum daily number of events in all oblasts excluding Kyiv. (b) The total number of events from November 11, 2013, to February 02, 2014, in all oblasts excluding Kyiv.

https://doi.org/10.1371/journal.pone.0301639.g006

A careful look at the data, unfortunately, does show some discrepancies. First, the number of protesters is not reported for all events. Second, the number of deceased individuals is heavily underrepresented. For example, the Kharkiv Human Rights Protection Group, one of Ukraine’s oldest and most respected human rights organizations, reports the death of Pavlo Mazurenko, on the 24th of December, 2013, in a Kyiv hospital after a violent police interaction [43]. However, this tragic event is missing from our data set. Third, and most important is the fact that the data set is missing important activity from the first two months of 2014. The only events reported in 2014 are those that began in 2013 and spilled over into 2014. This period has seen a resurgence in protests and clashes between the pro-Euromaidan protesters and police [4446]. The turmoil that Ukraine plunged into after the Euromaidan revolution, namely the Russo-Ukrainian war, made it difficult to collect data or local news reports in the country. The international news unfortunately does not offer the level of granularity needed for this type of analysis we are conducting.

The fact that the data set does not contain events that began in 2014 forces us to focus on modeling only the first half of Euromaidan. Given the fact that there was a clear break in the protests, before the resurgence in mid-January, it is sound to study the first half of Euromaidan as its own contained self-excitation event, which will be further demonstrated in the statistical analysis section. Moreover, we must also model the number of events and not the number of protesters. This can be seen as a limitation given that large protests may have a larger impact than others. However, as seen in [13], the magnitude of a protest does not always correlate to the importance of the event. For example, an event that has only a few protesters can cause a chain reaction and lead to more important events in the future. Note also that we do not differentiate between pro and anti-European Union protests, and do not take into account the different political leanings of all groups involved in the different events. The main reason behind this is that all protests do add to the general tension and, as seen from the statistical analysis, lead to more protests. See the statistical analysis section below for details supporting this assertion. As one can see in Figs 5 and 7, which respectively illustrate the overall number of events in all of Ukraine and each of its oblast per day from November 21, 2013, to January 4, 2013, there is a lack of activity starting from January 1, 2014. This is because the only events in January and February are those that were initiated in 2013. However, as mentioned above, the protests did increase significantly after the new year and resulted in the ousting of the president of Ukraine. Therefore, the data set is missing most key events that happened during the second part of the protests.

thumbnail
Fig 7. The total number of events per day in Ukraine from November 21, 2013, to January 4, 2014, including protests, rallies, riots, and police crackdowns registered in media under scrutiny.

https://doi.org/10.1371/journal.pone.0301639.g007

Statistical analysis

The data set consisted of one row per protest, with columns containing information on the start date, end date, oblast where the protest occurred, and numbers of protesters, arrests, injuries, and fatalities. Unfortunately, due to missing data issues, most of these numerical fields cannot be used, as remarked above, but we do use the number of injuries in our final model, because of its relevance in a similar model in [41]. Our primary goal in this section is to build a model to predict the number of protest events on any given day and to show that the time series “number of events” exhibits self-excitation behavior. Towards that goal, we first wrangle the given data set into a different form, where the fundamental unit is a day, rather than a protest. Each day in this new data set will have information about how many protests occurred that day, as well as three covariates listed below.

To create a variable containing the number of events per day, we transformed the data and extracted several time series:

  • pt is the number of events on day t (that is, all events where t is between the start and end date, inclusive)
  • nrt is the number of events with a “negative response” on day t
  • et is the number of events associated with Euromaidan on day t
  • it is the number of civilians injured on day t

In this context, an event E occurring on day t can influence the number of events on day t + 1 either by extending to last another day, or by spawning subsequent events (e.g., inspired by news coverage regarding E). We will show a positive relationship between pt and pt+1, which justifies the slogan “more events today are associated with even more tomorrow.” Furthermore, we fit a threshold model to pt, inspired by the spread of epidemics, that illustrates that after a certain tipping point, the relationship between pt and pt+1 is even larger. This provides evidence that a Hawkes process model is appropriate for these data.

The first step is to figure out which of these variables leads/lags the others. Is the number of injuries today associated with the number of events tomorrow, or is it the other way around? We use cross-correlation analysis [47] after pre-whitening to answer this question. See Appendix 0.2 for details on this methodology. Carrying out this procedure (that is, cross-correlation analysis after suitable pre-whitening) for the time series above demonstrated that pt is closely and positively associated with lags of the three covariate time series, e.g., most strongly with it−1, et−3, and nrt−5. That is, if there were many injuries today, negative response events today, or Euromaidan events today, then one can expect more events overall tomorrow and in the days to come. This suggests that injuries to protesters are associated with more future protests, and provides evidence that Euromaidan events were driving the protests. It also suggests that government negative responses (e.g., police use of force, beatings, etc.) are associated with more future protests, i.e., have an inflammatory rather than suppressing effect on protests. The cross-correlations of the pre-whitened data are useful to be able to say there is a statistically significant association between pt and the lagged variables but lack a meaningful real-world interpretation. So, we instead report the actual correlations (reiterating the caveat that, without pre-whitening, these correlations can be affected by exogenous variables) to illustrate the strength of the relationships between the variables in question.

This table says, for example, that the correlation between pt and et is 0.91, and between pt and nrt−1 is 0.71, etc. Lags on the other side (e.g., between pt and nrt+1) were smaller, and the pre-whitened cross-correlation showed that such lags lack statistical significance. The pattern in the table above is consistent with pt being affected by each of the three covariates and their lags. As the correlations above are computed without pre-whitening, the lag 2 correlations are likely primarily due to the lag 1 correlation, since et−2 is correlated with pt−1 and hence with pt as we will see below.

We turn now to statistical models. Autoregressive integrated moving average (ARIMA) models are standard statistical models for time series data. They are built of three parts: an AR part, an I part, and an MA part. The methodology for the AR part identifies, for a given time series yt, which lagged time series ytk have a statistically significant correlation with yt, and then builds a linear model where the response variable is yt, the explanatory variables are the relevant ytk, and the residuals now ideally exhibit independence. In situations of multivariate time series, one can model yt as a linear function of its past (ytk’s), of another time series xt, and of the lagged explanatory time series xtk. The MA part also allows explanatory variables of the form ϵtk, meaning the error term of the model for ytk. The purpose of this is to allow large “shocks” from the past (e.g., a day on which the number of events was much larger than expected) to affect the present. The I part involves replacing the starting time series yt with a differenced time series. For example, the first-order differenced time series Δyt is defined as ytyt−1, and differencing can be iterated, so Δ2yt = Δ(Δyt) = Δ(ytyt−1) = (ytyt−1) − (yt−1yt−2). Differencing is used to shift from a non-stationary time series yt to a stationary time series Δiyt, the i-times differenced time series, and then fit a model (using AR and MA terms) to Δiyt. The ARIMA framework assumes the given time series can be made stationary by iterated differencing. To simplify notation, we write zt for Δiyt (where i is chosen to make zt stationary) and we write xt generically to mean any explanatory variable (including a lagged ztk). Seasonal ARIMA models further allow seasonal AR terms (e.g., yt−7, the number of events one week ago), seasonal MA terms (e.g., ϵt−7), and seasonal differencing. The notation SARIMA(p,d,q)x(P,D,Q)[S] represents a model with p regular lags, P seasonal lags (e.g., ytS, yt−2S, etc.), d regular differencing, D seasonal differencing, q regular MA terms, and Q seasonal MA terms. For us, S will be 7, so our time series will depend on non-seasonal lags (like, yesterday or the day before) as well as seasonal ones (like, what happened seven days ago or 14 days ago).

Once a suitable SARIMA model for zt has been fit, whose residual vector is independent random white noise that does not depend on time, the model can be used to forecast future values of zt (and hence of yt). The model coefficients and standard errors can be used to determine which explanatory variables xt are statistically significant (and the direction of their influence), and the Akaite Information Criterion (AIC, from information theory) can be used to decide between competing models. Using these techniques we see, for example, that pt is strongly influenced by nrt and et, with negative reaction events and Euromaidan events both associated with more total events in the subsequent days.

Time series analysis starts from the assumption that the time series of interest is stationary (or that a differencing operation can make it so), which means, informally, that its mean, variance, and autocorrelation structure do not change over time. In the case of protest data from Ukraine in 2013-2014, the time series does change radically, in November of 2013. Hence, differencing was required to make the time series stationary.

We used both classical time series analysis techniques (e.g., inspection of time plots, unit root tests, differencing, (partial) autocorrelation function graphs, cross-correlation functions, Box-Ljung tests, and inspection/tests of residuals) as well as automated model-fitting techniques. Both approaches resulted in the same model.

Statistical analysis results.

Our best models for the “number of events” from its history required first-order differencing to make pt stationary. We let Pt denote Δpt, and note that Pt passes tests of stationarity. Our best ARIMA model for the autocorrelation structure is an ARIMA(2,1,3) model, meaning that two lags, Pt−1, Pt−2 and three moving-average terms ϵt−1, ϵt−2, ϵt−3 are included in the model. However, the residuals still exhibited autocorrelation, so we allowed seasonal terms, resulting in a SARIMA(2,1,3)(1,0,2) [37] model for pt. We use the notation ar to stand for “autoregressive” (e.g., ar1 refers to the coefficient of Pt−1), ma for “moving average” (e.g., ma3 is the coefficient of ϵt−3), sar for “seasonal autoregressive” terms like Pt−7, and sma for “seasonal moving average” terms ϵt−7 and ϵt−14. Below, we provide the coefficients and standard errors for our best model for the time series of protest events, based on its history.

        ar1     ar2     ma1    ma2     ma3   sar1    sma1   sma2
coef 1.4594 -0.7899 -1.8331 1.1981 -0.1720 0.9191 -0.9114 0.0902
s.e. 0.0702  0.0688  0.0913 0.1512  0.0742 0.0867  0.1048 0.0576

That is, the model first transforms pt to Pt = Δpt, then determines that Pt depends, in a statistically significant way, on Pt−1, Pt−2, and Pt−7, as well as ‘shocks’ (e.g., days with an unusual number of events) at times t − 1, t − 2, t − 3, t − 7, and t − 14. One could spell out the model as: (1)

If one wished to predict the number of events tomorrow, one could use this model to predict Pt+1 and then use the fact that Pt+1 = pt+1pt to predict pt+1 as pt + Pt+1, i.e., the number of events today plus the output of this model based on the history of the event time series.

In this model, the residuals exhibit no autocorrelation, all terms are statistically significant, and the AIC is improved relative to the non-seasonal ARIMA model. Because the residuals exhibited mild heteroskedasticity, we investigated fitting GARCH models, but they did not improve on the SARIMA model. The heteroskedasticity was fixed by the final model introduced below, using the covariates. Because the residuals were not normal, non-parametric (bootstrap) methods were used to ascertain statistical significance. This model achieved an AIC of 3162.5.

We turn now to multivariate models. Including it, nrt, and et as explanatory variables for pt, and fitting a SARIMA model to the residuals, resulted in a strong improvement over the model above (AIC = 2680). All three of the “number of injuries,” “number of negative responses,” and “number of Euromaidan events” have statistically significant predictive power for the number of events (as do all the SARIMA terms in the model). Furthermore, this model no longer exhibits heteroskedasticity, thanks to the inclusion of the Euromaidan terms.

The best model begins with it, nrt, and et, then fits a SARIMA(1,1,1)(2,0,0) [7] to the residuals of that model. We list the coefficients and standard errors below:

         ar1     ma1   sar1   sar2   ts_i  ts_nr   ts_e
      0.2341 -0.9466 0.1829 0.2399 1.0326 1.1902 0.7970
s.e.  0.0536  0.0188 0.0497 0.0485 0.2669 0.1144 0.0279

The model can be spelled out as: where the differenced series Xt = Δxt satisfies:

The dependence on the past is now encapsulated by the residual term xt, which makes the model cleaner but does not change our earlier assertion that pt depends in a statistically significant way on the lags of it, nrt, and et.

Interpreting the coefficients, we learn that every injury is associated with 1.03 more events (so, if one event led to 100 people getting injured, we would expect 103 more events), and every negative response event is associated with 1.19 more events. We also fit models including lagged terms such as nrt−1, but this did not improve the model, as the dependence on the past is already encapsulated by the SARIMA term, xt.

Lastly, we fit a threshold model to pt. Threshold models are the statistical analog of a self-excitation process and are often used to model the spread of epidemics. The idea is to pick a threshold r (e.g., using cross-validation), fit one SARIMA model to all ptr, and fit a different SARIMA model to all pt > r. Threshold models have more parameters than simple SARIMA models, but this allows them to model time series that behave differently during periods of high intensity. While such models can be used for prediction, our goal in fitting this model was to determine if there was a moment when the protest dynamics accelerated and to find the date on which this change occurred. Our best threshold model achieved an AIC of 2107, demonstrating that it was a better fit than the preceding models. Nevertheless, in this case, due to the complexity of the threshold model, we prefer the models above for explaining the relationships between the variables. When we fit a threshold model to pt, we found that the only time period where pt was above our threshold (70 events per day) occurred starting on November 22, 2013. Sociologists with expertise in Ukraine selected the date November 29, 2013, as the date on which the protests changed in nature (based on the EU trade agreement and police crackdowns). This perfectly fits our model, which says that the number of events on any given day depends on the days in the preceding week. The two SARIMA models (above/below the threshold, or, equivalently in this case, before/after the cutoff date) were both substantially better fits than using one SARIMA model for the entire 2013 year. We experimented with other cut-off dates in November of 2013 and the choice of cut-off did not significantly change the conclusion.

Dynamic modeling

Hawkes process.

Initially developed to model seismic events [48], Hawkes processes appear well-suited for characterizing these protests. This category of stochastic point processes functions as a counting process, where the history of events shapes the likelihood of future occurrences. For instance, each event excites the process, heightening the probability of upcoming events. Hawkes processes have found applications in describing financial processes [49], mass shootings [50], and the popularity of tweets [51]. While some formulations of Hawkes processes adopt a continuous form, the nature of our dataset, presenting a daily sum of events, necessitates the use of a discretized variation.

In a Hawkes process, events are portrayed as temporal points, with each event having the potential to trigger subsequent occurrences. The initiation of new events is influenced by past events, leading to a cascade-like effect. This self-exciting property implies that the occurrence of an event can increase the likelihood of more events happening in its vicinity, forming clusters or bursts of activity.

Mathematically, a Hawkes process is defined by its intensity function, which gauges the rate of event occurrences at any given time. This intensity function is influenced by both the baseline event rate and the impact of past events. As events unfold, they contribute to the intensity function, subsequently affecting the likelihood of future events. To fully comprehend Hawkes processes, one must grasp their two major components.

1—Kernel: In a Hawkes process, the kernel plays a pivotal role in elucidating how past events influence the occurrence of future events. The kernel, a mathematical function, models the impact of previous events on the process’s intensity, reflecting its inherent self-exciting behavior. It quantifies the temporal influence of past events on the intensity at a given time, measuring how much the occurrence of a past event affects the likelihood of a new event happening shortly afterward. The shape of the kernel function determines the form and strength of this influence, capturing the dynamics of the process by indicating the “memory” of the system and how it responds to past occurrences.

Mathematically, the kernel function is typically non-negative and integrates to a finite value. It is convolved with the history of event occurrences to compute the instantaneous intensity at any given time. As new spikes occur, they contribute to the intensity, influencing the likelihood of subsequent events. Different types of kernel functions, such as exponential, power-law, and Gaussian kernels, can model various temporal patterns and degrees of influence. The choice of the kernel function profoundly impacts the overall behavior of the Hawkes process and its ability to capture real-world dynamics.

Given that the events occur on a fixed time period, a natural assumption is that the expected number of events Nexp(t) and the observed data Y follow a Poisson process. Thus, fitting the model is a matter of minimizing the negative likelihood derived from the said process:

1.1—Classical exponential kernel: The exponential kernel is a fundamental component of the Hawkes process, serving as a common choice to model the temporal influence of past events on the occurrence of future events. This kernel embodies the principle that the influence of a past event on the intensity of the process decreases exponentially over time. Mathematically, the kernel function takes the form: (2)

Thus, the number of expected events at any time t, given self-exciting events happening at the times {t0, t1, ….., tn}, is of the form (3) where N0,ob is the average number of events before self-excitation, which serves as a base that the model will converge to after self-excitation, Nsec represents the maximum magnitude of the spike, Tex is a variable influencing the time needed for complete relaxation and return to the steady state after a spike, and represents the effect of the spike time on the number of expected events at a time t.

1.2—Susceptible population model: The Hawkes process presented in Eq (3) assumes no differences between oblasts. However, the population and their political leaning play a role in the observed number of events. Moreover, the spikes’ magnitude changes over time as the tension grows in the system before relaxing. Thus, we modify the Hawkes process to reflect the population in each oblast and the rise and subsequent drop in magnitude. In addition, we see more activity in the pro-EU regions and consequently make the ansatz that the population that is susceptible to participate in these protests are pro-EU individuals. This leads to the following model: (4) where pr(ob) is the population ratio of each oblast, vr(ob) is the pro-EU vote in each oblast scaled by the entire countries’ vote from 2014, and is a term that reduces the magnitude of the spike with de being a time delay. Note that when Nsec is multiplied by pr(ob) and vr(ob), the model will scale the number of expected events for each oblast. We use population ratio as it allows for the population factor to stay between [0, 1].

1.3—Interaction effect between oblasts: While Eq (4) takes into consideration the specificities of the different regions, it does not reflect their influence on one another. We add a lag term to reflect such such effect to a couple of models, which we discuss below.

1.3.1—Geographical influence: In the study by Bonnasse-Gahot et al., [13], the dispersion of events was found to be influenced by the geographic distance between different regions in France. In our initial model aimed at accounting for spatial interactions between oblasts, we integrated geographical distance as a factor. We thus have the following model: (5) Where In this case, ‖objob‖ is the Euclidean distance of the central point of the two oblasts.

1.3.2—Political influence: The political polarization observed in Ukraine motivates the development of a model designed to examine the hypothesis that the contagion of events between oblasts is not primarily determined by geographic proximity but is instead influenced by political alignment. To investigate this, we employ an alternative factor based on voting ratios to assess such a model: (6) where and vr(ob) is the pro-EU vote in the oblast scaled by the entire countries’ vote from 2014, and c and d are two variables influencing the effect of this factor.

1.4—The effect of the number of injured individuals: The statistical analysis outlined above and previous research [41] demonstrates that the count of injured individuals had a discernible impact on forecasting the daily event count. The effect is observed even though a substantial number of injuries during the protests were not accounted for. Consequently, our model evolves to include such events and takes the form: where I(t, ob) is the number of reported injured people per day per oblast.

2—Spike times: While self-excitation is observed from the statistical analysis, it is not necessarily the case that all events contribute to self-excitation. Including all events would lead to an exponential growth of events with no time for relaxation. Thus, we incorporate thresholding where we consider event time in our model that is above a certain threshold. By implementing a threshold, we can control the events that contribute to the intensity function. This is particularly important in self-exciting processes, as it helps prevent excessive growth in the intensity function due to a cascading effect from numerous closely spaced events. Additionally, thresholding allows us to focus on events that have a significant impact on the process. This enhances the interpretability of the model, as it helps distinguish between events that genuinely contribute to the self-excitation mechanism and those that may be considered background noise. A carefully chosen threshold ensures that only meaningful events are considered in the modeling process. In practice, thresholding can significantly improve the computational efficiency of parameter estimation procedures for Hawkes processes. By excluding events below a certain threshold, the algorithm can concentrate on the most relevant events, reducing the computational burden associated with estimating parameters. Finally, introducing a threshold can contribute to the stability of the modeling process. It helps prevent overfitting and ensures that the model generalizes well to new data. Without thresholding, the model might become overly sensitive to minor fluctuations in the data, potentially leading to poor generalization. We incorporate two thresholding methods: (1) uniform thresholding, where spike times are chosen at regular temporal intervals. While the approach is naive, it does not add much complexity to the system and assumes that the effect of events is delayed. (2) oblast sensitivity, where the threshold deciding whether or not an event is a spike time is decided by the number of events in that particular day relative to the maximum number of events per oblast. While this approach adds a certain level of complexity, it far outperforms the the uniform model from an AIC perspective.

3—Exogenous effect: In complex social phenomena such as protests, it is important to note the complex interplay between internal processes and external stimuli in shaping protest phenomena. Levels of protest are influenced not solely by internal factors but also by external events. The initial incident that sparks a protest, termed the triggering event, is a crucial internal factor that should be incorporated into the model. However, additional occurrences, like government concessions or their absence, further contribute to the intensification of the situation. For example, the study by Varol et al. in [52] examines the impact of external events on a social media uprising associated with the Gezi Park movement in Turkey. In scenarios where political concessions are sought, external political events play a crucial role in shaping the dynamics of protests and should, therefore, be integrated into models. Mathematically, these external events act as external forcing terms in models [9]. Identifying events that influence protests is a nuanced process that requires a multidisciplinary approach and the integration of various data sources. Researchers often rely on comprehensive event databases, such as the Global Database of Events, Language, and Tone, which captures a wide array of events worldwide, including political demonstrations and social movements. Analyzing media reports, social media content, and government records provides valuable insights into the occurrence and context of events leading to protests. Scholars emphasize the importance of triangulating information from diverse sources to enhance the reliability of event identification [53]. As such we devised a comprehensive list of events that influenced the protests Table 1. This list was devised by checking multiple news sources for events adjacent to the protests that were qualitatively judged to influence the protests. One way in which exogenous events can shape the evolution of protests is exemplified by actions such as a ban on protests, which may decrease their occurrence, whereas confrontations with law enforcement might intensify the tension within the system. In mathematical terms, the impact of such events is represented as a δ pulse, and we adjust this pulse by the population ratio and vote ratio of each oblast. This leads to the following model: (7) where Nexo represents a parameter measuring the impact of exogenous events on the system and denotes the set of times when these external occurrences take place.

thumbnail
Table 1. 2013 exogenous events corresponding to the best fitting spike times to model Euromaidan.

https://doi.org/10.1371/journal.pone.0301639.t001

Results

Our investigation encompasses six primary variations of the Hawkes process framework, each building on the previous to address specific characteristics of protest events, including self-excitation, external influences, effects of different regions, and the number of injured individuals. Under the final model, Nsec, de, Tex, c, d, and p are the parameters to be determined by minimizing the negative log-likelihood. By systematically comparing these models and assessing their goodness-of-fit, we gain valuable insights into not only the suitability of Hawkes processes for characterizing the complex temporal dynamics inherent in protest data but also the different protest drivers that governed the Euromaidan revolution.

The classical kernel performs poorly, which is predictable given that it fails to capture the specificities of each oblast. Only including the susceptible population does not lead to a model that better predicts the dynamics of the protests. It is only when accounting for the effect of different oblasts that we see a significant increase in performance in both the negative log-likelihood and Akaike information criterion (AIC). With regards to spatial influence, while the distance-based model presented in Eq (5) performs well, the voting-based interaction model is far more effective at predicting the spread of events (Appendix 0.3). The fit of the Hawkes process experiences a remarkable enhancement when spike times are adjusted to exceed a two-day threshold and surpass 40% frequency. This improvement is consistent across various configurations, primarily because the model encounters difficulties in accurately capturing the self-excitation process when the threshold is set too low. By raising the threshold to a minimum of 40% of the maximum potential events in each oblast, the model aligns more closely with the expected behavior, resulting in a more precise fit. This threshold setting ensures that the model accounts for the underlying dynamics and optimally represents the observed data, thereby enhancing its overall performance. These results highlight the key fact that self-excitation depends on the sensitivity of the different regions and that 40% constitutes a global threshold for events to cause self-excitation. Once we add the effect of identified exogenous events presented in Table 1, the model using the 2014 voting data excels at predicting the protest dynamics as seen in Fig 8 achieving a negative log-likelihood of −1590.08 which stays far better than any configuration as presented in Table 2.

thumbnail
Fig 8. The predicted number of events per day in each oblast weighted by the susceptible population.

Nsec = 102.6., de = 4.18, Tex = 5.8, c = 2.6, d = 2.4, p = 2, and Nexo = 7.22.

https://doi.org/10.1371/journal.pone.0301639.g008

thumbnail
Table 2. Summary of the negative log-likelihood and AIC scores.

https://doi.org/10.1371/journal.pone.0301639.t002

Fitting the parameters for the best-fitting model gives an insight into the behavior of the protests. Given that Tex = 5.8, the effect of each spike decreases by a factor of . The magnitude of spikes that influence the number of expected events at reaches its highest value de = 4 days into the protests, the following major spikes are all due to the self-excitation from the previous event. The voting affinity between different oblasts plays a major role in the spread of protests. The number of injured individuals does affect the number of events as the number of injured to the power of p = 2 is proportional to the number of events.

The model succeeds in capturing the general behavior of protests. It excels at predicting the spatio-temporal spread of events. More specifically, the model captures the massive spike that happened on December 1, 2013, in Kyiv, thanks to the addition of the influence of the number of injured individuals, and its following spread through the western oblasts (Fig 9). The model captures the peaks and following self-excitation behavior in Ivano-Frankivsk, Khmelnytskyi, Kyiv, Luhansk, Lviv, Rivne, Odesa, and Vinnytsia. However, it struggles to capture the spikes in Cherkasy, Chernivtsi, Lutsk, and Ternopil.

thumbnail
Fig 9. A spatiotemporal illustration of the normalized predicted number of total events per oblast from November 21, 2013, to December 05, 2013, where spike times are determined from the average rate of change per oblast between two days.

These events include protests, rallies, riots, and police crackdowns.

https://doi.org/10.1371/journal.pone.0301639.g009

Discussion and conclusions

Euromaidan was a significant political and social movement that unfolded in Ukraine from November 2013 to February 2014. The protests, which were primarily peaceful at first, grew in scale and intensity as they continued. Protesters demanded President Yanukovych’s resignation and a shift towards closer ties with the EU. The protests took a violent turn in January 2014, after the Ukrainian government passed legislation restricting the right to protest. This led to clashes between protesters and law enforcement officials, resulting in several fatalities. Despite the violence, protesters remained steadfast in their efforts, and their resolve ultimately led to President Yanukovych’s removal from power in February 2014. The events of Euromaidan had significant implications, both within Ukraine and beyond. The movement had a profound impact on the country’s political landscape, leading to the formation of a new government and a pivot towards the EU. However, it also contributed to the annexation of Crimea by Russia and a prolonged conflict in eastern Ukraine. As such, the Euromaidan movement is a significant case study from a sociological and mathematical standpoint. Its unfolding provides insight into the dynamics of political movements and the role of civil society in shaping the direction of a country.

We use data from the Center for Social and Labor Research to study the spatiotemporal dynamics of the protest events. While the data set is missing important activity that occurred during the first two months of 2014 and also shows some discrepancies, such as the missing number of injured and deceased individuals, it remains useful for modeling the first half of Euromaidan. We choose to model the number of events and not the number of protesters, as the magnitude of an event does not always correlate with the importance of the event. As pointed out above, all protest events add to the general tension and lead to more events. Additionally, our statistical analysis shows the number of protest events per day was closely associated with the number of injuries, negative response events, and Euromaidan events on the previous day. We found that the number of events per day was strongly influenced by the number of events with a negative response and the number of events associated with Euromaidan, with negative reaction events and Euromaidan events both associated with more total events in the subsequent days. One can derive two main results. First, there is indeed self-excitation in the data, which justifies the use of the Hawkes process. Second, injuries and negative police responses play an important role in the process. The kernel used in our model takes into account the interactions between oblasts and the influence they have on each other due to political affinity.

The model accurately predicts the spatiotemporal dynamics of events during the Euromaidan protests in Ukraine. While the model struggles to accurately predict the magnitude of events in certain regions, it excels in predicting the spread of events. The political affinity between regions was found to be a more significant factor in determining protest spread than geographic distance, highlighting the importance of social and cultural factors. This is in contrast to other studies, such as that of the 2005 French riots where geographic distance was seen to play a key role in the spread of activity [13]. Thus, our study highlights the shift in protest contagion as being less dependent on geography (perhaps due to the immediate spread of information in current days) to being more influenced by political affinity. We note that the under-reporting of police violence negatively affects the model’s accuracy. The best spike times are those that take into account the specific reactions of each region. The differences between the chosen spike models indicate that certain regions are significantly affected by the national upheaval, while others benefit from considering them individually.

Appendix

0.1 Missing data

In this section, we discuss missing data in more detail. The data set consists of 6627 events that began during 2013. Each event has a start date, though only 6591 have an end date recorded. The latest end date is 2014-02-13. Because the events missing an end date appear to be missing at random, we omit them from our analysis. Of the 6627 events, 3044 are missing data on the number of protesters present, and even in cases where this data is not missing, it is frequently an estimate. For this reason, we do not include the “number of protesters” as a variable in any of our models. Similarly, data on the number of arrests, injuries, and deaths is missing from most of the events. For example, the data set contains only 8 deaths total in 2013, a vast undercount. The data set also shows that for the eight most serious events (including one with 800,000 protesters) there were no arrests and no injuries, which is impossible. In general, the “number of civilians injured” variable had much less missing data than the variables for the number of civilians arrested/killed. This variable is coded ‘NA’ in only 65 of the events, but it takes the value zero more often than it should, as noted above. It is only positive in 212 events but takes a range of values, the highest of which is 122. Of the events with the “number of civilians injured” recorded as zero, we do not know how many actually had zero injuries.

The lack of data on the number of protesters is a clear limitation in our model, as is the extent of missing data in the “number of civilians injured” variable. The latter variable has been shown to have a statistically significant relationship to the number of protests in the USA [41], and for this reason, we do include this variable in our model. To our knowledge, no research paper has ever included the former variable, because of the difficulty in reliably measuring the number of protesters. We hope that future researchers will be able to study the relationship between the number of protesters, the extent of media coverage, and the effect on protest dynamics, if technological advances allow for these data to be correctly measured.

In our final model, the “number of civilians injured” variable is a statistically significant explanatory variable, whose inclusion slightly improves the model. Due to the missing data issues, we urge the reader to use caution interpreting this variable as it relates to protests in Ukraine and to consider the model in [41], on a data set that exhibits less cause for concern regarding missing data for this variable. Even without reliable data on the number of protesters, our analysis still shows self-excitation in the number of protests, analogously to the situation of [41].

0.2 Cross-correlation analysis

In this section, we discuss the idea behind cross-correlation analysis [47]. First, we pick either of the two-time series, e.g., it, and compute shifted versions of it. For example, it−1 means we shift the entire column down by one, so there’s no data at time 1, and at time 2 the data recorded is i1, etc. Similarly, it+1 means we shift the column up by one so that at time 1 we record i2, etc. Secondly, we write down all the correlations, first between the column pt and the column it, then between the column pt and the column it−1 (hence, the number p1, of events on the first day, is dropped, as it would be paired with i0, which doesn’t exist), then between pt and it−2, etc., and also between pt and it+1, it+2, etc. For each shift, h, we record the correlation between pt and ith. Finally, we look for a spike in this collection of correlations, e.g., if the strongest correlation is between pt and it−1 then this means the number of civilians injured on any given day (time t − 1) is associated with the number of events on the next day (time t).

Carrying out this plan is slightly more complicated in practice because there could be some exogenous variables that affect both time series. To solve that issue, the standard technique, pre-whitening, is to first model one of the time series (say pt) as a function of its own history, then write down a new time series of residuals rt of that model, which should be white noise (i.e., a random time series with no dependence on its own history, and no dependence on any exogenous variable). This process provides a filter that transforms pt into rt. We apply that filter to it to transform it into , and compute the correlation between rt and . This correlation shows how related pt and it are once we remove any dependence on anything else. We repeat this process for each lag, e.g., ith becomes , and again write down these correlations for every h.

0.3 Voting Based Hawkes Configuration with no Exogenous Events

Fig 10. The predicted number of events per day in each oblast weighted by the susceptible population without exogenous events. Nsec = 100, de = 3.84, Tex = 15.8, c = 2.3, d = 2.6, p = 2.

thumbnail
Fig 10. The predicted number of events per day in each oblast weighted by the susceptible population without exogenous events.

Nsec = 100, de = 3.84, Tex = 15.8, c = 2.3, d = 2.6, p = 2.

https://doi.org/10.1371/journal.pone.0301639.g010

Dedication

An unfortunate consequence of the Euromaidan revolution has been the current war between Ukraine and Russia. We would like to dedicate this work to the people of Ukraine.

Acknowledgments

We are indebted to the anonymous referees whose careful review significantly improved this paper. We would like to express our deepest appreciation to Leonid Berlyand for his help with this project in the fall of 2021, and we thank the Center for Social and Labor Research for providing the data used in this project.

Software archive

The code used to generate all graphs and models presented in this paper can be found at: https://github.com/bahidyassin/Protest-Modeling.

References

  1. 1. Masters J. Ukraine: Conflict at the Crossroads of Europe and Russia Council of Foreign Relations; 2022 https://www.cfr.org/backgrounder/ukraine-conflict-crossroads-europe-and-russia
  2. 2. Ukraine Partnership and Cooperation Agreement. https://cordis.europa.eu/article/id/2672-euukraine-partnership-and-cooperation-agreement, EU/Ukraine Partnership and Cooperation Agreement—Cordis
  3. 3. Marples D. Ukraine’s Euromaidan: Analyses of a Civil Revolution. Ibidem Verlag; 2015.
  4. 4. Polityuk P, Balmforth R. Yanukovich declared Winner in Ukraine Pol. The Independent; 2010. https://www.independent.co.uk/news/world/europe/yanukovich-declared-winner-in-ukraine-poll-1899552.html
  5. 5. Ukraine Drops EU Plans and Looks to Russia. Aljazeera. 2013. https://www.aljazeera.com/news/2013/11/21/ukraine-drops-eu-plans-and-looks-to-russia/
  6. 6. Lemos CM, Coelho H, Lopes RJ. Agent-based Modeling of Social Conflict, Civil Violence and Revolution: State-of-the-art Review and Further Prospects. In:EUMAS; 2013.
  7. 7. Epstein JM. Modeling civil violence: An Agent-based Computational Approach. Proceedings of the National Academy of Sciences. 2002;99(suppl 3):7243–7250. pmid:11997450
  8. 8. Andreev A, Borodkin L, Levandovskii M. Using Methods of Non-linear Dynamics in Historical Social Research: Application of Chaos Theory in the Analysis of the Worker’s Wovement in Pre Revolutionary Russia. Hist Soc Res. 1997;22(3/4 (83)):64–83.
  9. 9. Berestycki H, Nadal J, Rodríguez N. A Model Of Riots Dynamics: Shocks, Diffusion And Thresholds. Networks and Heterogeneous Media. 2015;10.
  10. 10. Alsulami A, Glukhov A, Shishlenin M, Petrovskii S. Dynamical Modelling of Street Protests Using the Yellow Vest Movement and Khabarovsk as Case Studies. Scientific Reports. 2022;12(1):20447. pmid:36443352
  11. 11. Quek HY, Tan KC, Abbass HA. Evolutionary Game Theoretic Approach for Modeling Civil Violence. IEEE Transactions on Evolutionary Computation. 2009;13(4):780–800.
  12. 12. Gavrilets S. Collective Action Problem in Heterogeneous Groups. The Royal Society. Philos Trans R Soc Lond B Biol Sci. 2015;370(1683):20150016. pmid:26503689
  13. 13. Bonnasse-Gahot L, Berestycki H, Depuiset M, Gordon MB, Roché S, Rodríguez N, et al. Epidemiological Modelling of the 2005 French riots: a Spreading Wave and the Role of Contagion. n. Scientific Reports. 2018;8(1):107. pmid:29311553
  14. 14. Khosaeva ZK. The mathematics Model of Protests. Computer Research and Modeling. 2009;13(4):780–800.
  15. 15. Caroca Soto P, Cartes C, Davies TP, Olivari J, Rica S, Vogt-Geisse K. The Anatomy of the 2019 Chilean social Unrest. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2020;30(7). pmid:32752607
  16. 16. Dimarco G, Perthame B, Toscani G, Zanella M. Kinetic Models for Epidemic Dynamics with Social Heterogeneity. Journal of Mathematical Biology. 2021;83(1):4. pmid:34173890
  17. 17. Tench S, Fry H, Gill P. Spatio-temporal Patterns of IED Usage by the Provisional Irish Republican Army. European Journal of Applied Mathematics. 2016;27(3):377–402.
  18. 18. Loeffler C, Flaxman S. Is Gun Violence Contagious? A spatiotemporal Test. Journal of quantitative criminology. 2018;34:999–1017.
  19. 19. Brantingham JP, Yuan B, Herz D. Is Gang Violent Crime More Contagious than Non-Gang Violent Crime? Journal of quantitative criminology. 2021;37:953–977.
  20. 20. Campedelli GM, D’Orsogna MR. Temporal Clustering of Disorder Events During the COVID-19 Pandemic. PLOS One. 2021;16(4):e0250433. pmid:33886656
  21. 21. Ukraine Summary. The Encyclopedia Britannica; 2010. https://www.britannica.com/summary/Ukraine
  22. 22. Reznik O. From the Orange Revolution to the Revolution of Dignity East Eur Polit Soc. 2016;30(4):750–765.
  23. 23. Kutzio T. Nationalism, Identity and Civil Society in Ukraine. Communist and Post Communist Studies; 2010. https://www.britannica.com/summary/Ukraine
  24. 24. Official European Union Site European Union Eastern Partnership. 2009 https://www.eeas.europa.eu/eeas/eastern-partnership_en
  25. 25. European Neighbourhood Policy and Enlargement Negotiations (DG NEAR). https://neighbourhood-enlargement.ec.europa.eu/enlargement-policy/glossary/association-agreement_en
  26. 26. Association Agreement between the European Union and Ukraine. 2017. https://www.kmu.gov.ua/en/yevropejska-integraciya/ugoda-pro-asociacyu
  27. 27. Kotsyuba O. Ukraine’s Battle for Europe. The New York Times. 2013. https://www.nytimes.com/2013/11/30/opinion/ukraines-battle-for-europe.html
  28. 28. Lapatina A. Berkut officers who attacked protesters during Euromaidan given first ever prison sentences. https://www.kyivpost.com/post/8065.
  29. 29. 1 December 2013 Euromaidan riots. Wikipedia. 2013. https://en.wikipedia.org/wiki/1_December_2013_Euromaidan_riots
  30. 30. 11 December 2013 Euromaidan assault Wikipedia. 2013. https://en.wikipedia.org/wiki/11_December_2013_Euromaidan_assault
  31. 31. Ukrainian President Approves Strict Anti-Protest Laws. The Guardian; 2014. https://www.theguardian.com/world/2014/jan/17/ukrainian-president-anti-protest-laws.
  32. 32. 200,000 Mass in Ukraine in Defiance of Protest Curbs The Guardian; 2014. https://www.theguardian.com/world/2014/jan/17/ukrainian-president-anti-protest-laws.
  33. 33. Understanding Ukraine’s Euromaidan Protests Open Society Foundations; 2019. https://www.opensocietyfoundations.org/explainers/understanding-ukraines-euromaidan-protests
  34. 34. Europe’s New Battlefield The Economist; 2014. https://www.economist.com/briefing/2014/02/20/europes-new-battlefield.
  35. 35. Hewitt G. Ukraine Conflict: Tymoshenko Speech Ends Historic Day of Revolution The BBC News; 2024. https://www.bbc.com/news/av/world-europe-26302277
  36. 36. Ukraine Elections: Runners and Risks. BBC; 2014 https://www.bbc.com/news/world-europe-27518989
  37. 37. Smith-Spark L, Brumfield B, Krever M. Ukraine Signs EU Deal that Sparked Months of Upheaval, Extends Cease-Fire; CNN; 2014. https://www.cnn.com/2014/06/27/world/europe/ukraine-crisis/index.html
  38. 38. The Center for Social and Labor Research. Databases of protest events and reports. 2014. https://www.cslr.org.ua/en/databases-of-protest-events-and-reports/
  39. 39. The Center for Social and Labor Research. Ukrainian Protest and Coercion Data Project; 2023. https://www.cslr.org.ua/en/ukrainian-protest-and-coercion-data-project/
  40. 40. The Center for Social and Labor Research Ukrainian Protest And Coercion Data Codebook. 2015 http://cslr.org.ua/wp-content/uploads/2015/01/UPCD_Codebook_31-Dec-2012.pdf
  41. 41. Rodríguez N, White D. An Analysis of Protesting Activity and Trauma through Mathematical and Statistical Models. Crime Science. 2023;12(1):17.
  42. 42. Young T. 10 maps that Explain Ukraine’s Struggle for Independence; 2015. https://www.brookings.edu/blog/brookings-now/2015/05/21/10-maps-that-explain-ukraines-struggle-for-independence/.
  43. 43. EuroMaidan protester allegedly beaten by Berkut dies. 2013. https://khpg.org/en/1387893053
  44. 44. Fisher M. The Three Big Reasons that Protests Reignited in Ukraine The Washington Post; 2021. https://www.washingtonpost.com/news/worldviews/wp/2014/02/18/the-three-big-reasons-that-protests-reignited-in-ukraine/
  45. 45. Fisher M. War on the Streets of Kyiv. Ukrayinska Pravda; 2024. https://www.pravda.com.ua/articles/2014/02/18/7014151/
  46. 46. Interns. EuroMaidan Rallies in Ukraine (Jan. 19 updates); Kyiv Post; 2014.
  47. 47. Shumway RH, Stoffer DS. Time Series Analysis and Its Applications: With R Examples. 4th ed. Springer; 2017.
  48. 48. Ogata Y. Statistical Models for Earthquake Occurrences and Residual Analysis for Point Processes. Journal of the American Statistical Association. 1988;83(401):9–27.
  49. 49. Azizpour S, Giesecke K, Schwenkler G. Exploring the sources of default clustering. Journal of Financial Economics. 2018;129(1):154–183.
  50. 50. Mohler GO, Short MB, Brantingham PJ, Schoenberg FP, Tita GE. Self-Exciting Point Process Modeling of Crime. Journal of the American Statistical Association. 2011;106(493):100–108.
  51. 51. Zadeh A, Sharda R. How Can Our Tweets Go Viral? Point-Process Modelling of Brand Content. Information Management. 2022;59(2):103594.
  52. 52. Varol O, Ferrara E, Ogan CL, Menczer F, Flammini A. Evolution of online user behavior during a social upheaval. In: Proceedings of the 2014 ACM conference on Web science; 2014. p. 81–90.
  53. 53. Earl J, Martin A, McCarthy JD, Soule SA. The Use of Newspaper Data in the Study of Collective Action. Annu Rev Sociol. 2004;30:65–80.