Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Concurrent Bursty Behavior of Social Sensors in Sporting Events

  • Yuki Takeichi,

    Affiliation Department of Complex Systems Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 458-8601, Japan

  • Kazutoshi Sasahara ,

    sasahara@nagoya-u.jp

    Affiliation Department of Complex Systems Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 458-8601, Japan

  • Reiji Suzuki,

    Affiliation Department of Complex Systems Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 458-8601, Japan

  • Takaya Arita

    Affiliation Department of Complex Systems Science, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 458-8601, Japan

Concurrent Bursty Behavior of Social Sensors in Sporting Events

  • Yuki Takeichi, 
  • Kazutoshi Sasahara, 
  • Reiji Suzuki, 
  • Takaya Arita
PLOS
x

Abstract

The advent of social media expands our ability to transmit information and connect with others instantly, which enables us to behave as “social sensors.” Here, we studied concurrent bursty behavior of Twitter users during major sporting events to determine their function as social sensors. We show that the degree of concurrent bursts in tweets (posts) and retweets (re-posts) works as a strong indicator of winning or losing a game. More specifically, our simple tweet analysis of Japanese professional baseball games in 2013 revealed that social sensors can immediately react to positive and negative events through bursts of tweets, but that positive events are more likely to induce a subsequent burst of retweets. We confirm that these findings also hold true for tweets related to Major League Baseball games in 2015. Furthermore, we demonstrate active interactions among social sensors by constructing retweet networks during a baseball game. The resulting networks commonly exhibited user clusters depending on the baseball team, with a scale-free connectedness that is indicative of a substantial difference in user popularity as an information source. While previous studies have mainly focused on bursts of tweets as a simple indicator of a real-world event, the temporal correlation between tweets and retweets implies unique aspects of social sensors, offering new insights into human behavior in a highly connected world.

Introduction

Social media is an increasingly popular communication tool by which people have massive social interactions in cyberspace [1]. These interactions can have a significant effect beyond cyberspace, with real world consequences. A well-known example is that social media helped Arab Spring activists spread and share information, playing a key role in the ensuing revolutionary social movements [2]. As in this case, social media can interface between cyberspace and the physical world by globally connecting people and information in nontrivial ways, thereby leading to novel collective phenomena. The quantitative understanding of such collective phenomena is a central issue in the emerging field of computational social science.

Many social media studies have been conducted using Twitter, a popular social media that allows users to read, post, and forward a short text message of 140 characters or less (called tweets). These studies have focused on the characteristics and effects of Twitter, such as the structural properties of user networks [3, 4], the nature of online social interactions [5, 6] and information diffusion [7, 8], collective attention [9, 10] and collective mood [11, 12], users’ behavior related to particular real-world events [13, 14], and the prediction of the stock markets [15].

In this paper, we focused on Twitter as a network of social sensors to investigate, a novel collective phenomenon empowered by social media. Fig 1 shows a schematic illustration of how social sensors work, in which Twitter users actively sense real-world events and spontaneously mention these events by posting tweets, which immediately spread over user networks in cyberspace. Such information cascades can be amplified by chains of retweets (re-posted tweets) from other users or followers. Consequently, Twitter as a whole can behave like a network of social sensors, exhibiting distinct collective dynamics linked with target events.

thumbnail
Fig 1. Schematic illustration of a network of social sensors.

Nodes in cyberspace represent social sensors (Twitter users). The thick arrows represent social sensors (grey nodes) sensing a real-world event, and thin arrows represent the corresponding information cascades by means of tweet and retweet.

https://doi.org/10.1371/journal.pone.0144646.g001

Similar ideas have been tested in several different settings, most of which are in the context of the real-world event detection on Twitter. For example, Sakaki and Matsuo monitored earthquake-related tweets and trained a statistical learning model with these data; they were successful (96% accuracy) in detecting earthquake events of the Japan Meteorological Agency of a seismic intensity scale three or more [13]. Social sensors under emergency situations such as large earthquakes and Tsunamis were studied to demonstrate distinct retweet interactions [10]. Twitter data during sporting events were also analyzed in a variety of settings. For example, Zhao et al. studied Twitter for real-time event detection during US National Football League (NFL) games and reported a detection accuracy of 90% in the most successful case [16]. Other studies developed methods for event detection from bursts of tweets related to football games by using a keyword frequency approach [17, 18] and tweets about Olympic Games by using a non-negative matrix factorization approach [19].

These studies share the hypothesis that Twitter is a mirror of reality and mainly focus on either bursts of tweets or retweets to identify spontaneous reactions of people to events in the physical world. However, little is known about the more unique nature of social sensors that cannot be explained solely by these bursts of tweets or retweets. Tweets and retweets, by nature, convey different kinds of information: tweets are more linked with what users want to say about real-world events, whereas retweets are more linked with what users are aware of in cyberspace. Thus, the concurrent bursts of tweets and retweets would be a novel indicator of collective behavior. The objective of this study was to determine the function of these concurrent bursts in social sensors.

Materials and Methods

Dataset

We targeted major sporting events for the study of social sensors. This is because, as shown by the previous studies, natural disasters and major sporting events tend to strongly attract people’s attention, which gives rise to a large volume of tweets and retweets. While natural disasters are largely unpredictable events, sporting events are scheduled and therefore allow data to be collected systematically. Therefore, major sporting events were suitable targets for our aim.

Using the Twitter Search API (https://dev.twitter.com/docs/api/), which allows 180 queries per 15-min window, we compiled a dataset of tweets related to Japan’s 2013 Nippon Professional Baseball (NPB) games, including at least one hashtag of NPB teams such as #giants (Yomiuri Giants) and #rakuteneagles (Tohoku Rakuten Golden Eagles). These hashtags were selected by reference to a hashtag cloud site (http://hashtagcloud.net). This hashtag-based crawling with multiple crawlers allowed us to obtain the nearly-complete data regarding these sporting events: 528,501 tweets surrounding 19 baseball games from the Climax Series (the annual playoff series) and from the Japan Series (the annual championship series) in the 2013 NPB. We also collected tweets related to Major League Baseball (MLB) games in 2015, including at least one hashtag of the MLB teams such as #Yankees and #BlueJays. The hashtags were selected by reference to Official Twitter account of the MLB (https://twitter.com/mlb). We sampled 730,142 tweets from 17 games of New York Yankees from September 11 to 27, 2015. The NPB complete dataset was used to address Twitter as a social sensor network and the MLB sampling dataset was used to validate the results of the former analysis. The datasets are publicly available (http://dx.doi.org/10.7910/DVN/42V7E0).

Correlation Between Tweet and Retweet Burst Trains

Burst-like increases in tweets may arise when an event happens in the physical world, and that is what many previous researches on social sensors have shown. However, in such cases, the reaction is not limited to tweets alone. According to our observations, bursts of retweets often follow those of tweets when positive events happen in the physical world. If we assume tweet behaviors during a two-team sport, concurrent bursts of tweets and retweets would be repeatedly generated by the fans of the winning team and as a result tweet and retweet burst trains would be similar to each other with a little time lag. With this point in mind, one of the easiest ways to measure the similarity of tweet and retweet burst trains is to use a cross-correlation function [20]. Suppose xi is a tweet count series and yi is a retweet count series, where i = 1, ⋯, N. The cross-correlation function is defined as follows: where τ is the time lag, and and σx denote mean and variance, respectively. Its value ranges from -1 for complete inverse correlation to +1 for complete direct correlation. If xi and yi are not correlated, its value becomes around zero. In this study, xi and yi were counted by 10 sec for the NPB complete dataset and those were counted by 60 sec for the MLB sampling dataset. We set the maximum time lag 300 sec and adopted the maximum of rxy(τ) as a measure of correlation between the tweet and retweet count series, denoted by rmax. For statistical comparison, Fisher z-transformation [21] was applied to the resulting rmax value to convert to the normally distributed value Rmax. Thus, the greater Rmax indicates that tweet and retweet concurrent bursts highly correlate with each other.

Construction of Retweet Networks

The interactions of social sensors linked with major sporting events are examined using networks [22]. Using retweet data, we construct a retweet network as previously reported [10]. In the retweet network, each node represents a Twitter user and a directed edge is attached from user B to user A, if user B retweets a tweet originally posted by user A. If there is a user C’s retweet “RT @user B... RT @user A...,” then links are made CB and CA. In this network, retweet interactions among social sensors are represented and influential users (also known as hub users) whose tweets are preferentially retweeted by many users are represented as nodes with many incoming edges (in-degrees).

The resulting retweet networks are visualized in a force-directed layout algorithm in Gephi (https://gephi.org), so that users who retweet more frequently (i.e., more connections) can be placed closer together. The size of nodes is proportional to the logarithm of the number of in-degrees. In addition, cumulative in-degree distributions () are calculated from retweet networks to access their structural properties.

Results

Tweet and Retweet Bursts: An Example

Fig 2 shows an example of the tweet and retweet dynamics during a baseball game, the 6th round in the 2013 Japan Series, in which the Yomiuri Giants beat the Tohoku Rakuten Golden Eagles by a score of 4–2. We see many sudden increases of tweet and retweet counts for both teams, which are seemingly random spikes. However, we noticed special cases where the bursts of tweets and those of retweets simultaneously occurred, and each of these cases corresponded to the following events, respectively:

  1. The Eagles scored twice.
  2. The Giants scored third and turned the game around.
  3. The Giants added another run.
  4. The Giants won the game.

thumbnail
Fig 2. Example of tweet and retweet time series (counts per minute) for the sixth round in the 2013 Japan Series.

Red lines denote tweets and blue dashed lines denote retweets. The upper panel shows tweets for the Giants (#giants) and the lower panel for the Eagles (#rakuteneagles). See the main text for event (1)-(4).

https://doi.org/10.1371/journal.pone.0144646.g002

As shown in Fig 2, the concurrent bursts of tweets and retweets were generated more frequently in the context of the Giants (the winning team) than the Eagles (the losing team). Once a particular event happens during a game, users spontaneously post a scream of delight from the winning side and one of disappointment from the losing side. For example, during event (3), positive tweets such as “Oh goody!” and “Go-ahead homer!” were posted with #giants, whereas negative tweets such as “Oh, no...” and “Disaster!” were posted with #rakuteneagles. Without such events in a game, there was no strong bias against a tweet’s polarity, positive or negative.

This example shows that social sensors can immediately show reactions to a positive and a negative event by a burst of tweets; however, a positive event is more likely to induce a subsequent burst of retweets. Therefore, we assume that a correlation between tweet and retweet time series would work as a measure of collective positive reactions of social sensors, which may eventually correlate to the result of a game.

Tweet and Retweet Bursts During Games in the NPB

We study the above-mentioned hypothesis using the NPB dataset. To this end, we computed and compared Rmax for tweet and retweet time series, as defined in the Methods section, in 19 games from the Japan Series and the Climax Series for the Central and Pacific Leagues. Fig 3 shows examples of the correlation function (rxy(τ)) between tweet and retweet count series as a function of the time lag τ for the sixth round in the 2013 Japan Series, in which rxy reached the maximum at τ = 60 sec for the Giants and at τ = 100 sec for the Eagles.

thumbnail
Fig 3. Example of the correlation between tweet and retweet time series (rxy(τ)) for the six round in the 2013 Japan Series (cf. Fig 2).

https://doi.org/10.1371/journal.pone.0144646.g003

Fig 4 (left) shows the values of Rmax in tweet and retweet time series for the Giants (G) and the Eagles (E) across seven games in the Japan Series. In this figure, we can confirm that the winning teams have Rmax greater than that of the losing team in all games. Moreover, two interesting features are shown in Fig 4 (left): in the first round, Rmax for the Eagles was considerably smaller than that of the Giants, because the Eagles created scoring opportunities many times but failed to score a run; in the fifth round, both teams showed an equivalent Rmax value, because it was a closer game. These results seem reasonable because a greater Rmax value is associated with positive events such as a base hit or a home run.

thumbnail
Fig 4. Rmax between tweet and retweet time series for the 2013 Japan Series (A) and the 2013 Climax Series for the Central (B) and Pacific (C) Leagues. Red letters with an underline denote the winning team and blue letters denote the losing team. G: Yomiuri Giants, E: Tohoku Rakuten Golden Eagles, T: Hanshin Tigers, C: Hiroshima Toyo Carp, M: Chiba Lotte Marines, L: Saitama Seibu Lions.

https://doi.org/10.1371/journal.pone.0144646.g004

We then examined whether this property holds for other baseball games in the Climax Series. Fig 4B and 4C reveal that this property holds true, except in the case of three games: the second round in the Central League Climax Series and the fifth and seventh rounds in the Pacific League Climax Series. These exceptions were attributed to the non-stationary nature of tweet and retweet time series. In two of these exceptions, the fans of a losing team generated a single intense concurrent burst of tweets and retweets when a scoring event happened in the late inning of the game. The other exception was based on an extraordinary number of retweets about the Eagles’ victory in the Climax Series, which lowered the Rmax for the Eagles to below that of the losing team. In principle, Rmax cannot be applied to a non-stationary time series; therefore both cases are out of the application range. Overall, Rmax worked as a good indicator of the baseball game’s results in 16 out of 19 games. We also computed the time lag from the NPB dataset and the average time lag was 137±87 sec, at which correlation between tweet and retweet burst trains becomes maximum. There was not a significant difference in the time lag at Rmax between the winning team and the losing team (independent-samples t-test, n = 38, P = 0.059).

In Fig 5, we classified the computed Rmax values into two groups—one is the winning team group and the other the losing team group—and compared their means statistically. The analysis identified a significant difference between the two groups (independent-samples t-test, n = 38, P < 0.05), suggesting that greater Rmax values are related to winning games. Our hypothesis described above has now been statistically confirmed in the NPB dataset.

thumbnail
Fig 5. Boxplots of Rmax in the winning team group and the losing team group in the 2013 Japan Series and the 2013 Climax Series for the Central and Pacific Leagues, with a significant difference between two groups.

https://doi.org/10.1371/journal.pone.0144646.g005

Tweet–retweet Concurrency and Positive Events

Here, we examined how social sensors reacted to positive events in the NPB baseball games. We computed the relative occurrence frequency of ten baseball terms such as “hit” and “homer” (rposi), as probes of positive events, from all of the baseball data. As a result, rposi is 0.07 ± 0.03 for tweets and is 0.28 ± 0.18 for retweets, indicating that retweets are more biased toward positive information than tweets (independent-samples t-test, n = 38, P < 0.001). One expected result was that rposi for retweets would be higher in the winning team than in the losing team since retweets are used to convey positive information in a baseball game. Such correlation, however, was not confirmed (independent-samples t-test, n = 38, P = 0.096); in fact, rposi for retweets was higher in the losing team than in the winning team in 9 out of 19 games. These additional findings indicate that the number of positive tweets is not simply associated with wins or loses and that the timing or concurrency of tweet and retweet spikes (Rmax) are more indicative of positive outcomes of sporting events.

Tweet and Retweet Bursts During Games in the MLB

There is potential concern that the above finding would be an artifact caused by the different Twitter usage or custom of Japanese users. To confirm that this is not the case, we analyzed tweets sampled during New York Yankees games from September 11 to 27, 2015 (n = 17), mostly posted by English-speaking users, with the same setting. Fig 6 shows an example of tweet and retweet series counted by 60 sec for the first game in the above period, in which the Blue Jays beat the Yankees by a score of 11–5. In this figure, concurrent spikes of tweets and retweets were associated with chances to score or scoring events, which is similar to Fig 2. The resulting Rmax values for the MLB dataset in Fig 7A show that the winning team had Rmax values greater than those of the losing team in 15 out of 17 games. The two exceptions were seemingly due to the closeness of scores in the game. There is a significant statistical difference in Rmax values between the winning team and the losing team values in Fig 7B (independent-samples t-test, n = 34, P < 0.01). These results support our findings in that the concurrent bursts of tweets and retweets we observed are not, in fact, coincidental.

thumbnail
Fig 6. Example of tweet and retweet time series (counts per minute) for the Yankees vs. Blue Jays game on September 11, 2015.

Red lines denote tweets and blue dashed lines denote retweets. The upper panel shows tweets for the Yankees (#Yankees) and the lower panel for the Blue Jays (#BlueJays).

https://doi.org/10.1371/journal.pone.0144646.g006

thumbnail
Fig 7. Rmax values between tweet and retweet time series for the 2015 Major League Baseball (Yankees games from September 11 to 27).

(A) Rmax values by games (n = 17). Red letters with an underline denote the winning team and blue letters denote the losing team. Y: New York Yankees, B: Toronto Blue Jays, M: New York Mets, R: Tampa Bay Rays, W: Chicago White Sox. (B) Boxplots of Rmax in the winning team group and the losing team group, with a significant difference between two groups.

https://doi.org/10.1371/journal.pone.0144646.g007

Retweet Interactions Among Social Sensors

To examine active interactions between social sensors, we constructed retweet networks related to different events in the sixth round in the 2013 Japan Series using a combined data set of tweets with #giants and those with #rakuteneagles. As mentioned before, nodes represent Twitter users, who are fans of either team or baseball fans in general, and directed links represent official retweets between them.

In Fig 8, the retweet network (A) corresponds to event (1) where the Eagles got two runs in the second inning, and the network (B) corresponds to events (2) and (3) where the Giants turned the game around. These networks are composed of two main sub-networks, one is a cluster of the Giants fans (green) and the other is a cluster of the Eagles fans (blue). While a large amount of retweets were transferred within the same sub-networks (i.e., the fans of the same team), there were much fewer retweets between the different sub-networks. Interestingly, there were a few retweets with both hashtags. Moreover, the Giants cluster involves several hub users (large nodes) who are preferentially retweeted by many users, whereas only a single hub user existed in the Eagles cluster. It turned out that these hub users are either the official account for the teams or enthusiastic baseball fans.

thumbnail
Fig 8. Retweet networks and their cumulative in-degree distributions (Pcum(k)) in the sixth round of the 2013 Japan Series.

The retweet network (A) consists of data generated during 30 min from 19:17, in which more retweets were generated with #rakuteneagles. The retweet network (B) consists of data generated during 30 min from 20:16, in which more retweets were generated with #giants. Green lines and circles denote #giants and blue lines and circles denote #rakuteneagles.

https://doi.org/10.1371/journal.pone.0144646.g008

The bottom panels in Fig 8 show the cumulative in-degree distributions of the retweet networks (A) and (B), respectively. Both of the distributions exhibit a scale-free property Furthermore, the tails tended to shift to the right (i.e., greater k) on the winning side; that is, the tail is much longer in the Eagles cluster than the Giants cluster in (A), while the situation is opposite in (B).

These structural properties provide additional clues on how social sensors act, react, and interact to generate collective busty behavior during a sporting event. First, the scale-free property of retweet networks is indicative of a substantial difference in the popularity of social sensors as an information source for retweets. Second, the existence of two main sub-networks suggests that social sensors self-organized topic-based groups, in which they had a sense of belonging in their groups by using the same hashtag.

Discussion

We have demonstrated that social sensors respond preferentially to positive events in sporting events by generating concurrent bursts of tweets and retweets, the degree of which can be interpreted as a strong indicator of winning or losing a game. We think that such concurrent reaction occurs in a wide variety of settings but it is often weak or one-time occurrences, neither of which is a condition that fits our approach. Thus, we used major sporting events as ideal exemplars to illustrate the concurrent bursty behavior of social sensors that previous research has not addressed. A burst of tweets reflects a fast process where social sensors respond reflexively to real-world events, whereas that of retweets reflects a slower process where social sensors become aware and react selectively to the information posted about these events in cyberspace. As the latter process requires more attention and is highly context dependent, concurrent bursts of tweets and retweets are seemingly unlikely but possible during positive real-world events, as we have demonstrated. As seen in Fig 8, there are a few hub users (or influentials) who can cause data bias, therefore the amount of tweets (or retweets) cannot be a good measure for social sensors but the degree of the tweet–retweet concurrency can be a much robust measure. By incorporating this nature of concurrency with the conventional measures, we can develop a more accurate, reliable indicator of positive real-world events; otherwise, every single measure alone cannot work. Several exceptions observed in the baseball data suggest that the tweet–retweet concurrency is only one aspect of social sensors and that much remains to be discovered. Therefore, exploring real-world events by focusing on different features is indispensable for understanding the true complexity of social sensors. An extension of this study in this direction is also important for the development of an application of real-time social sensing, using humans as sensors, in the social media system of the future.

Our findings, however, do not necessarily hold true in other sporting events because different sports have different scoring dynamics [23]. For example, two-team sports such as baseball and football have detailed rules with a scoring mechanism that can prompt fans to be more aware of a game’s progress. This situation tends to elicit spontaneous, polarized tweet and retweet reactions to chances to score and scoring events among fans of different teams. In contrast, in multi-team sports like car racing, the rules are much simpler and there is no scoring mechanism, which may deprive fans of a chance to react to the progress of a race. In this situation, tweet and retweet reactions occur in a different fashion than with two-team sports. Furthermore, there are potential disadvantages of this method. As mentioned earlier, the long, stationary time series is necessary for the accurate estimation of Rmax. This is because the cross correlation function is a linear measure and it can poorly capture correlations between nonlinear signals; in such a case more advanced but perhaps more computationally expensive measures need to be employed. Our approach cannot work in non-popular sporting events, because people hardly tweet for such events and hence the amount of available tweets is not enough for analysis. Although several limitations are recognized, we think that the temporal correlation between tweets and retweets is a good measure to explore social sensors, and Rmax can be applied to a wider class of sporting events and probably other social events, such as presidential debates between two candidates.

Furthermore, the retweet networks for the baseball games exhibited a scale-free property of user popularity, with hub sensors (or influentials) who contribute to cascades of retweets, as with other retweet networks for meme diffusion [8] and for collective attention [10]. In addition, these retweet networks had sub-networks depending on the baseball teams, as with user networks for online political activity [6]. These sub-networks are interpreted as “topic-based groups” [24], in which people feel attached to the group or loosely connected to one another, by using the same hashtag. The common structural features of social sensor groups indicate the possibility of the same underlying design principle. To assess the generality of these results, further investigations are necessary using a wide variety of social events across various kinds of social media.

In conclusion, our simple analysis provides evidence that Twitter is a network of social sensors in that it allows people to immediately react to real-time events by tweeting and it is active in that people selectively retweet favorite posts, thereby yielding the spontaneous concurrent bursts of tweets and retweets that spread over scale-free user networks. Contrary to the well-tested analogy that “Twitter is a mirror of reality,” the results of this study imply the more unique aspects of social sensors, few of which have been quantitatively addressed so far. The accumulation of case studies of this kind is fundamental for computational social science to understand the complexity of human behavior in a highly connected world.

Author Contributions

Conceived and designed the experiments: YT KS. Performed the experiments: YT KS. Analyzed the data: YT KS. Wrote the paper: YT KS RS TA.

References

  1. 1. Miller G. Social scientists wade into the tweet stream. Science. 2011;333(6051):1814–1815. pmid:21960603
  2. 2. González-Bailón S, Borge-Holthoefer J, Rivero A, Moreno Y. The Dynamics of Protest Recruitment through an Online Network. Sci Rep. 2011;1:197. pmid:22355712
  3. 3. Kwak H, Lee C, Park H, Moon S. What is Twitter, a Social Network or a News Media? In: Proc. of the 19th International Conference on WWW; 2010. p. 591–600.
  4. 4. Bollen J, Goncalves B, Ruan G, Mao H. Happiness is Assortative in Online Social Networks. Artif Life. 2011;17(3):237–251. pmid:21554117
  5. 5. Grabowicz PA, Ramasco JJ, Moro E, Pujol JM, Eguíluz VM. Social Features of Online Networks: The Strength of Intermediary Ties in Online Social Media. PLoS ONE. 2012;7(1):e29358. pmid:22247773
  6. 6. Conover MD, Gonçalves B, Flammini A, Menczer F. Partisan Asymmetries in Online Political Activity. EPJ Data Sci. 2012;1(1):6.
  7. 7. Romero DM, Meeder B, Kleinberg J. Differences in the Mechanics of Information Diffusion Across Topics: Idioms, Political Hashtags, and Complex Contagion on Twitter. In: Proc. of the 20th International Conference on WWW; 2011. p. 695–704.
  8. 8. Weng L, Flammini A, Vespignani A, Menczer F. Competition Among Memes in a World With Limited Attention. Sci Rep. 2012;2:335. pmid:22461971
  9. 9. Lehmann J, Gonçalves B, Ramasco JJ, Cattuto C. Dynamical Classes of Collective Attention in Twitter. In: Proc. of the 21st International Conference on WWW; 2012. p. 251–260.
  10. 10. Sasahara K, Hirata Y, Toyoda M, Kitsuregawa M, Aihara K. Quantifying Collective Attention from Tweet Stream. PLoS ONE. 2013;8(4):e61823. pmid:23637913
  11. 11. Golder SA, Macy MW. Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures. Science. 2011;333(6051):1878–1881. pmid:21960633
  12. 12. Dodds PS, Harris KD, Kloumann IM, Bliss CA, Danforth CM. Temporal Patterns of Happiness and Information in a Global Social Network: Hedonometrics and Twitter. PLoS ONE. 2011;6(12):e26752. pmid:22163266
  13. 13. Sakaki T, Okazaki M, Matsuo Y. Earthquake Shakes Twitter Users: Real-Time Event Detection by Social Sensors. In: Proc. of the 19th International Conference on WWW; 2010. p. 851–860.
  14. 14. Borge-Holthoefer J, Rivero A, García I, Cauhé E, Ferrer A, Ferrer D, et al. Structural and Dynamical Patterns on Online Social Networks: The Spanish May 15th Movement as a Case Study. PLoS ONE. 2011;6(8):e23883. pmid:21886834
  15. 15. Bollen J, Mao H, Zeng X. Twitter Mood Predicts the Stock Market. J Comput Sci. 2011;2(1):1–8.
  16. 16. Zhao S, Zhong L, Wickramasuriya J, Vasudevan V. Human as Real-Time Sensors of Social and Physical Events: A Case Study of Twitter and Sports Games. In: Proceedings of CoRR; 2011.
  17. 17. Lanagan J, Smeaton AF. Using Twitter to Detect and Tag Important Events in Live Sports. Artificial Intelligence. 2011;p. 542–545.
  18. 18. Corney D, Martin C, Göker A. Spot the Ball: Detecting Sports Events on Twitter. In: Advances in Information Retrieval; 2014. p. 449–454.
  19. 19. Panisson Ae, Gauvin L, Quaggiotto M, Cattuto C. Mining Concurrent Topical Activity in Microblog Streams. In: Proc. of 4th Workshop on Making Sense of Microposts; 2014. p. 3–10.
  20. 20. Box G, Edward P, Jenkins G. Time Series Analysis: Forecasting and Control. New Jersey, USA: Wiley; 2015.
  21. 21. Fisher RA. Statistical Methods for Research Workers. Edinburgh, UK: Oliver and Boyd; 1925.
  22. 22. Newman ME. Networks: An introduction. Oxford, UK: Oxford University Press; 2010.
  23. 23. Merritt S, Clauset A. Scoring dynamics across professional team sports: tempo, balance and predictability. EPJ Data Sci. 2014;3(1):4.
  24. 24. Ren Y, Kraut R, Kiesler S. Identity and Bond Theories to Understand Design Decisions for Online Communities. Academy of Management Proceedings. 2006;2006(1):B1–B6.