Figures
Abstract
The response of netizens toward controversial events plays an important guiding role in the development of events. Based on the identification of such responses, this study aimed to determine the critical outbreak time window of events. The microblog texts related to an event were divided into seven emotional categories via multi-emotional analysis to capture the subtle emotions of netizens toward an event, i.e., public opinion. By detecting the characteristics of the text and regional coverage of emotions, an emotional coverage index that reflects the intensity of emotional impact was proposed to determine the mainstream emotion of netizens. By capturing the mutation characteristics of the impact intensity of mainstream emotions, the critical time window of the public opinion toward the event was obtained. The experimental results demonstrated that the proposed method can effectively identify the critical outbreak time window of controversial events, which can help authorities in preventing the further aggravation of events.
Citation: Wang M, Wu H, Zhang T, Zhu S (2020) Identifying critical outbreak time window of controversial events based on sentiment analysis. PLoS ONE 15(10): e0241355. https://doi.org/10.1371/journal.pone.0241355
Editor: Sergi Lozano, Universitat de Barcelona, SPAIN
Received: September 18, 2019; Accepted: October 5, 2020; Published: October 29, 2020
Copyright: © 2020 Wang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: This work was supported by the National Natural Science Foundation of China (Grant No. 71473034), the Heilongjiang Provincial Natural Science Foundation of China (Grant No. LH2019G001), and the financial assistance from Postdoctoral Scientific Research Developmental Fund of Heilongjiang Province (Grant No. LBH-Q16003). All the funds were obtained by MW. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: SZ was affiliated with Asia Intelligence Technology Pte Ltd during the research period. There are no patents, products in development or marketed products associated with this research to declare. This does not alter our adherence to PLOS ONE policies on sharing data and materials. The authors have declared that no other competing interests exist.
Introduction
The upsurge of public opinion in controversial events severely threatens the social order and endangers public security. If the critical time window before the outbreak of public opinion can be determined in a timely manner, it will help authorities in identifying the problems reflected by the public opinion to take appropriate measures that can reduce the adverse social impact due to the further development of public opinion. With the popularity of mobile internet, new media based on the internet and social network platforms has gradually replaced traditional media and has become a mainstream form of media [1]. As a type of new media, microblogs have been developed and popularized rapidly in China [2]. An increasing amount of news and opinions are posted on Sina Weibo by the media and public, thereby making it an important platform for Chinese netizens to release public opinions [3]. In a series of controversial events in recent years, the network public opinion due to new media platforms such as microblogs has played an important role [4, 5]. Thus, the analysis of public opinion data related to controversial events on new media platforms can help in better capturing the development trend of events and provide valuable auxiliary support to the authorities to cope with the impact of the development of an event.
Researchers have stated that the response of netizens toward the public opinion of an event is an important factor affecting the evolution of the event. They pointed out that the emotional strength of netizens changes during the evolution of an event [6], and argued that the formation of online public opinion is impacted by the change in netizens’ views [7–9]. They also pointed out that emotions aroused by the online participation of the public on social media will further affect their online behavior [10]. Consequently, based on the cognitive appraisal theory [11], a cognition-emotion behavior process was proposed to explain this phenomenon [12]. The process shows that the appraisal of events can trigger certain emotional reactions [13] that can affect the behavioral intentions of an individual [14]. To detect the emotions of netizens toward events on social media, researchers constructed various emotional dictionaries [15–18] and established different machine learning models [19–24].
In conclusion, emotion plays an important role in guiding the development of the public opinion toward events. Moreover, different emotional categories have different impacts on the development of public opinion. Some researchers have pointed out that in the process of public opinion dissemination, the impact of negative emotions is often greater than that of positive emotions. This conclusion can be drawn even through the case study of positive events [25]. However, there are few studies that examined the role of negative emotions in public opinion early warning. In other words, few studies have predicted the outbreak of public opinion by monitoring the evolution of negative emotions. This paper presents a quantitative and in-depth study on this issue. By identifying the mainstream negative emotions that affect the development of public opinion, combined with the characteristics of the text and regional coverage of mainstream negative emotions, this study discusses the value of emotions in early warning of public opinion toward an event.
The contribution of this study is threefold. First, based on the examination of specific emotions expressed by emoticons, the microblog texts related to an event are divided into multiple emotions to obtain the subtle emotions of netizens for the event. Second, based on the investigation of the characteristics of text coverage and geographical coverage of different emotions, the mainstream negative emotions that play a key role in guiding the development of the public opinion toward an event are determined. Third, the critical time window before the outbreak of public opinion is determined by tracking the mutation of the intensity characteristics of mainstream negative emotions. The mutation of the intensity of mainstream negative emotions is an important signal for public opinion warnings. This mutation shows that the social contradictions reflected by the mainstream negative emotions have reached the edge of intensification and outbreak. Therefore, the time window of the sudden change in the intensity of mainstream negative emotions is the key time window before the outbreak of public opinion. After this time window, if the authorities do not implement timely and reasonable policies to intervene and control the public opinion toward an event, the event will quickly evolve into a more controversial event, which will cause a more malignant impact on the society and lives of the people to a greater extent.
In the next section, we present a literature review of the related works before describing the detailed method for identifying the critical outbreak time window of controversial events. We then present the experimental results and discussion to verify the proposed method. Finally, we summarize our work and provide future research directions.
Literature review
When an event occurs, social media platforms are often used as a channel by people in different parts of the world to express their views, feelings, and comments [26]. Consequently, the social media content released during the evolution of an event is increasingly used by news media and researchers to describe the public response to these events [26]. In this section, we summarize related research.
Researchers believe that social media plays an important role during the upsurge of controversial events [27]. Through its inherent multi-channel communication channels and highly interactive characteristics, social media has a tremendous impact on the event as well as the understanding of netizens toward the event [28, 29]. During the evolution of an event, social media may potentially influence the thoughts, behaviors, and reactions of people [30–33]. According to a telephone survey in China, 64.7% of the respondents participated in discussions and communication related to controversial events, and more than 40% participated in social media communication [34]. Thus, it can be concluded that social media users are no longer passive recipients of information, but active participants in event communication [35].
Emotions play a special role in the evolution of events, leading to certain intentional behaviors in social media. A fine-grained emotional analysis was proposed for Twitter to better capture the emotions of netizens during event management [36]. Some researchers proposed that the emotion strength of Twitter users may change during an emergency. They found that in the process of public opinion dissemination, negative emotions usually play a greater role than positive emotions, even for positive events [25]. Researchers also noted that people’s emotions toward an event change over time [37]. To detect the response of netizens toward events, researchers have constructed different emotional dictionaries for different social media platforms [15–18]. Various techniques of machine learning [19–24], deep learning [38–40], and natural language processing [41, 42] are also widely used in the emotional analysis of event-related texts [43–46].
In addition, researchers have discussed the regional emotional bias in the evolution of public opinion toward an event [47]. They explored the emotional prejudices in different language regions on the same event and found some interesting observations. For example, in the 2016 terrorist attacks in Paris, 16% more negative comments were written in English than in French, even though the incident originated in France [48]. Researchers also studied the communication between citizens and local governments during 18 snowstorms in Maryland, USA, to potentially help local governments determine the informational requirements of citizens [49].
These studies show that different emotions of netizens on social media will resonate differently among the public, and thus have different impacts on the development of events. However, we still lack a quantitative perspective to test whether there is a dominant emotion that can guide the development of public opinion and if the evolution of such a dominant emotion can help in issuing early warnings before the outbreak of public opinion.
This study aims to identify the specific dominant emotions that play a major role in the development of events and explore the early warning effect of these emotions on the outbreak of public opinion. In contrast to previous studies, this study quantitatively analyzes the influence of emotion. By quantifying the text coverage and regional coverage characteristics of different emotions, we identify the mainstream emotions that play a leading role in the development of public opinion toward an event. By detecting the mutation of the impact intensity of mainstream emotions, we can find the time window of the emotional outbursts of netizens. The time window is the key time point before the outbreak of public opinion, which provides direct guidance for issuing early warnings related to public opinion.
Methods
This study mainly identifies the critical outbreak time window of the public opinion toward an event by monitoring the evolution of the emotions of netizens. Among them, identifying the mainstream negative emotions of netizens and monitoring the change in the impact intensity of these emotions are the key issues. This section provides a detailed process to solve the aforementioned problems. Taking event A as an example, Fig 1 shows a schematic diagram of the method proposed in this paper.
There may be some special symbols in the original microblog texts that are not valuable for our experiments. Thus, before performing the following experiments, we first removed all unnecessary symbols from the collected microblog texts.
- The web link URL and HTML format code that appear in the microblog texts were filtered out.
- The forwarding tag and username after the symbol “@” were deleted to avoid the possible influence of these tags on the following emotional analysis.
- The symbols “#” and the data between them were deleted because they are basically unrelated to the event.
- The Jieba package in Python was used to segment the microblog texts after completing the above filtering operations. Furthermore, there are still some words, such as modal words and function words, which are not helpful in analyzing the texts. They were removed from the word lists to improve the efficiency and quality of text expression. Table 1 shows examples of modal words and function words that were deleted from the microblog texts in this study.
Multiple emotional classification of event-related microblog texts
Netizens express and exchange opinions on social platforms around events, which is essentially an emotional release of public opinion [50, 51]. Emotional analysis can identify the attitudes, thoughts, judgments, and emotions of netizens toward an event through event-related text data [24, 52].
Researchers have constructed emotional dictionaries for Twitter, Weibo, and other platforms by considering the official language and linguistic expression habits of different social media platforms [15–18, 53]. The emotional dictionary used for a microblog platform mainly relies on open source emotional dictionaries, including HowNet [54], the Chinese emotional vocabulary ontology database of the Dalian University of Technology [55] and NTUSD of the National Taiwan University [56]. In addition to emotional words, netizens often express their emotions via other symbols, such as emoticons, slangs, and hashtags. Researchers have widely discussed the role of these features in identifying the emotions of texts on Twitter [57–63]. Among them, emoticons, which have clear emotional meanings, are often widely used by netizens in microblog environments to express their feelings [64–66]. They are important emotional signals for microblog sentimental analysis.
In previous study, by summarizing the emoticons used by microblog users to express their specific emotions, we extracted emotional words from the sentences covered by each emoticon [67]. The emotional words extracted based on a given emoticon were considered to have the same emotional tendency as the emoticon. They were finally used to expand the Chinese emotional vocabulary ontology database proposed by the Dalian University of Technology. The vocabulary includes seven emotional categories of “Happiness,” “Like,” “Sadness,” “Disgust,” “Astonishment,” “Anger,” and “Fear.” In comparison to the traditional two or three types of emotions, the seven emotional categories can better express the subtle feelings of netizens toward an event. In this study, the expanded emotional dictionary was introduced to determine the emotions of netizens toward events. Simultaneously, the deep learning model, Word2vec, was used to vectorize the emotional words, followed by the vectorization of the sentences where the emotional words were located. Because Word2vec can capture the contextual information of emotional words in a microblog corpus [68], the emotional word vector and sentence vector trained by Word2vec can better express the semantic information of texts, thereby improving the performance of identifying the emotions of netizens. Table 2 presents the algorithm for classifying the event-related microblog texts into seven-element emotions.
Identifying critical outbreak time window
In different stages of the public opinion dissemination of an event, netizens express their feelings by posting comments about the event. Different emotions have different guiding effects on the evolution of events. Generally, negative emotions are more likely to resonate with netizens, and it plays a more important role in guiding the development of events [51]. Therefore, in the study of evolutionary characteristics of the public opinion toward an event, researchers focused on the identification of negative emotions [69–71]. However, this type of exploration is more concerned with the number of texts covered by negative emotions, while ignoring the regional characteristics of negative texts. In addition to the text, the geographic characteristics of netizens are also a very important feature that can reflect the scope of negative emotions in a region [46]. If a negative emotion extensively and evenly covers a large amount of area, it will further lead to more resonance among the netizens, thereby further stimulating the development of the negative influence of events. Some researchers recognized the importance of regional characteristics and incorporated the region dimension into the discussion of the development trend of events [72–74]. However, most of these studies focus on the discussion of local events based on the regional characteristics, without the discussion of mainstream negative emotions and public opinion outbreak warning.
In this study, both the text and geographical coverage characteristics of different emotions were considered to identify the mainstream negative emotions affecting the evolution of events. Considering city as the basic geographical unit, the geographic data of netizens when posting comments to an event was collected. Because the netizen response to such events is dynamic during different periods of an event, the event-related microblog texts were divided into different time windows based on the day as the basic unit of time. By analyzing the text and geographical coverage characteristics of each emotion in different time windows, the negative emotions with higher coverage in maximum time windows were identified as the mainstream negative emotions. The changes in the coverage characteristics of the mainstream negative emotions reflect the changes in attitudes and opinions of netizens in different regions. If the coverage of mainstream negative emotions suddenly surges, it implies that the social contradictions reflected by these emotions have aroused more resonance among netizens, which, to a large extent, will lead to a concentrated outbreak of social contradictions reflected by events. Therefore, the evolutionary characteristics of mainstream negative emotions play a more important role in guiding and predicting the development trend of events. Based on the identification of mainstream negative emotions, this study focuses on the mutation point of these emotions on the time axis, and considers the time window of the mutation of mainstream negative emotions as the critical time window before the event outbreaks.
Several definitions are given before providing the algorithm for identifying the critical outbreak time window. In the following definitions, symbol t is used to represent the time window with a value ranging from 1 to n. n denotes the total number of time windows divided by the current event; that is, the duration of the event from germination to extinction. Symbol i denotes the emotion with a range of values ranging from 1 to 7.
Definition 1.
Text coverage ratio TCRti: Calculates the ratio of the number of event-related microblog texts, TCti, covered by emotion i to the total number of texts, TCt, in each time window.
(1)
The total number of texts, TCt, in one time window is the sum of texts covered by each emotion in that time window; that is
.
In Formula (1), a parameter β is added to the denominator to eliminate the data sparsity that may exist in a given time window. For example, if the netizens only post one microblog text in one time window, the text coverage ratio of the emotion category that the text is classified into would reach the maximum; that is, TCRti = 1. However, this situation has no statistical significance, and the value of TCRti calculated in this time window is not valuable for exploring the evolutionary characteristics of events. By observing the daily posts of the four events used in this experiment, we found that the maximum number of posts per day was just over 5000. Therefore, we added a sparsity parameter, β = 5000, to the denominator to eliminate the possible impact of data sparsity on the emotional analysis.
Definition 2.
Regional coverage ratio RCRti: it calculates the ratio of the number of regions, RCti, covered by each emotional category to the total number of regions, RCti, covered by all emotions in each time window.
(2)
The total number of regions, RCti, covered by different emotions in one time window is the sum of the number of regions covered by each emotion; that is,
.
The text coverage ratio, TCRti, and regional coverage ratio, RCRti, give the proportion of texts and regions covered by different emotions in different time windows. Evidently, higher values of TCRti and RCRti mean that a particular emotion was dominant among the netizens, thereby demonstrating that netizens from different regions express the same emotions about events by posting comments on social media platforms. However, the regional coverage ratio, RCRti, can only measure the geographical scope of emotional radiation and cannot express the uniformity of emotions in the geographic coverage. If a certain emotion has higher coverage and more uniform distribution in these areas, it can better explain that this emotion has been more widely recognized among netizens in different areas, and this emotion can be considered as the mainstream emotion of netizens in different areas.
To calculate the regional distribution uniformity of different emotions, we calculated the regional distribution index of emotions. The regional distribution index refers to the ratio of the number of texts covered by one emotion posted in each region to the total number of texts covered by the same emotion in each time window. Accordingly, the information entropy is calculated to measure the regional diffusion uniformity of different emotions.
Definition 3.
Regional distribution RDtij: In each time window, the number of event-related texts, TCtij, covered by emotion i in each different area accounts for the proportion of the total number of event-related texts, TCtij, covered by emotion i in the same time window.
(3)
TCtij represents the number of microblog texts covered by emotion i, posted in area j under the t-th time window. TCti represents the total number of microblog texts covered by emotion i that were posted under the t-th time window.
Based on the regional distribution, RDtij, of each emotion, the regional diffusion uniformity under different time windows was calculated by the information entropy.
Definition 4.
Regional coverage information entropy ERDti: Calculate the uniformity of the geographic coverage of each emotion in each time window.
(4)
The value range of ERDti is [0,1]. The larger the index value, the more evenly distributed the netizens are with this emotion; this implies that the emotion resonated among the netizens in different regions. The smaller the index value, the netizens are relatively concentrated in a few regions, indicating that such emotions have high regional concentration characteristics and will not cause large-scale diffusion. Comparatively, the larger the value of the index, the more significant the emotion will be in guiding the development of public opinion.
Combining the text coverage ratio, TCRti, regional coverage ratio, RCRti, and regional coverage information entropy, ERDti, of different emotions, this study proposes a comprehensive coverage index of emotions to measure the comprehensive impact intensity of emotions in different time windows. The values of, TCRti, RCRti, and ERDti are all between [0,1]. If these indicators are combined in an additive way, it may be difficult to clearly demonstrate the difference in the comprehensive coverage of each emotion. Therefore, in this study, these indicators were multiplied to highlight the differences in the comprehensive coverage of different emotions.
Definition 5.
Emotional coverage ECti: In each time window, the text coverage ratio, TCRti, regional coverage ratio, RCRti and regional coverage information entropy, ERDti of different emotions are synthesized to obtain the comprehensive coverage value of each emotion.
(5)
In a certain time window, if an emotion covers more event-related texts and has higher geographical coverage and higher geographical distribution uniformity, it implies that the emotion is resonated by more netizens from different regions. This type of emotion can be taken as the mainstream emotion of netizens in this time window and it will play an important guiding role in the evolution of public opinion.
Definition 6.
Mainstream emotion: The mainstream emotion is the emotion with the largest value of emotional coverage, ECti.
When the negative mainstream emotion mutates in a certain time window, its direct representation is that when the emotional coverage, ECti, of this emotion suddenly rises, the subsequent time window will be accompanied by an outbreak of events. The sudden increase in the emotional coverage, ECti, of negative mainstream emotion means that this type of emotion has increased sharply at the social level. This surge may induce the same emotional resonance among more netizens, which will lead to the evolution of events into a controversy and have a more adverse impact on the social life. To capture the time window of the surge in the comprehensive emotional coverage of mainstream emotion and obtain the critical outbreak time window before the outbreak of events, the concept of emotional trend ratio is proposed in this paper.
Definition 7.
Emotional trend ratio ETt: Calculate the ratio of the emotional coverage of mainstream emotion in the current time window to the average of the emotional coverage of the emotion in all previous time windows.
(6)
ECt indicates the comprehensive emotional coverage of the mainstream emotion in the current time window t and
represents the average emotional coverage of the mainstream emotion in all previous time windows.
By observing the value of the emotional trend ratio in the evolving process of events, we found that there exists a threshold that can monitor the outbreak of events. When the emotional trend ratio of the negative mainstream emotion is greater than the threshold in a certain time window, the public opinion toward an event breaks out. Based on this observation, we set the current time window when ETt begins to exceed the threshold as the critical outbreak time window before the outbreak of the events.
Definition 8.
Critical outbreak time window Tcritical: When ETt begins to exceed the threshold δ, that is ETt > δ, the current time window is defined as the critical outbreak time window of the events.
It is critical for governments to implement strategies to supervise events before their outbreak to reduce their negative consequences. Table 3 presents the algorithm for identifying the critical outbreak time window of events.
Experimental results and discussion
Dataset
Four typical controversial events were selected as examples for our experiments: “Wei Zexi,” “Nanny arson in Hangzhou,” “Mum who steals chicken leg,” and “Luo Yixiao.” Based on these four events, we explored the possibility of identifying the critical outbreak time window before the outbreak of events. All microblog texts related to the four events from the germination to the recession period were collected using a web crawler from the Sina Weibo platform. Data collection was conducted from April 28, 2016 to May 14, 2016; June 22, 2017 to June 30, 2017; May 31, 2016 to June 9, 2016; and November 28, 2016 to December 12, 2016. These four events reflect four different aspects of social life. Among them, the “Wei Zexi” incident aroused concern for “Putian Hospital”; the “Nanny arson in Hangzhou” incident reflects upon the human nature; the “Mum who steals chicken leg” incident reflects the social assistance mechanism; and the “Luo Yixiao” incident makes the netizens reflect on the existing donation mechanism. Considering the length of the revised manuscript, we describe the four incidents in detail in the “S1 File”
Results and discussion
According to the seven-element emotional classification process given by the algorithm in Table 2, the event-related microblog texts in each time window were divided into seven emotional categories. Considering the “Wei Zexi” incident as an example, we detected the emotional distribution of microblog texts related to the event in different time windows. The results are presented in Table 4. It shows that in different time windows, netizens present different and subtle emotional experiences to events. Under each time window, each emotion had different coverage characteristics on event-related texts.
Combining the text and regional coverage characteristics of each emotion, we examined the mainstream emotions for the four events. According to the algorithm proposed in Table 3, we calculated the text coverage ratio, TCRti, regional coverage ratio, RCRti, regional distribution, RDtij, and regional coverage information entropy, ERDti, for each emotion. Based on TCRti, RCRti, and RDtij, we calculated the emotional coverage, ECti, for each emotion. Figs 2, 3, 4, and 5 show the evolutionary trend of emotional coverage, ECti, for each emotion over time for each event. We adopted two y-axes to clearly demonstrate the changes in the coverage of each type of emotion in the process of event evolution. The left y-axis represents the number of event-related microblog texts and the right y-axis represents the value of emotional coverage, ECti, under each emotional category. We found that for the four events, the two negative emotions of “anger” and “sadness” always obtained higher values in ECti under each time window. These two emotions reflect the attitudes and opinions of most netizens towards an event, and they eventually lead to the outbreak of the event after being dominant among more netizens in a wider range. Therefore, these two negative emotions are the mainstream emotions dominating the development trend of public opinion in these four events. The results are consistent with the conclusions of previous research [25], which show that negative emotions outweigh positive emotions in the process of the public opinion dissemination of an event.
Based on the emotional coverage, ECti, of the two mainstream emotions of “anger” and “sadness,” we further calculated the emotional trend ratio, ETt, for these two emotions. Figs 6, 7, 8 and 9 show the changes in the ETt values of the two mainstream emotions during the evolution of the four events. We used two Y-axes to clearly demonstrate the changes in the ETt values of the two mainstream emotions in the process of event evolution. The left Y-axis represents the ETt value of “anger” and the right Y-axis represents that of “sadness”. The experimental results show that in the process of event evolution, the ETt value of the negative mainstream emotion has a significant pulse mutation in a specific time window. These specific time windows occur within 1–3 days before the event. This shows that before the outbreak of the event, the negative mainstream emotions of netizens undergo a significant mutation, specifically manifested such that more netizens from various regions resonated with this negative emotion and expressed their views on the event by publishing more posts. This mutation leads to the breakout of the event in a short period of time. The results show that identifying the mainstream negative emotions of netizens and monitoring the sudden changes in the emotional intensity can help in predicting the outbreak of events.
By observing the evolutionary trend of emotional coverage under the four events, we found that when the value of the emotional trend ratio, ETt, of mainstream emotions exceeded 150, all four events erupted and evolved into more controversial events in the subsequent time windows. Therefore, we set the value of the threshold δ to 150 in this study. When the value of the emotional trend ratio ETt of the mainstream emotions exceeds the threshold, δ, the event will erupt in the following time windows. The time window when ETt first exceeds the threshold δ is defined as the critical outbreak time window of the event. Considering the event of “Wei Zexi” as an example, it can be concluded that the critical outbreak time window of the event is May 1, 2016. The incident erupted on May 3, 2016 and reached its peak. For the other three events, i.e., “Nanny arson in Hangzhou,” “Mum who steals chicken leg,” and “Luo Yixiao,” the critical outbreak time windows identified by this method are June 23, 2016, June 1, 2016, and November 30, 2016, respectively. These three events broke out on June 26, 2016, June 3, 2016, and December 1, 2016, respectively, in the future time window near the identified critical time window.
According to the recognition results of the four representative events, the method proposed in this study could effectively identify the critical outbreak time window of the four events. In comparison to the method that uses the number of event-related microblog texts as an indicator of event outbreak, the proposed method can predict event outbreaks at least one day in advance, thereby accomplishing the purpose of prescreening event outbreaks. This prescreen can give more time to the authorities to adopt reasonable prevention, control, and supervision strategies before the event really breaks out with the aim of reducing the risks and hazards caused by the event.
Conclusions and future work
Conclusions
The effective identification of the critical time window before the outbreak of an event is of great significance for the relevant government departments to prevent and control the events in advance and reduce the losses caused by the events. During the evolution of the public opinion toward an event, the emotional response of netizens to the event plays an important role in forecasting and guiding the development trend of events.
Microblogs are the mainstream social platforms for Chinese netizens to express their views and exchange information, as well as the main platform for spreading event-related network public opinions in China. Taking four typical events that occurred in China in the recent years as an example, this study explored the possibility of identifying the critical outbreak time window before the occurrence of the events. Through the discussion of the subtle emotional tendencies of netizens toward the events, this study revealed that the different emotion categories of netizens have different impacts on the evolution of events. Some emotions, especially some negative emotions, play a direct guiding role in the development of an event, and can also predict the development trend of the public opinion toward the event. These emotions are the mainstream emotions reflecting the attitudes and views of most netizens. In this study, we propose several definitions to determine the mainstream emotions of an event by considering the text coverage and region coverage characteristics of each emotional category. By detecting the evolution of mainstream emotions, we found that the sudden increase in the emotional coverage characteristics of mainstream emotions can appropriately predict the outbreak of events in the future time window. The surge in the impact intensity of mainstream negative emotion shows that emotions induce more emotional resonance among netizens, which makes the event more controversial and has a more negative impact on the social life. Through the empirical analysis of the four events, it was found that the critical outbreak time window determined by the proposed method lies just before the real outbreak of the event, which shows the effectiveness of the proposed method. Simultaneously, it proves that emotional analysis is a valuable method for analyzing the development trend of events. The research work and conclusions of this study can help in issuing early warnings related to controversial events and help relevant management departments in implementing timely measures to monitor and intervene in the development of events. However, there are still some limitations to this work.
First, at present, the current research work can only help relevant management departments in obtaining information about the time of intervention; it cannot help in determining the appropriate object of intervention. The dissemination and development of events involve the participation of numerous netizens, but the roles and importance of different netizens are different. To help the government implement more targeted and effective intervention measures, it is necessary to identify the key disseminators affecting the dissemination of public opinion and focus on their supervision and intervention. Second, owing to the limitation of data access, we only selected four Chinese cases to complete the study. Due to regional/cultural differences, the conclusions of this study may not be directly extrapolated to case studies in other countries or regions. Third, this study selected 150 as the warning threshold, which is a conservative choice. If the emotional intensity of a certain time window reaches 150 times the previous average, it is considered a strong emotional mutation. For other cases, whether the threshold value determined in this study is appropriate is a problem worthy of discussion.
In addition, this study only selected four Chinese cases for analysis. Because microblogs are the most mainstream social media platform in China, all data included in this study were obtained from the Sina Weibo platform. Owing to the limitations of experimental conditions, this study does not discuss whether the particularity of social media platforms (such as Twitter) will affect the experimental results. However, many previous studies based on Twitter pointed out that the response of netizens is an important factor affecting the development of events. The research performed in this study is a further in-depth analysis of these conclusions. Our experimental results showed that the mutation of the negative mainstream emotion intensity of netizens indicates an outbreak of events. This conclusion further elucidates that the changes in the emotions of netizens play an important role in the development of events, and can be used as an important guiding factor for issuing early warning for events. This conclusion provides valuable information for researchers to explore the impact of emotions on the development of events.
Future work
In future work, we plan to further explore the applicability of the proposed method in more domestic and foreign cases. Through the analysis of these more extensive cases, we can explore whether regional/cultural differences will affect the early warning research of controversial events based on emotional analysis. Meanwhile, more case studies can also help in determining a more universal warning threshold, which can help in further promoting the application of the method in practice.
In addition to effectively determining the time factor of event warning, identifying the core communicators in the evolution process of events can also help to better monitor the development trend of events. Combined with the communication and reply relationship between microblog texts, a communication network of public opinion on events can be constructed. Through the analysis of the network structure, the key communicators in the process of public opinion dissemination can be identified. Combined with the multi-emotional analysis technology used in this study, we can identify the emotional tendencies of key communicators. In the process of event evolution, the emotional holders who are consistent with the mainstream negative emotions are considered as important personnel to be monitored. The combination of the identification of the outbreak time of an event and the identification of key communicators can help the relevant management departments to intervene in the events in a timely manner, restrain the further deterioration of events, and reduce the harmful impacts of events on social life.
References
- 1. Qi J. S., Liang X., Wang Y. & Cheng H. C. (2018). Discrete time information diffusion in online social networks: micro and macro perspectives. Scientific Reports, 8, 11872. pmid:30089814
- 2. Ding Z., Jia Y. & Zhou B. (2014). A survey of microblog data mining research. Computer Research & Development, 51(4), 691–706.
- 3. Zou X., Yang J., & Zhang J. (2018). Microblog sentiment analysis using social and topic context. PLoS ONE, 13(2), e0191163. pmid:29394258
- 4. Li L., Wu Y. B., Zhang Y. W. & Zhao T. Y. (2018). Time+User dual attention based sentiment prediction for multiple social network texts with time series. IEEE ACCESS, 7, 17644–17653.
- 5. Quattrociocchi W., Caldarelli G. & Scala A. (2014). Opinion dynamics on interacting networks: media competition and social influence. Scientific Reports, 4, 4938. pmid:24861995
- 6. Thewall M., Buckley K. & Paltoglou G. (2011). Sentiment in Twitter events. Journal of the American Society for Information Science and Technology, 62(2), 406–418.
- 7. Cambria E., Poria S., Gelbukh A. & Thelwall M. (2017). Sentiment analysis is a big suitcase. IEEE Intelligent Systems, 32(6), 74–80.
- 8. Jiang D. D., Luo X. F., Xuan J. Y. & Xu Z. (2017). Sentiment computing for the news event based on the social media big data. IEEE ACCESS, 5, 2373–2382.
- 9. Liu X. Y. & Liu C. (2018). Information and opinion leader mathematical modeling based on Microblog. IEEE ACCESS, 6, 34736–34745.
- 10. Luo Q. J. & Zhai X. T. (2017). “I will never go to Hong Kong again!” How the secondary crisis communication of “Occupy Central” on Weibo shifted to a tourism boycott. Tourism Management, 62, 159–172. pmid:32287749
- 11. Lazarus R. (1982). Thoughts on the relations between emotions and cognitive. American Psychologist, 37(9), 1019–1024.
- 12. Breitsohl J. & Garrod B. (2016). Assessing tourists’ cognitive, emotional and behavioural reactions to an unethical destination incident. Tourism Management, 54, 209–220.
- 13. Choi J. N., Sung S. Y., Lee K. & Cho D. (2011). Balancing cognition and emotion: Innovation implementation as a function of cognitive appraisal and emotional reactions toward innovation. Journal of Organizational Behavior, 32(1), 107–124.
- 14. Frijda N. H., Kuipers P. & ter Schure E. (1989). Relations among emotion, appraisal, and emotional action readiness. Journal of Personality and Social Psychology, 57(2), 212–228.
- 15.
Yang, X. Y. L., Xu, S. J., Wu, H. & Bie, R. F. (2018). Sentiment analysis of Weibo comment texts based on extended vocabulary and convolutional neural network. International Conference on Identification, Information and Knowledge in the Internet of Things, 147, 361–368.
- 16. Parlar T., Ozel S. A. & Song F. (2019). Analysis of data pre-processing methods for sentiment analysis of reviews. Computer Science-AGH, 20(1), 123–141.
- 17. Araque O., Zhu G. G. & Iglesias C. A. (2019). A semantic similarity-based perspective of affect lexicons for sentiment analysis. Knowledge-based Systems, 165, 346–359.
- 18. Yu C. M. Zhu X. Y., Feng B. L., Cai L. & An L. (2019). Sentiment analysis of Japanese tourism online reviews. Journal of Data & Information Science, 4(1), 89–113.
- 19. Zhang B. W., Xu X. F., Li X. T., Chen X. J., Ye Y. M. & Wang Z. J. (2019). Sentiment analysis through critic learning for optimizing convolutional neural networks with rules. Neurocomputing, 356, 21–30.
- 20. Chen R. C. & Hendry (2019). User rating classification via deep belief network learning and sentiment analysis. IEEE Transactions on Computational Social Systems, 6(3), 535–546.
- 21. Tang J. J., Hao S. L. & Qu W. Q. (2019). Sentiment analysis of online Chinese comments based on statistical learning combining with pattern matching. Concurrency and Computation-Practice & Experience, 31(10), e4765.
- 22. Alharbi A. S. M. & de Doncker E. (2019). Twitter sentiment analysis with a deep neural network: an enhanced approach using user behavioral information. Cognitive Systems Research, 54, 50–61.
- 23. Rani S. & Kumar P. (2019). Deep learning based sentiment analysis using convolution neural network. Arabian Journal for Science and Engineering, 44(4), 3305–3314.
- 24. Shayaa S., Jaafar N. I., Bahri S., Sulaiman A., Wai P. S., Chung Y. W., et al. (2018). Sentiment analysis of big data: Methods, applications, and open challenges. IEEE ACCESS, 6, 37807–37827.
- 25. Thelwall M., Buckley K. & Paltoglou G. (2011). Sentiment in twitter events. Journal of the American Society for Information Science and Technology, 62(2), 406–418.
- 26. Smith K. S., McCreadie R., Macdonald C. & Ounis I. (2018). Regional sentiment bias in social media reporting during crises. Information Systems Frontiers, 20(5), 1013–1025.
- 27. Schroeder A., Pennington-Gray L., Donohoe H. & Kiousis S. (2013). Using social media in times of crisis. Journal of Travel & Tourism Marketing, 30(1–2), 126–143.
- 28. Kietzmann J. H., Hermkens K., McCarthy I. P. & Silvestre B. S. (2011). Social media Get serious! Understanding the functional building blocks of social media. Business Horizons, 54(3), 241–251.
- 29. Lotan G., Graeff E., Ananny M., Gaffney D., Pearce I. & Boyd D. (2011). The revolutions were Tweeted: Information flows during the 2011 Tunisian and Egyptian revolutions. International Journal of Communications, 5(5), 1375–1405.
- 30. Schroeder A. & Pennington-Gray L. (2015). The role of social media in International tourist’s decision making. Journal of Travel Research, 54(5), 584–595.
- 31. Starbird K., Dailey D., Walker A. H., Leschine T. M., Pavia R. & Bostrom A. (2015). Social media, public participation, and the 2010 BP deepwater horizon oil spill. Human & Ecological Risk Assessment, 21(3), 605–630.
- 32.
Chen, R. & Sakamoto, Y. (2013). Perspective matters: sharing of crisis information in social media. In: Proceedings of the Annual Hawaii International Conference on System Sciences (HICSS), Waikoloa, HI, 2033–2041.
- 33. Dong T., Liang C. & He X. (2016). Social media and internet public events. Telematics and Informatics, 34(3), 726–739.
- 34. Xie Y., Qiao R., Shao G. & Chen H. (2016). Research on Chinese social media users’ communication behaviors during public controversial events. Telematics and Informatics, 34(3), 740–754.
- 35. Cheng J.W., Mitomo H., Otsuka T. & Jeon S. Y. (2016). Cultivation effects of mass and social media on perceptions and behavioural intentions in post-disaster recovery—the case of the 2011 Great East Japan Earthquake. Telematics and Informatics, 33(3), 753–772.
- 36.
Schulz, A., Thanh, T., Paulheim, H. & Schweizer, I. (2013). A fine-grained sentiment analysis approach for detecting crisis related microposts. In: Proceedings of the 10th International ISCRAM Conference, Baden-Baden,Germany, 846–851.
- 37.
Nagy, A. & Stamberger, J. (2012). Crowd sentiment detection during disasters and crises. In: Proceedings of the 9th International ISCRAM Conference, Vancouver, Canada, 1–9.
- 38.
Tang, D., Wei, F., Qin, B., Liu, T. & Zhou., M. (2014). Coooolll: a deep learning system for twitter sentiment classification. In: Proceedings of the 8th International Workshop on Semantic Evaluation, Dublin, Ireland, 208–212.
- 39.
Tang, D., Qin, B., Feng, X. & Liu, T. (2016). Effective LSTMs for targer-dependent sentiment classification. In: Proceedings of 26th International Conference on Computational Linguistics, Osaka, Japan, 3298–3307.
- 40. Tang D., Wei F., Qin B., Yang N., Liu T. & Zhou M. (2016). Sentiment embeddings with applications to sentiment analysis. IEEE Transactions on Knowledge and Data Engineering, 28(2), 496–509.
- 41.
Bontcheva, K., Derczynski, L., Funk, A., Greenwood, M. A., Maynard, D. & Aswani, N. (2013). Twitie: an open-source information extraction pipeline for microblog text. In: Proceeding of Recent Advances in Natural Language Processing, Hissar, Bulgaria, 83–90.
- 42. Wang M. Y., Leng D. T. & Ren J. J. (2019). Sentiment classification based on linguistic patterns in citation context. Current Science, 117(4), 606–616.
- 43. Cambria E. (2016). Affective computing and sentiment analysis, IEEE Intelligent Systems, 31(2), 102–107.
- 44. Li H. J., Fang W., & An H. Z. (2015). Words analysis of online Chinese news headlines about trending events: a complex network perspective. PLoS ONE, 10(3), e0122174. pmid:25807376
- 45. Han H., Zhang Y., Zhang J., Yang J., & Zou X. (2018). Improving the performance of lexicon-based review sentiment analysis method by reducing additional introduced sentiment bias. PLoS ONE, 13(8), e0202523. pmid:30142154
- 46. Al-Saffar A., Awang S., Tao H., Omar N., Al-Saiagh W., & Al-bared M. (2018). Malay sentiment analysis based on combined classification approaches and senti-lexicon algorithm. PLoS ONE, 13(4), e0194852. pmid:29684036
- 47. Ghosh S., Ghosh K., Ganguly D., Chakraborty T., Jones G. J. F., Moens M. F., et al. (2018). Exploitation of social media for emergency relief and preparedness: recent research and trends. Information Systems Frontiers, 20(5), 901–907.
- 48. Smith K. S., McCreadie R., Macdonald C. & Ounis I. (2018). Regional sentiment bias in social media reporting during crises. Information Systems Frontiers, 20(5), 1013–1025.
- 49. Hong L., Fu C., Wu J. & Frias-Martinez V. (2018). Information needs and communication gaps between citizens and local governments online during natural disasters. Information Systems Frontiers, 20(5), 2017–1039.
- 50. Wang X. F., Sheng S. & Lu Y. (2018). Analyzing public opinion from microblog with topic clustering and sentiment intensity. Data Analysis & Knowledge Discovery, 2(6), 37–47.
- 51. Wang G. H., Chi Y. X., Liu Y. J. & Wang Y. F. (2019). Studies on a multidimensional public opinion network model and its topic detection algorithm. Information Processing and Management, 56(3), 584–608.
- 52.
Yu, H. & Hatzivassiloglou, V. (2003). Towards answering opinion questions: Separating facts from opinions and identifying the polarity of opinion sentences. Proceeding of the 2003 Conference on Empirical Methods in Natural Language Processing, Sapporo, Japan, 129–136.
- 53. Hao X., An H., Zhang L., Li H., & Wei G. (2015). Sentiment diffusion of public opinions about hot events: based on complex network. PLoS ONE, 10(10), e0140027. pmid:26462230
- 54. Zhao Y. Y., Qin B. & Liu T. (2010). Sentiment analysis. Journal of Software, 21(8), 1834–1848.
- 55. Chen J. M., Lin H. F. & Yang Z. H. (2009). Automatic acquisition of emotional vocabulary based on syntax, CAAI Transactions on Intelligent Systems, 4(2), 100–106.
- 56. Li A. P., Di P. & Duan G. L. (2015). Document sentiment orientation analysis based on sentence weighted algorithm. Journal of Chinese Computer Systems, 36(10), 2252–2256.
- 57.
Davidov, D., Tsur, O. & Rappoport, A. (2010). Enhanced sentiment learning using twitter hashtags and smileys, Proceedings of the 23rd International Conference on Computational Linguistics, Beijing, China, 241–249.
- 58. Asghar M. Z., Kundi F. M., Ahmad S., Khan A. & Khan F. (2017). T-SAF: Twitter sentiment analysis framework using a hybrid classification scheme. Expert Systems, 35(1), e12233.
- 59.
Yamamoto, Y., Kumamoto, T. & Nadamoto, A. (2014). Role of emoticons for multidimensional sentiment analysis of Twitter. 16th International Conference on Information Integration and Web-based Applications & Services, Hanoi, Vietnam, 107–115.
- 60.
Schlichtkrull, M. S. (2015). Learning affective projections for emoticons on Twitter. 6th IEEE International Conference on Cognitive Infocommunications, Istvan Szechenyi University, Gyor, Hungary, 539–543.
- 61. Vidal L., Ares G. & Jaeger S. R. (2016). Use of emoticon and emoji in tweets for food-related emotional expression. Food Quality and Preference, 49, 119–128.
- 62.
Redmond, M., Salesi, S. & Cosma, G. (2017). A novel approach based on an extended cuckoo search algorithm for the classification of Tweets which contain emoticon and emoji. Proceeding of 2017 2nd International Conference on Knowledge Engineering and Applications, Imperial College, London, England, 13–19.
- 63. Samuel A., & Sharma D. K. (2018). A novel framework for sentiment and emoticon-based clustering and indexing of Tweets. Journal of Information & Knowledge Management, 17(2), 1850013.
- 64. Yang G., He H. B. & Chen Q. (2019). Emotion-semantic-enhanced neural network. IEEE-ACM Transactions on Audio Speech and Language Processing, 27(3), 531–543.
- 65. Hu Y., Zhao J., & Wu J. (2016), Emoticon-based ambivalent expression: a hidden indicator for unusual behaviors in Weibo. PLoS ONE, 11(1), 1–14. pmid:26800119
- 66. Dodds P. S., Harris K. D., Kloumann I. M., Bliss C. A., & Danforth C. M. (2011). Temporal patterns of happiness and information in a global social network: Hedonometrics and Twitter. PLoS ONE, 6(12), e26752. pmid:22163266
- 67. Wang M. Y., Wu H. & Jia X. T. (2019). Research on multi-emotional classification of weibo based on word2vec and extended emotional dictionary. Journal of Northeast Normal University (Natural Science Edition), 51(1), 55–62.
- 68. Hu K., Wu H. Y., Qi K. L., Yu J. M., Yang S. L., Yu T. X., et al. (2018). A domain keywork analysis approach extending Term Frequency-Keywork Active Index with Google Word2Vec model. Scientometrics, 114(3), 1031–1068.
- 69. Gaspar R., Pedro C., Panagiotopoulos P. & Seibt B. (2016). Beyond positive or negative: Qualitative sentiment analysis of social media reactions to unexpected stressful events. Computers in Human Behavior, 56, 179–191.
- 70. Wei-Dong H., Qian W. & Jie C. (2018). Tracing public opinion propagation and emotional evolution based on public emergencies in social networks. International Journal of Computers Communications & Control, 13(1), 129–142.
- 71. Bai H. & Yu G. (2016). A weibo-based approach to disaster informatics: incidents monitor in post-disaster situation via Weibo text negative sentiment analysis. Nat Hazards, 83(2), 1177–1196.
- 72. Mitchell L., Frank M. R., Harris K. D., Dodds P. S., & M D. C. (2013), The geography of happiness: connecting Twitter sentiment and expression, demographics, and objective characteristics of place. PLoS ONE, 8(5), e64417. pmid:23734200
- 73. Zhong Z. M., Guan Y., Li C. H. & Liu Z. T. (2018). Localized Top-k bursty event detection in microblog. Chinese Journal of Computers, 41(7), 1504–1516.
- 74. Zhang X., B., Lu X. A., Lian K. D., Liu J. & Liu Z. P. (2017). Method of microblog bursty events detection based on burst words regional analysis. Journal of Intelligence, 36(3), 98–103.