Emotion entrainment, which is generally defined as the synchronous convergence of human emotions, performs many important social functions. However, what the specific mechanisms of emotion entrainment are beyond in-person interactions, and how human emotions evolve under different entrainment patterns in large-scale social communities, are still unknown. In this paper, we aim to examine the massive emotion entrainment patterns and understand the underlying mechanisms in the context of social media. As modeling emotion dynamics on a large scale is often challenging, we elaborate a pragmatic framework to characterize and quantify the entrainment phenomenon. By applying this framework on the datasets from two large-scale social media platforms, we find that the emotions of online users entrain through social networks. We further uncover that online users often form their relations via dual entrainment, while maintain it through single entrainment. Remarkably, the emotions of online users are more convergent in nonreciprocal entrainment. Building on these findings, we develop an entrainment augmented model for emotion prediction. Experimental results suggest that entrainment patterns inform emotion proximity in dyads, and encoding their associations promotes emotion prediction. This work can further help us to understand the underlying dynamic process of large-scale online interactions and make more reasonable decisions regarding emergency situations, epidemic diseases, and political campaigns in cyberspace.
Citation: He S, Zheng X, Zeng D, Luo C, Zhang Z (2016) Exploring Entrainment Patterns of Human Emotion in Social Media. PLoS ONE 11(3): e0150630. https://doi.org/10.1371/journal.pone.0150630
Editor: Jose Javier Ramasco, Instituto de Fisica Interdisciplinar y Sistemas Complejos IFISC (CSIC-UIB), SPAIN
Received: January 20, 2015; Accepted: February 17, 2016; Published: March 8, 2016
Copyright: © 2016 He et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: IR05 dataset is available from ; CHI06 dataset is available from ; Sina Weibo dataset is available from ;  Mishne G (2005) Experiments with mood classification in blog posts. Proceedings of ACM SIGIR 2005 Workshop on Stylistic Analysis of Text for Information Access 19.  Leshed G, Kaye JJ (2006) Understanding how bloggers feel: recognizing affect in blog posts. CHI'06 extended abstracts on Human factors in computing systems: 1019-1024.  Zhang J, Liu B, Tang J, Chen T, Li J (2013) Social influence locality for modeling retweeting behaviors. Proceedings of the Twenty-Third international joint conference on Artificial Intelligence: 2761-2767.
Funding: This work was supported in part by the following grants: the National Natural Science Foundation of China under Grant Nos. 71402177, 71472175, 71025001, 61175040, and 71103180; the National Institutes of Health (NIH) of the United States of America under Grant No. 1R01DA037378-01; and the Ministry of Health under Grant No. 2013ZX10004218 and 2012ZX10004801. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Humans are emotional social beings from birth [1,2]. We transmit various emotional signals to communicate to and influence others. For instance, we usually unify our emotions to resist potential threats (e.g., unauthentic vaccination [3,4], illegal immigration , and bad customer experiences [6,7]) or to promote beneficial incidents (e.g., pro-social policies  and tobacco cessation ). In these scenarios, we always adjust our emotion states according to those of our friends via social interactions. These phenomena are typically conceptualized as entrainment, which was firstly identified by Huygens in 1665, and is generally defined as a tendency for two or more independent rhythmic processes to synchronize with each other [10–12].
Entrainment has been found to be particularly relevant to human emotions, and performs many important social functions. Firstly, emotion entrainment can promote more effective social communications by helping people “feel themselves into” another’s emotional episodes [13–15]. Through this communication process, humans both consciously and unconsciously transmit emotional signals that are essential for fostering social bonds and for maintaining good interpersonal relationships [16–18]. Secondly, emotion entrainment can help cultivate a kind of emotional culture . This functions as a social regulator that calibrates our practical comportment in socialization, and consequently leads to strong group commitments and solidarity. Furthermore, researchers recently uncover that empathy is often connected to entrainment in interpersonal interactions [20–22]. Therefore, the implications of emotion entrainment may promote a nuanced understanding of the processes underlying empathy.
Despite its importance, the principles or patterns of emotion entrainment, to date, are still poorly understood. Most of the existing studies merely explore the entrainment phenomenon in face-to-face interactions based on small-scale or controlled laboratory experiments [18,23,24]. How collective emotion entrains outside of in-person interactions in a large-scale, real world data setting is still unknown. Recently, the proliferation of various online social media platforms has provided entrainment investigation with huge amount of emotion-rich data. In addition, it has been uncovered that emotion cues can also be transferred through these avenues [25–27]. These two facts, together, have laid the groundwork for studying massive emotion entrainment beyond dyads.
However, there are still several other challenges for us to understand the principles of emotion entrainment. Firstly, emotion entrainment on a large scale involves a complex interplay, and often entails dealing with non-linear systems . Existing approaches, both in modeling and analysis, cannot deal with this problem well. Traditional approaches based on various entrainment models are usually computational complex, and the underlying assumptions often violate actual, real-world scenarios [28,29]. While, on the other hand, the more recent network analysis approaches inevitably lack enough detail about entrainment processes [30,31], and often do not distinguish entrainment directions. Secondly, though entrainment phenomena have been investigated from various dimensions (i.e., in-person interactions [32,33], cross-modality communication , and social norm calibration ), there lacks an effective model that can learn the principles governing entrainment processes well, and predict the future emotions of the targeting individuals or groups effectively.
To deal with the challenges presented above, in this paper, we elaborate a pragmatic framework that can characterize entrainment phenomenon and quantify its patterns on a large scale efficiently. Based on the datasets from large popular social media platforms, we primarily investigate (1) the rules and patterns of emotion entrainment outside of in-person interactions, and then evaluate (2) how different entrainment patterns benefit the prediction of individuals' future emotions. This work can provide significant insights into understanding the underlying dynamic process of large-scale online interactions and make more reasonable decisions regarding emergency situations, epidemic diseases, and political campaigns in cyberspace.
Community level entrainment
Previous research has uncovered that massive-scale emotional contagion occurs in online social networks . In this section we further attempt to clarify whether human emotions entrain in social media communities. For our study we use three large-scale, real world datasets, including two English datasets (IR05  and CHI06 ) from an emotion sharing blog platform named Livejournal (http://www.livejournal.com/), and one Chinese dataset (Sina Weibo ) from a microblogging platform called Sina Weibo (http://www.weibo.com/). The descriptive statistics of the three experimental datasets are summarized in S1 Table. To facilitate emotion entrainment analysis, we conduct polarity classification  on these datasets. The 1449 most commonly used mood labels in IR05 and CHI06 datasets are grouped into three emotion states, including positive (POS), neutral (NEU), and negative (NEG) (details are shown in S2 Table). While, for the Sina Weibo dataset, a Naive Bayes classifier  is trained to determine the three emotion states. During the computational process, we convert the emotion tags of online users into three discrete values [-1, 0, +1] that characterize negative, neural, and positive emotions respectively. However, an empirical study on emotion entrainment has proven difficult since the entrainment process in observational data is usually confounded by user heterogeneity (e.g., difference in user’s gender, age, and residence) [40,41]. Thus, to further make the results reliable, we have designed a randomized trial by randomly sampling 20k users who have labeled at least 3 emotion tags out of each dataset. As users have been randomly selected, their emotion states differ in expectation only through the entrainment process. With this manipulation, we can independently supervise emotion entrainment in social media communities, while simultaneously minimize user heterogeneity caused by selection bias and data incompleteness .
To understand the evolution patterns of online users’ emotions on the two platforms, we quantify the average emotion distance within a dyad as a function of the community life (Fig 1). Accordingly, we define a variable to measure the emotional distance between two users, where p(vit) and p(vjt) respectively represent the probability distribution of emotion states for user vi and user vj at timestamp t, while CE(*) calculates the cross-entropy  between them. Smaller 〈CE〉t value indicates shorter emotional distance between the two users. To supervise how emotional distance changes over entrainment process, we utilize Etr(vi → vj) = TE(vj → vi) to measure entrainment strength from user vi to user vj based on transfer entropy (TE) . Different from previous work, the proposed entrainment measure in this paper, which is asymmetric and allows differentiation in the direction of entrainment, can capture complex emotion dynamics without modeling the exact interactions within dyads. In addition, unlike existing studies that are almost concerned with aggregate measures for entrainment quantification [23,32], the measure presented in this paper allows more fine-grained characterization on the information flow among individuals. This key property enables us not only to study massive emotion entrainment at the community level, but also to examine different entrainment patterns at the peer level. For more details of the measurement, please refer to the Materials and Methods section.
Inset: entrainment evolution over time.
Fig 1 suggests that emotion entrainment occurs (drops in cross-entropy value) as the communities develop. At the inception of Livejournal, users in the community are emotionally dissimilar. As entrainment strength enhances (Insets of Fig 1(A) and 1(B)), emotion disparity among users decreases (blue curves in Fig 1(A) and 1(B)). In contrast, users in Sina Weibo experience a quite different entrainment process. Primarily, users are emotionally similar to each other. This anomalous phenomenon may be attributed to the time delay in data collection (about 5 months after the public release of Sina Weibo platform), which allows users on the platform to entrain for a certain period of time before our surpervision. Hereafter, collective emotions in Sina Weibo roughly undergo two entrainment cycles (Fig 1(C)). During this process, entrainment strength vibrates rhythmically (Inset of Fig 1(C)), which causes the entrainment phenomenon transient without entering a stable state.
As entrainment promotes rapport and social closeness , we subsequently turn to examine how emotions evolve as entrainment strength enhances. Specifically, we consider the emotional distance 〈CE〉T as a function of the minor value of reciprocal entrainment strength (Fig 2). Here, T is the total observation period in a dataset.
Regression lines are colored in blue, with coefficients of -0.08, -0.12 and -2.48 respectively for (a) (b) and (c).
We observe that as entrainment strengthens, the average emotional distance decreases (blue regression lines in Fig 2). This tendency indicates that users are more emotionally similar to each other under stronger entrainment process. Compared with that of Livejournal, the emotional distance in Sina Weibo drops more sharply as entrainment strength enhances (Fig 2(C), with the lowest regression coefficient of -2.48). Notably, this entrainment process develops in a moderate way, with the maximum entrainment strength value no more than 0.25. This observation is inline with previous contention that moderately rhythmic social interactions generally promote social closeness and positive experience .
Peer level entrainment
Understanding how social relationship and emotion interact may require further close scrutiny in peer level entrainment. Since whether emotion entrainment is directional or not is unclear yet, previous work simply eludes this issue by measuring entrainment in a symmetric way [23,47]. In this section, we primarily clarify this issue by probing the reciprocal entrainment strength within each user pair. If the strength of reciprocal entrainment differs significantly, then emotion entrainment can be considered as directional. Otherwise, it can be regarded as undirectional. Under this assumption, we introduce a variable EnDis to quantify entrainment difference for a given user pair vi and vj: (1) where, abs(*) and max(*) respectively calculate the absolute value and the maximum value of the function enclosed.
Given this definition, we calculate EnDis for each user pair and further show how it changes over entrainment strength (Fig 3). Experimental results indicate that there is significant disparity in reciprocal entrainment strength within dyads. The average EnDis values of IR05, CHI06, and Sina Weibo are 0.431, 0.439, and 0.828 respectively. These results demonstrate that emotion entrainment on these social media platforms is directional. Notably, this phenomenon is more significant in Sina Weibo, which indicates that single entrainment is more pervasive on this platform.
Here, EnDis is considered as a function over entrainment strength. Average EnDis in (a), (b), (c) are 0.431, 0.439, 0.828 respectively.
Further, from Fig 3, we find that there exists an uncannily correlation between EnDis and entrainment strength, i.e., EnDis decreases as entrainment strength increases (denoted by the blue regression lines in Fig 3). This observation implies a positive correlation in reciprocal entrainment, i.e., if an individual entrains strongly towards his interaction partner, then his partner is likely to entrain back in a similar way. The corresponding Pearson's Correlation Coefficients (PCCs) are 0.737, 0.728, and 0.192 respectively in Fig 3(A), 3(B) and 3(C) (p<0.0001). The lowest correlation coefficient in Sina Weibo implies that users on this platform more often entrain in single way. It has been revealed that social contacts are tied to intimate relationships that often assume the same or similar emotions . Thus, the positive correlation in dyadic entrainment may manifest that users bearing strong reciprocal entrainment relationship are more emotionally similar. Definitive answer for this issue necessitates delving deeper into peer level entrainment. To this end, at each timestamp t, we divided all user peers into three groups: (2) where, users in the group Dual and in the group Single respectively entrain in dual way and in single way, while users in the group None do not entrain at all. θt represents the average entrainment strength at timestamp t. This time-dependent division for entrainment patterns is imperative. Previous research suggests that tight entrainment process does not always guarantee good social experiences, while moderately rhythmic social interactions generally provide a more positive result . Therefore, it is unreasonable to simply use a unified threshold to distinguish different entrainment patterns. Comparatively moderate entrainment process at the current timestamp, though below a predefined threshold, may promote more efficient emotional interactions at other timestamps. As such, we examine entrainment patterns at the peer level with time-varying thresholds. In what follows, without ambiguity, we omit the superscript t (e.g., simplifying G(vit, vjt) to G(vi, vj)) to indicate a variable at timestamp t.
Given the group division according to Eq 2, we discern how users’ emotions evolve at different peer levels (top figures in Fig 4). Experimental results indicate that emotions are different for those who do not entrain to each other (p<<0.001 according to an independent two-tailed t-test). These results mean that users’ emotions tend to converge under the entrainment process. Also, we find that emotion distance in the group Dual is larger than that in the group Single at most of the time, though such difference diminishes gradually as entrainment process proceeds (after about two thirds of the total lifespan of the community). This finding appears to be contradictory since we previously reveal that reciprocal entrainment is associated with high emotion proximity.
Bottom figures correspond to the varying ratio of the number of Dual entrainment over the number of Single entrainment; purple dashed lines separate the two stages of relationship establishment: ‘Develop’ stage and ‘Maintain’ stage.
According to our analysis, the higher emotional difference in the Dual entrainment group can be explained by two competing hypotheses:
- Users mutually entrain when they are emotionally dissimilar, while switch to single entrainment once they become emotionally familiar.
- Users become emotionally similar through single entrainment, and afterwards they undergo mutual entrainment in a loose manner to sustain their relationships.
In order to tease these two hypotheses apart, we examine the probability that users experience dual entrainment ahead of single entrainment (Fig 5). Results suggest that users on Livejournal and Sina Weibo often entrain in single way initially (p = 60.6–75.1%). In Livejournal, single entrainment merely leads weakly ahead, and the time difference (leading or lagging) is relatively small for most of the whole process (as indicated by the sharp peak around zero week in Fig 5(A) and 5(B)). This means users in Livejournal experience relatively short-lived Single entrainment procedure, and quickly switches to Dual entrainment process. The pervasion of Dual entrainment in Livejournal can be considered as a reflection of its users' extroversion in online socialization, as has been found in other social media platforms [49–52]. In contrast, users in Sina Weibo more often experience Single entrainment initially (75.1%). Further, the leading time of this process over Dual entrainment is more smoothly distributed (Fig 5(C)). This compound fact indicates that emotion interactions on this platform necessitate more proactive engagement.
To gain further insights into how Dual entrainment and Single entrainment impact users’ emotions, we examine the relative ratio between these two processes and supervise how it changes along with emotion evolution. At each timestamp t, we define a variable , where #Dualt and #Singlet stand for the number of user pairs engaging in Dual entrainment and Single entrainment respectively (bottom figures in Fig 4).
From Fig 4, we can find that despite having different lifespans, peer entrainments on the two platforms all follow a determined two-stage process: an increase in dual entrainment followed by a decrease trend. As Rt approaches its peak, the emotional distance in dyads decreases continuously (in Sina Weibo, the emotional distance decreases after reaching the second crest). We conjecture that this process corresponds to the relationship establishment stage, where emotion proximity develops in dyads. After the peak, entrainment switches to proceed in single way and the emotional distance further decreases. This indicates that users begin to sustain their established relationships. During this process, we notice that entrainment strength in Dual entrainment process is weaker than that in Single manner (p<0.05 according to an independent one-tailed t-test). This finding implies that moderate entrainment is more beneficial in establishing social relationships, yet sustaining it warrants more endeavor. According to these findings, we surmise that the establishment of online social relationships typically undergoes three entrainment stages, i.e., Single-Dual-Single. Originally, relationship establishes with an individual proactively entrains towards another (Single entrainment); then the relationship develops via mutual adaption in users’ emotions (Dual entrainment); finally, it sustains mainly in one way entrainment (Single entrainment).
Another meaningful finding we obtained from Fig 4 is that the time of fulfilled Dual entrainment (i.e., the second stage) is not fixed on different platforms. In Livejournal, it takes about half of the community lifespan to undergo Dual entrainment (i.e., establishing social relationships). In contrast, users in Sina Weibo experienced a relative short period of Dual entrainment process (about one third of the community lifespan) before morphing into Single entrainment procedure. In addition, users in Sina Weibo are more emotionally similar at the end of the entrainment process. These facts together ascertain that the establishment of social relationship in Sina Weibo is more efficient.
The above observations suggest that entrainment patterns in dyads are possible to inform something about their emotion proximity. For instance, if a user pair enters Single entrainment after experiencing Dual entrainment process for long, these pairwise users are very likely to have established an intimate social relationship, and they may assume similar emotions in the near future. This knowledge permits a better surpervision and prediction on the emotion evolution of online communities. This issue, if well tackled, may help to prevent potential negative effects incurred by emergency situations [53,54], epidemic diseases , and political campaigns [8,55,56]. In what follows we will harness these insights in the setting of an emotion prediction task.
By analyzing emotion entrainment in social media both at the community level and the peer level, we have uncovered some key principles governing the interactions between entrainment and emotion dynamics. We turn now to showing that these principles are predictive of emotional proximity within dyads, and could be used to leverage emotion prediction. Since online communities manifest stronger emotions than face-to-face interactions , emotion prediction can help us to understand online communications and further take corresponding counteractions timely. Also, emotion prediction can promote social coherence [58,59] by identifying “isolated” users early and recruit them in the community.
Most traditional classifiers learn and make predictions on each sample in isolation. However, modeling proximity relationships between samples enables us to leverage coherence [60–63], i.e., similar samples may share the same status. Entrainment network delivers insights into understanding social relationships (Fig 6 Panel A), yet existing work is incapable of providing a flexible way to encode such information. Thus, we elaborate an entrainment augmented factor graph model (hereafter EnFG model) to encode entrainment information and all other features into a unified framework. This model establishes relationships between entrainment information and user emotions by defining various factor functions, and then makes tractable inference via factorizing the “global” probability as a product of such “local” functions ((Fig 6 Panel B)). By elaborating the EnFG model, we show how entrainment information informs users' future emotions. This modeling framework can also be readily generalized to other emotion related tasks, such as relationship identification [18,64], social stratification , and affiliative community detection [66,67].
Panel A: entrainment network constitutes Dual entrainment and Single entrainment patterns; Panel B: EnFG model based on entrainment network.
This model involves three factor functions, including the modality factor g(yi, mij), the entrainment association factor g(Etr(vi → vj)), and the entrainment pattern factor g(G(vi, vj)). g(yi, mij) captures the correlation between the user vi’s jth modality mij and his emotion state yi, which is defined as: (4) where, m is the modality matrix, whose element mij correspods to the jth attribute of user vi. Previous research reveals that human emotions have memories  and are partly reflected by user’s collective behaviors [69–71]. Thus, when define the modality factor function, we mainly utilize two kinds of modalities: historical emotions and activity level. For the sake of efficiency and simplicity, we employ the n-gram models  to encode historical emotions. In this modeling scheme, we use unigrams (i.e., encoding each emotion state separately) to depict user emotion at each timestamp, while use bigrams (i.e., encoding two adjacent emotion states as a whole) to capture emotion dependency across different timestamps. In measuring user's activity level, we count the total amount of messages one individual posted at each timestamp. Considering the feature sparsity problem [52,73], we discretize the activity level of online users into three intervals: low, medium, and high. Thresholds separating the intervals are set respectively as one time and two times of the average posting number. At the same time, the activity level is also encoded with unigrams and bigrams. Finally, we differentiate users’ emotional and behavioral impacts occurring at distinct timestamps by combining each tag with its relative temporal index. All these features and those defined below are summarized in S3 Table.
g(G(vi, vj)) characterizes the entrainment pattern between users vi ∈ V and vj ∈ V, and is defined as: (6) where, G(vi, vj) is given by Eq 2.
α, β, and γ are respectively weights of the three different factor functions; θ = (α, β, γ) is a parameter configuration estimated from the training data; and Z is a normalization factor to ensure that the distribution is normalized so that the sum of the probabilities equal to 1. An example of the EnFG model is illustrated in Fig 6 Panel B.
To evaluate the proposed EnFG model, in each dataset (IR05, CHI06, and Sina Weibo), we select the top 10K users who have posted the most emotion tags to constitute the evaluation pool. In our experiments, we choose users’ historic modalities (i.e., emotion tags and activity levels) as the training data and use the learned model to predict their respective emotion state (i.e., positive, neutral, and negative) in the final timestamp. In the following experiment, each day is considered as a timestamp. For evaluation metric, we use prediction accuracy. All the following experiments are the average four-fold cross-validation results.
To clarify how EnFG performs compared with traditional approaches, we also employ several alternative approaches, including Naïve Bayes (NB), Maximum Entropy (ME) , Support Vector Machines (SVMs) , and Radial Basis Function Networks (RBFN) . Essentially, these comparison approaches do not provide a flexible way to capture association information. As such, to characterize Dual entrainment information, we group users through a chain clustering procedure. We then consider two users belong to the same group according to: (7) where, En0 is a similarity threshold.
To encode Single entrainment relationship in the comparison approaches, for each user vi, we select out his top K (= 5) most solely entrained neighbors (vj) and average their emotion distributions (dj) by weighting with the corresponding entrainment strength: (8) where, each dimension in dj corresponds to one emotion state, i.e., positive, negative or neutral.
Considering the data sparsity problem, Eq 8 is further discretized as: (9)
This entrainment modulated distribution is assumed to suggest the tendency of emotion evolution for each user.
Fig 7 represents the evaluation results for all approaches on the evaluation datasets. Experimental results suggest that entrainment information generally benefits emotion prediction task, albeit with a few exceptions. The performance gain may be attributed to the associations established between emotional similarity and different entrainment patterns. Compared with other two datasets, Sina Weibo dataset is confirmed to be the most predictable (3.77–11.47% higher in performance). We conjecture this fact is due to the higher efficiency in entrainment process in Sina Weibo, which engenders more coherence information to be leveraged in prediction. While, Dual entrainment information has more significant prediction impacts on IR05 and CHI06 (respectively 66.04% and 418.87% higher improvement than that in Sina Weibo). This suggests that entrainment plays an important role in social networks where users' emotions are publicly sensible to each other.
Significance of performance improvement over modality features according to a paired two-tailed t-test is indicated using *-notation (* = "p<0.05",** = "p<0.01").
Another experimental results indicate that distinguishing different entrainment patterns (Dual entrainment and Single entrainment) can further benefit prediction performance. While Dual entrainment information benefits most approaches employed in this experiment, Single entrainment consistently boost performance in our proposed EnFG model. Though the EnFG model does not perform well with mere modality features, it does predict user emotion with high accuracy by characterizing different peer entrainment information. In terms of accuracy, EnFG achieves a 3.84–5.08% higher performance compared with the four alternative approaches encoding the same feature sets. By incorporating entrainment information, EnFG boosts the original model with only modalities by 4.66–7.91% in precision. These results validate that EnFG model is both efficient and effective in characterizing entrainment information.
In this paper, we explore emotion entrainment in the context of two large social media platforms, including Livejournal and Sina Weibo. We study the emotion entrainment phenomenon at both the community level and the peer level. When examining the evolution of massive emotions on the two platforms, we find that collective emotions entrain with the evolution of communities. Especially, users' emotions in Sina Weibo roughly undergoes two entrainment cycles. During this process, entrainment strength vibrates rhythmically, making the entrainment phenomenon transient without entering a stable state. Additionally, we find that users become emotionally similar as entrainment enhances. This tendency indicates that users are more emotionally similar to each other under stronger entrainment process.
To discern entrainment closely, we examine entrainment at the peer level. We find that difference in reciprocal entrainment strength is significant on both platforms, indicating that emotion entrainment is directional. Specifically, this difference is most significant in Sina Weibo, which means users are customary to entrain asymmetrically on this platform. Also, we obtain that there exists a positive correlation in reciprocal entrainment. Since social contact is closely tied to emotional proximity, we are motivated to discern how emotions entrain in different peer patterns, i.e., Dual entrainment, Single entrainment and None entrainment. By studying the emotion evolution under different peer levels, we uncover that emotion difference is significant in dyads with no entrainment. In addition, we reveal that emotion difference in Dual entrainment is larger than that in Single entrainment. These findings indicate that users entraining in single way are more emotionally similar. To explain this superficially contradictory phenomenon, we calculate the relative time lag between Dual entrainment and Single entrainment and surprise the temporal ratio between them. Experimental results imply that the establishment of social relationship undergoes three stages, i.e., it launches with one individual proactively entrain towards his interaction partner, then develops mutually, and finally maintains mainly in single way. This finding is remarkable since it informs users’ future emotions through the lens of entrainment patterns. Although randomized trials are adopted in these analyses, it is notable that online user emotions may also be influenced by exogenous factors such as cross-platform interferes and unobservable offiline interactions. In these scenarios, our entrainment measurement are able to work. The proposed measurement is constructed based on transfer entropy that is sensitive to all order correlations. Furthermore, it aims to capture the underlying causation between two users rather than correlation. Therefore, this measurement setting is likely to gurantee relatively relabible results in open social systems.
Based on previous empirical findings, we then construct an entrainment augmented computational model for predicting user emotions. Different from existing work, our model provides a flexible way to encode different entrainment patterns by defining various factor functions. Experimental results prove that the proposed model is both efficient and effective in characterizing entrainment information. Furthermore, this modeling framework has practical implications by suggesting a possible way to encode entrainment information, and can be readily generalized to other emotion related analysis. What’s more, this work can help us to understand the underlying dynamic process of large-scale online interactions and make more reasonable decisions regarding emergency situations, epidemic diseases, and political campaigns in cyberspace.
Materials and Methods
Livejournal is a social media platform where users share passions and opinions under various kinds of topics. Aside from posting messages, this platform also allow users to label their moods. We suppose these mood tags correspond to users’ temporary emotion states, which are sufficient proxies for us to analyze the entrainment patterns. On this platform, IR05 spans a period of 5 years—from 2000 until 2005, and consists of 33K users and 624K mood tags in total. Also, it is emotional rich, with each user has published 24 posts and labeled 18 mood tags in average. CHI06 is an alternative dataset collected from Livejournal, which contains approximately 18 million English blog posts written by about 1.6 million bloggers. In average, each user writes 10 posts and labels 6 emotion tags. This dataset covers a 48 months period from 1st May 2001 to 23rd Apr. 2005. On the other hand, Sina Weibo is a Twitter-like microblogging system in China. With more than 40 million active users spreading approximately 100 million messages each day, this system is generally considered as an ideal laboratory for studying Chinese content. The Sina Weibo dataset contains 1.7 million users who generate about 12 million posts. The time span in this dataset is from 15th Jan. 2010 to 27th Nov. 2012. These three long period emotional rich datasets present us with an unprecedented opportunity to study emotion entrainment over users’ entire lifespans.
For the two platforms, including Livejournal and Sina Weibo, we consider that a user joins the online community after publishing his first post, and abandons it if he does not contribute any post for at least six months. To enforce this policy, in depicting the change in user base, we ignore users that have posted within the last six months in each community. Fig 8 shows a breakdown of active users showing the number of users joining and abandoning their communities over time. We find that the growth of user number follows by the exponential function (blue curves in Fig 8). In Sina Weibo dataset, however, the assumption for exponential growth only holds exclusive of data samples in the last half year (i.e., 2012–1 in Fig 8(C)). This fact is due to the cessation in exponential growth in user numbers, as reflected by the large portion of users abandoning the community in the last half year. Fig 9 illustrates the change in posting number which are divided into five groups, i.e., posts with positive emotion tags (POS), neutral emotion tags (NEU), negative emotion tags (NEG), unknown emotion tags within our emotion classification scheme (UNKNONWN), and no emotion tags (NOTAG). The result shows that the total number of posts also grows exponentially (blue curves in Fig 9).
Breakdown of active users each year. From bottom up: users that joined the community that year (and did not abandon the same year), users that joined and abandoned the community that year, users that abandoned the community that year (and did not join the same year) and other active users. Blue curve corresponds to fitted exponential function: y = a*exp(b*(t-t0)). ‘*’ means only partial data samples are used in curve fitting.
Breakdown of posts each year. From bottom up: posts with positive emotion tags (POS), neutral emotion tags (NEU), negative emotion tags (NEG), unknown emotion tags within our emotion classification scheme (UNKNONWN), and no emotion tags (NOTAG). Blue curve corresponds to fitted exponential function: y = a*exp(b*(t-t0)).
To protect user privacy, in our study, all datasets are anonymized by substituting real user ids with random number references. In Sina Weibo dataset, there is no emotion label for each post. Therefore, we have trained a Naive Bayes classifier to determine positive, negative or neutral polarity for each post. The training dataset  contains over 3.5 million labeled messages written by approximately 7K users from Sina Weibo. In designing the classifier, we choose to use unigrams and bigrams features since they proves to be effective in this dataset. Also, we take into account negation rules by adding a prefix ‘neg-’ for all features modified by a negation indicator . To tackle with polarity shifting [79,80] and the emergences of neologisms [81,82], a self-learning scheme  is implemented in the classifier. This scheme gradually enhances the trained classifier by augmenting training dataset with testing samples assigned with high prediction confidence. Overall, this classifier achieves an empirical precision of 72.18% with four-fold cross-validation on the dataset.
To construct the measurement framework for emotion entrainment, we mainly take two aspects into consideration. First, to make the framework reasonable and scalable for large datasets, we should assume as few hypotheses as possible. Second, whether emotion entrainment is directional or not is unclear yet, we should design the measurement asymmetrically to distinguish entrainment direction, and then validate its rationality with corroborating experimental evidence. As such, we adopt transfer entropy to guide the design of entrainment measure since this approach is asymmetric and can capture arbitrary nonlinear interactions well. As human emotions are found to have certain narrative memories , we can make predictions for each individual’s future emotions based solely on part of his emotion history. This means online user emotions satisfy the Markov property , and we can represent the history of emotions for each user x by a Markov process X = xt (S1 Fig). According to Schreiber , the transfer entropy from users y to x is given by: (10) where, , , while m and n are the orders (memory) of the Markov process X and Y respectively. For the sake of simplicity, we take m = n = 3 from this point on. According to our empirical analysis, this order is high enough, since higher order merely increases computational cost without present much quantitative differences. H(*) calculates entropy over a given probability distribution.
Eq 10 amounts to the uncertainty reduction by using the historical emotions of user y to predict the next emotion state of user x. This capacity to predict user x due to the knowledge of user y reflects the potential that user x adopts his emotions towards that of user y. Thus, the entrainment strength from user x to user y at the emotion level can be quantified as: (11)
Estimating Eq 11 based on finite data is prone to incur biases and statistical errors [86,87]. As such, we choose to use Simpson’s rule  as the entropy estimator, which gives more accurate approximation for entrainment strength Etr(X → Y) via piecewise quadratic. In what follows, we present the complexity analysis for this estimation procedure. According to Simpson’s rule, Eq 11 can be approximated by calculating the following composite Simpson quadrature: (12) where, the integration interval [a, b] is divided into n even subintervals (or grids), xi = a + ih, 0 ≤ i ≤ n, , and Etr'(x) corresponds to the derivation of function Etr(x). This computation is efficient with a complexity of O(N log N), where N equals to the number of grid points. This cost is acceptable in our datasets.
S1 Fig. Illustration of entrainment quantification.
The authors wish to thank Bo Xu, Changliang Li, Guanhua Tian and Hongwei Hao for useful discussions. This work was supported in part by the following grants: the National Natural Science Foundation of China under Grant Nos. 71402177, 71472175, and 71103180; the National Institutes of Health (NIH) of USA under Grant No. 1R01DA037378-01; and the Ministry of Health under Grant No. 2013ZX10004218.
Conceived and designed the experiments: SH XZ DZ. Performed the experiments: SH XZ. Analyzed the data: CL ZZ. Contributed reagents/materials/analysis tools: CL ZZ. Wrote the paper: SH XZ DZ.
- 1. Leon S, Nikov A (2010) Emotion-oriented eCommerce systems. WSEAS TRANSACTIONS on SYSTEMS 9: 594–606.
- 2. Chmiel A, Sienkiewicz J, Thelwall M, Paltoglou G, Buckley K, Kappas A, et al. (2011) Collective emotions online and their influence on community life. PloS one 6: e22207. pmid:21818302
- 3. Salathé M, Vu DQ, Khandelwal S, Hunter DR (2013) The dynamics of health behavior sentiments on a large online social network. EPJ Data Science 2: 1–12.
- 4. Salathé M, Khandelwal S (2011) Assessing vaccination sentiments with online social media: implications for infectious disease dynamics and control. PLoS computational biology 7: e1002199. pmid:22022249
- 5. Darcy O (Feb. 4, 2015) GOP Congressman on Why He Thinks Illegal Immigrants Might be to Blame for Measles Outbreak.
- 6. Oh O, Agrawal M, Rao HR (2013) Community intelligence and social media services: A rumor theoretic analysis of tweets during social crises. Mis Quarterly 37: 407–426.
- 7. Wasserman T (Sep. 01, 2011) How Toyota Used Social Media To "Digg" Itself Out of a PR Nightmare. Mashable.
- 8. Choy M, Cheong M, Laik MN, Shung KP (2012) Us presidential election 2012 prediction using census corrected twitter model. arXiv preprint arXiv:12110938.
- 9. Liang Y, Zhou X, Zeng DD, Guo B, Zheng X, Yu Z (2014) An integrated approach of sensing tobacco-oriented activities in online participatory media. IEEE Systems Journal: 1–10.
- 10. Ancona D (1996) Entrainment. Wiley Encyclopedia of Management.
- 11. Bortz B, Salazar S, Jaimovich J, Knapp RB, Wang G (2012) ShEMP: A mobile framework for shared emotion, music, and physiology. Proc the 3rd Int Workshop on Social Behavior in Music.
- 12. Lee C-C, Katsamanis A, Black MP, Baucom BR, Georgiou PG, Narayanan S (2011) An Analysis of PCA-Based Vocal Entrainment Measures in Married Couples' Affective Spoken Interactions. Interspeech. pp. 3101–3104.
- 13. Hatfield E, Rapson RL, Le Y-CL (2011) Emotional Contagion and Empathy. In: Decety J, Ickes W, editors. The social neuroscience of empathy. Boston, MA: MIT Press. pp. 19–30.
- 14. Bastiaansen J, Thioux M, Keysers C (2011) Mirror mechanisms in emotion processing. Mirror images. pp. 21.
- 15. Lee C- C, Katsamanis A, Black MP, Baucom BR, Christensen A, Georgiou PG, et al. (2014) Computing vocal entrainment: A signal-derived PCA-based quantification scheme with application to affect analysis in married couple interactions. Computer Speech & Language 28: 518–539.
- 16. Hawk S (2010) Changing channels: flexibility in empathic emotion processes. Journal of Personality and Social Psychology 32: 145–153.
- 17. Reidsma D, Nijholt A, Tschacher W, Ramseyer F (2010) Measuring multimodal synchrony for human-computer interaction. Cyberworlds (CW), 2010 International Conference on: 67–71.
- 18. Hess U, Fischer A (2013) Emotional mimicry as social regulation. Personality and Social Psychology Review 17: 142–157. pmid:23348982
- 19. von Scheve C, Ismer S (2013) Towards a theory of collective emotions. Emotion Review 5: 406–413.
- 20. Eisenberg N, Shea CL, Carlo G, Knight GP (2014) Empathy-related responding and cognition: A “chicken and the egg” dilemma. Handbook of moral behavior and development: Research 2: 63–88.
- 21. Hoffman ML (1984) Interaction of affect and cognition in empathy. Emotions, cognition, and behavior: 103–131.
- 22. Hoffman ML (2008) Empathy and prosocial behavior. Handbook of emotions 3: 440–455.
- 23. Mariooryad S, Busso C (2013) Exploring Cross-Modality Affective Reactions for Audiovisual Emotion Recognition. IEEE Transactions on Affective Computing: 1.
- 24. Sato W, Fujimura T, Kochiyama T, Suzuki N (2013) Relationships among facial mimicry, emotional experience, and emotion recognition. PloS one 8: e57889. pmid:23536774
- 25. Kramer AD, Guillory JE, Hancock JT (2014) Experimental evidence of massive-scale emotional contagion through social networks. Proceedings of the National Academy of Sciences: 201320040.
- 26. Fowler JH, Christakis NA (2008) Dynamic spread of happiness in a large social network: longitudinal analysis over 20 years in the Framingham Heart Study. Bmj 337.
- 27. Rosenquist JN, Fowler JH, Christakis NA (2010) Social network determinants of depression. Molecular psychiatry 16: 273–281. pmid:20231839
- 28. Lee C-C, Busso C, Lee S, Narayanan SS (2009) Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions. INTERSPEECH: 1983–1986.
- 29. Metallinou A, Katsamanis A, Narayanan S (2012) A hierarchical framework for modeling multimodality and emotional evolution in affective dialogs. Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on: 2401–2404.
- 30. Danescu-Niculescu-Mizil C, Gamon M, Dumais S (2011) Mark my words!: linguistic style accommodation in social media. Proceedings of the 20th international conference on World wide web: 745–754.
- 31. Danescu-Niculescu-Mizil C, Lee L, Pang B, Kleinberg J (2012) Echoes of power: Language effects and power differences in social interaction. Proceedings of the 21st international conference on World Wide Web: 699–708.
- 32. Nenkova A, Gravano A, Hirschberg J (2008) High frequency word entrainment in spoken dialogue. Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers: Association for Computational Linguistics. pp. 169–172.
- 33. Davis M (1982) Interaction rhythms: Periodicity in communicative behavior: Human Sciences Pr.
- 34. Danescu-Niculescu-Mizil C, West R, Jurafsky D, Leskovec J, Potts C (2013) No country for old members: User lifecycle and linguistic change in online communities. Proceedings of the 22nd international conference on World Wide Web: International World Wide Web Conferences Steering Committee. pp. 307–318.
- 35. Mishne G (2005) Experiments with mood classification in blog posts. Proceedings of ACM SIGIR 2005 Workshop on Stylistic Analysis of Text for Information Access.
- 36. Leshed G, Kaye JJ (2006) Understanding how bloggers feel: recognizing affect in blog posts. CHI'06 extended abstracts on Human factors in computing systems: ACM. pp. 1019–1024.
- 37. Zhang J, Liu B, Tang J, Chen T, Li J (2013) Social influence locality for modeling retweeting behaviors. Proceedings of the Twenty-Third international joint conference on Artificial Intelligence: AAAI Press. pp. 2761–2767.
- 38. Wilson T, Wiebe J, Hoffmann P (2005) Recognizing contextual polarity in phrase-level sentiment analysis. Proceedings of the conference on human language technology and empirical methods in natural language processing: Association for Computational Linguistics. pp. 347–354.
- 39. McCallum A, Nigam K (1998) A comparison of event models for naive bayes text classification. AAAI-98 workshop on learning for text categorization: Citeseer. pp. 41–48.
- 40. Van den Bulte C, Lilien GL (2001) Medical innovation revisited: Social contagion versus marketing effort1. American Journal of Sociology 106: 1409–1435.
- 41. Bemmaor AC (1994) Modeling the diffusion of new durable goods: Word-of-mouth effect versus consumer heterogeneity. Research traditions in marketing: 201–229.
- 42. He S, Zheng X, Zeng D (2015) A model-free scheme for meme ranking in social media. Decision Support Systems 81: 1–11. pmid:26823638
- 43. De Boer P-T, Kroese DP, Mannor S, Rubinstein RY (2005) A tutorial on the cross-entropy method. Annals of operations research 134: 19–67.
- 44. Vicente R, Wibral M, Lindner M, Pipa G (2011) Transfer entropy—a model-free measure of effective connectivity for the neurosciences. Journal of computational neuroscience 30: 45–67. pmid:20706781
- 45. Miles LK, Nind LK, Macrae CN (2009) The rhythm of rapport: Interpersonal synchrony and social perception. Journal of experimental social psychology 45: 585–589.
- 46. Warner RM, Malloy D, Schneider K, Knoth R, Wilder B (1987) Rhythmic organization of social interaction and observer ratings of positive affect and involvement. Journal of Nonverbal Behavior 11: 57–74.
- 47. Varni G, Camurri A, Coletta P, Volpe G (2008) Emotional entrainment in music performance. Automatic Face & Gesture Recognition, 2008 FG'08 8th IEEE International Conference on: IEEE. pp. 1–5.
- 48. Roberts SG, Dunbar RI (2011) Communication in social networks: Effects of kinship, network size, and emotional closeness. Personal Relationships 18: 439–452.
- 49. Lampe C, Ellison NB, Steinfield C (2008) Changes in use and perception of Facebook. Proceedings of the 2008 ACM conference on Computer supported cooperative work: 721–730.
- 50. Pempek TA, Yermolayeva YA, Calvert SL (2009) College students' social networking experiences on Facebook. Journal of Applied Developmental Psychology 30: 227–238.
- 51. Subrahmanyam K, Reich SM, Waechter N, Espinoza G (2008) Online and offline social networks: Use of social networking sites by emerging adults. Journal of Applied Developmental Psychology 29: 420–433.
- 52. Weisbuch M, Ivcevic Z, Ambady N (2009) On being liked on the web and in the “real world”: Consistency in first impressions across personal webpages and spontaneous behavior. Journal of Experimental Social Psychology 45: 573–576. pmid:20161314
- 53. De Choudhury M, Monroy-Hernandez A, Mark G (2014) Narco emotions: affect and desensitization in social media during the mexican drug war. Proceedings of the 32nd annual ACM conference on Human factors in computing systems: ACM. pp. 3563–3572.
- 54. Qu Y, Huang C, Zhang P, Zhang J (2011) Microblogging after a major disaster in China: a case study of the 2010 Yushu earthquake. Proceedings of the ACM 2011 conference on Computer supported cooperative work: ACM. pp. 25–34.
- 55. Pasek J, Kenski K, Romer D, Jamieson KH (2006) America's Youth and Community Engagement How Use of Mass Media Is Related to Civic Activity and Political Awareness in 14-to 22-Year-Olds. Communication Research 33: 115–135.
- 56. Tumasjan A, Sprenger TO, Sandner PG, Welpe IM (2010) Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment. ICWSM 10: 178–185.
- 57. Sia C- L, Tan BC, Wei K- K (2002) Group polarization and computer-mediated communication: Effects of communication cues, social presence, and anonymity. Information Systems Research 13: 70–90.
- 58. Bjarnason T (1998) Parents, religion and perceived social coherence: A Durkheimian framework of adolescent anomie. Journal for the Scientific Study of Religion: 742–754.
- 59. Martínez E, Kwiatkowski I, Pasquier P (2011) Towards a model of social coherence in multi-agent organizations. Coordination, Organizations, Institutions, and Norms in Agent Systems VI: Springer. pp. 114–131.
- 60. Haralick RM, Shapiro LG (1979) The consistent labeling problem: Part I. Pattern Analysis and Machine Intelligence, IEEE Transactions on: 173–184.
- 61. Flach B, Schlesinger MI (2000) A class of solvable consistent labeling problems. Advances in Pattern Recognition: Springer. pp. 462–471.
- 62. Kohli P, Torr PH (2009) Robust higher order potentials for enforcing label consistency. International Journal of Computer Vision 82: 302–324.
- 63. Agarwal S, Roychowdhury JS (2008) Efficient Multiscale Simulations of Circadian Rhythms Using Automated Phase Macomodelling Techniques. Pacific Symposium on Biocomputing. pp. 402–413.
- 64. Mackie DM, Smith ER, Ray DG (2008) Intergroup emotions and intergroup relations. Social and Personality Psychology Compass 2: 1866–1880.
- 65. Thamm R (1992) Social structure and emotion. Sociological Perspectives 35: 649–671.
- 66. Vakali A, Kafetsios K (2012) Emotion aware clustering analysis as a tool for Web 2.0 communities detection: Implications for curriculum development. World Wide Web Conference, WWW: Citeseer.
- 67. Li X, Chen H, Li S (2010) Exploiting Emotions in Social Interactions to Detect Online Social Communities. PACIS. pp. 136.
- 68. Garas A, Garcia D, Skowron M, Schweitzer F (2012) Emotional persistence in online chatting communities. Scientific Reports 2.
- 69. Watson D (2000) Mood and temperament: Guilford Press.
- 70. Hume D Emotions and Moods. Organizational Behavior: 258–297.
- 71. Dezecache G, Conty L, Chadwick M, Philip L, Soussignan R, Sperber D, et al. (2013) Evidence for unintentional emotional contagion beyond dyads. PloS one 8: e67371. pmid:23840683
- 72. Brown PF, Desouza PV, Mercer RL, Pietra VJD, Lai JC (1992) Class-based n-gram models of natural language. Computational linguistics 18: 467–479.
- 73. Burke M, Marlow C, Lento T (2010) Social network activity and social well-being. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems: 1909–1912.
- 74. Berger AL, Pietra VJD, Pietra SAD (1996) A maximum entropy approach to natural language processing. Computational linguistics 22: 39–71.
- 75. Hearst MA, Dumais ST, Osman E, Platt J, Scholkopf B (1998) Support vector machines. Intelligent Systems and their Applications, IEEE 13: 18–28.
- 76. Park J, Sandberg IW (1991) Universal approximation using radial-basis-function networks. Neural computation 3: 246–257.
- 77. Zhao J, Dong L, Wu J, Xu K (2012) Moodlens: an emoticon-based sentiment analysis system for chinese tweets. Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining: ACM. pp. 1528–1531.
- 78. Jiang L, Yu M, Zhou M, Liu X, Zhao T (2011) Target-dependent twitter sentiment classification. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1: Association for Computational Linguistics. pp. 151–160.
- 79. Kessler W, Schütze H (2012) Classification of Inconsistent Sentiment Words using Syntactic Constructions. COLING (Posters). pp. 569–578.
- 80. Wang F, Wu Y, Qiu L (2012) Exploiting Discourse Relations for Sentiment Analysis. COLING (Posters). pp. 1311–1320.
- 81. Fu G, Wang X (2010) Chinese sentence-level sentiment classification based on fuzzy sets. Proceedings of the 23rd International Conference on Computational Linguistics: Posters: Association for Computational Linguistics. pp. 312–319.
- 82. Lou X, Chai Y, Zan H, Xu R, Han Y, Zhang K (2013) Research on Micro-blog Sentiment Analysis. Chinese Lexical Semantics: Springer. pp. 466–479.
- 83. He S, Zheng X, Zhang C, Wang L (2011) Topic-oriented information detection and scoring. Intelligence and Security Informatics: Springer. pp. 36–42.
- 84. Markov AA (1953) The theory of algorithms.
- 85. Schreiber T (2000) Measuring information transfer. Physical review letters 85: 461. pmid:10991308
- 86. Hlaváčková-Schindler K, Paluš M, Vejmelka M, Bhattacharya J (2007) Causality detection based on information-theoretic approaches in time series analysis. Physics Reports 441: 1–46.
- 87. Kraskov A, Stögbauer H, Grassberger P (2004) Estimating mutual information. Physical review E 69: 066138.
- 88. Hildebrand FB (1987) Introduction to numerical analysis: Courier Corporation.