Analyzing information sharing behaviors during stance formation on COVID-19 vaccination among Japanese Twitter users

Sho Cho; Shohei Hisamitsu; Hongshan Jin; Masashi Toyoda; Naoki Yoshinaga

doi:10.1371/journal.pone.0299935

Abstract

To prevent widespread epidemics such as influenza or measles, it is crucial to reach a broad acceptance of vaccinations while addressing vaccine hesitancy and refusal. To gain a deeper understanding of Japan’s sharp increase in COVID-19 vaccination coverage, we performed an analysis on the posts of Twitter users to investigate the formation of users’ stances toward COVID-19 vaccines and information-sharing actions through the formation. We constructed a dataset of all Japanese posts mentioning vaccines for five months since the beginning of the vaccination campaign in Japan and carried out a stance detection task for all the users who wrote the posts by training an original deep neural network. Investigating the users’ stance formations using this large dataset, it became clear that some neutral users became pro-vaccine, while almost no neutral users became anti-vaccine in Japan. Our examination of their information-sharing activities during a period prior to and subsequent to their stance formation clarified that users with certain types and specific types of websites were referred to. We hope that our results contribute to the increase in coverage of 2nd and further doses and following vaccinations in the future.

Citation: Cho S, Hisamitsu S, Jin H, Toyoda M, Yoshinaga N (2024) Analyzing information sharing behaviors during stance formation on COVID-19 vaccination among Japanese Twitter users. PLoS ONE 19(12): e0299935. https://doi.org/10.1371/journal.pone.0299935

Editor: Ankit Gupta, CCET: Chandigarh College of Engineering and Technology, INDIA

Received: July 27, 2023; Accepted: November 1, 2024; Published: December 31, 2024

Copyright: © 2024 Cho et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The Twitter data cannot be shared publicly due to Twitter’s (X Corp.) restrictions on redistributing Twitter Content to third parties (https://developer.twitter.com/en/developer-terms/agreement-and-policy). The tweet contents are accessible via the Twitter API provided by X Corp (https://developer.twitter.com/en). Additionally, our stance annotation data cannot be shared publicly. Instead, researchers may request our models solely for non-commercial purposes, specifically to reproduce our experimental results in their research. We provide two model data: a BERT-based language model which has undergone additional training on the Twitter data, and a stance classification model built upon the language model. These include codes and trained parameters. For any inquiries, please reach out to our group representative at contact@tkl.iis.u-tokyo.ac.jp.

Funding: This research was conducted as part of "COVID-19 AI & Simulation Project" run by Mitsubishi Research Institute commissioned by Cabinet Secretariat, JAPAN. The methods for analysis were developed with support from JST CREST Grant Number JPMJCR19A4 and JSPS KAKENHI Grant Number JP21H03445. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Vaccination is thought to be one of the most powerful measures for containing outbreaks of infectious diseases such as COVID-19. When infectious diseases spread, wide acceptance of vaccines is necessary to prevent the spread of epidemic diseases. Since the 2019 novel coronavirus disease (hereinafter called COVID-19) pandemic, many countries recognized low vaccination coverage as a significant challenge [1].

In Japan, a country that had previously ranked among the nations with the lowest vaccine confidence globally [2], significant concerns existed regarding vaccine uptake, particularly among young individuals. According to a national survey on the intent to vaccinate against COVID-19 in February 2021, 32.9% answered “uncertain,” and 11.0% answered “no” regarding vaccination. However, following the beginning of the vaccination campaign on February 17, 2021, the percentage of individuals who received a second dose of the vaccine (fully vaccinated at the time) in Japan saw a remarkable surge, rising from 3.4% in June to 75.3% in October of the same year. This rapid increase put Japan in first place among G7 nations (Japan, Canada, Italy, France, United Kingdom, Germany, and the United States), as depicted in Fig 1. The vaccination campaign in Japan thus proceeded smoothly, with many individuals who initially hesitated to get vaccinated consequently opting to receive the vaccine.

Download:

Fig 1. Full vaccination coverage (1st & 2nd doses) of G7 countries based on data provided by Mathieu et al. [3].

https://doi.org/10.1371/journal.pone.0299935.g001

Examining successful vaccination campaigns, such as those in Japan, is crucial for understanding common factors influencing acceptance or hesitancy toward vaccines, and recently much research has studied these factors using social media data [4–14] while conventional approaches often rely on questionnaires and surveys [1, 2, 15, 16]. These investigations found several common reasons for vaccine hesitancy: anxieties about vaccine safety [4] and a lack of trust in vaccine efficacy [5]. Other studies looked into people’s attitudes towards vaccination, exploring factors such as cross-country variations in positive and negative views on vaccination [9], spatiotemporal shifts in sentiment towards vaccines in the US [8], and the polarization among different vaccination communities in Japan [6]. A later study found that the majority of negative sentiment in Twitter predominantly mentioned the coercive policies or vaccine mandates, rather than safety or efficacy concerns [10]. Several studies reported that SNS users’ stances toward vaccination were characterized by the information they obtained from SNS, such as external links [11, 12] or posts from other users [13, 14], resulting in wrong medical stances.

While these studies have enriched our understanding of overall characteristics and trends of public sentiment regarding COVID-19 vaccination, they assume that the stances of SNS users remain fixed throughout the study period, neglecting the process of stance formation of each user. We consider the stance formation as the process where users, initially without any stances on COVID-19 vaccination, shape their perspectives. In this study, users are classified into three stances: Pro-vaccine (those accepting or planning/getting vaccinated), Anti-vaccine (those avoiding vaccination), and Neutral (those without a specific stance on vaccination). We mainly focused on SNS users who were initially neutral and eventually held Pro-vaccine or Anti-vaccine stances and examined the contents shared by these users, including tweets from other users and external websites during their stance formation.

To draw lessons from Japan’s success in improving COVID-19 vaccine coverage, we conducted an analysis of users’ vaccination stances on Twitter for the purpose of identifying information-sharing behaviors that might have impacted their stance formations. We collected all Japanese tweets including a reference to vaccination over a span of five months when vaccine coverage rapidly grew (from June to October in 2021), which constituted a dataset of vaccination-related tweets. Within this dataset, we annotated a portion of the tweets with the stances of the users who posted them. Using this annotated dataset, we trained a classifier for stance detection, which is an NLP task [17] for predicting a stance (typically favor, against, and none) of given text toward a certain target. We developed a deep neural network utilizing both content and network features to assign vaccination stances to each tweet in the dataset and determined each Twitter user’s stance toward COVID-19 vaccines. Using this classifier, we periodically aggregated the predicted stances for each user to investigate how users’ stances on vaccination were shaped over time. This paper represents an extended version of our previously published work [18], in which we have enhanced the performance of our deep neural model and incorporated several new results into our analysis.

Our analysis is conducted as follows. We first examined the shift in stance distribution, discovering that the pro-vaccine group significantly outnumbered the anti-vaccine group. Subsequently, we investigated polarization in the user reaction graphs, and as a result, there was a gradual increase in the degree of polarization between the two factions over time. This finding suggested that the impact of anti-vaccine activity in Japan was limited. Finally, we focused on the users who were neutral at the beginning of the period, became pro-vaccine or anti-vaccine, and kept the stances. Our deeper exploration into their information-sharing behaviors allowed us to find potential factors that influenced their stance formation. One such user group, i.e., users who moved from neutral to pro-vaccine, exhibited a preference for more reliable information sources. These users frequently engaged with accounts of medical doctors and mainstream media outlets and shared links to external sites of mass media and web news. On the contrary, users who moved from neutral to anti-vaccine showed a tendency for alternative information sources. They regularly interacted with accounts of unclear or unknown occupation, and commonly shared links to bulletin board systems (BBSes), weblogs, and video hosting sites.

Related work

Research into public attitudes towards vaccination, particularly in relation to COVID-19, has leveraged social media data [7]. Using data from social media has many merits compared with traditional survey-based methods. For example, they make it possible to timely observe public opinions, leading to better understanding of vaccination intentions and attitudes regarding ongoing immunization campaigns [19]. We divided vaccine-related research using social media data into three categories: thematic analyses of vaccine-related discussions, assessments of polarization in vaccine debates, and observations of stance formation in vaccination attitudes.

Thematic analyses have been used to identify prevalent topics concerning vaccines and to investigate the underlying causes of vaccination hesitancy. In such work, anxiety about vaccine safety [4] and doubt in vaccine efficacy [5] were commonly regarded as the most common reasons. Research focused on polarization within vaccine debates has quantified the ratio of positive to negative perspectives on vaccine-related content [9], examined the network structure [20], and assessed the degree of polarization among different vaccine communities on social networks [6]. In addition, some studies have analyzed sentiment changes in response to specific vaccine-related events to find the topics influencing vaccination intentions. Hu et al. [8] conducted a study on the spatiotemporal patterns of public sentiment and emotion, tracking these factors over time at both national and state levels in the United States. They identified three distinct phases during the pandemic period where sharp changes in public sentiment and emotion occurred. It is noteworthy, however, that these studies on online vaccine debates [5, 20] did not focus the formations of users’ stances on vaccinations.

A variety of studies incorporated various techniques such as sentiment analysis, stance detection, and graph analytics. The majority of these studies assigned sentiment labels to each tweet using classifiers that were trained on a small set of accurate labels while some studies preferred to use crowd-sourcing platforms like Amazon Web Services (AWS) for labeling tweets [21]. Yousefinaghani et al. [22] used the Valence Aware Dictionary and sEntiment Reasoner (VADER) [23], a lexicon and rule-based sentiment analysis tool written in Python. Cotfas et al. [24] used several machine learning and deep learning methods to classify users’ stances towards vaccination. They demonstrated that the use of Bidirectional Encoder Representations from Transformers (BERT) [25] showed state-of-the-art results. Alhuzali et al. [26] combined sentiment analysis with geographical analysis. They used tweets from various cities in the United Kingdom to predict sentiment labels with a deep learning model and performed city-wise analyses. Mønsted et al. [14] leveraged transfer learning to enhance their stance classifier’s performance. They studied the propagation process of misinformation about vaccination on a mutual mention/retweet network. Garcia et al. [27] compared the transition of COVID-19 related topics in Brazil with that of the USA in Twitter. They also combined deep learning models and embedding-based techniques. To measure the level of polarization, some techniques for graph analytics such as community detection like METIS [28] and stochastic simulation like Random Walk Controversy (RWC) [29] were used. For instance, Yuan et al. [20] monitored polarization in debates on vaccination using the Louvain method [30], while Miyazaki et al. [6] quantified the degree of polarization using RWC.

This paper explores the information-sharing behaviors of SNS users during their stance formations for a more profound understanding of such behaviors. In terms of methodology, our study differs from prior work, which typically relies solely on content for sentiment analysis and stance classification. Instead, we propose a novel deep learning model incorporating both content and network-based elements. Furthermore, we leverage established methods of social network analysis to quantify controversies surrounding vaccine debates. This allows us to observe the formation of stances within the context of polarization.

Dataset construction

Twitter is a social media platform widely used in Japan and has a broad range of age groups, particularly younger generations (https://www.humblebunny.com/japans-top-social-media-networks). This platform allows users to share their thoughts through tweets and engage with others by reacting to their posts. These reactions, comprising retweets, quotes, and replies, indicate the users’ interest in the content of the tweets. Consequently, we leveraged these tweets to monitor people’s attitudes towards vaccination and used the reactions to look into their information-sharing behaviors.

Vaccination for individuals aged 18 to 64 in Japan started on June 17, 2021. By the end of October, approximately 75% of the population had been fully vaccinated. From all Japanese tweets, we extracted tweets from June 1, 2021 to October 31, 2021 that contained the keyword “ワクチン” (wakuchin, vaccine in English), resulting in 19,502,448 tweets posted by 4,446,499 users. The data of all Japanese tweets was provided by the NTT Data Japan Corporation. We took the utmost care in handling personal data and conducted our research in strict compliance with Twitter’s “Twitter Developer Agreement”(https://developer.twitter.com/en/developer-terms/agreement-and-policy/source). We intentionally excluded tweets posted by the “share via Twitter” function on certain websites and those produced by an app named “shindanmaker,” as they lacked any valuable information for assessing user intent. Our investigation of user accounts revealed that tweets posted from 4 major clients (iPhone, Android, iPad, and Web) accounted for 93.9%, and tweets posted from client applications including “bot” in their name accounted for only about 1%, which meant that explicit bot accounts had only a small impact on our analysis. Consequently, we created a dataset of vaccine-related tweets, consisting of 18,462,168 tweets posted by 4,408,669 unique users. Fig 2 illustrates the trend in the number of collected tweets. There was a great increase in tweet volume from June, which began to decline in late August when the initial vaccination rate surpassed 58.22%.

Download:

Fig 2. Changes in first-dose vaccination coverage [3] and number of vaccine-related tweets in Japan.

https://doi.org/10.1371/journal.pone.0299935.g002

Vaccination stance classification

As the basis and key to subsequent analysis, we identified users’ stances towards vaccination. Stance detection was done by using a vaccination stance classifier trained on our annotation dataset. This dataset was constructed on vaccine-related tweets manually annotated by four annotators. We trained the classifier using this dataset to label the other tweets. Fig 3 shows our tweet-selection process.

Download:

Fig 3. Outline of our tweet-selection process.

https://doi.org/10.1371/journal.pone.0299935.g003

Previous studies on predicting vaccine stances from tweets [20, 24] relied only on textual information and failed to classify tweets that referred to posts with the opposite vaccination stance. We additionally used reaction graph information to classify tweets into users’ stances towards vaccination using a deep neural network.

Annotation of stance of tweets

In this study, users are classified into three stances based on their stance towards vaccination: Pro-vaccine (those accepting or planning/getting vaccinated), Anti-vaccine (those avoiding vaccination), and Neutral (those without a specific stance on vaccination). To develop a deep learning model for stance classification, we conducted manual annotation on a subset of vaccine-related tweets. This involved categorizing tweets into three classes: pro-vaccine, anti-vaccine, and neutral towards vaccines. Criteria for stance annotation were established (see Table 1) to ensure consistent annotation. Pro-vaccine tweets consisted of expressions of support for vaccines, personal vaccination plans or experiences, recommendations for vaccination, and criticism of anti-vaccine sentiments. Anti-vaccine tweets consisted of vaccine denial, discouragement of vaccination, and criticism of pro-vaccine advocates. Neutral tweets consisted of factual information, introductions to press releases from public institutions, and discussions unrelated to the merits and drawbacks of vaccination. In accordance with these annotation criteria, four annotators labeled the same 500 tweets to measure the inter-annotator agreement; Fleiss’ kappa coefficient [31] for this annotation task was 0.74, which confirms the stability of the annotations. Each annotator then labeled an average of 2313 randomly-chosen tweets, resulting in a total of 9250 labeled tweets. Fig 4 shows our annotation, learning, and classification process.

Download:

Fig 4. An outline of our annotation, learning, and classification process.

https://doi.org/10.1371/journal.pone.0299935.g004

Download:

Table 1. Annotation criteria and example tweets.

Example tweets are translated from Japanese.

https://doi.org/10.1371/journal.pone.0299935.t001

Text and graph-based stance classification

With the above annotated tweets, we next trained a deep neural network on the basis of the textual content and reaction to classify vaccine stances. The architecture of our model is illustrated in Fig 5. Our model includes three components: a text encoder, reaction encoder, and classifier.

Download:

Fig 5. Overview of our vaccine-stance classifier.

https://doi.org/10.1371/journal.pone.0299935.g005

The text encoder induces linguistic features from tweet text. We fine-tuned a pre-trained Bidirectional Encoder Representations from Transformers (BERT) [25] on our target task. For each tweet, we carried out basic preprocessing, such as full-width half-width character conversion, case conversion, and removal of various symbols in the input before inputting it to BERT. To gain better classification performance, we conducted domain-adaptive pre-training (DAPT) [32] that continues pre-training for the pre-trained BERT with a corpus of a target task domain. In our model, BERT was pre-trained with a masked language model (MLM) objective using our vaccine tweet dataset that we constructed in the previous section.

Motivated by the fact that a user’s stance can be influenced by whom that user interacts with, the reaction encoder extracts reactions (retweets, quotes, and replies) between users to generate a reaction vector (RA vector) for each user representing who reacted to that user and whom the user reacted to. To reduce the computational costs, we used reactions to the most influential users. Specifically, we divided each month into three periods, 1st day to 10th, 11th to 20th, and 21st to 30th (31st), and we collected the top-10K users who reacted to others (hereinafter, information spreaders) and the top-10K users who others reacted to (hereinafter, information senders) for each period. We then vectorized the number of reactions between each user and top information spreaders/senders in the last three periods. The obtained RA vector was input to a fully-connected layer and tanh function to reduce the number of dimensions. The classifier inputs a concatenation of the tweet-text and reaction-graph vectors to a fully-connected layer. It then passes the output to the softmax function to make a prediction.

Experiments on vaccination stance classification

Settings.

For learning our vaccination stance classifier, we prepared datasets for training, development, and testing. Table 2 displays the statistics of the dataset. We created training and development datasets using the 9250 annotated tweets. To address potential annotator bias resulting from variations in the number of labeled tweets, we extracted an equal number of 125 tweets from each annotator to construct the development dataset, while the remaining 8750 tweets constituted the training dataset. To construct a reliable test dataset, we used the 500 tweets labeled by the four annotators to measure the inter-annotator agreements. To arrive at the final labels for the test dataset, a majority voting approach was used. In cases where there were disagreements among the annotators, resolutions were reached through collaborative discussions by the annotators.

Download:

Table 2. Vaccination stance dataset.

https://doi.org/10.1371/journal.pone.0299935.t002

To implement the text encoder, we used the Japanese BERT pre-learning model released by NICT, Japan (https://alaginrc.nict.go.jp/nict-bert/index.html). This Japanese-version BERT was pre-trained on Japanese Wikipedia. We used NICT_BERT-base_JapaneseWikipedia_100K. We set the maximum number of tokens to 160. On this BERT model, we performed DAPT with our vaccine tweet dataset for two epochs because no improvement in the performance (macro-F1 on the development data) was obtained after three epochs. For the reaction encoder, we obtained RA vectors with 500 dimensions by feeding the original 95,016 dimensional vectors to two fully connected layers and the tanh function.

Results.

To confirm the performance improvement of our classifier, we first compared the BERT classifier with that using DAPT. Subsequently, we incorporated the reaction encoder into the model to assess its efficacy. Table 3 lists the results including the precision, recall, F₁ scores of each class, and macro-F₁ score. The BERT classifier with DAPT pre-training (+DAPT) consistently outperformed the original BERT classifier across all evaluation metrics. Notably, the recall of the anti-vaccine class exhibited significant improvement, highlighting the effectiveness of DAPT in enhancing the performance of the minority class. The classifier incorporating DAPT and the reaction encoder (+DAPT+RAvec) demonstrated the highest performance across most evaluation metrics. Similarly to DAPT, it significantly enhanced the performance of the anti-vaccine class, indicating its effective utilization of user interactions in the minority class.

Download:

Table 3. Comparison of performance of vaccination stance classifiers.

https://doi.org/10.1371/journal.pone.0299935.t003

Because the prediction performance of the anti-vaccine class was still not good due to the small number of tweets in the class, we set a probability threshold to obtain reliable labels when we applied it to our vaccine tweet dataset. When the class with the maximum output of the softmax function was the anti-vaccine class, we set a threshold of 0.7 to the output probability of the anti-vaccine class. Thus, the precision of the anti-vaccine class increased from 0.524 to 0.700, which is not that much worse than the other classes. Instead, the recall of the class decreased from 0.688 to 0.438.

Analysis

We applied our vaccination stance classifier to all tweets in our vaccine tweet dataset. Using these automatically labeled tweets, we examined how users’ stances on vaccination were formed and the information-sharing behaviors that potentially influenced this process. In all analyses, we have complied with the terms and conditions of Twitter (X Corp.) at https://developer.twitter.com/en/developer-terms/more-on-restricted-use-cases. The following results update our previous findings [18] by incorporating our improved vaccination stance classifier (BERT+DAPT+RAvec).

Distribution of users’ stances

Using all the tweets labeled with our vaccination stance classifier, we determined users’ stances. We assumed that users do not change their stances in a short period of time and divided each month into three periods to aggregate tweet labels by each user using majority votes. In the case of a tie, we determined the user’s stance with priority toward pro-, neutral, and anti-vaccine in alignment with the order of the prediction precision.

Fig 6 illustrates the distribution of users’ stances in the 15 time periods during the 5 months. The number of pro-vaccine users was comparative with the neutral users during the first two periods. As the vaccination campaign progressed, the number of pro-vaccine users gradually increased. Starting in September 2021, there was decline in the number of pro-vaccine users coinciding with the achievement of approximately 50% full vaccination coverage, suggesting that interest in vaccination may have waned among these users around that time. The number of anti-vaccine users remained consistently small compared to other stances across all periods, suggesting that their influence was minimal, but their interest in the vaccination campaign remained unchanged.

Download:

Fig 6. Changes in number of users with each stance.

https://doi.org/10.1371/journal.pone.0299935.g006

Transition in polarization between vaccination stances

To examine the polarization between pro- and anti-vaccine users that is reported in online debates on vaccines for other infectious diseases [20, 33, 34], we created a graph of interactions between users, depicting the distribution of user stances based on a prior study [20]. We used undirected retweet graphs, where each node represented a user, and each edge represented the presence of retweets between users at both ends. To observe the changes in polarization overtime, we constructed retweet graphs for the 15 time periods and visualized each graph using Gephi (https://gephi.org/), a graph visualization tool. We only depicted nodes with degrees of 30 or higher to focus on users actively sharing information.

Fig 7 shows the retweet graphs for the first ten days of each month. The node colors show users’ vaccination stances, and the numbers show how many users have each stance. We can see densely connected nodes comprising three groups in each period, which is associated with the three different stances. There were relatively sparser connections between the pro- and anti-vaccine stances than those between the neutral group and the other two. This observation suggests a persistent polarization between the pro- and anti-vaccine groups throughout the periods.

Download:

Fig 7. Evolution of polarization of reaction graphs, RWC, and number of users with each stance.

https://doi.org/10.1371/journal.pone.0299935.g007

We estimated the degree of polarization for each period and examined its transition. For this estimation, we used a modified version of the Random Walk Controversy (RWC) [29], one of the most common measures for estimating the degree of polarization between two communities in a graph. We extended this method to measure the polarization between pro- and anti-vaccine communities in the presence of a third, neutral community. Our modified RWC first identified densely connected nodes within the retweet graph as communities. The METIS algorithm, which was used in the original RWC, is designed to divide a given graph into k clusters with an equal number of vertices. However, in our case, pro- and anti-vaccine communities has different number of vertices, making the METIS algorithm less suitable. Therefore, we used the Louvain method [30], which produced a better partitioning result in terms of modularity better reflecting the structure of the retweet graph. In this method, the “resolution” parameter has a direct impact on the minimum size of the communities, which we set at 2. We then assigned one of three stances to each of the three largest communities through a majority vote among the community’s users. Subsequently, we computed the RWC between the pro- and anti- communities for each period. RWC was originally designed to measure polarization between two communities by calculating the ratio of random walks starting from either community that remain within the same community. Our modified version just ignored the neutral community by starting the random walk from a node of either the pro- or anti-vaccine community and finishing it at k highest-degree nodes within either of pro- or anti-vaccine community. We used k = 10 highest-degree nodes as the goal nodes.

As shown in Fig 7, the three groups were initially segregated by their stances. However, the neutral and pro-vaccine groups became densely connected over the first three months. After the beginning of September, these two groups rapidly decreased in size, coinciding with a rapid increase in the number of vaccinated people, while the size of the anti-vaccine group remained consistent. This trend aligns with the decreasing number of vaccine-related tweets since late August, when first-dose vaccination coverage in Japan exceeded 58%, as shown in Fig 2.

The increase in RWC of the retweet graphs can be attributed to two main factors: the diminishing number of direct edges between pro- and anti-vaccine users, and a notable decrease in the number of neutral users bridging these opposing stances over time. These observations indicate that neutral users primarily communicated with pro-vaccine users, and both groups lost interest or stopped posting after receiving their first dose. The influence of anti-vaccine users remained limited in Japan, although their activities continued. To determine whether these anti-vaccine users were bots or a special type of user, we investigated 50 randomly sampled anti-vaccine accounts. We did not find any common characteristics among these users, nor did we find any bot accounts.

Stance formations of users

We examined the stance formations of active users who consistently posted during the data collection period. Firstly, we identified 308,789 users who posted in at least 4 months out of 5 months of data collection window. As defined in the analyses above, each month is divided into three periods. A user’s stance during each period is determined by the majority vote of stance labels from their tweets posted within that period. If a user did not post during a particular period, their stance was assigned based on their last known stance from a previous period with posts or their first stance in subsequent periods if they had no prior posts. Finally, we conducted analyses on the formation of users’ stances and their information-sharing behaviors over time. Throughout and after the analyses, we ensured that no information identifying individual users was accessed.

We first illustrate the transition in users’ stance distribution over 5 months (15 periods) in Fig 8. In the initial period, the number of pro-vaccine and neutral users was nearly equal, with very few anti-vaccine users. The number of pro-vaccine users steadily increased over subsequent periods, while the number of anti-vaccine users remained consistently small.

Download:

Fig 8. Changes in user stance distribution.

https://doi.org/10.1371/journal.pone.0299935.g008

To further investigate the exchange of users between stance groups, we present the matrix showing the number of users transitioning between stances from the initial to the final periods in Table 4. As depicted, both the pro-vaccine and anti-vaccine groups predominantly exchanged members with the neutral group, and the exchanges between the pro-vaccine and anti-vaccine groups are notably smaller. The stance of pro-vaccine users was relatively stable, whereas that of anti-vaccine users was more vulnerable. Among the 153,410 initially pro-vaccine users, 110,817 (73%) maintained their stance, while nearly all of the remaining users shifted to a neutral stance. Among the 2,930 initially anti-vaccine users, only 471 (16%) maintained their stance, while 1,608 (55%) shifted to a neutral stance, and a relatively small number, 851 (29%), shifted to pro-vaccine. The results show that both the pro-vaccine and anti-vaccine groups primarily exchanged members with the neutral group, and less with each other. These findings align with the sparse connections observed between pro- and anti-vaccine users, as shown in Fig 7. Therefore, our primary focus is on the stance formation of the 152,449 initially neutral users. Of these, 82,204 (54%) shifted to pro-vaccine, which significantly outweighed the shift from pro-vaccine to neutral. In contrast, only 1,989 (1.3%) shifted to anti-vaccine, a shift nearly equal to the movement from anti-vaccine to neutral.

Download:

Table 4. Transition matrix between users’ initial stances and final stances.

https://doi.org/10.1371/journal.pone.0299935.t004

Information-sharing behaviors associated with stance formation changing stances

As shown above, since there were fewer transitions between the pro- and anti-vaccine groups, we focused on users who were initially neutral, changed their stance only once to either pro-vaccine (neutral-to-pro) or anti-vaccine (neutral-to-anti), and maintained that stance until the last period. For each month, we compared the information-sharing behaviors of users who changed their stance that month with those who remained neutral (remaining-neutral) during that month. In our dataset, among the 152,449 users who were initially neutral in the first period, 60,289 users fell into one of three categories: neutral-to-pro, neutral-to-anti, or remaining-neutral. Among these, 42,905 changed to pro-vaccine once, 603 changed to anti-vaccine once, and both groups maintained that stance until the last period. The remaining 16,781 users stayed neutral throughout all the periods. We analyzed their information-sharing behaviors, focusing specifically on interactions with other user accounts and external sites.

What kinds of users were referred to by users who formed their stances?

We investigated which user accounts were most referred to (replied, retweeted, or quoted) by either the neutral-to-pro or neutral-to-anti users during their stance formation. These referred accounts were then compared to those referred to by the remaining-neutral users. For each month, we classified users into three groups: neutral-to-pro, neutral-to-anti, and remaining-neutral, based on the formation of their stances during that month. For each group, we extracted the user accounts referred to by members of each group during the period when their stance changed, as well as during the three periods prior to the change. If one user referred to the same user account multiple times in the month, it was counted only once. After identifying the user accounts frequently referred to by each group, we classified these accounts by their attributes, such as their titles, jobs and belonging organizations. The authors assigned these attributes to each user accounts based on their profiles, several recent tweets. Table 5 provides the definition of each attribute.

Download:

Table 5. Attributes of referred user accounts.

https://doi.org/10.1371/journal.pone.0299935.t005

Fig 9 shows the attributes of the top 30 user accounts most referred to by the neutral-to-pro, remaining-neutral, and neutral-to-anti users in each month. The neutral-to-pro and remaining-neutral users consistently referred to a notable number of user accounts belonging to medical workers. In contrast, the neutral-to-anti users were unlikely to refer to such accounts. In Japan, many medical doctors voluntarily provided information on the effects and risks of vaccines to the public through their personal accounts, which may have played a important role in increasing vaccination coverage. We also observed that users referred to a wider variety of user accounts when they changed their stances. As shown in Fig 9, neutral-to-pro and neutral-to-anti users referred to a diverse range of user accounts, such as artists, business persons, professional writers and influencers, compared to remaining-neutral users.

Download:

Fig 9. Users who were most referred to by neutral-to-pro users (top), remaining-neutral users (middle), and neutral-to-anti users (bottom).

https://doi.org/10.1371/journal.pone.0299935.g009

To further examine user accounts exclusively referred to by either the neutral-to-pro or neutral-to-anti users in comparison with remaining-neutral users, we carried out a chi-squared test of independence on two user account groups at a significance level of 5%. Among the user accounts that passed the chi-squared test, we extracted the top 30 user accounts most frequently referred to by users in each group and assigned attributes in the same manner.

Fig 10 shows the attributes of the top 30 user accounts referred to by neutral-to-pro and neutral-to-anti users over time. The neutral-to-pro users tend to refer to the vaccination experiences of a diverse group of users. They notably referred to many artist accounts that shared their vaccination experiences through comics and illustrations. Tweets featuring comics or illustrations tend to attract more attention than text-only tweets, and such tweets may have played a significant role.

Download:

Fig 10. Users, who passed the chi-squared test of independence, referred to by neutral-to-pro users (top) and neutral-to-anti users (bottom).

https://doi.org/10.1371/journal.pone.0299935.g010

References to office workers and professional writers were consistently shown. They shared information about the vaccination procedure at venues and their personal experiences with vaccine side effects. In October, there was a sharp increase in references to medical workers. This was due to a combination of factors, including mentions of the downsizing of vaccination venues, information about vaccines provided by the Ministry of Health, Labour and Welfare and The Japanese Society for Vaccinology, and reports on studies concerning vaccinations for children.

It was found that the neutral-to-anti users tended to refer to user accounts who highlighted the negative aspects of vaccines. They mainly emphasized the dangers and ineffectiveness of vaccines, or criticize policies related to vaccines, such as opposing vaccine certificates. It is also noteworthy that we were unable to identify the occupations for about half of the accounts referred to by the neutral-to-anti users.

What types of external sites were shared by users who determined their stances?

We next investigated the external sites that were shared by the neutral-to-pro and neutral-to-anti users during their stance formation. Similarly to the analysis of referred user accounts, we classified users into three groups (neutral-to-pro, remaining-neutral, and neutral-to-anti) for each month. For each group, we extracted members’ tweets and retweets containing links to external sites during the period when their stance changed, as well as during the three periods prior to the change. If one user shared the same link multiple, it was counted only once. After identifying the external links frequently shared by each group, the authors assigned categories to each link based on the types of their web sites. Table 6 provides the definition of each category.

Download:

Table 6. Categories of shared external sites.

https://doi.org/10.1371/journal.pone.0299935.t006

Fig 11 shows the categories of the top 30 external sites referred to by the neutral-to-pro, remaining-neutral, and neutral-to-anti users for five months. The neutral-to-pro users and remaining-neutral users mainly shared links to mass media or web news sites, while the neutral-to-anti users shared alternative information sources. There was no significant difference between neutral-to-pro users and remaining-neutral users in this figure. Neutral-to-pro users tended to refer slightly more to web news sites, while there were no significant differences in the referenced articles because these sites predominantly reported articles already covered by mass media. This indicates a tendency for neutral-to-pro users to access such articles via web news sites rather than directly through mass media sites. The neutral-to-anti users frequently shared video hosting sites, BBS, and blog sites, which means they prefer these alternative information sources over mass media sites such as TV or newspapers. For example, in July, the number of video hosting sites was ten in Fig 11. Among these, the number of major video sharing sites was only four, while the rest six sites were relatively minor video sharing sites. These ten videos included titles such as “Vaccines Cause Tragedies” and others claiming that vaccination for children poses a high risk, indicating that video sharing platforms have become mediums for the spread of anti-vaccine sentiments. Similarly, blogs and BBSs criticizing the government’s stance on vaccines or detailing post-vaccination deaths were frequently referenced. Specifically, these included blogs claiming that it is healthier not to get vaccinated, BBSs introducing cases of people who died after vaccination, and blogs reporting that the government concealed the number of deaths caused by vaccines. Among these, blogs that particularly highlight the dangers of vaccines often lack evidence or rely on suspicious evidence, supporting the hypothesis that blogs and BBSs serve as a hotbed of anti-vaccine.

Download:

Fig 11. External sites which most shared by neutral-to-pro users (top), remaining-neutral users (middle), and neutral-to-anti users (bottom).

https://doi.org/10.1371/journal.pone.0299935.g011

To further examine external links exclusively referred to by either the neutral-to-pro or neutral-to-anti users in comparison with remaining-neutral users, we carried out a chi-squared test of independence on two sets of shared links at a significance level of 5%. Among the shared links that passed the chi-squared test, we extracted the top 30 links most frequently referred to by users in each group and assigned categories in the same manner.

Fig 12 shows the top30 frequently shared external websites by the neutral-to-pro and -anti users over five months. The top of Fig 12 shows sites shared by the neutral-to-pro users. The majority of these were web news sites, with a much smaller number being mass media sites. However, most of the shared news articles were also covered by mass media, and we did not find significant differences in content compared with Fig 11. The bottom of Fig 12 shows sites shared by the neutral-to-anti users. Similarly, we did not find significant differences compared with Fig 11.

Download:

Fig 12. External sites which passed the chi-squared test, shared by neutral-to-pro users (top) and neutral-to-anti users (bottom).

https://doi.org/10.1371/journal.pone.0299935.g012

Fig 13 illustrates word cloud created from the headlines of the links shared by the neutral-to-pro and -anti users over time. We obtained the headlines from the external sites which were significantly frequently shared by neutral-to-pro or -anti users respectively at a significance level of 5%. From the shared sites, we extracted nouns frequently used in the titles of the headlines using MeCab (https://taku910.github.io/mecab), one of the most popular Japanese tokenizers. We performed a chi-squared test of independence on two word groups used in the titles shared by the neutral-to-pro (or neutral-to-anti) users and the remaining-neutral users at a significance level of 5%. When one user shared several sites and there was an overlap of tokens between the titles of the sites, the token was counted only once. After identifying the typical words used in each group, we conducted a deeper analysis of the contents containing these keywords.

Download:

Fig 13. Changes in keywords in titles of external sites referred to by neutral-to-pro users (top) and neutral-to-anti users (bottom).

https://doi.org/10.1371/journal.pone.0299935.g013

For neutral-to-pro users, we observed a significant interest in websites related to vaccine reservations. Specifically, in all months except for July, we identified an increased usage of the word “reservation” as users checked and shared information on how to make a vaccine reservation or tweeted that they had successfully made one.

In the first few months, neutral-to-pro users displayed significant interest in the potential drawbacks of vaccination and tended to refer to authoritative sources such as government agencies and officials in charge of vaccinations. During this period, Taro Kono, the Cabinet minister in charge of vaccinations, issued warnings about vaccine-related false rumors on his official website, which led to the emergence of the term “false rumor” in June and July (https://www.taro.org/2021/06/%e3%83%af%e3%82%af%e3%83%81%e3%83%b3%e3%83%87%e3%83%9e%e3%81%ab%e3%81%a4%e3%81%84%e3%81%a6.php). Additionally, the Ministry of Health, Labour and Welfare reported cases of suspected vaccine side effects online (https://www.mhlw.go.jp/stf/seisakunitsuite/bunya/vaccine_hukuhannou-utagai-houkoku.html), leading to the appearance of words such as “suspicion,” “side effects,” and “report.”

Starting in August, the vaccination campaign for young people began and attracted significant attention from neutral-to-pro users for several months. In August, news reports covered the opening of vaccination venues for young people (https://news.livedoor.com/article/detail/20718403, https://news.livedoor.com/lite/article_detail/20769771), leading to the appearance of “young people,” “venue,” “coupons,” and “reservation.” Some of these words continued to appear in the following months.

Since August, there has been increased interest in the effectiveness and safety of vaccines, confirmed by authorities. The word “Moderna” appeared from August to October. In August, news reported that the Ministry of Health, Labour and Welfare stated that over 80% of people who received the Moderna vaccine experienced a fever and recommended resting the day after vaccination (https://news.livedoor.com/article/detail/20649919). Attention to Moderna dropped in September but rose again in October when it was reported that some Moderna vaccines contained stainless steel contaminants (https://www.tokyo-np.co.jp/article/130189). In October, the word “Pfizer” also appeared. This was due to news that 80% of the antibodies from the Pfizer vaccine decreased within six months (https://times.abema.tv/news-article/8673379) and that a booster shot had an effectiveness of 96% (https://news.yahoo.co.jp/articles/04f1fe37b1a5f7a66ecee6081de50018c102c559).

As shown in bottom of Fig 13, the word cloud of the neutral-to-anti users contained words exaggerating the negative aspects of the vaccine, such as “death,” “side effects”, and “danger.” This suggests that the neutral-to-anti users were particularly anxious about the vaccine’s safety. Furthermore, BBSs and blog articles that report foreign information related to vaccines, particularly highlighting their drawbacks, attracted significant attention among neutral-to-anti vaccine users. For instance, in August, the term “Israel” began to appear, due to two blog articles: one claiming that Pfizer and Israel agreed to conceal the side effects of the vaccine (https://tocana.jp/2021/08/post_218206_entry.html), and another warning that the number of positive COVID-19 cases in Israel did not decrease despite booster shots, contrary to reports of their effectiveness (https://johosokuhou.com/2021/09/29/51798/). Since Israel was one of the first countries to start vaccinations globally, negative information about vaccines related to Israel attracted considerable attention from the neutral-to-anti users. Similarly, since September, the term “US” has also appeared, as the country was another early adopter of vaccination. This term emerged from blog articles claiming that 60% of American doctors refused to get vaccinated (http://www.rui.jp/ruinet.html?i=200&c=600&t=6&k=0&m=368469&g=131203) and introducing research from the US suggesting that vaccines posed a high risk are dangerous for young people (https://note.com/you3_jp/n/n463d19aeaf03). This appearance is considered to be for similar reasons as that of Israel.

It was also found that doctors and medical experts might be influencing neutral-to-anti vaccine users towards anti-vaccine sentiments. The results also suggest that anti-vaccine doctors and experts may be pushing people towards anti-vaccine sentiments. In August, the words “doctors” appeared, referring to the news about a group of 450 doctors and legislators submitting a petition to suspend vaccinations (https://www.sanspo.com/article/20210624-IOQJULJCVRMBXMZXIDJG6SDUHA). Additionally, in June and July, the word “developer” appeared, due to a blog article about a person claiming to be a vaccine developer who warned that the vaccine is poison (http://blog.nihon-syakai.net/blog/2021/06/12371.html?g=132207). This was actually an opinion by a Canadian vaccine researcher who had seen internal Pfizer documents. The appearance of the word “developer” over two months suggests that its influence was significant. Such news, based on the views of doctors and experts, has also garnered considerable attention, suggesting that even users with anti-vaccine tendencies find statements from individuals in authoritative medical positions to be effective. This is consistent with neutral-to-anti users tending to refer to accounts from medical workers and researchers shown in Figs 9 and 10.

Conclusion

To draw a lesson from the successful COVID-19 vaccination campaign in Japan, we analyzed stance formations of Twitter users towards COVID-19 vaccination. We developed a BERT-based stance classifier with reaction information and applied it to all vaccine-related tweets posted from June to October 2021.

Analysis of the distribution of stances and the polarization of the pro-vaccine and anti-vaccine users revealed that the number of pro-vaccine users greatly exceeds the number of anti-vaccine users, and interactions among them are relatively sparse. The number of pro-vaccine users increased from June when the vaccination campaign for individuals aged 18 to 64 started, but it decreased from September when the vaccination coverage reached around 50%. The impact of anti-vaccine users was relatively insignificant, as their numbers remained consistently low throughout the analyzed period. However, it is noteworthy that their level of interest in the vaccination campaign remained unchanged. We found that polarization between the pro-vaccine and anti-vaccine users increased over time. Additionally, we observed that neutral users tended to react more frequently to the pro-vaccine users rather than the anti-vaccine users. These findings may explain why the majority of neutral users who became other stances shifted towards being pro-vaccine. It underscores the significance of providing reliable and timely information to neutral users to effectively improve the vaccination coverage.

Users who transitioned to the pro-vaccine stance often relied on traditional and authoritative information sources such as medical doctors, mass media, governments, and politicians. This indicates that the information provided by these sources played a crucial role in influencing individuals’ decision to get vaccinated. Notably, many medical doctors took the initiative to share updated vaccine information through their personal accounts, highlighting the significant impact of personal activities in improving vaccination coverage.

Such users were also found to frequently refer to the vaccination experiences of a diverse group of user accounts. This suggests that users considering vaccination are preparing by looking at these experiences to smoothly undergo vaccination. Notably, they referred to many artist accounts that shared their vaccination experiences through comics and illustrations.

In contrast, users who transitioned to the anti-vaccine stance often relied on alternative information sources such as BBSs, blogs, video sharing sites, and accounts with unknown occupations. A word cloud analysis of the neutral-to-anti users revealed a predominant interest in vaccine safety. While it is natural for individuals to have concerns about vaccine risks, these users exhibited an elevated level of worry that led them to trust suspicious information, including rumors, gossip, and fake news. The information found on these platforms often lacked moderation and contributed to the dissemination of misinformation.

These findings highlight the importance of disseminating information through diverse channels to prevent neutral users from encountering misinformation in alternative information sources. By ensuring the availability of accurate and reliable information across various platforms, we can help neutral users make informed decisions and avoid being influenced by misleading content.

One limitation of our study is that we categorized both ‘vaccine acceptance’ and ‘vaccine uptake’ under pro-vaccine, despite their distinct nature. Expressing acceptance of COVID-19 vaccination does not necessarily mean actual uptake, and vice versa. A systematic review and meta-analysis revealed a significant difference between global acceptance rate of COVID-19 vaccination (67.8%) and its uptake rate (42.3%) [35]. In our annotation dataset, almost all of pro-vaccine posts reported uptake (95.7%), posing a challenge for analyzing these two concepts separately.

In future studies, interpretable models could reveal the key features determining the classification of tweets into different stances. For instance, these models could identify common words or influencers associated with each stance. For example, employing such models could provide popular words or popular influencers associated with each stance. The application of NLP techniques could deepen our understanding. A thematic analysis such as LDA could unveil prevalent topics among the neutral-to-pro/anti users. Emotion fusion could offer insight into emotional features associated with each stance, and their effect on stance formation.

References

1. Sallam M. COVID-19 vaccine hesitancy worldwide: a concise systematic review of vaccine acceptance rates. Vaccines. 2021;9(2). pmid:33669441
- View Article
- PubMed/NCBI
- Google Scholar
2. De Figueiredo A, Simas C, Karafillakis E, Paterson P, Larson HJ. Mapping global trends in vaccine confidence and investigating barriers to vaccine uptake: a large-scale retrospective temporal modelling study. The Lancet. 2020;396(10255):898–908. pmid:32919524
- View Article
- PubMed/NCBI
- Google Scholar
3. Mathieu E, Ritchie H, Ortiz-Ospina E, Roser M, Hasell J, Appel C, et al. A global database of COVID-19 vaccinations. Nature human behaviour. 2021;5(7):947–953. pmid:33972767
- View Article
- PubMed/NCBI
- Google Scholar
4. Thelwall M, Kousha K, Thelwall S. COVID-19 vaccine hesitancy on English-language Twitter. Prof Inf. 2021;30(2).
- View Article
- Google Scholar
5. Liu S, Liu J, et al. Understanding behavioral intentions toward COVID-19 vaccines: theory-based content analysis of tweets. J Med Internet Res. 2021;23(5). pmid:33939625
- View Article
- PubMed/NCBI
- Google Scholar
6. Miyazaki K, Uchiba T, Toriumi F, Tanaka K, Sakaki T. Retrospective analysis of controversial topics on COVID-19 in Japan. In: Proc. ASONAM; 2021. p. 510–517.
7. Cascini F, Pantovic A, Al-Ajlouni YA, Failla G, Puleo V, Melnyk A, et al. Social media and attitudes towards a COVID-19 vaccination: A systematic review of the literature. EClinicalMedicine. 2022;48. pmid:35611343
- View Article
- PubMed/NCBI
- Google Scholar
8. Hu T, Wang S, Luo W, Zhang M, Huang X, Yan Y, et al. Revealing public opinion towards COVID-19 vaccines with Twitter data in the United States: spatiotemporal perspective. J Med Internet Res. 2021;23(9). pmid:34346888
- View Article
- PubMed/NCBI
- Google Scholar
9. Ansari MTJ, Khan NA. Worldwide COVID-19 Vaccines Sentiment Analysis Through Twitter Content. Electron J Gen Med. 2021;18(6).
- View Article
- Google Scholar
10. Ng QX, Lim SR, Yau CE, Liew TM. Examining the prevailing negative sentiments related to COVID-19 vaccination: Unsupervised deep learning of Twitter posts over a 16 month period. Vaccines. 2022;10(9):1457. pmid:36146535
- View Article
- PubMed/NCBI
- Google Scholar
11. Balakrishnan V, Ng WZ, Soo MC, Han GJ, Lee CJ. Infodemic and fake news–A comprehensive overview of its global magnitude during the COVID-19 pandemic in 2021: A scoping review. International Journal of Disaster Risk Reduction. 2022;78:103144. pmid:35791376
- View Article
- PubMed/NCBI
- Google Scholar
12. Liew TM, Lee CS. Examining the utility of social media in COVID-19 vaccination: unsupervised learning of 672,133 twitter posts. JMIR public health and surveillance. 2021;7(11):e29789. pmid:34583316
- View Article
- PubMed/NCBI
- Google Scholar
13. Piedrahita-Valdés H, Piedrahita-Castillo D, Bermejo-Higuera J, Guillem-Saiz P, Bermejo-Higuera JR, Guillem-Saiz J, et al. Vaccine Hesitancy on Social Media: Sentiment Analysis from June 2011 to April 2019. Vaccines. 2021;9(1). pmid:33430428
- View Article
- PubMed/NCBI
- Google Scholar
14. Mønsted B, Lehmann S. Characterizing polarization in online vaccine discourse—A large-scale study. PLOS ONE. 2022;17(2):1–19. pmid:35139121
- View Article
- PubMed/NCBI
- Google Scholar
15. Dubé E, Vivion M, MacDonald NE. Vaccine hesitancy, vaccine refusal and the anti-vaccine movement: influence, impact and implications. Expert Rev of Vaccines. 2015;14(1):99–117.
- View Article
- Google Scholar
16. Nomura S, Eguchi A, Yoneoka D, Kawashima T, Tanoue Y, Murakami M, et al. Reasons for being unsure or unwilling regarding intention to take COVID-19 vaccine among Japanese people: A large cross-sectional national survey. Lancet Reg Health West Pac. 2021;14. pmid:34368797
- View Article
- PubMed/NCBI
- Google Scholar
17. Gera P, Neal T. A Comparative Analysis of Stance Detection Approaches and Datasets. In: Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems. Online: Association for Computational Linguistics; 2022. p. 58–69. Available from: https://aclanthology.org/2022.eval4nlp-1.7.
18. Shohei H, Sho C, Hongshan J, Masashi T, Naoki Y. Diachronic Analysis of Users’ Stances on COVID-19 Vaccination in Japan using Twitter. In: The 2022 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2022). Istanbul, Turkey: IEEE; 2022.
19. Brownstein JS, Freifeld CC, Reis BY, Mandl KD. Surveillance Sans Frontieres: Internet-based emerging infectious disease intelligence and the HealthMap project. PLoS medicine. 2008;5(7):e151. pmid:18613747
- View Article
- PubMed/NCBI
- Google Scholar
20. Yuan X, Schuchard RJ, Crooks AT. Examining emergent communities and social bots within the polarized online vaccination debate in Twitter. Soc Media Soc. 2019;5(3).
- View Article
- Google Scholar
21. Niu Q, Liu J, Kato M, Shinohara Y, Matsumura N, Aoyama T, et al. Public Opinion and Sentiment Before and at the Beginning of COVID-19 Vaccinations in Japan: Twitter Analysis. JMIR Infodemiology. 2022;2(1):e32335. pmid:35578643
- View Article
- PubMed/NCBI
- Google Scholar
22. Yousefinaghani S, Dara R, Mubareka S, Papadopoulos A, Sharif S. An analysis of COVID-19 vaccine sentiments and opinions on Twitter. International Journal of Infectious Diseases. 2021;108:256–262. pmid:34052407
- View Article
- PubMed/NCBI
- Google Scholar
23. Hutto C, Gilbert E. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In: Proceedings of the international AAAI conference on web and social media. vol. 8; 2014. p. 216–225.
24. Cotfas LA, Delcea C, Roxin I, Ioanăş C, Gherai DS, Tajariol F. The longest month: analyzing COVID-19 vaccination opinions dynamics from tweets in the month following the first vaccine announcement. IEEE Access. 2021;9:33203–33223. pmid:34786309
- View Article
- PubMed/NCBI
- Google Scholar
25. Devlin J, Chang MW, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In: Proc. NAACL-HLT; 2019. p. 4171–4186.
26. Alhuzali H, Zhang T, Ananiadou S. Emotions and topics expressed on Twitter during the COVID-19 pandemic in the United Kingdom: Comparative geolocation and text mining analysis. Journal of Medical Internet Research. 2022;24(10):e40323. pmid:36150046
- View Article
- PubMed/NCBI
- Google Scholar
27. Garcia K, Berton L. Topic detection and sentiment analysis in Twitter content related to COVID-19 from Brazil and the USA. Applied Soft Computing. 2021;101:107057. pmid:33519326
- View Article
- PubMed/NCBI
- Google Scholar
28. Karypis G, Kumar V. A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM Journal on scientific Computing. 1998;20(1):359–392.
- View Article
- Google Scholar
29. Garimella K, Morales GDF, Gionis A, Mathioudakis M. Quantifying controversy on social media. ACM Trans Social Comput. 2018;1(1):1–27.
- View Article
- Google Scholar
30. Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. J Stat Mech: Theory Exp. 2008;2008(10).
- View Article
- Google Scholar
31. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–174. pmid:843571
- View Article
- PubMed/NCBI
- Google Scholar
32. Gururangan S, Marasović A, Swayamdipta S, Lo K, Beltagy I, Downey D, et al. Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks. In: Proceedings of ACL; 2020.
33. Schmidt AL, Zollo F, Scala A, Betsch C, Quattrociocchi W. Polarization of the vaccination debate on Facebook. Vaccine. 2018;36(25):3606–3612. pmid:29773322
- View Article
- PubMed/NCBI
- Google Scholar
34. Cossard A, Morales GDF, Kalimeri K, Mejova Y, Paolotti D, Starnini M. Falling into the echo chamber: the Italian vaccination debate on Twitter. In: Proc. ICWSM; 2020. p. 130–140.
35. Wang Q, Hu S, Du F, Zang S, Xing Y, Qu Z, et al. Mapping global acceptance and uptake of COVID-19 vaccination: A systematic review and meta-analysis. Communications medicine. 2022;2(1):113. pmid:36101704
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Sallam M. COVID-19 vaccine hesitancy worldwide: a concise systematic review of vaccine acceptance rates. Vaccines. 2021;9(2). pmid:33669441
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. De Figueiredo A, Simas C, Karafillakis E, Paterson P, Larson HJ. Mapping global trends in vaccine confidence and investigating barriers to vaccine uptake: a large-scale retrospective temporal modelling study. The Lancet. 2020;396(10255):898–908. pmid:32919524
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Mathieu E, Ritchie H, Ortiz-Ospina E, Roser M, Hasell J, Appel C, et al. A global database of COVID-19 vaccinations. Nature human behaviour. 2021;5(7):947–953. pmid:33972767
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Thelwall M, Kousha K, Thelwall S. COVID-19 vaccine hesitancy on English-language Twitter. Prof Inf. 2021;30(2).
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref5] 5. Liu S, Liu J, et al. Understanding behavioral intentions toward COVID-19 vaccines: theory-based content analysis of tweets. J Med Internet Res. 2021;23(5). pmid:33939625
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Miyazaki K, Uchiba T, Toriumi F, Tanaka K, Sakaki T. Retrospective analysis of controversial topics on COVID-19 in Japan. In: Proc. ASONAM; 2021. p. 510–517.

[ref7] 7. Cascini F, Pantovic A, Al-Ajlouni YA, Failla G, Puleo V, Melnyk A, et al. Social media and attitudes towards a COVID-19 vaccination: A systematic review of the literature. EClinicalMedicine. 2022;48. pmid:35611343
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref8] 8. Hu T, Wang S, Luo W, Zhang M, Huang X, Yan Y, et al. Revealing public opinion towards COVID-19 vaccines with Twitter data in the United States: spatiotemporal perspective. J Med Internet Res. 2021;23(9). pmid:34346888
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref9] 9. Ansari MTJ, Khan NA. Worldwide COVID-19 Vaccines Sentiment Analysis Through Twitter Content. Electron J Gen Med. 2021;18(6).
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref10] 10. Ng QX, Lim SR, Yau CE, Liew TM. Examining the prevailing negative sentiments related to COVID-19 vaccination: Unsupervised deep learning of Twitter posts over a 16 month period. Vaccines. 2022;10(9):1457. pmid:36146535
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref11] 11. Balakrishnan V, Ng WZ, Soo MC, Han GJ, Lee CJ. Infodemic and fake news–A comprehensive overview of its global magnitude during the COVID-19 pandemic in 2021: A scoping review. International Journal of Disaster Risk Reduction. 2022;78:103144. pmid:35791376
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref12] 12. Liew TM, Lee CS. Examining the utility of social media in COVID-19 vaccination: unsupervised learning of 672,133 twitter posts. JMIR public health and surveillance. 2021;7(11):e29789. pmid:34583316
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref13] 13. Piedrahita-Valdés H, Piedrahita-Castillo D, Bermejo-Higuera J, Guillem-Saiz P, Bermejo-Higuera JR, Guillem-Saiz J, et al. Vaccine Hesitancy on Social Media: Sentiment Analysis from June 2011 to April 2019. Vaccines. 2021;9(1). pmid:33430428
View Article
PubMed/NCBI
Google Scholar

[45] View Article

[46] PubMed/NCBI

[47] Google Scholar

[ref14] 14. Mønsted B, Lehmann S. Characterizing polarization in online vaccine discourse—A large-scale study. PLOS ONE. 2022;17(2):1–19. pmid:35139121
View Article
PubMed/NCBI
Google Scholar

[49] View Article

[50] PubMed/NCBI

[51] Google Scholar

[ref15] 15. Dubé E, Vivion M, MacDonald NE. Vaccine hesitancy, vaccine refusal and the anti-vaccine movement: influence, impact and implications. Expert Rev of Vaccines. 2015;14(1):99–117.
View Article
Google Scholar

[53] View Article

[54] Google Scholar

[ref16] 16. Nomura S, Eguchi A, Yoneoka D, Kawashima T, Tanoue Y, Murakami M, et al. Reasons for being unsure or unwilling regarding intention to take COVID-19 vaccine among Japanese people: A large cross-sectional national survey. Lancet Reg Health West Pac. 2021;14. pmid:34368797
View Article
PubMed/NCBI
Google Scholar

[56] View Article

[57] PubMed/NCBI

[58] Google Scholar

[ref17] 17. Gera P, Neal T. A Comparative Analysis of Stance Detection Approaches and Datasets. In: Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems. Online: Association for Computational Linguistics; 2022. p. 58–69. Available from: https://aclanthology.org/2022.eval4nlp-1.7.

[ref18] 18. Shohei H, Sho C, Hongshan J, Masashi T, Naoki Y. Diachronic Analysis of Users’ Stances on COVID-19 Vaccination in Japan using Twitter. In: The 2022 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2022). Istanbul, Turkey: IEEE; 2022.

[ref19] 19. Brownstein JS, Freifeld CC, Reis BY, Mandl KD. Surveillance Sans Frontieres: Internet-based emerging infectious disease intelligence and the HealthMap project. PLoS medicine. 2008;5(7):e151. pmid:18613747
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref20] 20. Yuan X, Schuchard RJ, Crooks AT. Examining emergent communities and social bots within the polarized online vaccination debate in Twitter. Soc Media Soc. 2019;5(3).
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref21] 21. Niu Q, Liu J, Kato M, Shinohara Y, Matsumura N, Aoyama T, et al. Public Opinion and Sentiment Before and at the Beginning of COVID-19 Vaccinations in Japan: Twitter Analysis. JMIR Infodemiology. 2022;2(1):e32335. pmid:35578643
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref22] 22. Yousefinaghani S, Dara R, Mubareka S, Papadopoulos A, Sharif S. An analysis of COVID-19 vaccine sentiments and opinions on Twitter. International Journal of Infectious Diseases. 2021;108:256–262. pmid:34052407
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref23] 23. Hutto C, Gilbert E. Vader: A parsimonious rule-based model for sentiment analysis of social media text. In: Proceedings of the international AAAI conference on web and social media. vol. 8; 2014. p. 216–225.

[ref24] 24. Cotfas LA, Delcea C, Roxin I, Ioanăş C, Gherai DS, Tajariol F. The longest month: analyzing COVID-19 vaccination opinions dynamics from tweets in the month following the first vaccine announcement. IEEE Access. 2021;9:33203–33223. pmid:34786309
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref25] 25. Devlin J, Chang MW, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In: Proc. NAACL-HLT; 2019. p. 4171–4186.

[ref26] 26. Alhuzali H, Zhang T, Ananiadou S. Emotions and topics expressed on Twitter during the COVID-19 pandemic in the United Kingdom: Comparative geolocation and text mining analysis. Journal of Medical Internet Research. 2022;24(10):e40323. pmid:36150046
View Article
PubMed/NCBI
Google Scholar

[83] View Article

[84] PubMed/NCBI

[85] Google Scholar

[ref27] 27. Garcia K, Berton L. Topic detection and sentiment analysis in Twitter content related to COVID-19 from Brazil and the USA. Applied Soft Computing. 2021;101:107057. pmid:33519326
View Article
PubMed/NCBI
Google Scholar

[87] View Article

[88] PubMed/NCBI

[89] Google Scholar

[ref28] 28. Karypis G, Kumar V. A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM Journal on scientific Computing. 1998;20(1):359–392.
View Article
Google Scholar

[91] View Article

[92] Google Scholar

[ref29] 29. Garimella K, Morales GDF, Gionis A, Mathioudakis M. Quantifying controversy on social media. ACM Trans Social Comput. 2018;1(1):1–27.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref30] 30. Blondel VD, Guillaume JL, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. J Stat Mech: Theory Exp. 2008;2008(10).
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref31] 31. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159–174. pmid:843571
View Article
PubMed/NCBI
Google Scholar

[100] View Article

[101] PubMed/NCBI

[102] Google Scholar

[ref32] 32. Gururangan S, Marasović A, Swayamdipta S, Lo K, Beltagy I, Downey D, et al. Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks. In: Proceedings of ACL; 2020.

[ref33] 33. Schmidt AL, Zollo F, Scala A, Betsch C, Quattrociocchi W. Polarization of the vaccination debate on Facebook. Vaccine. 2018;36(25):3606–3612. pmid:29773322
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

[ref34] 34. Cossard A, Morales GDF, Kalimeri K, Mejova Y, Paolotti D, Starnini M. Falling into the echo chamber: the Italian vaccination debate on Twitter. In: Proc. ICWSM; 2020. p. 130–140.

[ref35] 35. Wang Q, Hu S, Du F, Zang S, Xing Y, Qu Z, et al. Mapping global acceptance and uptake of COVID-19 vaccination: A systematic review and meta-analysis. Communications medicine. 2022;2(1):113. pmid:36101704
View Article
PubMed/NCBI
Google Scholar

[110] View Article

[111] PubMed/NCBI

[112] Google Scholar

Figures

Abstract

Introduction

Related work

Dataset construction

Vaccination stance classification

Annotation of stance of tweets

Text and graph-based stance classification

Experiments on vaccination stance classification

Settings.

Results.

Analysis

Distribution of users’ stances

Transition in polarization between vaccination stances

Stance formations of users

Information-sharing behaviors associated with stance formation changing stances

What kinds of users were referred to by users who formed their stances?

What types of external sites were shared by users who determined their stances?

Conclusion

References