Identifying and characterizing superspreaders of low-credibility content on Twitter

doi:10.1371/journal.pone.0302201

Table 1.

Classification scheme utilized during the process of manually annotating superspreader accounts.

An account’s political affiliation was recorded if an annotator classified that account as political. The same was done for hyperpartisan accounts in certain other categories, such as media and journalists.

More »

Expand

Fig 1.

Top: The effect of removing accounts that created low-credibility posts during January and February 2020 (observation period) on the proportion of untrustworthy content present during the following eight months (evaluation period). Nodes (accounts) are removed one by one from a retweet network in order of ascending rank, based on the metrics indicated in the legend. The remaining proportion of retweets of low-credibility posts is plotted versus the number of nodes removed. The lowest value for all curves is not zero, reflecting the fact that approximately 13% of the low-credibility retweets in the evaluation network are by accounts who did not create low-credibility posts during the observation period. Bottom: Likelihood that the difference between the performance of h-index and Influence happened by random chance. The most prolific superspreaders according to these two metrics remove a similar amount of low-credibility content. To compare them for any given number of removed accounts, we conduct Cramer von Mises two-sample tests with increasingly larger samples and plot each test’s P-value on the y-axis. After removing more than 50 accounts (gray area) the Influence metric performs significantly better (P < 0.05). The difference is not significant if fewer accounts are removed.

More »

Expand

Fig 2.

Classification of superspreader accounts.

A large portion (55.1%) of accounts are no longer active. For each class annotated with political affiliations, colors indicate the ideological split. The last group aggregates all accounts with political affiliations.

More »

Expand

Fig 3.

Low-credibility content sharing behavior of superspreaders (points) as captured by the boxplot distribution of the ratio r_m.

Users identified via the h-index share a significantly higher ratio of untrustworthy sources than those identified with the Influence metric.

More »

Expand

Fig 4.

Distributions of language toxicity scores for superspreaders vs. all accounts in the low-credibility content ecosystem.

More »

Expand

Fig 5.

Relationship between suspension, verified status, and popularity of top 250 superspreaders.

Top: Percentage of suspended superspreader accounts that are verified. Bottom: Percentage of suspended superspreader accounts based on numbers of followers.

More »

Expand