Fig 1.
Emojis are ordered by the number of occurrences N. The average position ranges from 0 (the beginning of the tweets) to 1 (the end of the tweets). pc, c ∈ {−1, 0, +1}, are the negativity, neutrality, and positivity, respectively. is the sentiment score.
Table 1.
Correlations are between the occurrences of emojis in the Emoji Sentiment Ranking and Emojitracker, for two minimum occurrence thresholds. The numbers in parenthesis are the emojis that are common in both sets. The correlation values, significant at the 1% level, are indicated by *.
Fig 2.
Sentiment map of the 751 emojis.
Left: negative (red), right: positive (green), top: neutral (yellow). Bubble size is proportional to log10 of the emoji occurrences in the Emoji Sentiment Ranking. Sections A, B, and C are references to the zoomed-in panels in Fig 3.
Fig 3.
Emojis in sections A, B, and C of Fig 2.
Shown are emojis that occur at least 100 times in the Emoji Sentiment Ranking. Panel A: negative emojis, panel B: neutral (top) and bipolar (bottom) emojis, panel C: positive emojis.
Table 2.
Sentiment of tweets with and without emojis.
For each set, the mean, sd and sem are computed from the distribution of negative, neutral, and positive tweets.
Table 3.
Inter-annotator agreement on tweets with and without emojis.
The agreement is computed in terms of three measures over a subset of tweets that were labeled by two different annotators.
Fig 4.
Distribution of emojis by sentiment score.
The mean sentiment score of the 751 emojis (in bins of 0.05) is +0.305.
Fig 5.
Distribution of occurrences and sentiment of the 751 emojis.
The emojis are ranked by their occurrence (log scale). The column color indicates the sentiment score. The partitioning into two equally weighted halfs is indicated by a line at R1/2. The first 33 emojis are zoomed-in in Fig 6.
Fig 6.
Column color represents the emoji sentiment score.
Table 4.
Comparison of the more-frequent with the less-frequent emojis.
The emojis (r) ranked by occurrence N(r) are partitioned into two halves with approximately the same cumulative number of occurrences.
Fig 7.
Average positions of the 751 emojis in tweets.
Bubble size is proportional to log10 of the emoji occurrences in the Emoji Sentiment Ranking. Left: the beginning of tweets, right: the end of tweets, bottom: negative (red), top: positive (green).
Fig 8.
Negativity, neutrality, and positivity regressed with position (from left to right).
The trendlines are functions pc(d) of the distance d from the beginning of the tweets.
Table 5.
Emoji sentiment in different languages.
The languages are ordered by the number of different emojis used. Correlations are between the sentiment scores of emojis in the 13 languages and the Emoji Sentiment Ranking. The correlation values, significant at the 1% level, are indicated by *.
Table 6.
Tweets annotated for sentiment in different languages.
Languages are in alphabetical order, Ser/Cro/Bos denotes a union of tweets in Serbian, Croatian and Bosnian.
Table 7.
Types and numbers of emoji symbols.
[So] is an abbreviation for the Unicode category ‘Symbol, Other’.
Table 8.
Overlaps and differences for emojis from the three data sources.
A table entry is the number of emojis in (∈), or missing (∉) from a data source. N(Single, Double) denotes the total number N of emoji symbols, partitioned into the Single- and Double-character symbols, respectively.
Fig 9.
Sentiment bars of the ‘thumbs down sign’, ‘flushed face’, and ‘chocolate bar’ emojis.
The colored bar extends from −1 to +1, the range of the sentiment score . The grey bar is centered at
and extended for
, but never beyond the range of
. Colored parts are proportional to negativity (p−, red), neutrality (p0, yellow), and positivity (p+, green).
Table 9.
Coincidence matrix for tweets with emojis.
Table 10.
Coincidence matrix for tweets without emojis.