Figures
Abstract
As short videos dominate digital communication, this study examines audience emotional expression in this evolving media landscape. Focusing on the popular Chinese video “My Second Uncle”, we collected bullet subtitles—user comments displayed on-screen—and related comments. Using Latent Dirichlet Allocation (LDA), we identified key themes, while SnowNLP and ChatGPT facilitated an in-depth emotional analysis. Findings show a nuanced audience “gaze” toward the protagonist, with a strong trend of embracing his inspirational story, which generated predominantly positive emotional responses. Unlike traditional “gaze” research, engagement here was active and voluntary, with viewers using innovative online language. The study also reveals that influencer-driven short videos can quickly foster positive group sentiments, underscoring their potential for emotional healing and mental well-being.
Citation: Xiao R, Guo X, Xie X (2026) How audiences express emotions towards influencer short videos: Sentiment analysis of ‘my second uncle’. PLoS One 21(1): e0341797. https://doi.org/10.1371/journal.pone.0341797
Editor: Alejandro Botero Carvajal, Universidad Santiago de Cali, COLOMBIA
Received: April 7, 2025; Accepted: January 13, 2026; Published: January 27, 2026
Copyright: © 2026 Xiao et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data are available from Kaggle (URL: https://www.kaggle.com/datasets/xyxyxxxy/myseconduncledata; DOI:https://doi.org/10.34740/kaggle/dsv/12422728).
Funding: This work was supported by Major Humanities and Social Sciences Research Projects in Zhejiang Higher Education Institutions (Grant No. 2023QN031); Research Start-up Funding Project in Ningbo University of Technology (Grant No.2022KQ43).The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
In the rapidly evolving digital media landscape, internet celebrities play a pivotal role in shaping online culture and user engagement, particularly on short-video platforms. By June 2022, China’s short-video user base had surged to 962 million, underscoring the medium’s extensive reach and influence [1]. Internet celebrities have become central to this phenomenon, with their content profoundly shaping Chinese users’ digital experiences and preferences.
However, the essence of their influence lies not merely in broad dissemination but in their capacity to trigger deep emotional resonance and collective gaze. A prime example is the viral short video “Three Days Back in the Village, My Second Uncle Cured My Mental Exhaustion”, which spread widely across major social platforms in mid-2022 and evoked intense emotional responses. Such moments of viral transmission demonstrate that influencer videos have become focal points for public emotional expression and convergence.
Therefore, this study aims to transcend superficial observations of the viral phenomenon and delve into the emotional interaction mechanisms between its content and audience. Specifically, using the “My Second Uncle” video as a case study, it explores how audiences emotionally engage with viral short videos, focusing on expressions of empathy and spectator behaviors. By analyzing audience sentiment data on Bilibili, this research not only aims to reveal how viewers establish emotional connections with content and how these emotions shape online interactions but also seeks to examine the evolving dynamics of ’gaze’ and empathy within the new media context of short videos. While existing research has examined emotional expressions within videos, deeper analysis is needed on the dynamic processes of audience sentiment over time and across topics, as well as their implications for public discourse. The findings offer insights for media practitioners, policymakers, and mental health professionals seeking to understand the nuanced emotional narratives of digital audiences.
This paper proceeds as follows: Section 2 provides a literature review, Section 3 outlines the methodology, Sections 4 and 5 present the results of LDA and sentiment analysis, and Section 6 discusses “gaze” and empathy in short videos. Section 7 concludes the study.
Literature review
Research on public emotional expression and analysis in the network
With the rise of social media and short-video platforms, online emotional expression has become more dynamic and complex. Research primarily examines the features and factors influencing emotional expression on the internet, revealing the diversity and interactivity of audience emotions in short-video contexts. On user-generated content (UGC) platforms, real-time engagement through bullet subtitles, posts, and replies allows audiences to express emotions through symbols, numbers, emojis, and graphics [2], forming an “emotional atmosphere” with rapid dissemination and collective resonance [3]. This emotional resonance, often quick to ferment public opinion, strengthens community sentiment and cohesion around specific events [4]. Emotional expression, therefore, functions as an exploration of an event’s meaning, creating a collective response from the “bottom up” [3,5,6].
The social context of each platform also shapes emotional expression: on closed platforms like WeChat, expression tends to be conservative, while on open platforms like Weibo, it is more direct and intense [2]. Additionally, emotional diffusion varies by emotion type, with anger spreading more rapidly than others [7]. Economic factors also play a role, with higher economic development correlating with reduced positive emotional expressions and mitigation of fear [8]. Emotional expression, while beneficial for consensus-building, can also amplify extreme emotions, posing challenges for public opinion governance [9].
Building on existing studies, this paper uniquely examines the emotional expressions of audiences within the context of influencer short videos, providing a novel analytical perspective and research value. While previous literature has largely focused on the general characteristics of emotional expression on UGC platforms, including the use of symbols, emojis, and varied modes of expression in open-platform environments, this study emphasizes the unique influence and emotional interaction mechanisms specific to influencer content creators. Influencer short videos, which are often more closely aligned with the everyday lives of ordinary people, tend to generate deeper emotional resonance and stronger emotional transmission chains—a specificity that has not yet been widely addressed in research.
Theoretical perspective and reflection of gaze
Gaze theory builds upon the Western concept of “visual centrism” and explores “seeing” from diverse perspectives. It examines how people form deep, often unconscious, connections with others through the act of gazing, positioning oneself in the viewpoint of the other, and projecting feelings such as self-regard, fear, or desire onto them [10]. Hegel was among the first to discuss gaze in Phenomenology of Spirit, considering how human beings initially construct a subject-object relationship through mutual “seeing” [11].
Sartre, Lacan, and Foucault each contributed foundational insights into gaze theory from the standpoint of vision. Sartre argued that the self and others confirm their existence through gaze, emphasizing, from an existentialist view, that meaning is established in the act of seeing, which, in turn, affirms the essential role of visual experience in human subjectivity [12]. Lacan, from a psychoanalytic perspective, proposed a nuanced relationship between the gaze and the human eye, where the eyes signify the subject and the gaze represents the object. He saw the gaze as a projection of the subject’s desires, a phenomenon that can lead the viewer to experience lust and fixation during the act of gazing [13,14]. Foucault further developed the theory by linking power with gaze, creating a framework of vision-knowledge-power, exemplified by his concept of the “medical gaze” [15].
Starobinski posited that gaze functions as a social behavior and an organizing force in social relationships, whereby the innate human impulse to see and desire to explore opens a “second vision.” This expanded vision reveals a field of infinite potential,metaphorically similar to lifting a veil, which Starobinski argues can lead to a power dynamic akin to the right of interrogation or control [16]. Vision, subjectivity, and power are therefore central to gaze theory. Danny also viewed gaze as a visual behavior rooted in social power dynamics, integral to understanding “social identity” [17].
At the same time, some scholars have critiqued the traditional understanding of gaze by seeking to dismantle its inherent power dynamics. Their work focuses on examining gaze as it relates to women and racial identities. Hooks suggested that “seeing” challenges established authority and can be a form of confrontation [18]. Gurnham (2018) has extended this critique by encouraging subjects to engage in a reflective, oppositional gaze, reshaping the viewer-object dynamic. This opposition reveals the inter play of rights, desires, and resistance that exists between the viewer and the viewed, framing a visual relationship that touches upon issues of gender, race, and culture and contributes to a broader theoretical approach in postmodern cultural studies [19].
In the digital era, Xu argues that gaze theory has evolved with the rise of live streaming, blurring traditional gender boundaries in the dynamics of looking and being looked at. Now, men, too, are increasingly subjects of the gaze [20]. This shift reflects a broader transformation in the logic of visibility under digital capitalism. As Schröter [21] proposes, the proliferation of visual content and algorithmic circulation has given rise to what he terms the “digital capitalist gaze,” in which subjects are constituted through their visibility in media networks. Chang [22] further deepens this analysis by showing how motion capture technologies render the human body as dynamic data, destabilizing traditional boundaries between the observer and the observed, and evoking a sense of algorithmic uncanny. Liang [23] introduces the notion of “co-gazing” to describe how the traditional disciplinary surveillance of the “Panopticon” is reconfigured in digital environments: individuals no longer simply resist being seen, but increasingly internalize and even pursue visibility as a form of social value. This framework resonates with broader discussions on affective labor and self-surveillance on digital platforms. These changes also complicate the role of empathy in mediated environments. Liu [24] points out that digital technology is not inherently an enabler of empathetic communication, but rather the structural condition in which such communication occurs. While digital affordances such as comment systems and emotional tagging may shape the expression and reception of emotion, their actual impact on empathy remains ambiguous.
Research on empathy
Empathy is a fundamental ability inherent in human nature [25]. It is distinct from sympathy, which is often a form of pity lacking objectivity. In contrast, empathy involves a rational emotional response—experiencing and understanding what an other person thinks and feels [25]. Raboteg-Saric identified two types of empathy: vicarious emotional responses to others and the self-perception of others’ inner states. The first, known as “Emotional Empathy,” refers to an empathetic emotional response [26].
Empathy is not a singular emotion but a complex psychological process. Neuroscientific research has shown that empathy is grounded in physiological mechanisms. Rizzolatti and colleagues discovered the existence of “mirror neurons” in the human nervous system, which play a crucial role in empathetic responses [27]. These neurons allow individuals to immediately feel the emotions conveyed by others when confronted with intense emotional stimuli. This neural mechanism helps form emotional connections between individuals, enabling them to synchronize their emotional responses [28]. The neural network involved in empathy matures with age [29].
However, some researchers have raised concerns about the role of mirror imitation in empathy. While mirror imitation may help humans understand others’ intentions, they argue that simply imitating actions does not necessarily imply a full understanding of those behaviors or a mental alignment with the emotions of others. If mirror imitation is central to empathy, how do mirror neurons determine which behaviors to “imitate” to facilitate empathetic connections [30]. In response, Wu [28] proposed that “love”underpins empathy, with its expression varying according to the objects of empathy and the situations in which it arises. He also emphasized that communication is key to cultivating empathy, which in turn plays a critical role in fostering a global community with a shared future.
While much existing research has focused on the “panopticon” framework, which typically views audiences as passive objects of surveillance, new trends in short videos featuring internet celebrities suggest a shift in the dynamics of “gaze.” In these videos, audiences are no longer just passive observers but actively express emotions and engage in self-disclosure. This creates a “super-panopticon,” where traditional subject-object roles are blurred. This research explores two key questions: (1) How do audiences “gaze”at figures like the “second uncle” in influencer videos? (2) How do audiences generate empathy through comments and bullet subtitles? These inquiries aim to shed light on the nature of audience “gaze” and emotional expression—an area that has been largely unexplored in existing literature.
Methodology
Framework
The research framework of this paper, as illustrated in Fig 1, begins with the collection of comments or bullet screen data from the Bilibili platform. The collected text undergoes preprocessing: segmentation using the jieba library, removal of stop words, and elimination of meaningless symbols. Subsequently, the processed text is subjected to topic clustering using the Latent Dirichlet Allocation model [31] for generating topic probability distributions. LDA, based on the bag-of-words model, probabilistically selects topics and words within topics. By clustering words, LDA produces two distributions: topic-word probability distribution and comment-topic probability distribution. For the comment-topic probability distribution, the sentiment value of each comment is computed using the SnowNLP, thereby calculating the percentage of positive and negative sentiments under each topic. Comments are sorted by time, and SnowNLP is utilized to study changes in sentiment over time. Bullet screen data is sorted by progress bar, investigating changes in sentiment along the progress bar.
Experimental data and preprocessing
The study gathered primary comments and bullet subtitles from “My Second Uncle” on Bilibili, spanning from July 25 to August 25, 2022. A Python crawler program was developed for this purpose, resulting in the collection of 46,436 comments and 3,000 bullet subtitles. The data used in this study were collected from publicly available content on the Bilibili platform. All personally identifiable information was removed during data processing. Fig 2 illustrates the fluctuation in the number of comments overtime, showing a sustained low level after the 29th. The significant peak in comments observed around July 27–28 corresponds to the period when the “My Second Uncle”video achieved widespread virality on Bilibili. This surge in engagement was likely fueled by its powerful narrative resonating deeply with public sentiment, combined with active platform recommendations. The sharp decline in comments after July 29th and the subsequent sustained low levels are characteristic of the life cycle of viral content.This trajectory is generally attributable to several factors: the natural dissipation of the initial emotional impact, the shifting of public attention towards broader societal discussions or new media controversies, and the dynamics of platform algorithms, which often reduce the visibility of peak-performing content to promote the exposure of newer uploads.
Fig 3 displays the variation of bullet subtitle quantity along with the progress bar,grouped in intervals of 10 seconds.
Before the experiment commenced, data preprocessing was conducted. The comment texts often contain novel emoticons, which are rich in emotional polarity. Bilibili’s back end stores emoticons in text format. To prevent the emoticons from being segmented into multiple words with different emotional directions, all emoticon words from the comments were added to the segmentation dictionary. Additionally, besides removing stop words, some meaningless special symbols were eliminated. Because the“@” symbol is typically followed by a Bilibili user’s ID, which contributes little to sentiment recognition tasks, regular expressions were used to remove the “@” symbol and its subsequent ID names. Using the Gensim library in Python, an LDA model was constructed. Preprocessed words were fed into the LDA model for training, and the model’s effectiveness was evaluated using coherence metrics [32].
LDA topic clustering analysis of comments and bullet subtitles in ‘my second uncle’
Topic clustering of comment data
LDA clustering of comment data from the first five days
The LDA model was implemented using the Gensim library. We adopted the default hyperparameters provided by Gensim, where the Dirichlet priors were set to α = 1.0/num topics and β = 1.0/num topics. Using the comment data from the first five days for LDA topic clustering, Fig 4 illustrates the variation of coherence scores with the number of topics. From the graph, it is evident that when the number of topics is set to 6, the LDA model achieves the highest coherence score. Subsequently, as the number of topics continues to increase, the change in coherence score becomes minimal and does not surpass the maximum coherence value. Therefore, the optimal number of topics is determined to be 6. Under the scenario of 6 topics, the topic-word probability distributions are generated, providing the vocabulary distribution for each topic. Table 1 displays the topic-word distributions for the first five days of data, wherein the words under each topic are sorted in descending order of probability. The top 10 most relevant words for each topic are selected and used to label the topics. Retrieve the topic number for each comment data and tally the comment count for each of the six topics, as depicted in Fig 5.
LDA clustering of comment data from the first month
Using the first month of comment data for LDA topic clustering, Fig 6 displays the variation of coherence scores with the number of topics. From the graph, it is evident that when the number of topics is set to 8, the LDA model achieves the highest coherence score. Subsequently, as the number of topics increases, the change incoherence is minimal and does not surpass the maximum coherence value. Therefore,the optimal number of topics is determined to be 8. Under the scenario of 8 topics, the topic-word probability distributions are generated, providing the vocabulary distribution for each topic. Table II presents the topic-word distributions for all comments, wherein the words under each topic are sorted in descending order of probability. The top 10most relevant words for each topic are selected and used to label the topics. By comparing and analyzing Tables 1 and 2, it can be observed that with the passage of time, the number of topics that the audience is interested in also increases.
Retrieve the topic to which each comment belongs and obtain the count of comments for each of the eight topics, as shown in Fig 7.
Topic clustering of bullet subtitles
LDA is applied in the topic clustering of bullet subtitles data. Fig 8 shows how the coherence varies with the number of topics. The coherence of the LDA model is the largest when the number of topics is set to 3. The coherence gradually decreases as the number of topics continues to increase. Hence the optimal number of topics is 3. The topic-term probability distribution is generated when the number of topics is 3. Then the distributed vocabularies of each topic are obtained.
Table 3 shows the topic-term distribution of the data. The words of each topic are sorted from large probability to small one. The top 10 most relevant words for each topic are selected and the topics are labeled concerning them.
Each bullet subtitle is classified into the corresponding topic. Fig 9 shows the bullet subtitle number of the three topics.
Sentiment interpretation
Sentiment analysis of comments
SnowNLP was utilized to compute the sentiment value of the comments using its built-in sentiment analysis module, which is trained on user-generated content.Comments surpassing 0.5 were classified as positive, while those below 0.5 were considered negative. We also adopted a more nuanced approach to sentiment analysis by incorporating a broader spectrum of complex emotions. As shown in Table 4, we designed prompt instructions for ChatGPT to classify user comments into ten fine-grained emotion categories: nostalgia, sadness, praise, like, anger, surprise, fear, motivation, neutral, and reflection.
Sentiment analysis on the thematic dimension
Calculating the sentiment values of a total of 46,048 comments from July 25th to July29th, we obtained 37,754 positive comments and 8,294 negative comments, accounting for 81.99% and 18.01% respectively. Thus, the proportion of comments with positive sentiment significantly outweighs those with negative sentiment. Table 6 presents the distribution of emotional categories in user comments over five days. While neutral comments are the most common, the emotion of motivation stands out among the meaningful categories, indicating that many viewers felt inspired or encouraged by the video.
Clustering the comments from the first five days resulted in six topics, with the percentage of positive and negative sentiment for each topic shown in Fig 10. It can be observed from the figure that the proportion of positive sentiment comments exceeds that of negative sentiment comments for all topics. The highest proportion of positive sentiment comments is for Topic 5, while the lowest is for Topic 2. This indicates that audiences express predominantly positive sentiment in their comments; however, compared to other topics, there is a higher prevalence of negative sentiment projected onto Topic 2.
Sentiment analysis on the temporal dimension
To illustrate the temporal evolution of sentiment in comments, a bar chart was plotted with dates on the horizontal axis and the number of comments on the vertical axis, as shown in Fig 11. From the graph, it can be observed that both the number of positive sentiment comments and negative sentiment comments exhibit an increasing followed by decreasing trend over time, although the number of negative sentiment comments each day is significantly lower than that of positive sentiment comments. Fig 12 displays the percentage of positive and negative sentiment comments over the five-day period. It can be seen from the graph that the proportion of positive and negative comments remains relatively stable for the first three days, while the proportion of negative comments gradually increases on July 28th and 29th.
Sentiment analysis of bullet subtitles
For the bullet subtitles, we set the sentiment classification threshold of the SnowNLP to 0.44 to distinguish between positive and negative sentiment orientations. This threshold was determined based on an analysis of representative samples. Specifically, manual examination revealed that a sentiment score of 0.440219 corresponded to a clearly positive textual expression, whereas a score of 0.4390476 reflected an evidently negative sentiment. Given the minimal difference between these two values and their consistent semantic polarity, a threshold of 0.44 was selected as a conservative cutoff point to ensure robust classification performance.
Emotion analysis based on topics
Emotional analysis of bullet subtitles reveals 2,751 positive and 249 negative comments, accounting for 91.70% and 8.30% respectively. Therefore, the proportion of positive bullet subtitles far exceeds that of negative bullet subtitles. The complex emotions distribution is shown in Fig 13. We can see that the majority of bullet comments are categorized as nostalgia, reflecting a strong collective memory and emotional resonance triggered by the video. In contrast, other emotions such as sadness, motivation, and reflection appear much less frequently.
By clustering bullet subtitles, three topics have been identified, with the proportion positive and negative sentiment for each theme shown in the following figure. As depicted in Fig 14, the proportion of positive bullet subtitles exceeds that of negative bullet subtitles for all topics. The highest proportion of positive bullet subtitles is observed in Topic 1, while the lowest is in Topic 3. This suggests that audiences predominantly express positive emotions in bullet subtitles; furthermore, compared to other topics, audiences project more positive emotions towards Topic 1.
Emotion analysis based on the progress bar of the video
At intervals of every 10 seconds, calculate the average and cumulative values of emotion within each 10-second period, and plot the trend graphs as shown in Figs 15 and 16,respectively. From Fig 15, it can be observed that bullet subtitles mostly fluctuate within the positive sentiment range, with only a small number of time intervals exhibiting negative bullet subtitles, among which the sentiment of bullet subtitles in the[290,300) interval is the most negative, while the sentiment of bullet subtitles in the[380,390) interval is the most positive. The [290,300) interval and preceding scenes(barrages have a certain lag) describe the emotional life of the protagonist’s uncle,involving financial issues. The [380,390) interval and preceding scenes describe the protagonist’s grandmother’s life, involving perspectives on life and death (‘Illness and aging are necessary performances between life and death’), as well as the uncle’s attitude and actions towards ‘elderly care’.
From Fig 16, it can be observed that the cumulative sentiment remains stable within the interval [10, 580), reaching its peak in the interval [590, 600). As seen in Fig 3 earlier, the interval [590, 600) has the highest number of bullet subtitles, which contradicts our intuitive understanding. Through observation, it is evident that many users watch the video multiple times and leave more bullet subtitles at the beginning and end of the video. Therefore, the number of bullet subtitles is generally highest at the beginning and end of a video. Looking at the interval [590,600) and the preceding content, we can see: “He is such a genius, just born in the wrong era. He should have been nurtured by the country, and his fame should have spread far and wide. It’s such a pity, too regrettable.” However, my second uncle is not regretful, “The second happiest person is the one who never looks back.”
“Gaze” practice and empathetic communication in internet celebrity short videos
The practice and pattern of audience “gazed” at the second uncle
The research found that there is a certain degree of bias in the audience’s gaze towards the short video “My second uncle”. This bias is mainly reflected in the greater attention given to abnormal emotional life, traditional elderly care, and my second uncle’s optimistic attitude towards life. Audiences also tend to express emotions, forming a specific “gaze” practice and pattern.
This paper, through an empirical study of the short video “My second uncle,”combined with thematic classification and sentiment analysis of comments and bullet screen messages, summarizes the patterns of audience gaze on the protagonist, my second uncle, as shown in Fig 16. Bentham used the “panopticon” to describe modern power. Poster proposed the concept of the “Super Panopticon,” suggesting that,compared to the early “panopticon,” the “Super Panopticon” is based on digital technology. Its surveillance scope is broader, and its methods are more flexible.Individuals are monitored and disciplined unconsciously, willingly accepting this surveillance. Each individual participates in a self-construction process, transforming themselves into the subjects of surveillance in the Super Panopticon. Bauman believes that surveillance in modern society is a fluid, liquid form of monitoring. The mode of discipline has shifted from “rational” obedience, rule-following, and conformity to“irrational” proactivity, risk-taking, experimentation, and self-assertion, as well as the pursuit of emotion, pleasure, and entertainment [33].
Irrational emotional factors are an important component of this type of fluid, liquid surveillance. Particularly in digital communication practices, highly visual and interactive gazes become a new environment for emotional expression. In light of this,the gaze patterns in the short video “My Second Uncle” can be summarized as components, communication environment, expression methods, and emotional focus, as shown in Fig 17. Meanwhile, the far right end of this mode points to emotional governance, involving the need for platforms and regulatory authorities to strengthen the management of grotesque and harmful emotional expressions in comments and bullet subtitles. Additionally, a large amount of positive emotion can serve as an important resource for empathetic communication, applicable to specific communication fields.
From the perspective of its components, the short video platform provides a liquid surveillance scenario. The audience, by watching the story of “My Second Uncle”through the short video, plays the role of the “observer”; My Second Uncle, positioned at the center of the “Super Panopticon,” assumes the role of the “observed.” Comments and bullet subtitles are the specific ways in which the audience monitors the second uncle, consciously or unconsciously projecting their own experiences, transforming them into emotional expressions.
From the perspective of the communication environment, sophisticated video editing techniques allow the audience to quickly browse through the second uncle’s life experiences, compressing time and space. The application of intelligent communication technology enables each viewer to see others’ comments and understand their emotional inclinations, creating a subjective environment for emotional expression. To a certain extent, when viewers post comments or bullet subtitles, they are influenced by both the content they are viewing and the emotional expressions of other viewers. Additionally,sentiment analysis of the comments and bullet subtitles reveals that viewers have a significantly positive emotional response to the second uncle. This positive emotion flows within and outside the video and comments, permeating the overall communicative atmosphere. It is evident that communication technology greatly constructs the environment in which viewers “gaze” at the second uncle, making him the subject of the audience’s “liquid surveillance.” Simultaneously, the analysis of viewers’ positive emotional values indicates that viewers are also being “positively disciplined” by the second uncle.
From the perspective of expression methods, the audience mainly conveys their emotions and responds to the narrative through comments and bullet subtitles. For example, in Topic 5, keywords are “bull, living, victory, movie, doge, strive, overcome all difficulties, OK, determined, poison.” Other projective expressions include terms like“hexagonal warrior” (meaning an all-rounder). Additionally, there are innovative non-verbal expressions, such as the “Protect Carrot Crying” emoji on the Bilibili platform. These methods are used by the audience to express positive emotions. This mode of expression demonstrates a willingness to accept and internalize the narrative.On one hand, the audience monitors the second uncle’s story, while on the other hand,they willingly engage in self-reflection, sharing their own experiences and feelings, thus transforming into the observed.
From the perspective of emotional focus, the audience pays attention to four types of gaze: physical symbols, life experiences, life trajectory, and talent abilities. The audience naturally sympathizes with the second uncle’s physical disability, and coupled with his exceptional talent, it creates a strong contrast that elicits intense emotional investment from the audience. The second uncle’s legendary life experiences and tumultuous life trajectory, narrated in the short video, evoke a sense of historical vicissitudes, striking a chord with the audience’s deeply buried emotional pain points,and triggering emotional resonance.
Expanding applications of empathetic communication driven by‘gaze’ practices
Through posting and reviewing comments and bullet subtitles, the audience engages in the practice of gaze, which is a collective and emotional process. In the interaction within short videos, like-minded individuals gather to discuss, with comments and bullet subtitles reinforcing each other and flowing in a unified direction. However, this group of strangers, influenced by group thinking, may lead many to rely less on their own information and more on discussions with others or others’ viewpoints. This phenomenon, known as group polarization as described by Sunstein [34], results in a“cascade effect” that amplifies and disseminates empathetic communication. Based on the findings of this study, the second uncle is at the center of the group’s gaze in the short video. His physical symbols, life experiences, life trajectory, and talent abilities receive intense attention from the audience. Centered around the second uncle’s “inspirational story of suffering,” the audience invests positive emotions. Simultaneously,the audience completes self-gazing and mutual gazing through others’ comments and bullet subtitles. Through extensive discussion, a “cascade effect” is formed, resulting in empathy.
In the practice of “gaze” in short videos, the second uncle is the fixed object of the“gaze,” with his character image and storytelling being the focal points. Compared to traditional “gaze,” the practice of “gazing” at the second uncle reflects a relatively balanced power structure. The audience does not surveil from a superior position but engages in a mutual “gaze” as fellow ordinary people, displaying a positive motivational effect. Comments and bullet subtitles in the practice of gaze are crucial bridges guiding the audience towards empathy.
As suggested by the title of the short video “Three Days Back in the Village, My Second Uncle Cured My Mental Exhaustion,” the story of the second uncle shared through short video platforms has sparked public participation and viewership. Viewers compare and reflect on their own experiences, contemplating their mental exhaustion and seeking balance and emotional support from the second uncle’s story. In this practice of gaze, the boundaries between the “self” and the “other” are blurred because the audience often gazes at the second uncle from their own perspectives and experiences. This reflective process, which extends one’s self to others, incorporates both sympathy and empathy, thus facilitating empathetic communication. The phenomenon of short videos attracting large groups of viewers with positive emotions in a short time has become a significant resource for empathetic communication. This offers new samples and insights for related theoretical and practical research.
Conclusion
This study analyzes audience emotional data from the Bilibili platform to reveal how viewers establish emotional connections with short video content,and how such emotions shape online interactions and to explore the dynamic evolution of ’gaze’ and empathy practices within the new media context of short videos.
Findings reveal that audiences engage asynchronously with the ordinary human stories in My Second Uncle through screens. Though not expecting responses, they persistently express emotions, forming a practice of cross-spatial emotional expression rooted in connection. Against the specific social backdrop of the COVID-19 pandemic, this story elicited collective ’gazing’. Audiences engage in self-reflection through gazing and express emotions via comments and bullet screens. This interaction sparks discussions, mutual influence, and a ‘chain reaction effect’,triggering collective emotional release and empathy within a short time frame.
Compared to previous studies on online ‘gazing’, this research demonstrates that audience gazing practices and theories in viral short videos have evolved: Audiences at the center of the ‘panopticon’ exhibit proactivity and voluntariness, actively expressing emotions through comments and bullet screens—even engaging in self-disclosure—adding new dimensions to the gazing structure. The boundaries between’surveillance’ and ‘discipline’ in gaze practices are increasingly blurred, while the rigid distinction between gaze subjects and objects gradually dissolves. New characteristics emerge collectively through social context, technology (intelligent algorithms, mobile internet), expression methods (comments and bullet screens), surveillance direction (bidirectional), and disciplinary approaches (shifting from passive to active).
Simultaneously, this study experimentally assessed the emotional value of audience comments and bullet screens, revealing the dissemination effects of influencer short videos from a viewer perspective. This provides valuable insights for understanding audience focus, expression methods, emotional trends, and optimizing platform algorithm recommendations. Findings indicate that the overwhelming majority of viewers hold positive emotions toward ‘Second Uncle’ videos and experience positive motivation, demonstrating the content’s significant positive guiding value. This suggests that evaluating short video content should not rely solely on interaction metrics but should also focus on its emotional and social significance; ‘emotional value’ can serve as an effective indicator for guiding content evaluation and platform governance.
Based on these findings, we recommend that platforms optimize algorithms to identify and promote content that offers emotional comfort and healing despite its melancholic undertones, rather than simply demoting content containing sadness or sensitive elements. Simultaneously, policymakers should establish regulatory frameworks and incentive mechanisms to encourage the production and dissemination of digitally created content that is emotionally constructive and socially positive, fostering a healthy and inclusive online media environment.
This study has the following limitations: First, this study employs a single-case analysis approach, focusing on the short video ‘My Second Uncle’ and its dissemination on the Bilibili platform during a specific period. This methodology facilitates an in-depth examination of audience interaction mechanisms within a particular cultural and platform context, with findings offering valuable reference points for similar emotionally resonant short videos. However, inherent limitations of the research method restrict the broad applicability and generalizability of the conclusions. Therefore, future research could further explore commonalities and differences in short video dissemination across platforms and sociocultural contexts through cross-cultural comparisons. Alternatively, expanding to multi-case comparisons or large-scale quantitative studies could enhance the universality and explanatory power of research conclusions.
Second, due to technical limitations, this study could not determine whether the comments and bullet subtitles were influenced by capital or platform manipulation. Future research could collaborate with multiple platforms to obtain more comprehensive user information and conduct comparative analyses of audience emotional expression data across platforms. Additionally, subsequent research could integrate effective AI-based comment detection tools to refine data screening and mitigate potential biases in the data set.
Third, this study did not address the possibility that Bilibili’s algorithm may influence the visibility of user comments. Specifically, the platform may prioritize displaying certain comments or suppressing others based on factors such as interaction metrics, posting time, or semantic content, leading to a bias in the comment data set toward specific expression types. Future research should focus on eliminating the impact of Bilibili’s algorithm on data visibility to enhance the effectiveness of sentiment analysis.
Finally, future research could also focus on gender as a significant social dynamic factor to explore its role in audience reception of such videos and emotional engagement processes, thereby deepening our understanding of audience interaction mechanisms in digital spaces.
References
- 1. The Cyberspace Administration of China PRC. The 50th statistical report on internet development in China. https://www.cnnic.cn/n4/2022/0916/c38-10594.html
- 2. Zhang LJ, Zhang B. A study on the practice of public reflective emotion in the reversal of internet public opinion. New Media Res. 2022;8(12):12–5.
- 3. Zeng LH. The emotional spread function of cultural TV programs from the perspective of affordances. China Telev. 2021;12(11):53–7.
- 4. Cui X, Xu Q. The roles of identity and emotion in media events’ social integration mechanism: a case study of the 2017 U.S. presidential inauguration. Atlantic J Commun. 2019;28(2):138–52.
- 5.
Tooby J., Cosmides L. The evolutionary psychology of the emotions and their relationship to internal regulatory variables. Guilford Press; 2008.
- 6. Medugorac 6 V, Schuitema G. Why is bottom-up more acceptable than top-down? A study on collective psychological ownership and place-technology fit in the Irish Midlands. Energy Res Soc Sci. 2023;96(31).
- 7. Zeng R, Zhu D. A model and simulation of the emotional contagion of netizens in the process of rumor refutation. Sci Rep. 2019;9(1):14164. pmid:31578405
- 8. Bian HX, Xu T. Evolution of public sentiment during COVID-19 pandemic. Data Anal Knowled Dis. 2022;12(7):128–40.
- 9. Guo 9 X A. The utilization and reflection of emotional resources in public opinion guidance. J Mass Commun Month. 2019;3(12):27–37.
- 10.
Burke P. History of image evidence. Peking University Press; 2008.
- 11.
Wang MA. Cultural studies keywords. Nanjing: Jiangsu People’s Publishing House; 2007. 77–82.
- 12.
Sartre K. Existence and nothingness. SDX Joint Publishing Company; 1997.
- 13.
Lacan T. The gaze as object petit a’ contained in the spectacle of visual culture. Renmin University of China Press; 2005. 47–53.
- 14.
Dai JH. Film theory and criticism. Peking University Press; 2007. 103–12.
- 15.
Foucault PF. Power and subjectivity. Culture and Art Publishing House; 2001. 59–61.
- 16.
Starobinski J. Pope’s veil. Social Science Literature Publishing House; 1995. 336–55.
- 17.
Danny C. Cultural theory key words. Jiangsu People Press; 2006. 78–101.
- 18.
Hooks B. The gaze of resistance: the black female beholder. Peking University Press; 2009. 98–101.
- 19. Gurnham D. Ched Evans, rape myths and Medusa’s gaze: a story of mirrors and windows. Inter J Law Context. 2018;14(3):454–68.
- 20. Xu JH, Zhang RK, Guo JY, Gong JK. Bullet screen and discipline: Gender gaze in webcast based on Douyu big data. Chinese J Journal Commun. 2022;7(4):146–67.
- 21. Schr̈oter J. The digital, capitalist gaze. Digital Cult Soc. 2021;7(2):245–64.
- 22. Chang V. Catching the ghost: the digital gaze of motion capture. J Visual Cult. 2019;18(3):305–26.
- 23. Liang JJ. Selfie, surveillance, and the gaze: viewing practices and social implications in the digital era. Dongyue Tribune. 2025;46(2):35–43.
- 24.
Liu YJ. Empathy communication in the digital age: theoretical framework, impact, and mechanisms. In: Humanities and social sciences edition. Journal of Southwest Minzu University; 2024;45(6):167–78.
- 25.
Joe M. Empathy: you are stressed because you have no empathy. Beijing: Beijing United Publishing Company. 2017.
- 26. Raboteg-Saric Z, Hoffman ML. Empathy and Moral Development: Implications for Caring and Justice. Contemp Sociol. 2001;30(5):487.
- 27. Rizzolatti G, Craighero L. The mirror-neuron system. Annu Rev Neurosci. 2004;27:169–92. pmid:15217330
- 28. Wu F. An analysis on the theoretical foundation and practical approach of empathetic communication. Journalism & Communication. 2019;26(5):59–76.
- 29. Pan YG, Liu YL, Ma JL, Ran GM, Lei H. Neurological underpinnings of empathy. Adv Psychol Sci. 2012;20(12):2011–21.
- 30.
Gregory H. Mysterious mirror neurons. Zhejiang People’s Publishing House; 2016.
- 31.
Krestel R, Fankhauser P, Nejdl W. Latent dirichlet allocation for tag recommendation. In: Proceedings of the third ACM conference on Recommender systems, 2009. 61–8. https://doi.org/10.1145/1639714.1639726
- 32.
R̈oder M, Both A, Hinneburg A. Exploring the space of topic coherence measures. In: Proceedings of the eighth ACM international conference on Web search and data mining. 2015. 399–408.
- 33.
Bauman Z, Lyon D. Liquid surveillance: a conversation. John Wiley and Sons; 2013.
- 34.
Sunstein F. Extreme crowds: the psychology of group behavior. Beijing: Xinhua Publishing House; 2010.