Small means immoral? The impact of spatial size metaphor on moral judgment

Weirui Xiong; Jiaxin Wang; Jiayi Li

doi:10.1371/journal.pone.0324991

Abstract

This study aims to explore the unconscious relationship between moral concepts and the spatial dimension of size, as well as to examine whether the unknown size of a room influences participants’ moral cognitive judgments within the framework of embodied cognition. Study 1 and Study 2 investigate participants’ unconscious biases. Specifically, participants exhibited faster response times when judging moral concepts presented in large fonts and sizes and immoral concepts presented in small fonts and sizes, compared to when moral concepts were presented in small fonts and sizes and immoral concepts in large fonts and sizes. Study 3 employed a moral dilemma task, revealing that participants placed in a large room evaluated characters in a story more morally under the embodiment effect than those in a small room. Collectively, these three studies demonstrate that the unconscious psychological relationship between moral concepts and the spatial dimension of size influences individuals’ abstract moral judgments under embodied cognition.

Citation: Xiong W, Wang J, Li J (2025) Small means immoral? The impact of spatial size metaphor on moral judgment. PLoS One 20(5): e0324991. https://doi.org/10.1371/journal.pone.0324991

Editor: Kevin Schilbrack, Appalachian State University, UNITED STATES OF AMERICA

Received: January 17, 2025; Accepted: April 30, 2025; Published: May 27, 2025

Copyright: © 2025 Xiong et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting information files.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

In the process of social development, morality serves as a core element of human behavior and values. Its significance lies in regulating individual conduct and maintaining social order. But how do we comprehend the abstract concept of morality in our daily lives? In what ways do we associate moral concepts with tangible aspects of life?

Within the framework of embodied cognition, Lakoff and Johnson [1] proposed the Conceptual Metaphor Theory (CMT), which suggests that the conceptual systems underlying human thought, cognition, and behavior are structured by metaphors. This theory posits that understanding occurs through a process of mapping one domain onto another. Specifically, two key domains—the source concept and the target concept—are mapped onto each other to facilitate comprehension. A source concept originates from a direct experience in a person’s daily life, whereas a target concept is an abstract and vague idea, entity, or emotion [1]. According to CMT, understanding target concepts is facilitated by drawing on well-known and clearly represented prior experiences (source concepts).

In the Chinese linguistic landscape, the use of size-related words is closely associated with emotional valence. The adjectives “big” (da) and “small” (xiao) frequently denote individual moral attributes. For instance, in Chinese, phrases such as da du (generous), xiao qi (petty), da xiong huai (large-heartedness), and xiao xin yan (pettiness) are commonly used to describe an individual’s moral disposition. Similarly, in English, certain words such as large, wide, and broad are used to describe moral behavior, while words such as small, narrow, and thin are often associated with immoral behavior. People comprehend abstract concepts not only through concrete notions of size but also by recognizing the correlation between size and emotional valence.

Meier, Robinson, and Caven [2] investigated the correlation between the conceptualization of size and affective valence. Their study found that participants could understand abstract concepts through concrete representations of size. Specifically, the results showed that larger stimuli (as opposed to smaller stimuli) were associated with more positive moral evaluations. For example, in studies 1 and 2, when positive words were presented, participants judged words in larger fonts with faster reaction times, quicker evaluations, and higher accuracy compared to words in smaller fonts. The opposite trend was observed for negative words. Building on these findings, Study 3 demonstrated that the larger the font, the more positively the word was evaluated, indicating that the size effect influences lexical valence judgments. The larger font of the words conveyed a more positive meaning, illustrating the metaphorical association between ‘big and positive’ and ‘small and negative“. Martijn, Henk, and Ruud [3] examined the desire to achieve the goal and found that participants would increase the perceived size of an object, which would help achieve the goal. For instance, the researchers hypothesized that when a participant was deprived of the freedom to drink water, they would perceive a cup as larger. When the behavioral goal concept was paired with positive emotion, the motivation to engage in the action goal increased, resulting in the target tool used (e.g., shovel) being perceived to have a bigger size.

With the development of experiential philosophy and phenomenology, Merleau-Ponty [4] emphasized that the importance of the body and senses arises within organizational behavior. Furthermore, conceptual representation, as a fundamental and primary cognitive ability of humans, is inherently linked to the individual’s perceptual-motor system and cannot exist independently of it [5]. Embodied cognition refers to the necessity of using somatic experience to comprehend complex problems and abstract concepts. Moral judgment is fundamentally a process of abstract value-based cognitive assessment. Research on conceptual metaphor has demonstrated that an individual’s somatic perceptions influence cognitive judgments through metaphorical mediation. Witt [6] found that the size of paddles used to block balls can influence how quickly users perceive the speed of the balls. Lakens, Semin, and Garrido [7] identified a metaphorical correlation between individuals’ conceptualization of time and horizontal space, where the “left” is associated with the past and the “right” with the future. This finding suggests that embodied bodily perceptions influence abstract value judgments. Regarding the experimental procedure, participants were required to provide an optimal rating for each word. Within the framework of the purification metaphor, Ding and Wang [8] observed that participants primed with self-dirtiness exhibited heightened reactivity to negative moral emotion words compared to those primed with self-cleansing. However, no significant differences were found in participants’ judgment responses to non-moral emotional words following self-cleansing or self-dirtiness priming. This suggests that the concept of self-dirtiness, influenced by bodily experiences, facilitates the processing of negative moral emotion words. Additionally, in the experimental procedure, participants were not given a time limit to respond to the words, suggesting that they had ample time to process them thoroughly. Lu, Guo, and Jiang [9] examined the existence of a consistent psychological phenomenon linking moral concepts with the dimension of size. The first two experiments in their study revealed that participants responded more quickly when moral terms were presented in a large font. Conversely, categorical assessments were inhibited when moral words appeared in a small font. Additionally, participants exhibited faster judgment responses to immoral words displayed in a small font. In their research procedure, participants were given 2500 ms to respond to each word. Most studies are conducted at the conscious level. Therefore, the existence of an unconscious mental association with moral conceptual size-space metaphors has yet to be empirically substantiated, nor has their metaphorical consistency been rigorously examined within the context of complex, embodied moral judgment tasks. Consequently, Study 1 examined the psychological reality of metaphors linking moral concepts to large and small spaces at an unconscious level using a variant of the Stroop paradigm. In an unconscious context, participants named matching patches more quickly than non-matching ones, demonstrating the typical Stroop effect [10]. Study 2 used the variation of the Implicit Association Test (IAT) paradigm to explore further at an unconscious level whether the processing of moral concepts links to the participant’s perception of big and small spaces. The purpose of IAT is to establish the intrinsic connection between the target concept and the source concept so as to evaluate people’s implicit attitude towards a specific object Participants are required to classify pairs of stimuli into mutually exclusive categories. Without prior evaluations of the target concept or source concept, performance on this task is expected to be equal for both conditions. However, if participants have some pre-existing associations of the target concept with the source concept, their pre-existing associations can interfere with the classification task [11]. Both experiments controlled reaction time to prompt quick responses from participants, allowing for the inference of unconscious reactions.

In metaphorical studies of embodied moral concepts, prior research has primarily focused on weight metaphors [12–16], color metaphors [17–19], and temperature metaphors [20–22], among others. However, to date, few studies have systematically examined the role of size-space metaphors in the context of embodied moral concepts. Embodied cognition theory holds that cognitive activities are not only inseparable from the brain but are also closely linked to the interaction between the body and the environment. The brain is embedded in the body, and the body is embedded in the environment, forming an integrated cognitive system [23]. Research has shown that individuals represent and process conceptual knowledge through simulations of sensorimotor experiences [24]. For instance, when processing the concept of “apple,” the brain activates multimodal sensory information, including visual (red color), tactile (smooth texture), and gustatory (sweet taste) inputs, to construct an understanding of the concept. Consequently, Study 3 delves into the exploration of how size-space perceptions link to an individual’s moral judgments by unconsciously activating the participants’ perceptual information.

In conclusion, the linguistic manifestation of moral concepts through size-space metaphors is well-documented. Scholars, including Lu et al. [9], have substantiated the psychological validity of size metaphors at the conscious level. Consequently, the present study aims to explore the unconscious realm to investigate the psychological foundations of the metaphorical representation of large and small spaces. Furthermore, it seeks to determine whether the uncertainty of room size influences participants’ moral cognitive judgments within the framework of embodied cognition. These studies aim to expand knowledge on how humans make moral judgments and how these judgments are embodied. The anticipated contributions lie in bridging gaps between unconscious cognitive processes and embodied metaphor theory, potentially informing interdisciplinary discussions in moral psychology, linguistics, and environmental design.

Study 1: The psychological reality of spatial metaphors of size for moral concepts through Stroop

Participants

According to calculation using G*power 3.1 [25], for the two-way repeated measures analysis of variance applicable to this study, with a significance level of α = 0.05, a large effect size (f = 0.4), and a predicted statistical power (1-β) of 0.8, the required total number of research subjects was 22. We recruited 43 university students, including 25 women. All participants provided informed consent before the experiment. All participants were right-handed, had normal or corrected-to-normal visual acuity (>1.0), met the experimental criteria, and did not have dyslexia. All studies were approved by the Ethics Committee of Chongqing Normal University. All informed consent forms were signed before participants completed the questionnaire for evaluation and began the studies. All experimental procedures adhered to the ethical guidelines outlined in the Declaration of Helsinki.

Materials and methods

The experimental materials consisted of moral and immoral words selected from the Dictionary of Modern Chinese Frequency. A separate group of 34 participants who did not take part in the formal experiment assessed the familiarity and moral valence of these words using a 7-point Likert scale (1 = very immoral to 7 = very moral).

The morality scores of the moral words were significantly higher than the midpoint of the scale (4) (M = 6.23, SD = 0.56), while the morality scores of the immoral words were significantly lower than the midpoint (M = 1.78, SD = 0.45), with a statistically significant difference, t(33) = 36.05, p < 0.001. The mean familiarity scores for both moral and immoral words were above 5, and the difference in familiarity scores between moral words (M = 6.03, SD = 0.81) and immoral words (M = 5.70, SD = 0.75) was not statistically significant, t(33) = 1.76, p = 0.082. Based on these evaluations, 20 moral and 20 immoral words were ultimately selected as experimental materials.

Research design

The experiment employed the spatial Stroop paradigm within a two-factor, within-subjects design, structured as a 2 (lexical type: moral words, immoral words) × 2 (font size: large font, small font) factorial arrangement.

Procedure

The experiment was programmed using E-Prime 2.0, consisting of both practice and formal experimental sessions.

To initiate the task, a red “+” attention cue appeared at the center of the screen for 500 milliseconds. Subsequently, 40 words were presented randomly, one at a time, in the center of the screen. Each word appeared twice, varying in font size and sequential position. Participants categorized words as moral by pressing the “F” key and as immoral by pressing the “J” key. The assignment of response keys for moral and immoral classifications was counterbalanced across participants. If a response was not made within 1500 milliseconds, the word automatically disappeared, followed by the reappearance of the red “+” cue for another 500 milliseconds to signal the next trial. Prior to commencing the formal experiment, participants underwent an 8-trial practice phase to acclimatize themselves to the experimental procedures and operational protocols. They indicated their preparedness to initiate the formal experiment by pressing the “P” key. Participants unfamiliar with the task could return to the practice phase at any point by pressing the “Q” key to review the process before beginning the formal experiment. The experimental system automatically recorded all relevant data.

Data analysis

Data screening and processing.

After the experiment was completed, data were processed following the standard requirements of the Stroop paradigm [10] using SPSS 20.0. Participants whose accuracy rate fell below 80% were excluded from the dataset. Additionally, response times that deviated by more than 2.5 standard deviations from the mean were removed. The exclusion process ensured that no more than 3% of the total data were discarded, preventing any significant loss of data. The remaining valid dataset, consisting of 35 participants, was analyzed using repeated measures ANOVA.

Descriptive statistical analysis.

Descriptive statistics of participants’ response times and accuracy rates for different lexical judgment conditions are presented in Table 1.

Download:

Table 1. Different vocabulary type judgments with words.

https://doi.org/10.1371/journal.pone.0324991.t001

As shown in Table 1, when the font size was consistent, the judgment reaction time for moral materials (567 ± 53.2 ms) was significantly quicker than that for immoral materials (620 ± 85.6 ms). Conversely, when the type of vocabulary materials was consistent, varying font sizes influenced participants’ judgments, with moral words in a large font being processed significantly faster (567 ± 53.2 ms) than those in a small font (587 ± 67.1 ms). Additionally, the reaction time for immoral words in a small font (599 ± 66.9 ms) was significantly faster than that for immoral words in a large font (620 ± 85.6 ms).

When the font size was consistent, moral words were judged correctly more often (0.98 ± 0.04) than immoral words (0.93 ± 0.03). When the type of vocabulary materials was the same, moral words in a large font (0.98 ± 0.04) were judged correctly more often than those in a small font (0.96 ± 0.04). Similarly, immoral words in a small font (0.96 ± 0.03) were judged correctly more often than those in a large font (0.93 ± 0.03).

Analysis of variance for repeated measurements of reaction time.

The dependent variables of the experiment were the response time and accuracy of participants’ judgments of the lexical material. ANOVA results are presented in Table 2. The main effect of lexical word type was significant: in the task involving different lexical types of material, participants judged moral words (577 ms) faster than immoral words (609 ms), F(1,34) = 24.622, p = 0.000 < 0.01, η²_p=0.420.

Download:

Table 2. Response times in different task conditions.

https://doi.org/10.1371/journal.pone.0324991.t002

The interaction between vocabulary type and font size was significant, F(1,34) = 19.858, p = 0.000 < 0.01, η²_p = 0.369. For moral words, participants’ categorical judgment of words presented in a large font was significantly faster than for those presented in a small font, F(1,34) = 9.708, p = 0.004 < 0.01, η²_p = 0.222. For immoral words, participants’ categorical judgment of words presented in a small font was significantly faster than for those presented in a large font, F(1,34) = 10.179, p = 0.003 < 0.01, η²_p = 0.230.

ANOVA for repeated measures of accuracy.

A repeated measures ANOVA was conducted on participants’ accuracy; the results are presented in Table 3. The main effect of vocabulary type was significant: participants were more accurate when judging moral words (97%) than when judging immoral words (94%), F(1,34)=12.320, p = 0.001 < 0.01, η²_p = 0.266.

Download:

Table 3. Accuracy in different task conditions.

https://doi.org/10.1371/journal.pone.0324991.t003

The interaction between vocabulary type and font size was significant, F(1,34) = 12.803, p = 0.001 < 0.01, η²_p = 0.274. The results indicate that when the experimental task involved immoral words, participants made correct judgments more frequently with a small font than with a large font, F(1, 34) = 13.222, p = 0.001 < 0.01, η²_p = 0.280.

In summary, there is a psychological association of “morality with big and immorality with small.” Study 2, building on these findings, will further investigate the psychological validity of spatial metaphors of size for moral concepts using the IAT.