Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The German translation of the Oxford utilitarianism scale: Validation and the impact of the Covid-19 pandemic on the observations

  • Aistė Ambrasė ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Validation, Visualization, Writing – original draft, Writing – review & editing

    aiste.ambrase@med.uni-tuebingen.de, aiste.ambrase@gmail.com

    Affiliation Department of Psychiatry and Psychotherapy, Women’s Mental Health and Brain Function, Tübingen Center for Mental Health (TüCMH), University of Tübingen, Tübingen, Germany

  • Malte Hendrickx,

    Roles Conceptualization, Formal analysis, Methodology, Resources, Writing – original draft, Writing – review & editing

    Affiliation Philosophy Department, University of Michigan, Ann Arbor, United States of America

  • Melina Grahlow,

    Roles Data curation, Formal analysis, Investigation, Methodology, Project administration, Writing – original draft, Writing – review & editing

    Affiliation Department of Psychiatry and Psychotherapy, Women’s Mental Health and Brain Function, Tübingen Center for Mental Health (TüCMH), University of Tübingen, Tübingen, Germany

  • Hong Yu Wong,

    Roles Conceptualization, Supervision, Writing – review & editing

    Affiliations Werner Reichardt Centre for Integrative Neuroscience, University of Tübingen, Tübingen, Germany, Department of Philosophy, University of Tübingen, Tübingen, Germany

  • Birgit Derntl

    Roles Conceptualization, Funding acquisition, Methodology, Resources, Supervision, Writing – original draft, Writing – review & editing

    Affiliations Department of Psychiatry and Psychotherapy, Women’s Mental Health and Brain Function, Tübingen Center for Mental Health (TüCMH), University of Tübingen, Tübingen, Germany, German Center for Mental Health (DZPG), Partner Site Tübingen, Tübingen, Germany, LEAD Research School and Graduate Network, University of Tübingen, Tübingen, Germany

Abstract

The study of utilitarian inclinations is probably the most experimentally investigated aspect of morality. The Oxford Utilitarianism Scale has been developed to provide a self-report tool for reliable measurement of utilitarian views while addressing serious methodological issues with previous measures. In this study, we have translated and validated a German version of the Oxford Utilitarianism Scale (OUS-DE). The scale consists of two subscales: Impartial Beneficence (IB-DE) and Instrumental Harm (IH-DE). We conducted a procedure in a general German sample (NS1 = 378, 243 women, Mage = 25.37) before the Covid-19 pandemic. A confirmatory factor analysis demonstrated a good fit of a two-factor model for OUS-DE, while internal consistency and construct reliability were acceptable. Both in the pre-pandemic and the post-pandemic sample (NS2 = 348, 206 women, Mage = 24.61) we found a sex/gender difference, with women scoring significantly higher in the IB-DE subscale than men. We also found that the mean agreement with the IB-DE subscale decreased after the pandemic. In a separate third sample (NS3 = 39, 19 women, Mage = 23.72), we observed an inverse U-shape relationship between moral behavior related to quarantine requirements and the IH-DE subscale, as measured during the peak pandemic restrictions in late 2020. Repeated OUS-DE measurement in this sample showed stability in responders’ utilitarian beliefs post-pandemic. In sum, OUS-DE is the first available measurement of utilitarian inclinations in German. The scale will enable further research on how utilitarian preconceptions affect behavior in German-speaking populations.

Introduction

Utilitarianism is a moral theory that holds that the morally correct action is the one that brings about the most good for the greatest amount of people [13]. As a consequentialist theory, it is often distinguished from other moral theories by its focus on the consequences of actions (as opposed to, e.g., intentions, rights, or virtues). Due to Utilitarianism’s commitment to intriguing departures from commonsense morality, agreement with utilitarian principles in the general population has been extensively studied in psychology and neuroscience [46].

Utilitarianism deviates from the commonsense morality in two important ways. First, in contrast to the requirement of non-interference in commonsense morality, e.g., “not to harm”, “not to steal” [711], Utilitarianism controversially instructs one to use harm in an instrumental way in situations where the aggregate outcome would outweigh the cost. For example, it may be morally required to sacrifice a single bystander to divert a trolley racing toward a group of five [12].

Utilitarianism, like commonsense morality, posits a moral obligation to act beneficially towards those in need [13]. However, in a second radical departure from commonsense morality, Utilitarianism advocates for a radically impartial stance in beneficence. It rejects preferential consideration for individuals based on their relation to or distance from the moral agent [14]. For example, Utilitarianism would require choosing a charity based on objective needs rather than national preferences. The departures from commonsense morality with regard to instrumental harm (IH) and radically impartial beneficence (IB) suggest a 2-D model of utilitarian psychology [15].

While Utilitarianism has received more attention in psychology and neuroscience than any other moral theory, its measurement has typically relied on a caricaturized view of Utilitarianism. Agreement with Utilitarianism was operationalized only in terms of agreement of subjects to sacrifice the wellbeing and life of others in moral dilemmas. This led to a situation where both true moral utilitarians and antisocial individuals are classified as utilitarian [1620]. Utilitarianism is not well captured by an exclusive focus on IH [21]. In fact, IB is considered a distinctive feature of all consequentialist theories, of which Utilitarianism is the most prominent [22].

As a self-report measure, the Oxford Utilitarianism Scale (OUS; [23]), aims to address this problem by representing both IB and IH, thus assessing personal agreement with the main principles of utilitarian moral theory. The questionnaire comprises two subscales: Impartial Beneficence (IB, 5 items) and Instrumental Harm (IH, 4 items) representing the two tenets that distinguish Utilitarianism from other moral views. The two subscales are independent and dissociable as they show no consistent association with each other. Agreement to both is necessary to infer that the moral agent is indeed guided by Utilitarianism and not another moral theory. The two subscales can also be used separately when investigating associations between the two constructs and individual differences as well as other moral inclinations [23].

In the original OUS validation study [23] and further studies on utilitarian morality [2426] positive weak to moderate correlations between agreement with the original OUS or either of its subscales and agreement with utilitarian choice in moral dilemmas emerged, indicating content validity of the scale. This evidence was reported in sacrificial dilemmas [24,25], indicating that the measurement generally tracks utilitarian attitudes towards IH, and in impartiality dilemmas [26], showing that the whole IB subscale can capture the impartiality principle well.

Since its publication, OUS has proven to be a valuable research tool in assessing utilitarian inclinations in the general population as well as specific specialist groups such as healthcare professionals [27,28]. OUS has been translated and validated into Turkish [29], French [30], and Spanish [31]. Its structural validity was assessed in Chinese, French, German, Greek, Hungarian, Italian, Polish, Portuguese, Romanian, Russian, Serbian, Slovak, and Turkish languages [32]. Most translated and validated versions of OUS confirm the two-factor solution. However, in some languages the translations resulted in a different factor structure [32], indicating either that the translations should be revised and further validation procedures carried out to confirm the two-factor construct of Utilitarianism or that cultural influences should be considered.

Associations between utilitarian inclinations and other moral dispositional traits.

It’s focus on beneficence and minimization of the suffering when harm is involved places Utilitarianism in the moral domain of Harm/Care. Moral domains are sets of morally relevant issues, such as care and harm, fairness, loyalty, authority, sanctity and purity, and they can be measured by the Moral Foundations Questionnaire [33,34]. In a US study, it has been shown that extraordinary altruists highly endorse IB and are more concerned with the Harm/Care than regular controls [35]. In a Polish study, agreement to Harm/Care mediated the significant relationship between religiosity and IB, showing that in different countries cultural and individual differences might be uniquely affecting people’s utilitarian inclinations.

Concern for care also links Utilitarianism to empathy. In moral philosophy, it has been assumed that empathy, the understanding and reception of otherness, is necessary for care behaviors [36,37]. It has been shown that individuals who exhibit higher empathic concern endorse IB to a greater extent and IH to a lesser extent [15,23,30,38]. This pattern of associations between the two components of Utilitarianism has been also demonstrated in other-benefiting altruistic tasks and moral dilemmas where Utilitarianism requires IH [3941]. However, the relationship between OUS and its subscale scores with empathy were reversed in the Turkish validation of the OUS [29], indicating again that more research is required. As religiosity fosters beneficent behaviors, a positive association between frequency of religious practice and the IB subscale has been found [23,42], while no associations were established between religiosity and behavior during sacrificial harm moral dilemmas [43].

Some authors have claimed that utilitarians are 1) more morally competent because utilitarian choices are based on deliberation processes rather than intuition or emotion [4446] and that 2) their choices sometimes are driven by higher action-taking tendencies or lower aversion to harm, rather than a true endorsement of utilitarian principles [39,4751]. OUS has not yet been compared to such constructs. Doing so might provide novel insight into utilitarian psychology.

Associations between utilitarian inclinations and sociodemographic variables.

The current state of research on OUS indicates that sociodemographic differences are associated with consistent individual differences in agreement with utilitarian views. Some authors claim that socialization, culture, and biological sex differences lead to different moral inclinations [50,52,53]. For example, some report that women more strongly internalize their moral beliefs, affecting self-evaluations and moral behavior [54,55]. In the original OUS validation study [23], authors found a trend that men endorsed IH to a greater extent than women, which accords with previous findings in sacrificial moral dilemmas [50,51,5659] and translation studies of OUS [31]. A higher endorsement of IB has been also found in women [31]. This is similar to studies assessing a related construct, altruism, with women more inclined on average to choose altruistic options than men [6062].

Moral inclinations also differ between different age groups, even in adults [6367]. Agreement with IB was found to increase with age in China, Spain, and Chile [31,68]. This pattern is consistent with age effects in altruism [69]. However, in a Romanian sample, younger adolescents scored higher on IB than their older underaged counterparts, indicating that the age effect might not be linear [70]. Regarding IH, agreement to this subscale increased with age in China [68] but decreased in Spain and Chile [31]. A consistent age effect has been demonstrated in utilitarian choices during moral dilemmas, where older adults endorse IH to a lower degree than younger adults [66,71]. The heterogeneity of results on age effects in assessing utilitarian inclinations with OUS indicates that country-related age effects should be investigated.

Effects of Covid-19 pandemic on utilitarian inclinations.

During the Covid-19 pandemic, which started in March 2020 and ended in May 2023 [72], many countries adopted public pandemic policies based on utilitarian principles [7375]. Germany’s strict pandemic regulations provide ample examples: group/society’s interests were prioritized over interests of individuals, cost/benefit analyses were based on the principle of the greater good in political decisions, and impartiality principles were applied in medical care [76,77]. In daily life, individuals were exposed to personal moral dilemmas, in which the utilitarian option was both personally and socially beneficial [78,79].

This increased exposure to situations where utilitarian choices brought about the largest benefit to society might have altered people’s perceptions of Utilitarianism. Previous research has shown that individuals revise their moral beliefs when their judgments run counter to majority opinion [80] or when imagining a situation in which a proposed contradictory action would be moral [81]. Furthermore, when moral situations are presented in a practical context, as was often the case during the pandemic, individuals tend to be more accepting of moral solutions, even when these solutions contradict their moral views [82]. Taking them together, investigating the effects of Covid-19 pandemic on agreement with OUS and its subscales would provide a valuable addition to research on stability of moral beliefs.

Associations between utilitarian inclinations and utilitarian behavior.

As OUS measures individuals’ self-reported moral beliefs, a question arises whether the measurement can predict real-life utilitarian behavior. Previous research reported that general moral values and beliefs are only weakly associated with actual moral behavior [83]. It has been shown that the association between moral inclinations and the corresponding moral behavior is mediated by co-occurrent moral emotions, moral courage, and surrounding context [8487]. However, the OUS items are highly specific and criterion-matched to target the distinctive features of Utilitarianism, and methodological research shows that such measures have higher predictive validity than measurements targeting broader constructs [88]. In line with the specificity criterion, the Covid-19 pandemic presented a prime opportunity to study the degree to which OUS predicts moral behavior. During the pandemic, individuals were expected to follow quarantine rules, designed to minimize suffering and benefit a greater number of people while demanding personal sacrifices [78]. Therefore, it is possible that the OUS would have had stronger associations with behavior during the pandemic, as the pandemic provided a conceptually corresponding context.

Current studies

Here we report three studies which measured utilitarian inclinations using OUS. The primary aim of Study 1 was to provide a validated German translation of OUS questionnaire (OUS-DE). Our second aim was to investigate the potential influence of demographic variables, such as age, sex/gender, and religiosity, as well as effects of personality and character differences on agreement with OUS-DE, reported in Study 2. Additionally, as the circumstances allowed, in Study 2 we also intended to investigate the effects of the Covid-19 pandemic on the agreement with OUS-DE, comparing observations from two samples collected before and after the pandemic. Finally, in Study 3 we sought to examine the stability of utilitarian inclinations in the same sample with repeated measures during and after the pandemic as well as studying the relationship between utilitarian moral beliefs and self-reported moral behavior and judgement during the pandemic.

Study 1: Translation and validation of the German Oxford Utilitarianism Scale

The aim of this study was to validate the German translation of the OUS (OUS-DE). We have implemented a validation procedure similar to that done for the original OUS study [23]: we examined the factor structure with confirmatory factor analyses (CFA), assessed the internal consistency, and the split-half reliability of OUS-DE. To establish criterion-related validity of the scale, we implemented the same procedure as the original study by using moral dilemma scenarios related to sacrificial harm and greater good, and self-reported agreement with Utilitarianism.

We hypothesized that a two-factor OUS-DE would reach comparable goodness-of-fit as the original English OUS, and that each of the subscales will show good to excellent model fit in CFA. We expected positive but weak correlations between the two subscales and a strong correlation between the score of each of the subscales and the overall OUS-DE score. OUS has already been translated and validated for use in the Turkish [29], French [30] and Spanish [31] languages, and the CFAs in those studies indicated an excellent two-factor model fit. However, structural validity of a different German translation of OUS was not comparable to the original scale in English language and the scale did not meet configural invariance, indicating that the factorial structure of that German translation was not stable across multiple groups [32]. This prompts the need for another translation and validation study.

Materials and methods

Sample description.

Five hundred five participants completed an online validation survey from 2019-05-14 to 2019-07-14. One hundred twenty-seven participants were excluded from further analyses: 21 participants reported a mother-tongue other than German, two participants failed two attention checks included in the survey, 20 participants completed the survey faster than the predetermined minimal time limit (< 20 minutes), one participant indicated non-binary gender, 70 participants self-reported a mental disorder diagnosis, and 13 participants were identified as outliers in univariate and multivariate outlier analyses (for more details, please see Results). Therefore, the final sample consisted of 378 participants (243 women, Mage = 25.37, SD = 6.52, for sociodemographic characteristics, please see Table 1).

thumbnail
Table 1. Sociodemographic Characteristics of the final sample.

https://doi.org/10.1371/journal.pone.0335215.t001

Materials.

Oxford utilitarianism scale.

The Oxford Utilitarianism Scale (OUS) consists of two subscales, i.e., Impartial Beneficence (IB, 5 items) and Instrumental Harm (IH, 4 items), comprised of 9 items in total [23]. The scale is scored on a Likert scale from 1 (strongly disagree) to 7 (strongly agree). Greater scores reflect a greater utilitarian inclination, that is, greater implicit agreement with main principles of utilitarian moral theory. The IB subscale measures the positive dimension of Utilitarianism concerning with duties of beneficence (3 items), treating immoral acts and failures to act morally (omissions) as equally morally wrong (1 item), and, importantly, moral impartiality (1 item). Moral impartiality is an important principle in utilitarian moral theory requires an individual to treat other moral agents equally, despite their social closeness [23]. The IH subscale assesses moral appropriateness of instrumental harm, i.e., harm to somebody used as a collateral to save or benefit a greater number of people (3 items) and short-term political oppression to ensure well-being of the citizens (1 item). Together, the two subscales measure general agreement with a utilitarian moral view and an overall agreement score can be calculated. On the other hand, the two subscales can be measured and their relationships with other measurements assessed separately, i.e., they are dissociable as measurements.

Moral dilemma scenarios.

Six moral dilemma scenarios – three sacrificial harm moral dilemmas that capture participants’ behavior in applying instrumental harm for the greater good, and three greater good moral dilemmas that capture participants’ self-sacrificial and impartial behavior – originally used in the validation study by Kahane [23] were used to measure utilitarian inclinations during moral decision-making. Study participants had to evaluate the moral appropriateness of the proposed solution to the sacrificial harm dilemmas by using a scale from 1 (not at all wrong) to 7 (absolutely wrong), with 1 indicating fully utilitarian judgment and 7 indicating fully non-utilitarian judgment. In greater good dilemmas the scoring was reversed, using a scale from 1 (absolutely wrong) to 7 (not at all wrong), with 1 indicating fully non-utilitarian judgment and 7 indicating fully utilitarian judgment.

Self-reported utilitarianism.

Participants self-reported whether they considered their view to be utilitarian after reading a paragraph, explaining the theoretical moral commitments of utilitarian and deontological moral theories on a scale from 1 to 10 (low score for deontological view, high score for utilitarian view). The paragraph on utilitarian moral theory was provided by the authors of the original OUS validation [23].

Procedure.

The Ethics commission of the Medical Faculty at the University of Tübingen approved the online survey (“Validation of German translation of Oxford Utilitarianism Scale”, project number 839/2018BO2).

The OUS and the validation dilemma scenarios as well as Utilitarianism description for self-reported agreement were translated to German by one of the authors, who specializes in moral philosophy and is a native speaker of German. Two independent English native speakers (one from the field of philosophy and one with non-philosophical background) with a very good knowledge of the German language performed back-translation. Any inconsistencies in translation were assessed by the authors of the study. Overall, the German translation of the OUS (OUS-DE) provided a good understanding of the questionnaire items as only minor inconsistencies in wording in the back-translations were observed. Original and translated versions of the questionnaire are depicted in Table 2.

The online study on the SoSciSurvey platform ([89], available at https://www.soscisurvey.de) was set up for the validation of the OUS-DE. Announcements for the online study were distributed by mailing lists on the University of Tübingen’s server as well as printed posters and flyers in the town of Tübingen. The online study ran from May to July 2019. The study was not preregistered.

In the online survey, respondents were first provided with information about the aims of the study, inclusion and exclusion criteria and had to provide written consent via digital button click after reading this information. They were also provided with data protection information for the anonymous survey format. After these procedures, the respondents filled out measurements in the following order: OUS-DE, six moral dilemma scenarios, self-reported agreement with Utilitarianism, Moral Competence Test (MCT, [90]), Moral Foundations Questionnaire (MFQ, [91]), the Saarbrücker Personality Questionnaire (SPF; [92]), the Harm Avoidance Scale (TCI-HA) from the Temperament and Character Inventory (TCI; [93]), and the Action Regulation Emotion Systems Scale (ARES; [94]). In this study only results from OUS-DE, moral dilemma scenarios and self-reported agreement with Utilitarianism will be reported. Original instructions from the questionnaires were included to let the respondents know how to fill out the measurements. Finally, they provided the following demographic information: sex/gender, age, education level, religious affiliation, religiosity, political ideology, mother tongue, existence of diagnosed mental disorders.

After completing the survey, respondents were also provided with an opportunity to win a retail voucher and could enter their e-Mail address into a competition. A voucher competition was used as part of the survey advertisement. Entries from the competition were recorded separately from the survey into a different repository and saved as an independent data file. No identification between survey answers and competition entries was possible as no other information than e-Mail addresses was recorded. The e-Mail addresses were used only for the purpose of voucher competition.

Data analysis.

Data analyses were performed with IBM SPSS Statistics 28.0.1.1, IBM SPSS Amos 28.0.0 (IBM, Chicago, USA), and R programming language [95] using the lavaan package [96].

Preliminary checks.

Data distribution for the OUS-DE was first checked for normality using descriptive statistics (skewness and kurtosis). Univariate outliers in OUS-DE responses were identified by Median Absolute Deviations (MAD) test (Tukey’s fence), calculated on the total OUS-DE score, while multivariate outliers were detected by the Mahalanobis Distance in linear regression.

Factorial validity.

Confirmatory factor analysis (CFA) was performed to evaluate the factorial structure of the OUS-DE using a maximum likelihood estimation. Both one-factor and two-factor models were tested, with the two latent factors allowed to intercorrelate as in the original OUS validation.

Model fit evaluation.

The goodness-of-fit of OUS-DE was assessed by several fit indices, selected based on the original OUS validation report [23] and methodological recommendations [97,98]. Model Chi-square (χ2) tests assess the difference between the observed and expected covariance matrixes, where goodness of fit is indicated by a non-significant test result at a p > .05. However, the χ2 test is sensitive to sample size, and in our case, it will possibly indicate a poor model fit due to a large sample size [99]. Therefore, we prioritized two incremental fit indices, the non-normed fit index (also called Tucker-Lewis’s index, TLI) and the comparative fit index (CFI) as they adjust for sample size issues (for both: ≥ .90 = acceptable; ≥ .95 = excellent). Additionally, we used the root mean square error of approximation (RMSEA) to measure the discrepancy between the observed covariance matrix and the hypothesized covariance matrix (≤.05 = good;.05–.08 = acceptable; [100]). Lastly, we used the standardized root mean square residual (SRMR) to measure the average of the standardized fitted residuals (≤.08 = acceptable; [100,101]). Parsimony of the measurement models was assessed by Akaike Information Criterion (AIC; [102]) and Bayesian Information Criterion (BIC; [103]). Lower values in both indices show better parsimony in the measurement [97].

Reliability.

Internal reliability of the measurement was assessed by split-half method (items split by odd and even positions), with Spearman-Brown and the Guttman split-half coefficients [104,105]. Internal consistency was evaluated using Cronbach’s alpha, where higher values (closer to 1) indicate stronger consistency, accounting for questionnaire length effects [106108].

Construct validity.

Pearson’s two-tailed correlations were calculated between the score of each OUS-DE subscale and the mean score of three corresponding moral scenarios (for more information see Materials of Study 1), as well as between the overall OUS-DE score and self-reported agreement with Utilitarianism.

Results

Data screening and outlier analysis.

Descriptive statistics were first used to assess data normality distribution in overall mean OUS-DE score, scores for each of the two subscales and every item in the scale separately. The mean OUS-DE score was slightly negatively skewed (S = −0.111). IB-DE scores were negatively skewed (S = −0.261), and IH-DE scores were positively skewed (S = 0.292). When each item was considered separately, skewness ranged from negative −0.402 to positive 0.45. Kurtosis statistic showed positive kurtosis for the overall mean OUS-DE score (K = 0.503). Kurtosis negatively but weakly affected IB-DE (K = −0.177) and IH-DE (K = −0.390). When each item was considered separately, kurtosis ranged from −0.999 to −0.399. All reported skewness and kurtosis statistics fall into the acceptable range for large samples [109].

The data was then assessed for univariate and multivariate outliers (see Data analysis section of Study 1). MAD test was performed on the sum of OUS-DE item scores for each participant and the analysis identified one univariate outlier. Mahalanobis Distance in linear regression identified 12 multivariate outliers. These univariate and multivariate outliers were removed from further analyses. After the procedure, final sample consisted of 378 individuals (243 women, Mage = 25.37). Mean responses to OUS-DE and its subscales are depicted in Table 3.

thumbnail
Table 3. Mean responses to the Oxford Utilitarianism Scale and its subscales.

https://doi.org/10.1371/journal.pone.0335215.t003

Psychometric properties of OUS-DE.

Confirmatory factor analysis (CFA).

CFA was performed on a dataset with 378 cases; no data was missing. Two different models were assessed and compared: 1) a one factor solution and 2) a two fixed factor solution based on the original questionnaire subscales, with covariance drawn between the two factors. Analyses used maximum likelihood estimation (10 iterations). The results are shown in Table 4 and Table 5.

thumbnail
Table 4. Results of Confirmatory Factor Analyses (N = 378).

https://doi.org/10.1371/journal.pone.0335215.t004

thumbnail
Table 5. Standardized factor loadings for OUS-DE in the CFA (N = 378).

https://doi.org/10.1371/journal.pone.0335215.t005

The default one-factor model clearly showed an inadequate model fit on all fit indices: χ2 = 564.0, df = 27, p < .001, TLI = 0.176, CFI = 0.382, SRMR = 0.186, RMSEA = 0.229, AIC = 12726.737, BIC = 12797.565. Similarly, standardized factor loadings indicated that IH-DE items fit poorly with the IB-DE items if one-factor solution is analyzed (see Table 5).

The two-factor model fit was better, though marginally below recommended cut-offs: χ2 = 92.317 df = 26, p < .001, TLI = 0.894, CFI = 0.923, RMSEA = 0.082, SRMR = 0.0627. Modification indices suggested that the model could be improved if residual covariances were modelled between questionnaire items IB-2 and IB-3 (MI = 32.453, Standardized EPC = 0.54), as well as IB-1 and IB-5 (MI = 32.111, Standardized EPC = 0.408). Incorporating these adjustments yielded a modified two-factor model with adequate fit: χ2 = 56.509 df = 24, p < .001, TLI = 0.944, CFI = 0.962, RMSEA = 0.06, SRMR = 0.0569. Lower AIC and BIC indices of the modified two-factor model also supported this modified model as the preferable solution. Standardized factor loadings (see Table 5) indicated that the two-factor solution for OUS-DE is appropriate (all loadings > .4).

Results of CFA for the two-factor model remained robust when the sample was split according to sex/gender (for analysis descriptions and CFA results in tables see Supporting Information S1 File).

Split-half reliability.

Split-half reliability testing was performed on all nine items in the scale. The method of odd-even trials was selected. It estimated a Spearman-Brown Coefficient of 0.71 and a Guttman Split-Half Coefficient of 0.7.

Internal consistency.

Internal consistency estimations resulted in Cronbach’s alpha for the whole questionnaire α = 0.671; for IB-DE α = 0.760; and for IH-DE α = 0.716. It is important to note that short questionnaires result in lower alpha values [108], therefore, overall internal consistency could be assumed as acceptable to good.

Construct validity.

Construct validity for this measurement was assessed by calculating Pearson’s correlations (two-tailed) between the overall OUS-DE score, IB-DE score, IH-DE score and self-reported Utilitarianism, as well as mean scores of two types of moral dilemmas: a set of dilemmas concerning with greater good and a set concerning with sacrificial harm. As in the original OUS validation study [23], we have assumed that if the OUS-DE truly measures utilitarian inclinations, moral appropriateness ratings in greater good dilemmas will be directly associated with the measures in IB-DE and behavioral results in moral appropriateness ratings in sacrificial harm dilemmas will be directly associated with the measures in IH-DE, and behavioral results in moral decision-making.

The correlation between IB-DE and OUS-DE was strong (r = 0.778, p < .001). The correlation between IH-DE and OUS-DE was also strong (r = 0.662, p < .001).

Greater good dilemmas.

Mean score of the greater good dilemmas positively correlated with OUS-DE score (r = 0.446, p < .001), IB-DE score (r = 0.544, p < .001) as well as with self-reported Utilitarianism (r = 0.392, p < .001), indicating consistent construct validity. Fig 1 and Table 6 illustrate these correlations.

thumbnail
Table 6. Correlations between OUS-DE, its subscales, moral dilemmas, and self-reported Utilitarianism.

https://doi.org/10.1371/journal.pone.0335215.t006

thumbnail
Fig 1. Construct validity analysis.

Correlations between responses to IB-DE subscale and greater good dilemmas (A) and between responses to IH-DE subscale and sacrificial harm dilemmas (B).

https://doi.org/10.1371/journal.pone.0335215.g001

Sacrificial harm dilemmas.

Mean score of the sacrificial harm dilemmas, as expected due to scoring difference in these dilemmas with lower scores meaning greater agreement with utilitarian solution, moderately and negatively correlated with OUS-DE score (r = −0.501, p < .001), IH-DE score (r = −0.64, p < .001), as well as with self-reported Utilitarianism (r = −0.204, p < .001), indicating consistent construct validity. Please see Fig 1 and Table 6 for illustration of the correlation.

Self-reported utilitarianism.

Self-reported Utilitarianism positively correlated with overall OUS-DE: (r = 0.399, p < .001) as well as IB-DE (r = 0.338, p< .001) and IH-DE (r = 0.231, p < .001). Please see Table 6 for these correlations.

Discussion: study 1.

In this study, we translated and validated the German version of the Oxford Utilitarianism Scale (OUS-DE). Our results indicate that the OUS-DE is a reliable measure of utilitarian inclinations in young healthy adults, making it an adequate research tool to address utilitarian moral inclinations.

Using a confirmatory factor analysis, a two-factor solution of OUS-DE achieved appropriate model fit. This supports previous findings from the original OUS validation study [23] as well as more recent validation studies of the Turkish [29], Spanish [31], and French [30] OUS versions. A study assessing structural validity for OUS in 15 languages found that a different German OUS translation required multiple modifications to achieve a two-factor solution [32]. In our study, two modifications for the two-factor model were also required to achieve an adequate model fit: in IB-DE subscale, we needed to draw covariances between items IB-2 and IB-3 as well as IB-1 and IB-5. IB-2 (kidney donation) and IB-3 (leg sacrifice) items measure the principle of beneficence, and in particular, one’s agreement with a positive duty to sacrifice one’s body part to save another person’s life in an urgent situation. Due to the similarity of the two items, covariance between these items could have been expected. Covariation between items IB-1 (impartiality) and IB-5 (donation) could also be expected as both items involve referencing of a positive duty to help and care for others. These covariations have not been discovered in the original OUS validation study [23] or other OUS translations. Further studies should investigate possible conceptual overlaps between these items.

It has been recently suggested that the IB subscale should be revised or separated into more fine-grained subscales to represent the general beneficence, self-sacrifice, and action and omission distinction [110]. While covariations between items in this subscale indicated possible conceptual similarity among them, based on our results, we do not recommend a complete revision of the subscale. It is a central commitment of Utilitarianism to consider omissions to help and active harm as commensurable wrongs, as utilitarians consider consequences to be the sole arbiter of wrongness [2,3,22]. Divorcing it from other aspects of Utilitarianism, including impartiality, would lead to a skewed representation of Utilitarianism in the resulting measurement.

Finally, internal consistency of the scale indicated that the translated version of the questionnaire performs comparably well as the original English version and other OUS translations. Construct validity of the translated version was supported by moderate bivariate correlations between OUS-DE and self-reported Utilitarianism, as well as moderate-to-strong correlations between OUS-DE subscales and thematically corresponding moral dilemmas. The latter correlations are comparable with the results of the original OUS validation (see Study 2 in [23]. Moderate-to-strong associations between observations of OUS-DE and moral dilemmas indicate that they are measuring the same construct reliably and might be immune to response style or social desirability effects [111,112].

Study 2: effects of sociodemographic and personality variables on agreement with OUS-DE

The aim of this study was to investigate the stability of utilitarian inclinations in the general population and the origins of individual differences in self-reported Utilitarianism. To assess the stability of utilitarian inclinations, we have collected responses to OUS-DE and its subscales in two different samples at two different time points: in 2019 and in 2023 when the last protective measures of the Covid-19 pandemic were phased out. Based on previous research [8082], we assumed that increased exposure to moral situations which require utilitarian solutions would increase agreement with OUS-DE, indicating that external circumstances can have an influence on moral beliefs in a general population.

Furthermore, we investigated individual differences in utilitarian inclinations among individuals due to their sex/gender, age, religiosity, and psychometric measures which conceptually are related to one’s moral dispositions such as moral competence, moral foundations, empathy, harm aversion and approach/avoidance trait. We expected to find a significant sex/gender difference with men reporting higher acceptance of IH-DE than women [50,51,5659]. We also expected to find a positive relationship between age and IB-DE as well as a negative relationship between age and agreement with IH-DE [31,68]. Furthermore, we anticipated to find a positive relationship between religiosity and IB-DE [23,42]. Finally, positive relationships were expected between agreement with Utilitarianism and moral competence [44], empathy and Harm/Care foundation [15,23,30,35,38], while weak negative relationships were expected between agreement with Utilitarianism and harm aversion as well as behavioral approach trait [39,4751].

Materials and methods

Sample description.

Sample 1.

A sample of N = 378 individuals from Study 1 was included in the analyses in Study 2. (Please see Sample description in Study 1 and Table 1).

Sample 2.

Five hundred twenty-six participants completed an online survey from 2022-10-04 to 2023-01-16. The survey consisted of the same measurements as the survey in Sample 1. One hundred seventy eight participants were excluded from further analyses: 32 participants due to a different mother-tongue, nine for failing the two attention checks, 27 for completing the survey below the predetermined minimal time limit (< 20 minutes), five due to non-binary gender identification, 96 due to a self-reported mental disorder diagnosis, three due to an error recording their answers in one of the questionnaire sections. Univariate and multivariate outlier analyses indicated another six participants to be excluded (for more details, please see Data analysis section in Study 1). The final sample consisted of 348 participants (206 women, Mage = 24.61, SD = 6.59). From this sample, 12 participants indicated that they had participated in our first validation survey in 2019, 29 were not sure, and the rest did not participate in the survey in 2019. Please see Table 7 for the sociodemographic characteristics of this sample.

Materials

Oxford Utilitarianism Scale – German translation.

For this study we used the German version of Oxford Utilitarianism Scale, validated in Study 1. The description and the properties of the scale remain exactly as for the OUS described in the Materials section in Study 1.

Moral Competence Test, German version.

The Moral Competence Test (MCT; [90,113,114]) is a behavioral test that measures cognitive ability to make consistent judgments about reasons for and acceptability of moral behavior in different moral situations even when the moral behavior opposes subject’s own moral views. The measurement consists of two moral dilemma scenarios and 12 judgment questions for each scenario. The degree of moral competence is measured by the C-score which ranges from 0 to 100, with higher scores indicating higher moral competence. A validated German version of the test was used in this study [113].

Moral Foundations Questionnaire, German version.

The Moral Foundations Questionnaire (MFQ-30; [115]) is a self-report measure that assesses the agreement with five foundational moral domains: Harm/Care: approval of harm prevention/relief and disapproval of cruelty and aggression, Fairness/Reciprocity: approval of reciprocal interaction, Ingroup/Loyalty: approval of group solidarity and distrust for outgroup members, Authority/Respect: approval of respect, duty and obedience to authority, and Purity/Sanctity: approval of spirit over body and disapproval of carnal passions [33]. The distinction of the five foundational domains resulted from ethnographic research and evolutionary psychology, and was further developed in the Moral Foundations Theory [116]. Scores range from 0 to 30 in each foundation, with higher scores indicating higher relevance of that foundation to the respondent’s moral intuitions and decision-making. Finally, the MFQ assesses moral progressivism. Progressivism is a social, moral, and political view, which aims to improve society by means of social reform and scientific as well as technological advancement [117]. Subject’s moral progressivism can be measured by subtracting the mean score of Ingroup/Loyalty, Authority/Respect, and Purity/Sanctity foundations from the mean score of Harm/Care and Fairness/Reciprocity foundations. A German version of this questionnaire provided by the official websites of creators and researchers of the MFQ and MFT at www.moralfoundations.org [115] was used in the study.

The Saarbrücker Personality Questionnaire.

The Saarbrücker Personality Questionnaire (SPF; [92]) is a German version of the Interpersonality Reactivity Index (IRI; [118]), a common measure of empathy. It consists of four subscales, namely, Perspective Taking, Fantasy, Empathic Concern, and Personal Distress, designed to represent different empathic faculties. The overall empathy score is calculated as a sum of the scores of the three subscales Perspective Taking, Fantasy, and Empathic Concern. The Personal Distress subscale is excluded from the empathy score as it measures emotion regulation in empathic situations. The Personal Distress score (negatively scored) can be subtracted from the overall empathy score to calculate the overall score of the questionnaire.

Harm Avoidance Scale from Temperament and Character Inventory, German version.

The Harm Avoidance Scale (TCI-HA) from the Temperament and Character Inventory (TCI; [93]) consists of 35 items and measures temperament traits related to harm-avoidant and anxiety-expressing behaviors, like action inhibition, avoidance of punishment, novel stimuli, or non-rewarding actions. Four subscales constitute the TCI-HA: anticipatory worry vs uninhibited optimism (HA1), fear of uncertainty vs confidence (HA2), shyness with strangers vs gregariousness (HA3), and fatigability and asthenia vs vigor (HA4). Overall, the TCI-HA score is calculated as a sum of the four subscales. Higher scores in the whole scale indicate higher harm aversion and higher anticipatory worry, higher fear of uncertainty, more pronounced shyness with strangers, as well as lower mental and physical energy.

Action Regulation Emotion Systems Scale.

The Action Regulation Emotion Systems Scale (ARES; 94) is a measure developed in Germany to measure the Behavioral Inhibition System (BIS) and the Behavioral Approach System (BAS), and is based on the BIS/BAS-Scale by Carver and White [119]. In ARES, the BIS scale assesses the subject’s proneness to inhibit behavior that might lead to negative outcomes or feelings. Similarly, BAS scale measures proneness to engage in behaviors that bring about positive (rewarding) outcomes and feelings. Additionally, ARES assesses proneness to anger as a reaction to failure of a goal-directed action in a separate subscale [94]. The long version of the questionnaire was implemented in this study (58 items), which consists of three scales: 1) BIS, which is further split into two subscales BIS I “Anxiety” and BIS II “Frustration”, 2) Anger, and 3) BAS, which is divided into two subscales BAS I “Drive” and BAS II “Gratification”.

Procedure.

The Ethics commission of the Medical Faculty at the University of Tübingen approved the online survey (“Validation of German translation of Oxford Utilitarianism Scale”, project number 839/2018BO2).

The online study was sept up on the SoSciSurvey platform (89, available at https://www.soscisurvey.de) and was identical to Study 1. Please see description of the study’s procedure in Study 1. Only an additional final question was included in this study, asking the respondents whether they believe that they have already participated in the identical study in 2019. The online study ran from January to April 2022. The study was not preregistered.

Data analysis.

Data analyses were performed with IBM SPSS Statistics 28.0.1.1 and R programming language [95].

Preliminary checks.

The distribution of OUS-DE scores was examined for normality using descriptive statistics, including assessments of skewness and kurtosis. Univariate and multivariate outlier analysis was performed using the same procedures as in Study 1 to ensure consistency.

Multivariate group comparisons.

To investigate the joint effects of sex/gender and sample (measurement time) on the dependent variables (i.e., OUS-DE, IB-DE, IH-DE) we conducted a Multivariate Analysis of Variance (MANOVA). This approach was chosen as it accounts for conceptual and statistical relationships among dependent variables while controlling for Type I error inflation. In this study, sex/gender and measurement time (sample) served as fixed factors, with the three OUS-DE measures as dependent variables. Prior to analysis, assumptions of normality, homogeneity of variance-covariance matrices, and absence of multicollinearity were checked. In cases where assumptions for parametric testing were violated, robust test statistics were used (e.g., Pillai’s Trace for MANOVA; Welch’s ANOVA for univariate follow-ups).”

Linear associations.

Pearson’s two-tailed correlations were computed to examine associations between OUS-DE scores and personality measures, age, and religiosity. The strength of correlations was interpreted as very strong: r ≥ .8; strong:.8 > r ≥ .6; moderate:.60 > r ≥ .4; weak:.4 > r ≥ .2; very weak: r < .2 [120]. The Benjamini-Hochberg procedure [121] was applied to control the false discovery rate (FDR) as correction for multiple comparisons within logically defined correlation families. Applied to meaningful hypothesis groups, this procedure provides a sensitive correction, while reliably controlling false discoveries and family-wise error. It also accommodates dependency between tests and preserves statistical power when a large number of tests are performed [122].

Results

Associations between OUS-DE and sociodemographic measures

Sex/gender and sample effects.

A two-way MANOVA was conducted to examine the effects of sex/gender and sample on subscales of OUS-DE. The analysis revealed significant multivariate main effects of sex/gender (Pillai’s Trace = 0.026, F(2, 721) = 9.5, p < .001), and sample (Pillai’s Trace = 0.01, F(2, 721) = 3.7, p = .025), but no significant interaction between the two factors (Pillai’s Trace = 0.002, F(2, 721) = 0.73, p = .48). This indicates that sex/gender and sample each independently contributed to differences in the dependent variables. For group means of IB-DE, IH-DE, and each of the IB-DE items by sample and sex/gender, please see Table 8.

thumbnail
Table 8. Mean scores of IB-DE, IH-DE, and IB-DE items in Sample 1 and Sample 2 as well as women and men across the samples.

https://doi.org/10.1371/journal.pone.0335215.t008

Given the significant multivariate main effects, follow-up univariate ANOVAs were conducted separately for sex/gender and sample. Interaction effects were not examined further at the univariate level because they were non-significant in the MANOVA.

IB-DE.

For sample, a one-way ANOVA also revealed a significant effect, F(1, 724) = 8.23, p = .004, η² = 0.011, with higher values in Sample 1 (please see Fig 2A). For sex/gender, the Welch’s ANOVA revealed a significant difference in IB-DE mean scores, F(1, 513) = 17.13, p < .001, η² = 0.025, with higher values in women (please see Fig 2C).

thumbnail
Fig 2. Violin plots showing the distribution of IB-DE and IB-4 across samples and sexes.

A and B display group differences between Sample 1 and Sample 2 in IB-DE and IB-4 (Y axis depicts scores of A. IB-DE and B. IB-4), respectively, while C illustrates sex/gender differences in IB-DE (Y axis depicts scores of IB-DE). Each violin represents the probability density of the data, with the point range indicating the mean ± standard deviation.

https://doi.org/10.1371/journal.pone.0335215.g002

IH-DE.

For sample, a one-way ANOVA yielded no significant effect, F(1, 724) = 1.69, p = .194, η² = 0.002. Similar for sex/gender, the Welch’s ANOVA showed no significant difference in IH-DE mean scores, F(1, 509) = 1.13, p = .289, η² = 0.002.

IB-DE items.

To analyze which IB-DE items were influenced by multivariate effects of sex/gender and sample, another two-way MANOVA was performed. As expected, multivariate main effects were significant for sex/gender (Wilks’ Lambda = 0.954, F(5, 718) = 7.0, p < .001) and sample (Wilks’ Lambda = 0.971, F(5, 718) = 4.31, p < .001) but not their interaction (Wilks’ Lambda = 0.999, F(5, 718) = 0.14, p = .983), again indicating independent contribution of sex/gender and sample on dependent variables.

For sample, separate one-way ANOVAs with Bonferroni correction of α = 0.01 showed one significant effect of sample on the IB-4 item, F(1, 724) = 38.01, p < .001, η² = 0.05 (all other p’s > .99), with higher values in Sample 1 (please see Fig 2B). For sex/gender, separate Welch’s ANOVAs with Bonferroni correction of α = 0.01 indicated significant sex/gender effects for three items – IB1, F(1, 519.5) = 8.21, p = .02, η² = 0.012, IB4, F(1, 539.5) = 11.94, p < .001, η² = 0.02, and IB5, F(1, 540.6) = 8.47, p = .02, η² = 0.012 (all other p’s > 0.1), with higher values in women.

Age.

Weak negative correlations between age and OUS-DE (rS1 = −0.152, pS1 = .003; rS2 = −0.15, pS2 = .005) and IH-DE (rS1 = −0.15, pS1 = .004; rS2 = −0.12, pS2 = .026) subscale were found in both samples. No correlation between age and IB-DE subscale was found in both samples (all p’s > .065).

Religiosity.

A weak positive correlation between frequency of one’s religious practice and IB-DE was found in Sample 1 only (rS1 = 0.124, pS1 = .016). No correlations between religious practice and OUS-DE or IH-DE were found in both samples (all p’s > .079), while religious practice and IB-DE did not correlate in Sample 2 (p = .19).

Associations between OUS-DE and psychometric measures

Correlations with psychometric measures were assessed by correlation analyses (Pearson’s r, two-tailed significance test) in Sample 1 (N = 378) and Sample 2 (N = 348). Correction for multiple comparisons was applied to logical test families according to Benjamini-Hochberg procedure. Results of associations between OUS-DE and its subscales with personality measures are depicted in Table 9 for Sample 1 and Table 10 for Sample 2.

thumbnail
Table 9. Correlations between OUS-DE, its subscales and personality measurements in Sample 1.

https://doi.org/10.1371/journal.pone.0335215.t009

thumbnail
Table 10. Correlations between OUS-DE, its subscales and personality measurements in Sample 2.

https://doi.org/10.1371/journal.pone.0335215.t010

Moral competence.

Both in Sample 1 and Sample 2, no correlations between moral competence score and individual OUS-DE scores or scores of the IB-DE and IH-DE reached significance (all p’s > .09).

Moral foundations.

Individual OUS-DE scores positively and weakly correlated with Harm/Care (rS1 = 0.21, pS1 < .001, rS2 = 0.16, pS2 < .001), Fairness/Reciprocity (rS1 = 0.31, pS1 < .001, rS2 = 0.15, pS2 = .01), and Progressivism (rS1 = 0.15, pS1 = .01, rS2 = 0.23, pS2 < .001) in both samples. In Sample 2, OUS-DE scores also negatively and weakly correlated with Authority/Respect (rS2 = −0.14, pS2 = 0.01) and Purity/Sanctity (rS2 = −0.13, pS2 = 0.025). No correlation was found between the mean OUS-DE score and Ingroup/Loyalty in both samples (p > .1).

IB-DE scores positively correlated with Harm/Care (rS1 = 0.39, pS1 < .001, rS2 = 34, pS2 < .001) and Fairness/Reciprocity (rS1 = 0.40, pS1 < .001, rS2 = 0.27, pS2 < .001) as well as Progressivism (rS1 = 0.33, pS1 < .001, rS2 = 0.42, pS2 < .001) in both samples. It negatively and weakly correlated with Authority/Respect (rS1 = −0.14, pS1 = 0.01, rS2 = −.31, pS2 < .001) in both samples, and with Ingroup/Loyalty (rS2 = −.13, pS2 = .025) as well as Purity/Sanctity (rS2 = −.15, pS2 = .01) in Sample 2.

IH-DE negatively and weakly correlated with Harm/Care (rS1 = −0.14, pS1 = 0.01, rS2 = −0.13, pS2 = .026) in both samples and Progressivism (rS1 = −0.16, pS1 < .001) in Sample 1. It was also positively associated with Authority/Respect (rS1 = 0.13, pS1 = 0.02) in Sample 1. Correlations between IH-DE and Fairness/Reciprocity, Ingroup/Loyalty, and Purity/Sanctity did not reach statistical significance in both samples, as well as Authority/Respect and Progressivism in Sample 2 (all p’s > 0.05).

Empathy.

Individual OUS-DE scores positively but weakly correlated with the Empathic Concern subscale (rS1 = 0.18, pS1 = .002, rS2 = 0.19, pS2 = .002) and the overall Empathy score (rS1 = 0.12, pS1 = 0.04, rS2 = 0.15, pS2 = .02) in both samples. In Sample 1 they also positively and weakly correlated with the Personal Distress subscale (rS1 = 0.17, pS1 = .004). In Sample 2 they positively and weakly correlated with the SPF total score (rS2 = 0.14, pS2 = 0.03).

IB-DE positively correlated with Perspective Taking (rS1 = 0.20, pS1 < .001, rS2 = 0.13, pS2 = .04), Empathic concern (rS1 = 0.30, pS1 < .001, rS2 = 0.33, pS2 < .001), Empathy score (rS1 = 0.21, pS1 < .001, rS2 = 0.26, pS2 < .001), and SPF total score (rS1 = 0.15, pS1 = .008, rS2 = 0.23, pS2 < .001) in both samples. In Sample 1 it also positively correlated with Personal Distress (rS1 = 0.12, pS1 = 0.04), while in Sample 2 it correlated with Fantasy (rS2 = 0.14, pS2 = .02).

IH-DE negatively correlated with Perspective Taking (rS1 = −0.11, pS1 = 0.046) as well as the SPF total score (rS1 = −0.12, pS1 = 0.04) and was associated positively with Personal Distress (rS1 = 0.12, pS1 = 0.04) in Sample 1. In Sample 2, no significant correlations were found (all p’s > 0.3).

Harm avoidance.

The individual OUS-DE and IB-DE scores showed no correlations with TCI-HA or its subscales in both samples (all p’s > .89).

Behavioral inhibition and activation.

The individual OUS-DE, IH-DE, and IB-DE scores did not correlate with ARES scores in both samples (all p’s > .05).

Discussion: Study 2

Differences in utilitarian inclinations before and after the Covid-19 pandemic.

During the Covid-19 pandemic individuals worldwide were confronted with an unusually high number of moral dilemmas involving both instrumental harm and impartial beneficence [7173]. Therefore, we hypothesized that prolonged exposure to such situations would increase agreement with utilitarian principles. To examine this, we compared scores of the OUS-DE and its subscales in two independent samples collected before and after the pandemic.

We expected to see a change in IH-DE ratings, with higher acceptance of instrumental harm after the pandemic, as the pandemic uniquely exposed society to sacrificial dilemmas [78]. Surprisingly, no differences were found for IH-DE or combined OUS-DE scores, suggesting that acceptance of instrumental harm – and general utilitarian outlook – remained stable. This aligns with prior research showing either no change [79,123] or mixed effects [28,124] in sacrificial moral dilemmas over time, although some studies report increases in acceptance of utilitarian solutions in pandemic-specific triage contexts [25,123125]. Taken together, this supports the idea that general attitudes toward instrumental harm are temporally stable, while their application and more nuanced formulations may shift depending on situational salience and personal proximity to the dilemma.

In contrast, IB-DE ratings significantly decreased in the post-pandemic sample, driven primarily by lower agreement with IB-4 (“failing to help”). This change may reflect “moral fatigue”: during the pandemic, individuals – regardless of personal risk – were repeatedly urged to take burdensome protective actions for the benefit of anonymous others [126128]. While such appeals were effective early on, prolonged exposure to invisible or diffuse harms may have eroded motivation, leading to reduced endorsement of impartial beneficence [129131]. This hypothesis fits well as many countries, including Germany, saw significant pushback against regulations, both in the form of nonviolent protest and violent escalation (for example, see news reports [132,133]).

An alternative, more speculative explanation relates to the timing of data collection in Sample 2 (from October 2022 to January 2023), during Russia’s invasion of Ukraine. Media coverage during this period heavily featured debates over the extent on how much financial, humanitarian, and military aid the German government should provide to the Ukraine [134]. Public opinion polls in Germany have shown a change in attitudes towards military actions and beneficence in the general population, indicating that after the invasion of Ukraine, the German population more strongly condemned offensive military actions, yet supported diplomatic pressure and military aid, and strongly supported carrying additional personal costs due to sanctions to the aggressor, as well as engaged personally in civil helping behaviors [135,136]. Under these conditions, acts of omission (failure to help) may have been perceived differently, as societal focus shifted toward collective, high-cost assistance and visible political action.

Overall, the findings suggest that while instrumental harm attitudes remained robust to major societal disruptions, impartial beneficence, particularly regarding obligations to help strangers, may be more susceptible to contextual and temporal influences.

Sex/gender differences in utilitarian inclinations.

Our results indicate a significant sex/gender difference in the mean rating of the IB-DE subscale. On average, women agreed with the IB-DE subscale more than men. Subsequent analyses indicated that the sex/gender difference in IB-DE is driven by independent sex/gender effects on items IB-1 (impartiality), IB-4 (failing to help), and IB-5 (donation). Sex/gender differences in charitable giving and altruism, concepts that are close to beneficence, have been observed consistently in behavioral paradigms (e.g., [60,61] for meta-analysis). Similar results were recently obtained in a German adult sample with women scoring higher in altruistic, emotional, and compliant prosocial behavior than men [137].

Gender roles and gender identification, rather than biological sex, have been suggested to underly these differences. Research shows that women and feminine-identifying individuals tend to view moral dilemmas as more important than men and masculine-identifying individuals [138], suggesting a greater motivation to engage in prosocial behavior. Moreover, gender role orientation has been found to relate more strongly to care-based moral reasoning than biological sex [139]. Women’s generally higher empathic abilities are often linked to their greater prosocial tendencies [138,140,141]. In addition, previous research indicates that women place greater emphasis on fairness, a concept closely related to impartiality, than men, particularly in WEIRD societies [53]. Previous literature has also identified a social desirability bias in ethical responding, with women more likely than men to answer in ways deemed socially acceptable, potentially inflating observed sex/gender differences [142]. In this study, we did not correct for this bias. Taken together, sex/gender differences in agreement with IB-DE may reflect a combination of differences in other moral dispositions.

Contrary to our expectations and previous behavioral findings in sacrificial dilemmas, where men were more accepting of direct sacrificial harm [50,51,5658,140] and self-serving harm [143], we found no sex/gender differences on IH-DE subscale. One possible explanation might be that women and men do not differ in sacrificial harm-related moral beliefs per se, but diverge in behavior when applying these beliefs, influenced by factors such as gender role expectations in care-based morality [144,145] or the experimental framing of harm-related dilemmas [51].

Relationships between OUS-DE and other psychometric measures.

In our study we examined the psychometric properties of the OUS-DE, with a particular focus on the associations between personality, character, and utilitarian inclinations.

Moral foundations.

Across both samples, we consistently found that agreement with OUS-DE and IB-DE was positively associated with Harm/Care, whereas IH-DE was negatively associated with Harm/Care. This suggests that IB-DE scores were driving the correlation between overall OUS-DE scores and the Harm/Care foundation. In line with our findings, Amormino [35] reported that extraordinary altruists scoring high on IB also show a greater concern for the Harm/Care foundation. We also found associations between IB-DE and the Fairness/Reciprocity foundation in both samples. This is consistent with the view that impartial treatment of individuals is typically seen as a core component of fairness and justice, as argued by many philosophers (e.g., [3,146149]). Finally, IB-DE was negatively associated with Authority/Respect in both samples. Previous research shows that individuals concerned with the Authority/Respect foundation prefer benefiting their in-group members and have a low concern for impartiality or beneficence towards out-group members [150152]. Together, these results show that the OUS-DE subscales are distinctly — and in the case of Harm/Care, oppositely — related to broader moral concerns, highlighting the importance of examining subscale-level relationships rather than relying solely on overall Utilitarianism scores.

Empathy.

In this study, we found that in both samples OUS-DE scores were positively associated with perspective taking, empathic concern and overall empathy. The association between OUS-DE and empathic concern replicates the findings of the original study [23] and the French validation (OUS-Fr, 30) but contrasts with the Turkish validation [29], despite all three using the same empathy questionnaire. This pattern suggests that correlations between utilitarian inclinations and empathy might be shaped by cultural differences, particularly differences between WEIRD and non-Western contexts.

Across both samples, IH-DE showed no correlation with empathic concern. In Sample 1, IH-DE was negatively correlated with perspective taking and consequently with the overall scale score. This finding is consistent with prior research that empathy, both as a personality trait and as a cognitive ability [153] is generally positively associated with prosocial moral beliefs [154156] but negatively associated with endorsement of instrumental harm [157,158]. However, other studies indicate that the relationship between empathy, prosocial tendencies, and acceptance of sacrificial harm is dependent on gender or gender role orientations [138,141,159].

Similarly to the divergent associations observed between moral foundations and the OUS-DE subscales, we also found an opposite association between the subscales and perspective taking. Taken together, these results support the notion that the two OUS-DE subscales capture distinct, and in some respect opposing psychological dispositions, consistent with the two-dimensional model of utilitarian psychology [15,23]. However, future research is needed to better characterize individuals who endorse both tenets of Utilitarianism simultaneously. Such a profile may reflect a relatively rare moral phenotype, defined by a distinctive constellation of cognitive and emotional traits. In the context of perspective taking, these individuals may uniquely combine a strong ability to adopt others’ viewpoints with a willingness to endorse instrumental harm when it serves impartial benefit.

Moral competence, harm avoidance, and approach/avoidance.

In this study, we found no significant correlations in both samples between Utilitarianism, moral competence, harm avoidance, or approach/avoidance traits. While we cannot conclude from our results any causal explanations on whether highly utilitarian individuals base their moral inclinations on more cognitive approaches, our analysis showed that in both samples, there was a mix of individuals scoring low, medium, and high on MCT regardless of their agreement with Utilitarianism. Furthermore, it is important to note that in our study, avoidance of self-harm as opposed to avoidance of harming others was measured. It is possible that these two types of harm avoidance share little overlap [160]. While there is no validated self-report measure of avoidance of harming others, future research should investigate this aspect and its relation to Utilitarianism.

Previous behavioral studies have linked moral competence, harm avoidance, and approach/avoidance traits to Utilitarianism [47,161,162]. However, while these traits relate to utilitarian behavior through cognitive and affective mechanisms, they might not be tied to moral inclinations per se. It has been suggested that moral beliefs do not directly translate to moral behavior [83] but instead are applied flexibly depending on moral context, state of sobriety, stress level and other external factors. These factors influence mental states, cognitive control, emotion processing, value coding, moral norm internalization and their interactions [163170]. Therefore, our results add to the growing body of evidence suggesting that moral inclinations do not necessarily translate directly into moral behavior.

Study 3: Relationships between utilitarian inclinations and real-life moral judgment and behavior

Our results in Study 2, along with previous research, indicate that moral inclinations are not directly associated with moral behavior [83]. We aimed to investigate whether utilitarian inclinations are more strongly related to moral behavior in contexts where such behavior is explicitly expected. Specifically, we compared OUS-DE scores with self-reported agreement to (moral judgment) and compliance with (moral behavior) quarantine rules that required individuals to forgo certain personal needs for the benefit of society.

We expected that individuals scoring high on the OUS-DE would report greater agreement with, and higher compliance to, government quarantine regulations, both in general and during the Christmas and New Year period. We also hypothesized that these regulations could appeal to two distinct groups: those who prioritize adherence to moral norms (e.g., high deontological, low utilitarian agreement) and those who aim to maximize the greater good (high utilitarian agreement), due to their rule-based structure and utilitarian content, respectively [78,83]. Accordingly, we tested both linear and quadratic (U-shaped) associations between OUS-DE scores, self-reported moral judgement, and moral behavior regarding adherence to quarantine rules.

Our second aim was to extend the findings of Study 2 by examining changes in utilitarian inclinations in women and men over the course of the Covid-19 pandemic. For this purpose, we performed repeated measurement of OUS-DE at two time points: during the first year of the pandemic (October-December 2020) and towards its end (March 2023). As in Study 2, we expected higher agreement with Utilitarianism, especially with the IH dimension.

Materials and methods

Sample description.

Fifty-eight participants took part in a separate laboratory study on moral decision-making and the influence of sex hormones between October – December 2020. Exclusion criteria were neurological or mental disorders, severe hypertension, diabetes, thyroid diseases, congestive heart failure, vasectomy, use of hormonal contraceptives, hormonal treatment, premenstrual dysphoric disorder, current pregnancy and birth or breastfeeding within the last year. By the end of December 2020, participants of this study were invited to complete a supplemental battery of questions regarding Covid-19 regulations. Of all 58 participants invited, 41 participants completed the online survey during the period between 2020-12-15 and 2021-01-10. Two participants were excluded as they did not fulfil the criterion of spending Christmas or New Year’s Eve in Germany. The final sample comprised 39 participants, of which 19 were women (Mage = 22.89, SD = 3.05) and 20 were men (Mage = 24.15, SD = 3.07), more details can be found in Table 11.

thumbnail
Table 11. Sociodemographic Characteristics of Participants after Exclusions in Study 2.

https://doi.org/10.1371/journal.pone.0335215.t011

For repeated OUS-DE measurement we invited all 39 participants from the final sample in early 2023 when the last Covid-19 measures (compulsory face mask wearing in public transportation and hospitals) were being phased out from daily life. Between 2023-03-06 and 2023-03-21 30 participants (Nwomen = 14) completed repeated measurements of the online OUS-DE and questions regarding their and their immediate family’s history of Covid-19 infection. Twenty-five of these responders were infected with Covid-19, one suffered from “Long Covid Syndrome”, three had a close family member hospitalized because of Covid-19 infection, four had a close family member suffering from “Long Covid Syndrome”, and none had lost a close family member to Covid-19 infection.

Materials.

The same questionnaires were used in Study 1 and 2 (for descriptions see Materials sections in Study 1 and Study 2) with one alteration concerning the Temperament and Character Inventory (TCI). In Study 2, the whole TCI (240 items) was applied. TCI consists of two dimensions – Temperament and Character [93]. Temperament dimension involves 4 scales: Novelty Seeking (NS), Harm Avoidance (HA), Reward Dependence (RD), and Persistence (P); while Character dimension includes 3 scales: Self-directedness (S), Cooperativeness (C), and Self-transcendence (ST). All scales except for the Persistence scale are further separated into several related subscales. Higher scores on the whole scale indicate higher novelty seeking, harm avoidance, reward dependence, persistence, self-directedness, cooperativeness, and self-transcendence.

Moral judgment and moral behavior during Covid-19 pandemic

To better understand the attitudes and motivation to comply with Covid-19 regulations in December 2020, we asked participants to provide information on whether they had tested positive for Covid-19 before, which federal German state they currently reside in, whether they live alone or with their partner/parents/in a shared flat (differentiating between < 5 and > 5 residents) and in which federal state they plan to spend/had spent Christmas (December 24th-26th) and New Year’s Eve (December 31st-January 1st), as government regulations differed slightly between states.

Participants’ attitudes – i.e., their moral judgment – towards the mandatory wearing of face masks and the enforced curfew were then assessed (all questions on Visual Analogue Scales from 0 – not strict enough or not at all, depending on the question to 100 – too strict or totally). Participants were also instructed to indicate how satisfied they are with the government’s current regulations and to what extent they are planning to/have complied with these regulations, to assess their moral behavior. Questions were based on the official regulations of Baden-Württemberg state in Germany [171]. Summary of regulations in English can be found in S2 Appendix.

For analysis purposes, responses to questions assessing personal attitudes to face mask wearing, curfew, and meeting restrictions, as well as satisfaction with general and celebration-specific regulations to prevent spread of Covid-19 (a total of 8 questions), were combined into a mean score measuring Moral Judgment (MJ). Responses to questions to what extent the participants comply with the regulations regarding face mask wearing, curfew, and celebration-specific meeting restrictions (in total 5 questions) were combined into a mean score measuring general Moral Behavior (MB) during the pandemic.

Further, it was assessed how reasonable participants consider the government’s regulations for social gatherings on Christmas and New Year’s Eve (see S1 Appendix) and to what extent they comply/complied with the regulations, to assess their moral judgment and moral behavior during Christmas and New Year.

Subsequently, participants stated the amount of people they plan to spend/have spent Christmas and New Year’s Eve with, how many of those were particularly vulnerable to infections, how many households this corresponded with, and if they plan or have taken any precautionary measures to protect the others around them, to assess their moral behavior during Christmas and New Year. Finally, participants stated whether and when they plan to get vaccinated against Covid-19.

Questions regarding precautionary measures the person was planning or had taken during the festivities were combined into moral behavior scores separately for Christmas and New Year celebrations (MB during Christmas – 21 items, or MB during New Year – 12 items). For each response indicating that the participant was planning to/ had complied with regulations or took a precautionary measure, 1 point was added. For each incompliance, 1 point was subtracted from the final score. In the end, the sum scores for MB during Christmas ranged between −13–15 points. The scores for MB during New Year’s ranged between −7–9 points.

Repeated measurement.

OUS-DE was assessed again in March 2023. Additionally, at this time point we asked participants to self-report whether they believe that their moral inclinations and moral behavior have changed during the pandemic (1 question each) and provide other Covid-19 related demographic information, such as whether the participant was ever infected with Covid-19, hospitalized because of Covid-19, suffered or still suffers from “Long Covid Syndrome” and whether the participant had a close family member who hospitalized because of Covid-19, suffered or still suffers from “Long Covid Syndrome”, or died because of Covid-19.

Procedure.

The Ethics commission of the Medical Faculty at the University of Tübingen approved the study (“Modulatory effects of sex hormones on moral decision-making”, project number 707/2019BO2). By the end of December 2020, all participants that had taken part in the separate laboratory study on moral decision-making until this point were contacted directly via E-Mail whether they would like to participate in a subsequent survey on moral decision-making in times of Covid-19.

To assess participants’ attitude towards strict Covid-19 regulations that were in place in Germany between in December 2020 and January 2021, questions described in Materials section of this study were administered online via SoSciSurvey [89]. Participants were instructed to answer all questions truthfully, even if their true answer should not adhere to the government’s regulations. To further lower concerns, they were reminded that all collected data was pseudonymized and could not be traced back. Participants were compensated for taking part in a separate laboratory study on moral decision-making.

All participants, which had answered the subsequent questions on their attitude and compliance with Covid-19 restrictions in December 2020 – January 2021 and gave consent to be contacted again for further research purposes, were again invited by E-Mail in 2023.03.07–2023.03.20 to fill out the repeated measurement of OUS-DE via SoSciSurvey, the time period when the last German pandemic measures were being phased out. Written informed consent was obtained from the participants. Participants were compensated 5 Euros for participation in this survey. The data was stored in a pseudonymized form. This study was not preregistered.

Data analysis

Predictive associations.

To examine predictive relationships, linear regression models were constructed with moral judgment (MJ), moral behavior (MB), MB during Christmas, and MB during New Year as dependent variables and individual OUS-DE, IB-DE, and IH-DE scores as predictors. To test for potential nonlinear relationships, nonlinear regression models were additionally applied.

Linear associations.

Linear associations between MJ, MB, MB during Christmas, and MB during New Year were investigated by using bivariate Pearson’s two-tailed correlations. Correlation strength was interpreted as in Study 2 (please refer to Data analysis section for more information).

Repeated measures.

Two-way repeated measures ANOVA was applied to investigate variance of OUS-DE and its subscales over time (2020, 2023) as a within-subject factor and sex/gender as a between-subjects factor as sex/gender was found to have an independent effect on the changes in agreement with IB-DE in Study 2. This model allowed us to examine both main effects (time, sex/gender) and their interaction effects on individual OUS-DE scores and subscale scores.

Results: Study 3

OUS-DE measurement during the Covid pandemic.

In this sample (N = 39), mean agreement with the whole OUS-DE measurement was highly comparable to the mean responses in both samples from Study 1 (see Table 12). Correlation analysis indicated that the OUS-DE score strongly correlated with the scores in the two subscales – IB-DE (r = 0.788, p < .001) and IH-DE (r = 0.534, p < .001), similarly to the results in Study 1.

thumbnail
Table 12. Mean responses to the OUS-DE, its subscales, quarantine moral judgement, quarantine moral behavior, and moral behavior during Christmas and New Year in Sample 3.

https://doi.org/10.1371/journal.pone.0335215.t012

Relationship between OUS-DE and moral judgment and behavior during Covid-19 pandemic.

We found no predictive linear relationship between OUS-DE and its subscales and MJ or MB (all p’s > .09). Nonlinear regression analysis indicated that there was an inverse U-shape quadratic relationship between general MB and IH-DE subscale (R2 = 0.23; β = −3.05, p = .003; F(2, 430.65) = 5.377, p = .009, as also depicted in Fig 3). No linear or nonlinear relationships with OUS-DE or its subscales were found for MJ, MB during Christmas, and MB during New Year (all p’s > .11).

thumbnail
Fig 3. Quadratic inverse U-shape relationship between Moral Behavior during the pandemic and Instrumental Harm subscale.

https://doi.org/10.1371/journal.pone.0335215.g003

Further, MB and MJ scores correlated positively and moderately strongly with each other (r = 0.55, p < .001). MB during Christmas positively and moderately correlated with MB during New Year (r = 0.5, p = .001). MJ and MB did not correlate with MB during either Christmas or New Year. For these correlations, please see Table 13.

thumbnail
Table 13. Correlations between the OUS-DE, its subscales, quarantine moral judgement, quarantine moral behavior, and moral behavior during Christmas and New Year.

https://doi.org/10.1371/journal.pone.0335215.t013

We have additionally investigated sex/gender differences for MJ and MB measurements in this study. No sex/gender differences were observed for any of the scores (all p’s > .39, see S1 Table).

Peri- and post-pandemic OUS-DE responses.

A two-way repeated measures ANOVA was used to assess whether responses to OUS-DE and its subscales changed with time (within-subjects factor) in women and men (between-subjects factor). We observed no significant time effect on OUS-DE (F(1, 28) = 0.212, p = .65), IB-DE subscale (F(1, 28) = 0.414, p = .525), and IH-DE subscale (F(1, 28) = 0.032, p = .86). Time and sex/gender interaction was, too, not significant in the OUS-DE and its two subscales when analyzed separately (all p’s > .073). Assessment of between subject effects did not find sex/gender differences for OUS-DE (F(1, 28) = 3.23, p = .083) or IH-DE (F(1, 28) = 0.031, p = .86) ratings. Sex/gender effect in ratings of IB-DE reached a trend level towards significance (F(1, 28) = 4.16, p = .051).

Additionally, we asked study participants during repeated measurement of OUS-DE to self-report whether their moral views or moral behavior have changed after the pandemic. 6.7% indicated that their moral views have not changed at all, 53.3% indicated that they have changed a little, 26.7% indicated that they have changed a lot, and 13.3% indicated that they have changed significantly. In a separate question, 23.3% reported that their moral behavior had not changed at all, 30% reported that it had changed a little, 40% reported that it had changed a lot, and 6.7% reported that it had changed significantly.

Discussion: Study 3

Relationship between utilitarian inclinations and pandemic-related moral judgment and behavior.

The primary goal of Study 3 was to investigate whether utilitarian inclinations were associated with self-reported moral judgment of pandemic safety measures and moral behavior as expressed by adherence to these measures during the first winter of the Covid-19 pandemic. Contrary to our expectations, we found an inverted U-shape relationship between agreement with the IH-DE subscale and conformity to the quarantine rules (MB score), indicating that those scoring the lowest and the highest were least conforming to the quarantine rules, such as mask wearing, distance keeping, and limitation of social contacts. We also found that agreement with quarantine rules and conformity to those rules positively correlated to each other but not to conformity to quarantine rules during Christmas and New Years celebrations. Finally, we observed that behavior scores during the festivities positively correlated. Overall, our results indicate that participants in this study based their judgments and behavior towards conformity to the quarantine rules on circumstantial principles (i.e., important traditions) or their specific agreement with the quarantine rules but not their general utilitarian inclinations.

Our results in this study are in line with some recent findings from the general population as well as healthcare staff, which show that not moral inclinations but rather sociodemographic and circumstantial variables are more closely related to moral behavior during the pandemic. Lachowicz-Tabaczek and Kozłowska [172] assessed social distancing behavior in 505 Polish participants from the general public and found that it was predicted by concern for collective health and personal values of social responsibility, although they did not assess utilitarian inclinations in this sample. In a sample of 49 Iranian women medical professionals, age, number of encountered deaths from Covid-19, perceived self-risk of infection, and stress level but not utilitarian inclinations as measured by sacrificial moral dilemmas positively correlated with social distancing behavior [27]. In an online survey with 197 participants from the general public, Díaz [86] showed that actual past and future compliance to Covid-19 regulations was better predicted by moral emotions that arise in response to moral statements involving harm than factual moral beliefs’ statements involving harm. Similarly, in a Dutch general public sample (N = 1396), moral values as measured by the MFQ were weak predictors of conformity to Covid-19 regulations, while other sociodemographic variables, such as gender and age had better predictive power [84].

There is also some evidence that utilitarian inclinations might have been associated with decisions in experimental pandemic-related triage dilemmas. Kneer and Hannikainen [123] relied on an US sample of around 1100 participants from the general public and reported that the IB but not the IH subscale scores predicted answers to realistic pandemic-related triage dilemmas. In a large sample of over 15000 individuals from countries in Latin America (mostly Argentina) and a separate sample from the US (N = 1300), Navajas [25] observed that commitment to Utilitarianism as measured by sacrificial harm dilemmas and both OUS subscales in an online survey modulated moral decisions regarding the Covid-19 pandemic in triage dilemmas. In this study, prioritized treatment for younger over older patients was positively associated with IH subscale scores, while priority to equitable public health was positively associated with IB subscale scores, and both scales were positively associated with preference to public health over data-privacy, economic activity, and friendship.

Taken together, results from our study as well as previous research indicate that when assessing actual pandemic-related behavior, other personal and situational variables, such as social orientation, emotional response to harm, age, stress, or number of encountered deaths from Covid-19 are better predictors of the behavior than general utilitarian inclinations. Associations between the two dimensions of Utilitarianism could only be found in experimental studies assessing behavior in triage situations but not in conformity to the quarantine rules directed at the general public. This might suggest that either a methodological effect exists in assessing experimental scenarios even if they are realistic, or that utilitarian inclinations play a role in triage situations only, as both assessments deal with extreme sacrifice.

General discussion

The main aim of the paper was to validate the German translation of the Oxford Utilitarianism Scale (OUS-DE). The studies in this paper show psychometric robustness and construct validity of OUS-DE and provide evidence of reliability of the translated version for future research. In Study 1, we demonstrated moderate to strong linear associations between utilitarian inclinations as measured by OUS-DE subscales and choice preferences in thematically corresponding moral dilemmas. In Study 2, we confirmed conceptual associations between Impartial Beneficence (IB) and empathy as well as concern for Harm/Care foundation. Study 3, maybe surprisingly, revealed that association of utilitarian inclinations regarding instrumental harm or behavioral beneficence are not linearly related to actual moral behavior during the Covid-19 pandemic. Nevertheless, our analyses of OUS-DE responses before, during, and after the Covid-19 pandemic in Studies 2 and 3 indicate that the questionnaire captures stable utilitarian inclinations in a young German sample. In two studies we also found a consistent sex/gender difference regarding responses to IB-DE, with women accepting IB to a higher degree, independently from the measurement time point.

Across the studies, we have consistently observed a context-dependent role of Utilitarianism. As measured by the OUS-DE, similarly to measurements by high-stakes sacrificial scenarios, utilitarian beliefs remained stable. However, when the public discourse and policy repeatedly invoked requirements for utilitarian solutions, endorsement of IB shifted. Finally, the weak translation of utilitarian inclinations into everyday compliance behaviors further strengthened the assumption of the context dependence. These findings highlight the importance of separating moral belief endorsement from moral behavior in future research, the so-called “moral judgment-behavior gap” [173]. Increasing evidence is provided by studies testing the existence of this gap in other contexts rather than Covid-19 pandemic, for example, impartial resource allocation [173,174] or political voting behavior [127].

Our findings also suggest a need for longitudinal designs to track the changes in individual moral inclinations. Current turbulent geopolitical and societal events might be uniquely reshaping moral beliefs in the general population as well as specific professional groups such as healthcare workers, public servants, or military personnel. Future investigations could see how changes or stability in personal moral beliefs relate to the changing values around national sovereignty, identity, and community [175].

On a more methodological note, our studies showed that there might be potential for conceptual overlap between items in the OUS-DE. While no other validation studies have encountered this issue, culturally sensitive adaptations of OUS could be considered. Furthermore, our studies also show that the conceptual associations between the two dimensions of Utilitarianism should be further clarified as they might operate through different cognitive-affective pathways yet culminate into one coherent moral outlook. Exploration of a “dual-high” utilitarian profile – high on IB and high on IH – and its cognitive-affective correlates would aid in this pursuit. Finally, future studies should further investigate in which contexts the OUS – and self-reported moral inclinations in general – has a high predictive validity and where the moral judgment-behavior gap persists.

Taken together, OUS-DE is a valid and reliable tool to assess utilitarian moral inclinations in German-speaking populations. The degree to which an individual’s moral judgements cohere with Utilitarianism is of interest to moral psychology and beyond. The recent Covid-19 pandemic has brought to the forefront the inevitability of triage and other cost-benefit analyses in healthcare. Some reported experiencing a ‘moral injury’ during the pandemic, having to behave in ways that conflict with their moral beliefs, such as when denying help to incoming patients in overcrowded hospitals during the most intense Covid-19 periods [176178]. Knowing the folk attitude to Utilitarianism can help in understanding individual responses to such dilemmas, in gauging the expected agreement to utilitarian policies, in informing public decision-making, and in designing interventions for moral resilience in healthcare and policy.

Limitations

Some limitations of our validation study should be noted. First, our samples mainly consist of young adults, with the majority studying at university. The surveys were advertised in the University town of Tübingen, a city with one of the youngest average populations in Germany [179]. Therefore, our results might not generalize to the general German population and future studies should assess OUS-DE in older populations, as age effects had been shown in moral inclinations [71]. Our validation study was also performed online, which might hinder reliability of responses to the survey. We have implemented a minimum time threshold for completion of the survey and rigorous univariate and multivariate outlier analyses to identify any sporadic responses to the survey; however, we are not able to check the sincerity of the survey answers, a problem that any self-report measure would face. Finally, in Study 1 we did not perform a repeated measures survey pre- and post-pandemically, as it was impossible due to the anonymous nature of our survey. Notwithstanding these limitations, our validation and cross-validation studies demonstrate robust validity and reliability properties of the OUS-DE, comparable to the original English version.

In Study 2, we have compared measurements of OUS-DE between two different samples when investigating the effects of Covid-19 pandemic. While this was necessary due to the anonymous nature of our online surveys, the observed sample effect in the difference of agreement to IB-DE subscale might be due to other reasons rather than the pandemic, for example, different but unaccounted sociodemographic constitution of the samples or a societal shift in moral attitudes due to other historical events. As we are aware of this limitation, we have added an alternative explanation for sample effects in the discussion of Study 2.

The sample size in Study 3 might be considered relatively small and not generalizable to the general population, as all participants in the study were young adults. As this study was part of a larger in-person experiment, due to the Covid-19 pandemic, collection of data was limited and resulted in a small sample size.

Notwithstanding these limitations, sample sizes and their sociodemographic constitution is comparable to many studies in the field of psychology, while our validation study demonstrates robust validity and reliability properties of the OUS-DE, comparable to the original English version.

Conclusion

The present research provides a comprehensive validation of the German translation of the Oxford Utilitarianism Scale (OUS-DE). Across three studies, we have demonstrated this translation’s psychometric robustness, construct validity, and reliability and supported its conceptual consistency with the original version. This establishes OUS-DE as a sound tool for assessing utilitarian moral inclinations in German-speaking populations.

Our research also highlighted important new findings such as a consistent sex/gender difference in endorsement of impartial beneficence and possible influence of historical events on agreement with individual items in the subscales. Moreover, our results support the findings of other studies underscoring that self-reported moral inclinations do not directly translate into moral behavior. This empathizes with the need to further investigate a well-documented moral judgment-behavior gap in the context of utilitarian morality. This has significant implications for applied fields such as healthcare, policy-making, and moral education, particularly in contexts where individuals are required to make high-stakes moral decisions, as seen during the Covid-19 pandemic.

Future studies might utilize the OUS-DE in assessing general utilitarian beliefs in the general population and in specialized populations. OUS-DE could be also used as an experimental or as a diagnostic tool to identify vulnerable populations when the circumstances require strict behavior in accordance with a particular moral theory. Finally, it could also be assessed in sociological investigations and opinion polls related to agreement with various policies or political support. Sex/gender differences in impartial beneficence should be considered when the measurement is applied.

In sum, the OUS-DE is a reliable and valid instrument that not only advances research in moral psychology but also provides a valuable foundation for applied studies aiming to understand and support moral reasoning in times of societal, political, or global crises.

Supporting information

S1 File. OUS-DE confirmatory factor analysis (CFA) in women and men (Sample 1) separately.

Data analysis description, factor loadings for OUS-DE in women and in men separately, CFA results in women and in men separately.

https://doi.org/10.1371/journal.pone.0335215.s001

(DOCX)

S1 Appendix. Pen and paper style German Oxford Utilitarianism Scale.

https://doi.org/10.1371/journal.pone.0335215.s002

(PDF)

S2 Appendix. English summary of Covid-19 regulations in Baden-Württemberg state, Germany, where the survey was conducted.

Link to the original regulations in German and a summary translation to English.

https://doi.org/10.1371/journal.pone.0335215.s003

(DOCX)

S1 Table. Sex differences in quarantine related moral judgment (MJ), moral behavior (MB), and moral behavior (MB) during Christmas and New Year in Sample 3.

Results of t-test analyses in a table.

https://doi.org/10.1371/journal.pone.0335215.s004

(DOCX)

References

  1. 1. Bentham J. An Introduction to the Principles of Morals and Legislation. Oxford: Clarendon Press. 1907.
  2. 2. Mill JS. Utilitarianism. Oxford: Oxford University Press. 1988.
  3. 3. Singer P. Practical ethics. Cambridge: Cambridge University Press. 2011.
  4. 4. Conway P, Goldstein-Greenwood J, Polacek D, Greene JD. Sacrificial utilitarian judgments do reflect concern for the greater good: Clarification via process dissociation and the judgments of philosophers. Cognition. 2018;179:241–65. pmid:30064654
  5. 5. Greene JD, Sommerville RB, Nystrom LE, Darley JM, Cohen JD. An fMRI investigation of emotional engagement in moral judgment. Science. 2001;293(5537):2105–8. pmid:11557895
  6. 6. McGuire J, Langdon R, Coltheart M, Mackenzie C. A reanalysis of the personal/impersonal distinction in moral psychology research. J Exp Soc Psychol. 2009;45(3):577–80.
  7. 7. Buckland L, Lindauer M, Rodríguez-Arias D, Véliz C. Testing the Motivational Strength of Positive and Negative Duty Arguments Regarding Global Poverty. Rev Philos Psychol. 2022;13(3):699–717.
  8. 8. Tomasello M. The moral psychology of obligation. Behav Brain Sci. 2019;43:e56. pmid:31133086
  9. 9. Holyoak KJ, Powell D. Deontological coherence: A framework for commonsense moral reasoning. Psychol Bull. 2016;142(11):1179–203. pmid:27709981
  10. 10. Ellemers N, van der Toorn J, Paunov Y, van Leeuwen T. The Psychology of Morality: A Review and Analysis of Empirical Studies Published From 1940 Through 2017. Pers Soc Psychol Rev. 2019;23(4):332–66. pmid:30658545
  11. 11. Buchtel EE. Moral disciplining provides a satisfying explanation for Chinese lay concepts of immorality. Behav Brain Sci. 2023;46:e298. pmid:37789554
  12. 12. Foot P. The problem of abortion and negative and positive duty: a reply to James LeRoy Smith. J Med Philos. 1978;3(3):253–5. pmid:702006
  13. 13. Singer PF. Famine, affluence, and morality. In: Alexander LA, editor. International ethics: A philosophy and public affairs reader. Princeton: Princeton University Press. 1972. p. 229–43.
  14. 14. Sidgwick H. The Methods of Ethics. 7 ed. London: Palgrave Macmillan. 1962.
  15. 15. Everett JAC, Kahane G. Switching Tracks? Towards a Multidimensional Model of Utilitarian Psychology. Trends Cogn Sci. 2020;24(2):124–34. pmid:31911126
  16. 16. Young L, Koenigs M, Kruepke M, Newman JP. Psychopathy increases perceived moral permissibility of accidents. J Abnorm Psychol. 2012;121(3):659–67. pmid:22390288
  17. 17. Koenigs M, Kruepke M, Zeier J, Newman JP. Utilitarian moral judgment in psychopathy. Soc Cogn Affect Neurosci. 2012;7(6):708–14. pmid:21768207
  18. 18. Bartels DM, Pizarro DA. The mismeasure of morals: antisocial personality traits predict utilitarian responses to moral dilemmas. Cognition. 2011;121(1):154–61. pmid:21757191
  19. 19. Vyas K, Jameel L, Bellesi G, Crawford S, Channon S. Derailing the trolley: Everyday utilitarian judgments in groups high versus low in psychopathic traits or autistic traits. Psychiatry Res. 2017;250:84–91. pmid:28152398
  20. 20. Glenn AL. How can studying psychopaths help us understand the neural mechanisms of moral judgment?. Cellscience. 2010;6(4):30–5. pmid:26413135
  21. 21. Kahane G. Sidetracked by trolleys: Why sacrificial moral dilemmas tell us little (or nothing) about utilitarian judgment. Soc Neurosci. 2015;10(5):551–60. pmid:25791902
  22. 22. Brown C. Consequentialize This. Ethics. 2011;121(4):749–71.
  23. 23. Kahane G, Everett JAC, Earp BD, Caviola L, Faber NS, Crockett MJ, et al. Beyond sacrificial harm: A two-dimensional model of utilitarian psychology. Psychol Rev. 2018;125(2):131–64. pmid:29265854
  24. 24. Caviola L, Kahane G, Everett JAC, Teperman E, Savulescu J, Faber NS. Utilitarianism for animals, Kantianism for people? Harming animals and humans for the greater good. J Exp Psychol Gen. 2021;150(5):1008–39. pmid:33074696
  25. 25. Navajas J, Heduan FÁ, Garbulsky G, Tagliazucchi E, Ariely D, Sigman M. Moral responses to the COVID-19 crisis. R Soc Open Sci. 2021;8(9):210096. pmid:34527267
  26. 26. Fowler Z, Law KF, Gaesser B. Against Empathy Bias: The Moral Value of Equitable Empathy. Psychological Science. 2021;32(5):766–79.
  27. 27. Borhany H, Golbabaei S, Jameie M, Borhani K. Moral Decision-Making in Healthcare and Medical Professions During the COVID-19 Pandemic. Trends Psychol. 2023;31(1):210–30. pmid:40477450
  28. 28. Antoniou R, Romero-Kornblum H, Young JC, You M, Kramer JH, Chiong W. Reduced utilitarian willingness to violate personal rights during the COVID-19 pandemic. PLoS One. 2021;16(10):e0259110. pmid:34679124
  29. 29. ERZİ S. Psychometric Properties of Adaptation of The Oxford Utilitarianism Scale to Turkish. HUMANITAS - Uluslararası Sosyal Bilimler Dergisi. 2019;7(13):132–47.
  30. 30. Carron R, Blanc N, Anders R, Brigaud E. The Oxford Utilitarianism Scale: Psychometric properties of a French adaptation (OUS-Fr). Behav Res Methods. 2024;56(5):5116–27. pmid:37794207
  31. 31. Pascual-Soler M, Berrios-Riquelme J, Gomez-Frias R, Caamaño-Rocha L, Frias-Navarro D. Oxford Utilitarianism Scale: Psychometric Properties of a Spanish Adaptation (OUS-Spa). SAGE Open. 2025;15(1).
  32. 32. Oshiro B, McAuliffe WHB, Luong R, Santos AC, Findor A, Kuzminska AO. Structural Validity Evidence for the Oxford Utilitarianism Scale Across 15 Languages. Psychol Test Adapt Dev. 2024;5:175–91.
  33. 33. Graham J, Nosek BA, Haidt J, Iyer R, Koleva S, Ditto PH. Mapping the moral domain. J Pers Soc Psychol. 2011;101(2):366–85. pmid:21244182
  34. 34. Graham J, Nosek BA, Haidt J, Iyer R, Spassena K, Ditto PH. Moral foundations questionnaire (MFQ). 2011.
  35. 35. Amormino P, Ploe ML, Marsh AA. Moral foundations, values, and judgments in extraordinary altruists. Sci Rep. 2022;12(1):22111. pmid:36543878
  36. 36. van Dijke J, van Nistelrooij I, Bos P, Duyndam J. Engaging otherness: care ethics radical perspectives on empathy. Med Health Care Philos. 2023;26(3):385–99. pmid:37171745
  37. 37. Hare RM. Moral thinking: its levels, method, and point. New York: Oxford University Press. 1981.
  38. 38. Andrade G, Abdelmonem KYA, Teir HJ, Alqaderi N, Elamin ABA, Bedewy D. Utilitarian psychology and influenza vaccine acceptance in the United Arab Emirates: implications for moral education and public policy. BMC Psychol. 2025;13(1):138. pmid:39972498
  39. 39. Körner A, Deutsch R, Gawronski B. Using the CNI Model to Investigate Individual Differences in Moral Dilemma Judgments. Pers Soc Psychol Bull. 2020;46(9):1392–407. pmid:32111135
  40. 40. FeldmanHall O, Dalgleish T, Evans D, Mobbs D. Empathic concern drives costly altruism. Neuroimage. 2015;105:347–56. pmid:25462694
  41. 41. Dovidio JF, Allen JL, Schroeder DA. Specificity of empathy-induced helping: Evidence for altruistic motivation. J Pers Soc Psychol. 1990;59(2):249–60.
  42. 42. Paruzel-Czachura M, Charzyńska E. Investigating the Relationship between Centrality of Religiosity, Instrumental Harm, and Impartial Beneficence through the Lens of Moral Foundations. Religions. 2022;13(12):1215.
  43. 43. Mueller D, Halfmann K. Dopamine, religiosity, and utilitarian moral judgment. Soc Neurosci. 2021;16(6):627–38. pmid:34461025
  44. 44. Singer P. Ethics and Intuitions. J Ethics. 2005;9(3–4):331–52.
  45. 45. Greene JD, Nystrom LE, Engell AD, Darley JM, Cohen JD. The Neural Bases of Cognitive Conflict and Control in Moral Judgment. Neuron. 2004;44(2):389–400.
  46. 46. Hutcherson CA, Montaser-Kouhsari L, Woodward J, Rangel A. Emotional and Utilitarian Appraisals of Moral Dilemmas Are Encoded in Separate Areas and Integrated in Ventromedial Prefrontal Cortex. J Neurosci. 2015;35(36):12593–605. pmid:26354924
  47. 47. Gawronski B, Armstrong J, Conway P, Friesdorf R, Hütter M. Consequences, norms, and generalized inaction in moral dilemmas: The CNI model of moral decision-making. J Pers Soc Psychol. 2017;113(3):343–76. pmid:28816493
  48. 48. Gawronski B, Beer JS. What makes moral dilemma judgments “utilitarian” or “deontological”?. Soc Neurosci. 2017;12(6):626–32. pmid:27745112
  49. 49. Gawronski B, Conway P, Armstrong JB, Friesdorf R, Hütter M. Understanding responses to moral dilemmas: Deontological inclinations, utilitarian inclinations, and general action tendencies. The social psychology of morality. Routledge. 2016. p. 91–110.
  50. 50. Armstrong J, Friesdorf R, Conway P. Clarifying gender differences in moral dilemma judgments: The complementary roles of harm aversion and action aversion. Soc Psychol Personal Sci. 2019;10(3):353–63.
  51. 51. Friesdorf R, Conway P, Gawronski B. Gender differences in responses to moral dilemmas: a process dissociation analysis. Pers Soc Psychol Bull. 2015;41(5):696–713. pmid:25840987
  52. 52. Qian Y, Takimoto Y, Wang L, Yasumura A. Exploring cultural and gender differences in moral judgment: A cross-cultural study based on the CNI model. Current Psychology. 2023;:1–11.
  53. 53. Atari M, Lai MHC, Dehghani M. Sex differences in moral judgements across 67 countries. Proc Biol Sci. 2020;287(1937):20201201. pmid:33081618
  54. 54. Kennedy JA, Kray LJ, Ku G. A social-cognitive approach to understanding gender differences in negotiator ethics: The role of moral identity. Organ Behav Hum Decis Process. 2017;138:28–44.
  55. 55. Jordan J, Leliveld MC, Tenbrunsel AE. The Moral Self-Image Scale: Measuring and Understanding the Malleability of the Moral Self. Front Psychol. 2015;6:1878. pmid:26696941
  56. 56. Hauser M, Cushman F, Young L, Kang‐Xing Jin R, Mikhail J. A Dissociation Between Moral Judgments and Justifications. Mind & Language. 2007;22(1):1–21.
  57. 57. Fumagalli M, Ferrucci R, Mameli F, Marceglia S, Mrakic-Sposta S, Zago S, et al. Gender-related differences in moral judgments. Cogn Process. 2010;11(3):219–26. pmid:19727878
  58. 58. Rota G, Palumbo S, Lattanzi N, Manfrinati A, Sarlo M, Lotto L, et al. Harm aversion explains utilitarian choices in moral decision-making in males but not in females. Arch Ital Biol. 2016;154(2–3):50–8. pmid:27918062
  59. 59. Armbruster D, Kirschbaum C, Strobel A. Androgenic morality? Associations of sex, oral contraceptive use and basal testosterone levels with moral decision making. Behav Brain Res. 2021;408:113196. pmid:33621608
  60. 60. Bilén D, Dreber A, Johannesson M. Are women more generous than men? A meta-analysis. J Econ Sci Assoc. 2021;7(1):1–18.
  61. 61. Doñate-Buendía A, García-Gallego A, Petrović M. Gender and other moderators of giving in the dictator game: A meta-analysis. J Econ Behav Organ. 2022;198:280–301.
  62. 62. Andreoni J, Vesterlund L. Which is the Fair Sex? Gender Differences in Altruism. The Quarterly Journal of Economics. 2001;116(1):293–312.
  63. 63. Caravita SCS, De Silva LN, Pagani V, Colombo B, Antonietti A. Age-Related Differences in Contribution of Rule-Based Thinking toward Moral Evaluations. Front Psychol. 2017;8:597. pmid:28473788
  64. 64. Fernandes C, Pasion R, Gonçalves AR, Almeida R, Garcez H, Ferreira-Santos F, et al. Awareness to utilitarian responses in later life: An ERP study with moral dilemmas. Neurosci Lett. 2022;787:136824. pmid:35917838
  65. 65. Hannikainen IR, Machery E, Cushman FA. Is utilitarian sacrifice becoming more morally permissible?. Cognition. 2018;170:95–101. pmid:28963983
  66. 66. Huang S, Faul L, Sevinc G, Mwilambwe-Tshilobo L, Setton R, Lockrow AW, et al. Age differences in intuitive moral decision-making: Associations with inter-network neural connectivity. Psychol Aging. 2021;36(8):902–16. pmid:34472915
  67. 67. Eisenberg N, Cumberland A, Guthrie IK, Murphy BC, Shepard SA. Age Changes in Prosocial Responding and Moral Reasoning in Adolescence and Early Adulthood. J Res Adolesc. 2005;15(3):235–60. pmid:20592955
  68. 68. Lin X, Yixuan W, Lei D, Lin Y, Yuan B. Age differences in utilitarian and deontological moral judgments. Ethics Behav. 2024;:1–17.
  69. 69. Sparrow EP, Swirsky LT, Kudus F, Spaniol J. Aging and altruism: A meta-analysis. Psychol Aging. 2021;36(1):49–56. pmid:33705185
  70. 70. Maftei A. Gender and age differences in the two-dimension model of utilitarianism. Studia Universitatis Moldaviae. 2018;9(119):166–72.
  71. 71. McNair S, Okan Y, Hadjichristidis C, de Bruin BW. Age differences in moral judgment: Older adults are more deontological than younger adults. J Behav Decis Mak. 2019;32(1):47–60.
  72. 72. Ryan JM. Timeline of COVID-19. London, UK: Routledge. 2021.
  73. 73. Chuang E, Grand-Clement J, Chen J-T, Chan CW, Goyal V, Gong MN. Quantifying Utilitarian Outcomes to Inform Triage Ethics: Simulated Performance of a Ventilator Triage Protocol under Sars-CoV-2 Pandemic Surge Conditions. AJOB Empir Bioeth. 2022;13(3):196–204. pmid:35435803
  74. 74. Herron TL, Manuel T. Ethics of U.S. government policy responses to the COVID-19 pandemic: A utilitarianism perspective. BSR. 2022;127(S1):343–67.
  75. 75. Alfano V, Ercolano S, Pinto M. Fighting the COVID pandemic: National policy choices in non-pharmaceutical interventions. J Policy Model. 2022;44(1):22–40. pmid:35034999
  76. 76. Hübner J, Schewe DM, Katalinic A, Frielitz FS. Rechtsfragen der Ressourcenzuteilung in der COVID-19-Pandemie – Zwischen Utilitarismus und Lebenswertindifferenz. Dtsch Med Wochenschr. 2020;145(10):687–92.
  77. 77. Kröll W, Platzer J, Ruckenbauer HW, Schaupp W. Die Corona-Pandemie: Ethische, gesellschaftliche und theologische Reflexionen einer Krise. 1 ed. Baden-Baden: Nomos Verlagsgesellschaft mbH & Co. KG. 2020.
  78. 78. Savulescu J, Persson I, Wilkinson D. Utilitarianism and the pandemic. Bioethics. 2020;34(6):620–32. pmid:32433782
  79. 79. Francis KB, McNabb CB. Moral decision-making during COVID-19: moral judgements, moralisation, and everyday behaviour. Frontiers in Psychology. 2022;12.
  80. 80. Horne Z, Powell D, Hummel J. A Single Counterexample Leads to Moral Belief Revision. Cogn Sci. 2015;39(8):1950–64. pmid:25810137
  81. 81. Tepe B, Byrne RMJ. Cognitive processes in imaginative moral shifts: How judgments of morally unacceptable actions change. Mem Cognit. 2022;50(5):1103–23. pmid:35532831
  82. 82. Zhang L, Tong S, Peng K. Moral Believer or Moral Problem-Solver? Moral Pragmatism Fosters Tolerance Without Impeding Moral Behavior. Behav Sci (Basel). 2024;14(11):984. pmid:39594284
  83. 83. Hertz SG, Krettenauer T. Does moral identity effectively predict moral behavior?: A meta-analysis. Rev Gen Psychol. 2016;20(2):129–40.
  84. 84. van den Berg TGC, Kroesen M, Chorus CG. Why Are General Moral Values Poor Predictors of Concrete Moral Behavior in Everyday Life? A Conceptual Analysis and Empirical Study. Front Psychol. 2022;13:817860. pmid:35978767
  85. 85. Myyry L, Helkama K, Silfver-Kuhalampi M, Petkova K, Valentim JP, Liik K. Explorations in Reported Moral Behaviors, Values, and Moral Emotions in Four Countries. Front Psychol. 2021;12:661172. pmid:34017292
  86. 86. Díaz R. Do moral beliefs motivate action?. Ethical Theory Moral Pract. 2023;26(3):377–95.
  87. 87. Asadullah MA, Iqbal T, Khalifa AH, Haider S. Mere moral beliefs aren’t sufficient to determine pro-social behaviors! A moderated mediation framework tested in healthcare settings based on Belief in Self-Determinism (BSD) Theory. Acta Psychol (Amst). 2025;253:104763. pmid:39914110
  88. 88. Smits N, van der Ark LA, Conijn JM. Measurement versus prediction in the construction of patient-reported outcome questionnaires: can we have our cake and eat it?. Qual Life Res. 2018;27(7):1673–82. pmid:29098607
  89. 89. Leiner DJ. SoSci Survey (Version 3.1.06). 2019.
  90. 90. Lind G. The meaning and measurement of moral judgment competence: A dual-aspect model. In: Fasko JD, Willis W, editors. Contemporary philosophical and psychological perspectives on moral development and education. Hampton Press. 2008. p. 185–220.
  91. 91. Zhang H, Hook JN, Johnson KA. Moral Foundations Questionnaire. Encyclopedia of Personality and Individual Differences. Springer International Publishing. 2016. p. 1–3.
  92. 92. Paulus C. Der Saarbrücker Persönlichkeitsfragebogen SPF (IRI) zur Messung von Empathie: psychometrische Evaluation der deutschen Version des Interpersonal Reactivity Index. 2009.
  93. 93. Cloninger CR, Svrakic DM, Przybeck TR. A psychobiological model of temperament and character. Arch Gen Psychiatry. 1993;50(12):975–90. pmid:8250684
  94. 94. Hartig J, Moosbrugger H. Die “ARES-Skalen” zur Erfassung der individuellen BIS- und BAS-Sensitivität. Zeitschrift für Differentielle und Diagnostische Psychologie. 2003;24(4):293–310.
  95. 95. Team RC. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. 2025.
  96. 96. Rosseel Y. lavaan: An R package for structural equation modeling. J Stat Softw. 2012;48(2):1–36.
  97. 97. Hooper D, Coughlan J, Mullen MR. Structural equation modelling: guidelines for determining model fit. Electron J Bus Res Methods. 2008;6(1):53–60.
  98. 98. Kline RB. Principles and practice of structural equation modeling. 3rd ed. New York: Guilford Publications. 2011.
  99. 99. Bearden WO, Sharma S, Teel JE. Sample size effects on chi square and other statistics used in evaluating causal models. J Marketing Res. 1982;19(4):425–30.
  100. 100. Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal. 1999;6(1):1–55.
  101. 101. Hooper D, Coughlan J, Mullen MR. Evaluating model fit: a synthesis of the structural equation modelling literature. In: Regent’s College, London, UK, 2008. 195–200.
  102. 102. Akaike H. A new look at the statistical model identification. In: Parzen E, Tanabe K, Kitagawa G, editors. Selected Papers of Hirotugu Akaike. New York, NY: Springer. 1974.
  103. 103. Schwarz G. Estimating the dimension of a model. Ann Stat. 1978;:461–4.
  104. 104. Jackson PH. A Note on the Relation between Coefficient Alpha and Guttman’s “split-half” Lower Bounds. Psychometrika. 1979;44(2):251–2.
  105. 105. Jordan C, Hoefer R. Reliability and validity in quantitative measurement. In: Thyer B, editor. The handbook of social work research methods. Thousand Oaks, California: SAGE Publications, Inc. 2001. p. 53–68.
  106. 106. Cronbach LJ. Coefficient Alpha and the Internal Structure of Tests. Psychometrika. 1951;16(3):297–334.
  107. 107. Symonds PM. Factors influencing test reliability. J Educ Psychol. 1928;19:73–87.
  108. 108. Tavakol M, Dennick R. Making sense of Cronbach’s alpha. Int J Med Educ. 2011;2:53–5. pmid:28029643
  109. 109. Kim H-Y. Statistical notes for clinical researchers: assessing normal distribution (2) using skewness and kurtosis. Restor Dent Endod. 2013;38(1):52–4. pmid:23495371
  110. 110. Mihailov E. Measuring impartial beneficence: A Kantian perspective on the Oxford utilitarianism scale. Rev Philos Psychol. 2022;14:989–1004.
  111. 111. Dang J, King KM, Inzlicht M. Why are self-report and behavioral measures weakly correlated?. Trends Cogn Sci. 2020;24(4):267–9.
  112. 112. Van de Mortel TF. Faking it: social desirability response bias in self-report research. Aust J Adv Nurs. 2008;25(4):40–8.
  113. 113. Framhein G, Langer HJ. Theorie und Validität des ‘Moralisches-Urteil-Tests’–Zur Erfassung kognitiv-struktureller Effekte der Sozialisation. Student und Studium im internationalen Vergleich. Klagenfurt: Kärntner Druck- u. Verlagsgesellschaft m.b.h. 1984. p. 166–87.
  114. 114. Lind G. How to teach moral competence. Logos Verlag Berlin GmbH. 2019.
  115. 115. Jockel S, Dogruel L, Arendt K, Stahl H, Bowman N. German MFQ translation and back-translation. https://moralfoundations.org/questionnaires/. 2010.
  116. 116. Haidt J, Graham J. When Morality Opposes Justice: Conservatives Have Moral Intuitions that Liberals may not Recognize. Soc Just Res. 2007;20(1):98–116.
  117. 117. Nugent W. Progressivism: A Very Short Introduction. Oxford: Oxford University Press. 2010.
  118. 118. Davis MH. Interpersonal reactivity index. 1980.
  119. 119. Carver CS, White TL. Behavioral inhibition, behavioral activation, and affective responses to impending reward and punishment: the BIS/BAS scales. J Pers Soc Psychol. 1994;67(2):319–33.
  120. 120. Evans JD. Straightforward statistics for the behavioral sciences. Belmont, CA, US: Thomson Brooks/Cole Publishing Co. 1996.
  121. 121. Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B: Statistical Methodology. 1995;57(1):289–300.
  122. 122. Benjamini Y, Hochberg Y. On the adaptive control of the false discovery rate in multiple testing with independent statistics. J Educ Behav Stat. 2000;25(1):60–83.
  123. 123. Kneer M, Hannikainen IR. Trolleys, triage and Covid-19: the role of psychological realism in sacrificial dilemmas. Cogn Emot. 2022;36(1):137–53.
  124. 124. Mazza M, Attanasio M, Pino MC, Masedu F, Tiberti S, Sarlo M, et al. Moral decision-making, stress, and social cognition in frontline workers vs. population groups during the COVID-19 pandemic: An explorative study. Frontiers in Psychology. 2020;11.
  125. 125. Wilkinson D, Zohny H, Kappes A, Sinnott-Armstrong W, Savulescu J. Which factors should be included in triage? An online survey of the attitudes of the UK general public to pandemic triage dilemmas. BMJ Open. 2020;10(12):e045593. pmid:33293401
  126. 126. Merkel A. An address to the nation by Federal Chancellor Merkel on March 18, 2020. 2020.
  127. 127. Everett JAC, Colombatto C, Awad E, Boggio P, Bos B, Brady WJ, et al. Moral dilemmas and trust in leaders during a global health crisis. Nat Hum Behav. 2021;5(8):1074–88. pmid:34211151
  128. 128. Everett JAC, Colombatto C, Chituc V, Brady WJ, Crockett M. The effectiveness of moral messages on public health behavioral intentions during the COVID-19 pandemic. PsyArXiv. 2020.
  129. 129. Figley CR. Compassion fatigue: Toward a new understanding of the costs of caring. In: Stamm B, editor. Secondary traumatic stress: Self-care issues for clinicians, researchers, and educators. Baltimore, MD, US: The Sidran Press; 1995. p. 3–28.
  130. 130. Bride BE, Radey M, Figley CR. Measuring Compassion Fatigue. Clin Soc Work J. 2007;35(3):155–63.
  131. 131. Taylor S. Moral fatigue--a nursing perspective. Bioethics Forum. 2002;18(1–2):37–43. pmid:12956173
  132. 132. Biermann K. Mit weiterer Gewalt ist zu rechnen. Zeit Online. 2020.
  133. 133. Zeit. Bei Corona-Protesten in Dresden zwölf Polizisten verletzt. Zeit Online. 2021.
  134. 134. Trebesch C, Antezza A, Bushnell K, Frank A, Frank P, Franz L. The Ukraine Support Tracker: Which countries help Ukraine and how?. Kiel Institute for the World Economy (IfW Kiel). 2023.
  135. 135. Höltmann G, Hutter S, Rößler-Prokhorenko C. Solidarität und Protest in der Zeitenwende: Reaktionen der Zivilgesellschaft auf den Ukraine-Krieg. ZZ 2022–601. 2022.
  136. 136. Mayer SJ, Lietz A, Dollmann J, Siegel M, Köhler J. Reaktionen auf den Ukraine-Krieg. Eine Schnellbefragung des DeZIM.panels. SocArXiv. 2022.
  137. 137. Rodrigues J, Ulrich N, Mussel P, Carlo G, Hewig J. Measuring Prosocial Tendencies in Germany: Sources of Validity and Reliablity of the Revised Prosocial Tendency Measure. Front Psychol. 2017;8:2119. pmid:29270144
  138. 138. Skoe EEA, Cumberland A, Eisenberg N, Hansen K, Perry J. The Influences of Sex and Gender-Role Identity on Moral Cognition and Prosocial Personality Traits. Sex Roles. 2002;46(9–10):295–309.
  139. 139. Sochting I, Skoe EE, Marcia JE. Care-oriented moral reasoning and prosocial behavior: A question of gender or sex role orientation. Sex Roles. 1994;31(3–4):131–47.
  140. 140. Cordellieri P, Boccia M, Piccardi L, Kormakova D, Stoica LV, Ferlazzo F. Gender differences in solving moral dilemmas: emotional engagement, care and utilitarian orientation. Psychol Stud. 2020;65(4):360–9.
  141. 141. Eisenberg N, Hofer C, Sulik MJ, Liew J. The development of prosocial moral reasoning and a prosocial orientation in young adulthood: concurrent and longitudinal correlates. Dev Psychol. 2014;50(1):58–70. pmid:23731289
  142. 142. Dalton D, Ortegren M. Gender Differences in Ethics Research: The Importance of Controlling for the Social Desirability Response Bias. J Bus Ethics. 2011;103(1):73–93.
  143. 143. Christensen JF, Flexas A, Calabrese M, Gut NK, Gomila A. Moral judgment reloaded: a moral dilemma validation study. Front Psychol. 2014;5:607. pmid:25071621
  144. 144. Ford MR, Lowery CR. Gender differences in moral reasoning: A comparison of the use of justice and care orientations. J Pers Soc Psychol. 1986;50(4):777–83.
  145. 145. Gilligan C. In a different voice: Psychological theory and women’s development. Cambridge, MA, US: Harvard University Press. 1982.
  146. 146. Hooker B. Fairness. Ethics Theory Moral Practice. 2005;8(4):329–52.
  147. 147. Cullity G. Public goods and fairness. Australasian Journal of Philosophy. 2008;86(1):1–21.
  148. 148. Brown B. Impartiality and fair play revisited. J Political Philosophy. 2023;31(3):315–36.
  149. 149. Rawls J. A Theory of Justice: Original Edition. Harvard University Press. 1971.
  150. 150. Hadarics M, Kende A. Moral foundations of positive and negative intergroup behavior: Moral exclusion fills the gap. Int J Intercult Relat. 2018;64:67–76.
  151. 151. Nilsson A, Erlandsson A, Västfjäll D. The congruency between moral foundations and intentions to donate, self-reported donations, and actual donations to charity. J Res Pers. 2016;65:22–9.
  152. 152. Smith IH, Aquino K, Koleva S, Graham J. The moral ties that bind . . . even to out-groups: the interactive effect of moral identity and the binding moral foundations. Psychol Sci. 2014;25(8):1554–62.
  153. 153. Cuff BMP, Brown SJ, Taylor L, Howat DJ. Empathy: A Review of the Concept. Emot Rev. 2016;8(2):144–53.
  154. 154. Tusche A, Böckler A, Kanske P, Trautwein F-M, Singer T. Decoding the Charitable Brain: Empathy, Perspective Taking, and Attention Shifts Differentially Predict Altruistic Giving. J Neurosci. 2016;36(17):4719–32. pmid:27122031
  155. 155. Caprara GV, Alessandri G, Eisenberg N. Prosociality: the contribution of traits, values, and self-efficacy beliefs. J Pers Soc Psychol. 2012;102(6):1289–303. pmid:21942280
  156. 156. Eisenberg N. Empathy-related responding and prosocial behaviour. In: Novartis Foundation symposium, 2007. 71–88.
  157. 157. Patil I, Silani G. Reduced empathic concern leads to utilitarian moral judgments in trait alexithymia. Front Psychol. 2014;5:501. pmid:24904510
  158. 158. Decety J, Cowell JM. Empathy, justice, and moral behavior. AJOB Neurosci. 2015;6(3):3–14.
  159. 159. Takamatsu R. Turning off the empathy switch: Lower empathic concern for the victim leads to utilitarian choices of action. PLoS One. 2018;13(9):e0203826. pmid:30212541
  160. 160. MJ Crockett, Z. Kurth-Nelson, JZ Siegel, P. Dayan. Harm to others outweighs harm to self in moral decision making. PNAS USA. 2014;111(48):17320–17325.
  161. 161. Antoniou R, Romero-Kornblum H, Young JC, You M, Kramer JH, Rankin KP, et al. Contrasting two models of utilitarian reasoning. Heliyon. 2023;9(7):e17498. pmid:37424598
  162. 162. Conway P, Gawronski B. Deontological and utilitarian inclinations in moral decision making: a process dissociation approach. J Pers Soc Psychol. 2013;104(2):216–35. pmid:23276267
  163. 163. Ugazio G, Grueschow M, Polania R, Lamm C, Tobler P, Ruff C. Neuro-computational foundations of moral preferences. Soc Cogn Affect Neurosci. 2022;17(3):253–65.
  164. 164. Paruzel-Czachura M, Pypno K, Everett JAC, Białek M, Gawronski B. The drunk utilitarian revisited: does alcohol really increase utilitarianism in moral judgment?. Pers Soc Psychol Bull. 2023;49(1):20–31.
  165. 165. Singer N, Sommer M, Döhnel K, Zänkert S, Wüst S, Kudielka BM. Acute psychosocial stress and everyday moral decision-making in young healthy men: The impact of cortisol. Horm Behav. 2017;93:72–81. pmid:28495558
  166. 166. Koppang H, Wenstøp S, Pineda JA. Neural perspectives on morality due to beguiling mechanisms. Front Psychol. 2024;14:1151155. pmid:38362524
  167. 167. Qu C, Bénistant J, Dreher J-C. Neurocomputational mechanisms engaged in moral choices and moral learning. Neurosci Biobehav Rev. 2022;132:50–60. pmid:34826508
  168. 168. Qu C, Météreau E, Butera L, Villeval MC, Dreher J-C. Neurocomputational mechanisms at play when weighing concerns for extrinsic rewards, moral values, and social image. PLoS Biol. 2019;17(6):e3000283. pmid:31170138
  169. 169. Cummins DD, Cummins RC. Emotion and deliberative reasoning in moral judgment. Front Psychol. 2012;3:328. pmid:22973255
  170. 170. Bago B, Kovacs M, Protzko J, Nagy T, Kekecs Z, Palfi B, et al. Situational factors shape moral judgements in the trolley dilemma in Eastern, Southern and Western countries in a culturally diverse sample. Nat Hum Behav. 2022;6(6):880–95. pmid:35422529
  171. 171. Landesregierung Baden-Württemberg. Verordnung der Landesregierung über infektionsschützende Maßnahmen gegen die Ausbreitung des Virus SARS-CoV-2 (Corona-Verordnung – CoronaVO) [Ordinance of the state government on infection-protective measures against the spread of the SARS-CoV-2 virus (Corona Ordinance – CoronaVO)]. In: Baden-Württemberg L, editor. 2020.11.30.
  172. 172. Lachowicz-Tabaczek K, Kozłowska MA. Being others-oriented during the pandemic: Individual differences in the sense of responsibility for collective health as a robust predictor of compliance with the COVID-19 containing measures. Pers Individ Dif. 2021;183:111138. pmid:34511682
  173. 173. Amormino P, Kagel A, Li JL, Marsh AA. Close relationship partners of impartial altruists do not report diminished relationship quality and are similarly altruistic. Commun Psychol. 2024;2(1):128. pmid:39730983
  174. 174. Roets A, Bostyn DH, De Keersmaecker J, Haesevoets T, Van Assche J, Van Hiel A. Utilitarianism in minimal-group decision making is less common than equality-based morality, mostly harm-oriented, and rarely impartial. Sci Rep. 2020;10(1):13373. pmid:32770106
  175. 175. Guyer J, Robinson L, Cassier E, Miller R. Ruptures and new realities: The transatlantic relationship after Trump. Institute for Global Affairs. 2025.
  176. 176. Svoboda E. Moral Injury is an Invisible Epidemic That Affects Millions. Sci Am. 2022.09.19.
  177. 177. Akram F. Moral injury and the COVID-19 pandemic: A philosophical viewpoint. Ethics Med Public Health. 2021;18:100661. pmid:33778145
  178. 178. Hegarty S, Lamb D, Stevelink SAM, Bhundia R, Raine R, Doherty MJ, et al. “It hurts your heart”: frontline healthcare worker experiences of moral injury during the COVID-19 pandemic. Eur J Psychotraumatol. 2022;13(2):2128028. pmid:36276556
  179. 179. Die jüngste Gemeinde in Deutschland hat einen Altersdurchschnitt von 27,7 Jahren [press release]. 2021.05.27.