Multifaceted Empathy Test (MET): Validity evidence for the Brazilian population concerning the computer-based (face-to-face) and online versions

Felipe Fernandes Vieira de Lima; Giordano Rossi; Rafael Guimarães dos Santos; Flávia de Lima Osório

doi:10.1371/journal.pone.0284524

Abstract

Background

The lack of empathy is associated with several psychological and behavioral disorders, and it is important to assess this construct broadly, through multi-methods.

Objective

To conduct a psychometric analysis of the Brazilian version of the Multifaceted Empathy Test (MET), a computerized task that assesses emotional and cognitive empathy.

Methods

The samples were recruited from the community using the snowball method (phase 1: face-to-face; N = 142) and through social media (phase 2: online; N = 519). The participants completed the MET and the Interpersonal Reactivity Index (IRI) to assess the convergent validity between the instruments. To assess validity with correlated constructs (resilient coping and stress), the Brief Resilient Coping Scale and Perceived Stress Scale were used. A task was also implemented in the face-to-face application to assess facial emotions. The retest was applied 25 days later to a portion of the sample (face-to-face: N = 31; online: N = 102).

Results

It was observed adequate test-retest reliability for most items (ICC = 0.49–0.98), satisfactory infit and outfit indexes, discriminatory ability between sexes, weak convergent validity with empathy measures (r = 0.17–0.36), and correlate constructs (r = 0.12–0.46). MET presented good psychometric indicators, confirming its use in face-to-face/computer-based and online formats in clinical and research contexts. However, weaknesses were found regarding the cognitive subscale, demanding future studies to address larger samples to enable more robust conclusions concerning its adequacy. Further research on the instrument’s internal structure can also contribute to its improvement.

Citation: Fernandes Vieira de Lima F, Rossi G, dos Santos RG, de Lima Osório F (2023) Multifaceted Empathy Test (MET): Validity evidence for the Brazilian population concerning the computer-based (face-to-face) and online versions. PLoS ONE 18(7): e0284524. https://doi.org/10.1371/journal.pone.0284524

Editor: Yuan-Pang Wang, University of Sao Paulo, School of Medicine, BRAZIL

Received: January 6, 2023; Accepted: March 31, 2023; Published: July 13, 2023

Copyright: © 2023 Fernandes Vieira de Lima et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Data cannot be shared publicly due to ethical restrictions (authors do not have authorization from the local ethics committee for public sharing, only upon identification of the requester). Thus data can be requested from the author by e-mail (flaliosorio@gmail.com) or from the Medical School of Ribeirão Preto; São Paulo University (pg-saudemental@fmrp.usp.br).

Funding: Coordination for the Improvement of Higher Education Personnel (Capes); National Council for Scientific and Technological Development (CNPq –Productivity Research Fellows - Process No. 10 302601/2019-8). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors declare no conflict of interest.

Introduction

Empathy is a fundamental element of human experience [1], considered a domain of social cognition. It concerns the ability to understand and feel what someone else thinks and feels in affectively demanding situations and properly communicate such an understanding and feeling [2]. According to Baron-Cohen et al. [3], it is a multidimensional construct comprising a cognitive and an emotional component.

Lack of empathy is associated with various psychological and behavioral disorders, such as autism [4], personality disorders [5–7], social anxiety [8], and emotion regulation [9], among others. Additionally, Bordin et al. [10] point out relationships between anxiety and depression symptoms and declined empathy among health professionals. On the other hand, Spilg et al. [11] note that empathy is associated with favorable outcomes such as resilience among medical students.

Thus, systematically assessing empathy in clinical and research contexts is increasingly relevant. According to Lima & Osório [12], many different methods are used for this purpose, such as physiological measurements (e.g., skin conductivity and temperature, heart and respiratory rate), facial expressions and gestures analysis, standardized instruments based on interviews and self-assessment scales. The authors also note that in recent years, self-report measures have been the most frequently studied and used; though, some limitations associated with these measures are currently highlighted, such as social desirability bias. Thus, different resources based on multi-methods are increasingly needed to encompass the entire construct [13]. Emphasis has also been given to instruments with high ecological validity, such as computer-based instruments, which assess psychological phenomena more realistically because they provide multimedia resources such as images and videos that portray everyday situations [14].

From this perspective, Dziobek et al. [15] developed the Multifaceted Empathy Test (MET), an instrument using photorealistic stimuli to measure cognitive and emotional empathy simultaneously, which is one of its advantages, in addition to favoring greater ecological validity, less bias in the application and emission of responses. Furthermore, as it is a computerized instrument, it favors faster and more standardized application and correction The MET was originally published in German, but has also been translated and cross-culturally adapted into Chinese [16] and English [17], demonstrating adequate psychometric properties (e.g., reliability, internal consistency). Recently, it was culturally adapted to Brazilian Portuguese, and its content validity was assessed [18]. Despite being widely adopted by scientific studies [19–21], other types of validity and reliability remain to be assessed, in order to ensure the adequacy of the measure.

It is also noteworthy that in the current scenario, the use of advanced technologies that allow remote/online assessment of skills and other psychological and behavioral characteristics is increasingly necessary/desired. This fact encourages the development and/or adaptation of instruments to meet this demand, as well as the development of specific psychometric studies. For use in the Brazilian population, as pointed out by a review conducted by Lima and Osório [12], few instruments for assessing empathy are available (Empathy Inventory, Interpersonal Reactivity Index and Online Empathy Questionnaire), all of which are self-reported, based on items scored on Likert-type scales.

Therefore, this study’s objective is to present a psychometric assessment of the MET Brazilian version in the computer-based (face-to-face) and online versions, verify reliability indicators (i.e., internal consistency and test-retest), validity (based on external measures: known-group and convergent validity), and item response (difficulty and performance deviations based on the Rasch model). With this, we intend to add new psychometric evidence of the MET, assess its suitability for use in an online format and expand the resources available for the assessment of empathy in the Brazilian context.

Materials and methods

This observational study has a psychometric approach. The psychometric properties of MET were assessed at two points in time. First, the psychometric indicators of the computer-based version applied computer-based (face-to-face) were tested and then, the instrument was adapted to be applied online using the REDCap software. Data collection took place between July 2020 and October 2021 (for more information on data collection see S1 Table).

The study was approved by the Research Ethics Committee of the Hospital das Clínicas da Faculdade de Medicina de Ribeirão Preto da Universidade de São Paulo (CAAE n° 05370818.9.0000.5440) and the subjects gave their consent in writing or by providing their digital signature on the Redcap platform.

Participants and procedures

The sample size was based on the recommendations proposed by MacCallum et al [22]. For these authors, when the commonalities are high on average and the factors well determined, sample sizes in the range of 100 to 200 should be adequate to adequately estimate the population parameters. Inclusion and exclusion criteria were adopted to recruit two community samples to participate in each phase.

Inclusion criteria were: adult individuals (≥18 years old), both genders, literate, capable of reading and understanding texts, and voluntarily agreeing to participate in the study. Not completing any instrument or not finishing data collection were the criteria for excluding participants.

S1 Fig includes the flowchart presenting each sample’s composition. The final convenience samples comprised 142 participants (computer-based/face-to-face) and 519 participants (online).

The snowball sampling technique [23] was used to recruit the participants for computer-based (face-to-face) collection phase. It is a technique that uses chains of reference and indications. Briefly, key informants help the researcher to initiate recruitment. Then, the people indicated by these informants are asked to indicate new potential participants, with the desired characteristics, from their own personal network, and so on, until the desired sample is reached. First, data were collected using the following instruments (paper-and-pencil versions):

Sociodemographic and clinical questionnaire, composed of 11 items and developed for this study to collect complementary data concerning the participants’ sociodemographic and clinical information (gender, age, education, number of children, marital status, live together, religion, job, health problem, medication use and psychiatric diagnosis).

Interpersonal Reactivity Index (IRI) is an instrument that assesses emotional and cognitive empathy. It was proposed by Davis [24] and translated, adapted and psychometrically evaluate to Brazilian Portuguese by Koller et al [25] (α = 0.75; convergent validity = 0.54). Only three of the original scales compose the Brazilian version (Empathic Concern, Personal Distress, and Perspective Taking);

Brief Resilient Coping Scale (BRCS) is an instrument proposed by Sinclair and Wallston [26] to assess the use of coping strategies to solve problems in stressful situations. The version translated and adapted by Ribeiro and Morais [27] was used here, and demonstrated internal consistency (α = 0.53) and unifactorial structure.

Perceived Stress Scale (PSS) was proposed by Cohen et al. [28] and translated and adapted by Luft et al. [29]. It was designed to assess the individuals’ perception of how unpredictable and uncontrollable were the life events they experienced in the last month, providing a measure of stress perception.

In sequence, the computerized tasks were administered:

a) Multifaceted Empathy Test, developed by Dziobek et al. [15] and adapted by Foell et al. [17] to assess empathy’s cognitive and emotional aspects. It is composed of 40 photographs of people experiencing positive or negative situations in which the participants are asked to assess the photographs and answer two specific questions. In the first question, “What emotion is this person feeling?” the respondent may choose the alternative (out of four) that best represents the feeling elicited. The second question, “How much do you empathize with this person?” requires the participant to assess how much s/he feels empathically mobilized by the feeling the photograph expresses, using a Likert scale ranging from 1(not at all) to 9 (extremely). The stimuli are presented in eight blocks with ten photographs each.

b) Facial Emotion Recognition Task (FERT) [30]: to assess the recognition of dynamic facial emotions. A series of 24 stimuli composed of photographs of actors representing characteristics typical of six basic emotions are presented; the participants’ hit rate is assessed

The order in which the instruments were applied was random to avoid bias in data collection. In online collection phase, the participants were recruited via social media (e.g., Facebook, Instagram, Whatsapp) and email. MET was adapted to be applied online on a desktop or mobile using the REDCap platform. Given the REDCap’s limitations to randomize the items, we created two versions of the instrument, i.e., the order in which the items were presented varied. All the instruments described above were applied, except FERT.

The instrument (MET) was reapplied 25 days after the first application (both phases), using the same procedures adopted in the first collection to assess its test-retest reliability.

Data analysis

The descriptive statistical analyses, normality tests, comparisons between the groups, and correlation analyses were conducted using the Statistical Package for Social Sciences (SPSS-version 23.0). Furthermore, the instrument’s reliability was assessed using Cronbach’s alpha and McDonald’s omega via the Jamovi software (version 1.6.23). Furthermore, we verified whether the instrument fitted the Rasch model and checked for differential item functioning (DIF) according to gender and between MET versions (computer-based (face-to-face) and online) using the WINSTEPS software (version 5.2.3).

Sociodemographic and clinical data were analyzed with descriptive statistics. The following analysis/criteria were adopted for the psychometric study:

a) The instrument’s reliability was assessed using Cronbach’s alpha and McDonald’s omega (values above 0.60 are satisfactory [31,32]. Test-retest reliability was verified using the Intraclass Correlation Coefficient (ICC) and item-total correlation. The correlations’ magnitude was classified according to criteria suggested by Streiner et al [33]: weak (between 0–0.25), moderate (between 0.26–0.50), strong (between 0.51–0.70), and very strong (above 0.71);

b) The model fitted the dichotomous [34] and polytomous Rasch models [35]. Reliability, Person Separation Index and Item Separation Index were assessed. Reliability is expected to be above 0.70, while Separation Indexes above 1.53 indicate the existence of at least two groups of respondents or two different skill levels [36]. Performance deviations were assessed using the infit and outfit indexes. According to Wright [37], variations between 0.5 and 1.5 logits are acceptable for infit and outfit. Additionally, the difficulty of the items was estimated using a logit scale [38], with its mean value established at zero, with easier items presenting negative values and more challenging items presenting positive values.

c) The scores were compared between genders (validity based on known groups) using Student’s t-test. Bootstrapping was performed using the bias-corrected and accelerated method with 1000 resamplings for the results to present greater reliability, correct deviations from the normal distribution, and the differences between the groups’ sizes; a 95% confidence interval was established for the differences between means [39]. The variance homogeneity assumption was assessed using the Levene test.

d) The convergent validity study between MET, its subscales, and the IRI was performed using Spearman’s correlation. Considering that both instruments assess the empathy construct, the hypothesis is that the correlation between them is direct and of strong magnitude (between 0.51–0.70). Likewise, the study of convergent validity with correlated constructs (coping (BRCS), stress (PSS), and recognition of facial emotions) was conducted, for which correlations of moderate magnitude are expected (between 0.26–0.50). The correlations’ magnitudes were classified according to the guidelines proposed by Streiner et al [33].

Results

Characterization of the samples

The samples’ main sociodemographic and clinical characteristics are presented in Table 1. Note that their profiles are similar regarding the variables highlighted here, which enable comparisons (i.e., most were women, single, aged 32 on average, with a high educational level; approximately 30% reported health problems, and 16% to 27% reported a psychiatric diagnosis).

Download:

Table 1. Sociodemographic and clinical characterization of the samples.

https://doi.org/10.1371/journal.pone.0284524.t001

Reliability indicators and item analyses

Regarding item-total correlations, the cognitive subscale items showed weak correlations in both versions (computer-based: 0.23 to 0.29, p<0.05; online: -0.08 to 0.27, p<0.05). However, the opposite was found for the emotional subscale in both versions (face-to-face: 0.31 to 0.75, p<0.05; online: 0.28 to 0.76, p<0.05).

Regarding the reliability indicators, Cronbach’s alpha and McDonald’s omega values were below the expected for the cognitive subscale (face-to-face: α = 0.50, ω = 0.53; online: α = 0.42; ω = 0.46). On the other hand, the results for the emotional subscale were excellent (face-to-face: α = 0.96, ω = 0.96; online: α = 0.96, ω = 0.97). The cognitive subscale obtained moderate reliability coefficients in the test-retest (face-to-face: ICC (95% CI) = 0.49 (0.20–0.72)—for items: ICC = 0.22–0.57; online: ICC (95%CI) = 0.59 (0.47–0.69)—for items: ICC = 0.26–0.82. As for the emotional subscale, the coefficients’ magnitudes varied between strong and very strong (face-to-face: ICC (95% CI) = 0.97 (0.95–0.98)–for the items ICC = 0.34–0.85; online: ICC (95%CI) = 0.98 (0.97–0.99)–for the items: ICC = 0.61–0.87. Further details are provided in the S2 Table.

The average percentage of correct answers provided to the cognitive subscale was 66.12% for the computer-based (face-to-face) version and 67.73% for the online version, while the mean score obtained in the emotional empathy subscale was 6.19 (SD = 0. 53) for the computer-based (face-to-face) version and 6.46 (SD = 0.83) for the online version. The mean scores of these indices were similar between the two formats. S3 Table presents this information in detail.

Person-Item Maps are presented in (S2–S5 Figs). Tables 2 and 3 present the reliability, Person/ Item Separation Index, difficulty estimates, and performance deviations (infit and outfit) for the cognitive and emotional subscales of the computer-based (face-to-face) and online versions.

Download:

Table 2. Estimates of difficulty, performance deviations, reliability and Person/ Item Separation Index for the MET cognitive subscale.

https://doi.org/10.1371/journal.pone.0284524.t002

Download:

Table 3. Difficulty estimates, performance deviations, reliability and Person/ Item Separation Index for the MET emotional subscale.

https://doi.org/10.1371/journal.pone.0284524.t003

Table 2 shows that persons’ reliability was low in both cognitive subscales, indicating low data reproducibility. Consequently, the Person Separation Index was also below the expected, indicating the samples were homogeneous regarding the skills estimated. The reliability of the cognitive items in both versions showed a good data range, indicating that the items covered different difficulty levels.

Regarding the response pattern to the cognitive items, the computer-based (face-to-face) version did not identify items with a response pattern different than expected. As for the online version, only item 6 (Crestfallen) presented performance deviation (outfit = 2.06).

Table 3 shows high-reliability rates for the emotional subscale in both versions, indicating good data reproducibility. However, the analysis of performance deviations regarding the answers to the emotional items revealed three items with outfits outside the parameters (items 4, 12, and 29) in the computer-based (face-to-face) version and two in the online version (items 28 and 40). These findings suggest unexpected response patterns among people with latent trait levels different from these items’ difficulty levels.

Validity indicators

As expected, the known-group validity analysis revealed that women obtained statistically higher means in the computer-based (face-to-face) version’s cognitive and emotional subscales. However, the results obtained in the online version indicate that women obtained higher means only in the emotional subscale. Further details are available in S4 Table.

Table 4 shows the MET’s convergent validity indicators of both the computer-based (face-to-face) and online versions. Only low/ moderate magnitude correlations were found between the IRI and its subscales (except Personal Distress) with the emotional subscales of the MET in the online version. In the computer-based (face-to-face) version, these correlations were found only for the Perspective Taking subscale.

Download:

Table 4. MET convergent/ correlated construct validity indicators.

https://doi.org/10.1371/journal.pone.0284524.t004

A low magnitude correlation was found between resilient coping and the total / positive emotional subscale in the both version of the MET. As for perceived stress, low and moderate magnitude correlations were found, specially with the emotional subscale in computer-based (face-to-face) version. Regarding TREF, low and moderate correlations were found between the percentage of correct answers concerning happiness, fear, anger, surprise, and some of the MET’s specific subscales.

Regarding the correlations involving the MET subscales, a strong association was observed between the positive and negative subscales and the total (cognitive: r >0.73; emotional: r >0.77), in both versions. The correlation between the positive and negative subscales was less expressive (r = 0.21–0.25) and not significant when considering the emotional and cognitive subscales

Discussion

This study presents psychometric indicators concerning the reliability and validity of the MET’s computer-based (face-to-face) and online versions adapted to Brazil. Therefore, to minimize bias, two convenient samples with similar sociodemographic and clinical profiles were addressed.

Regarding reliability, satisfactory internal consistency was found for the emotional subscale, corroborating the moderate to strong correlations between most items and the instrument’s total score. Nonetheless, most of the item-total correlations of the cognitive subscale were weak, harming its internal consistency; the set of items presented low covariance with the total score, with a limited predictive ability [40].

Different psychometric studies involving MET have recurrently presented reliability indicators lower than expected for the cognitive subscale [17,21]. Müller [21] notes that this may be related to the dichotomous characteristic of this set of items. However, this hypothesis is weakened when the indicators verified in other studies using dichotomous scoring instruments, i.e., the Early Trauma Inventory Self Report—Short Form [41] and the Haj-Yahia’s Questionnaire [42] are considered because these proved appropriate.

In agreement with our findings, the Cronbach’s alphas reported for the American version [17 was 0.49 for the cognitive subscale and 0.94 for the emotional subscale. Therefore, in line with the literature [32,43], these authors proposed excluding the items presenting a low correlation with the total. Hence, a version with 19 (out of the 40 initial items) remained; this shorter version did not impact alpha (0.51). The authors considered that a potential explanation is related to some items’ ceiling effect, possibly restricting the test’s inter-individual accuracy range.

Items presenting low correlations with the total score were not excluded in this study because most items presented weak correlations, and analyses indicated that none of the items would increase alpha values if they were excluded. As for the hypothesis proposed by Foell et al. [17] regarding the ceiling effect (items with a percentage of correct answers above 80%), a large number of items with a ceiling effect were found (MET’s computer-based version: 10 items– 25.0% of the total; MET’s online version: 15 items– 37.5% of the total). However, this hypothesis cannot be fully confirmed since items that did not show a ceiling effect also presented a weak item-total correlation. Additionally, at an exploratory level, internal consistency was analyzed by excluding these items, and the indices remained inadequate (<0.43).

Therefore, these findings suggest that more global aspects related to the empathy construct and how it is measured, from the cognitive perspective, may be associated with the cognitive subscale’s poor psychometric performance. Studies adopting other empathy instruments, such as the Empathy Assessment Index [44] and The Basic Empathy Scale [45], also found that the cognitive subscales’ indicators are below those for the emotional subscale, which seems to support this hypothesis. As discussed below, this subscale also presents other weak psychometric indicators (e.g., convergent validity). In contrast, indicators resulting from specific analyses of items (infit and outfit) are satisfactory and do not justify the removal of any item.

With regard to test-retest reliability, temporal stability of the emotional subscale is observed. For the cognitive subscale, less robust values were observed for some individual items (six items with values below 0.30) and for the subscale as a whole (0.49–0.59). Thus, although these general indices are acceptable, there is a certain weakness in relation to this psychometric quality. In the study by Yu [16] carried out with the Chinese version, in an interval of 28 days, the test-retest reliability was adequate for both subscales: 84 for the cognitive and 0.80 for the emotional.

Rasch analysis of the cognitive and emotional items indicated high reliability in both versions (>0.92). This finding suggests that the item difficulty hierarchy tends to be reproduced in future surveys [36]. On the other hand, person reliability concerning skill levels was adequate only for the emotional items (>0.91). It was below the expected for the cognitive items, suggesting that we cannot affirm that the participants’ skill estimates concerning the cognitive items will be reproduced in future surveys.

This fact is possibly associated with the small skill range found in the sample for these items; it was concentrated between 0 and 2 logits for both versions. According to Linacre [36], the lack of a sufficiently large skill range (ideally between -3 and 3 logits) negatively impacts reliability indicators, considering that not all possible skill levels are represented in the sample and considered in the analysis. It is noteworthy, however, that the reliability obtained via Rasch analysis does not refer to data quality but rather to the reproducibility or non-reproducibility of the parameters found here [36].

Regarding the distribution of the items’ difficulty level, the analyzes indicated that, in general, the skill interval between -1 and 1 was the best represented by the test, indicating that people with a latent trait level within this range are possibly more accurately evaluated than people with a higher and/or lower latent trait level [36]. Such a fact may negatively impact the instrument’s ability to discriminate between individuals, especially those with extremely high or low skill levels.

Differences were also found in the scores obtained by men and women in both subscales, indicating validity evidence. Thus, MET discriminated between these known groups, reinforcing its clinical validity, considering the vast literature indicating that women present higher levels of empathy than men [12,46–48]. According to Brody [49], these differences between genders are due to sociocultural variables, especially regarding how girls and boys are socialized. While girls are encouraged to express affection and care, boys are taught to inhibit pro-social behavior. Eisenberg and Lennon [50] consider that social desirability may have a potential impact, leading the female group to endorse items more frequently/intensively. Differences at the biological level, such as in hormone concentrations (e.g., testosterone [51]) and some brain regions’ functioning patterns [52], also possibly explain such differences.

Potential relationships regarding the convergent validity parameters between the MET and the IRI were investigated. IRI is also an instrument with an internal structure composed of domains linked to cognitive empathy (Perspective Taking) and emotional empathy (Personal Distress and Empathic Concern). Contrary to expectations, the cognitive subscale was not associated with any IRI subscale; Dziobek et al. [15] and Foell et al. [17] had already reported this finding. Yu [16] found significant correlations but a very low magnitude (0.14). Correlations between the MET and IRI emotional subscales were significant but low in magnitude and only for the Empathic Concern and Personal Distress subscales. These findings reinforce the results reported by previous studies [15–17], in which weak associations were predominant (0.63, 0.22, and 0.15, respectively).

These results reinforce an intriguing question in the field related to the low convergent validity of instruments addressing empathy. Lima and Osório [12] consider that the lack of consensus on how the construct can be represented and assessed impacts psychometric indicators. For instance, the cognitive aspects of empathy in the IRI are assessed with sentences such as “I try to look at everybody’s side of a disagreement before I make a decision.” On the other hand, identifying emotions in the MET’s cognitive subscale [15] is more related to recognizing facial expressions and to the Theory of Mind, which involves recognizing other peoples’ mental states [53].

Lima and Osório [12] reviewed the factor structure of more than a dozen empathy instruments and verified a lack of a common theoretical model to adequately explain the instruments’ internal structure. They consider that it probably contributes to this lack of convergence among the instruments, requiring a deeper discussion about the construct.

Ding et al. [13] note that the fact that the relationships between self-report empathy scales and empathic behavior tasks are weak to moderate should not be considered a problem because these are not redundant but potentially complementary measures. These observations conceivably explain the indicators found in this study, reinforcing the use of multi-method assessments to minimize disadvantages between different methodologies [14].

Regarding the results involving the recognition of facial emotions, this construct is considered one of the domains that compose social cognition, sharing common characteristics with empathic skills, such as the ability to identify details in other people’s behaviors and signify experienced phenomena [53]. Therefore, the low magnitude correlations evidenced by MET suggest convergence between these constructs that represent domains of correlated social cognition. Such a fact had previously been reported by Parreira [54] when studying the IRI and Empathy Quotient [55], which showed correlations between 0.16 and 0.28 in a facial recognition task of basic emotions.

On the other hand, associations between empathy measures addressing clinically relevant correlates, such as aggressiveness, alexithymia, and impulsiveness, among others [12], are well documented. Associations with a resilience measure and a measure of perceived stress were tested in this study. Unlike the study conducted by Vinayak and Judge [56], in which the correlations between empathy (Toronto Empathy Questionnaire [57]) and resilience (Conner-Davidson Resilience Scale [58]) were strong (0.67), in this study the correlations with resilience were significant, but little expressive. The same pattern of correlations was found for the relationship with perceived stress, in line with Gupta and Kiran [59], in which only weak/moderate correlations (r = 0.28) were found between the scores of emotional empathy of Basic Empathy Scale [60] and perceived stress.

In general, the MET showed adequate psychometric indicators, in line with previous studies, which supports its use in research and clinical practice. Its innovative format stands out, with greater ecological validity and possibility of use in different formats. The emotional subscale performed better, while some weaknesses of the cognitive subscale should be mentioned, especially from the perspective of the classical test theory. These weaknesses need to be further explored by addressing larger samples to ensure more robust conclusions regarding whether it is adequate. In addition, future studies addressing the instrument’s internal structure can contribute to this understanding.

Later, studies that test the invariance of responses to the MET as a function of the type of application (computer-based/face-to-face and online) are opportune. This is because, according to Luxton et al [61], even in the presence of empirical evidence that a given measure can be applied using technology, there are no guarantees that it will remain compatible. Factors such as performing the test in a context other than the experimental one and changes in the layout of the instrument’s presentation can influence the response pattern, which needs to be analyzed using specific statistical techniques.

Finally, one of the limitations of this study concerns sampling biases related to sample convenience and their respective sociodemographic characteristics, considering that most participants were young adults with high education and skills (restricted range), which must be considered in the interpretation and generalization of data. Future studies should include more heterogeneous samples in relation to these issues and expand the sources and forms (randomized) of recruitment, in order to minimize the impacts of selection bias. Furthermore, it is important to highlight that despite the time adopted for the retest being in accordance with that recommended by the literature, the effect of a possible memory bias cannot be ruled out.

Conclusions

In general, the MET presented favorable indices regarding temporal stability, convergent validity with measures of empathy (albeit of low magnitude), clinical validity and adequate discrimination capacity. Its items have different levels of difficulty, and this is the first study to demonstrate these indicators. In addition to the well-known computerized version, the version developed for online use also showed good psychometric indicators and can be used as an alternative version, expanding its applicability. The better performance of the emotional subscale and the need for further studies on the cognitive subscale are highlighted.

Supporting information

S1 Fig. Flowchart regarding the inclusion/exclusion of participants in the computer-based (face-to-face) and online version samples.

https://doi.org/10.1371/journal.pone.0284524.s001

(DOCX)

S2 Fig. Item-person map of the MET cognitive subscale–computer-based (face-to-face) version.

https://doi.org/10.1371/journal.pone.0284524.s002

(DOCX)

S3 Fig. Item-person map of the MET emotional subscale–computer-based (face-to-face) version.

https://doi.org/10.1371/journal.pone.0284524.s003

(DOCX)

S4 Fig. Item-person map of the MET cognitive subscale–online version.

https://doi.org/10.1371/journal.pone.0284524.s004

(DOCX)

S5 Fig. Item-person map of the MET emotional subscale–online version.

https://doi.org/10.1371/journal.pone.0284524.s005

(DOCX)

S1 Table. Additional information on data collection.

https://doi.org/10.1371/journal.pone.0284524.s006

(DOCX)

S2 Table. Item-total correlation, internal consistency and test-retest reliability of the MET–computer-based (face-to-face) and online version.

* When applying the retest, this stimulus showed zero variance and therefore the coefficient was not obtained; P = Positive; N = Negative.

https://doi.org/10.1371/journal.pone.0284524.s007

(DOCX)

S3 Table. Indicators related to the items of the cognitive and emotional subscales of the MET–computer-based (face-to-face) and online version, depending on emotional valence.

P = Positive; N = Negative.

https://doi.org/10.1371/journal.pone.0284524.s008

(DOCX)

S4 Table. Indicators of validity based on external measures—known groups—MET cognitive and emotional subscales–computer-based (face-to-face) and online version.

CI = Confidence interval; df = degrees of freedom; SD = Standard deviation.

https://doi.org/10.1371/journal.pone.0284524.s009

(DOCX)

S5 Table. Analysis of metric invariance between the computer-based (face-to-face) and online versions of the MET considering the release of factor loadings for each item.

P = Positive; N = Negative.

https://doi.org/10.1371/journal.pone.0284524.s010

(DOCX)

Acknowledgments

The authors thank Lívia Manara, Júlia Bonagamba, Mayara Souza, and Thaís Ribas for their fundamental contribution in the computer-based (face-to-face) collection of data.

References

1. Melloni M, Lopez V, Ibanez A. Empathy and contextual social cognition. Cogn Affect Behav Neurosci. 2014; 14: 407–425. pmid:23955101
- View Article
- PubMed/NCBI
- Google Scholar
2. Shamay-Tsoory SG. The neural bases for empathy. Neuroscientist. 2011;17(1):18–24. pmid:21071616
- View Article
- PubMed/NCBI
- Google Scholar
3. Baron-Cohen S, Knickmeyer RC, Belmonte MK. Sex differences in the brain: implications for explaining autism. Science. 2005; 310(5749): 819–823. pmid:16272115
- View Article
- PubMed/NCBI
- Google Scholar
4. Harmsen IE. Empathy in autism spectrum disorder. J Autism Dev Disord. 2019; 49(10): 3939–3955. pmid:31147901
- View Article
- PubMed/NCBI
- Google Scholar
5. Ritter K, Dziobek I, Preissler S, Rüter A, Vater A, Fydrich T, et al. Lack of empathy in patients with narcissistic prsonality disorder. Psychiatry research. 2011; 187(1–2): 241–247.
- View Article
- Google Scholar
6. Domes G, Ower N, von Dawans B, Spengler FB, Dziobek I, Bohus M, et al. Effects of intranasal oxytocin administration on empathy and approach motivation in women with borderline personality disorder: a randomized controlled trial. Transl Psychiatry. 2019; 9(1): 328. pmid:31801937
- View Article
- PubMed/NCBI
- Google Scholar
7. Salgado RM, Pedrosa R, Bastos-Leite AJ. Dysfunction of empathy and related processes in borderline personality disorder: a systematic review. Harv Rev Psychiatry. 2020; 28(4): 238–254. pmid:32692088
- View Article
- PubMed/NCBI
- Google Scholar
8. Pittelkow MM, Aan Het Rot M, Seidel LJ, Feyel N, Roest AM. Social anxiety and empathy: a systematic review and meta-analysis. J Anxiety Disord. 2021 78: 102357. pmid:33588287
- View Article
- PubMed/NCBI
- Google Scholar
9. Thompson NM, Uusberg A, Gross JJ, Chakrabarti B. Empathy and emotion regulation: An integrative account. Prog Brain Res. 2019; 247: 273–304. pmid:31196438
- View Article
- PubMed/NCBI
- Google Scholar
10. Bordin D, Vascoski VC, Pereira ARG, Santos CBD, Zanesco C, Fadel CB. Relationship between empathy and quality of life: a study with primary health care professionals. Rev Min Enferm. 2019; 23: 1–8.
- View Article
- Google Scholar
11. Spilg EG, McNeill K, Sabri E, Duffy MC, Ananny L, Graham ID, et al. A cross-sectional study of the interrelationship between burnout, empathy and resilience in academic physicians. Psychol Health Med. 2021; 19: 1–8. pmid:34281438
- View Article
- PubMed/NCBI
- Google Scholar
12. Lima FF, de Lima Osório. Empathy: assessment instruments and psychometric quality–a systematic literature review with a meta-analysis of the past ten years. Front Psychol. 2021; 12: 781346 pmid:34899531
- View Article
- PubMed/NCBI
- Google Scholar
13. Ding YJ, de Paula Ferreira R, Pereira ART, Osório AAC. Multimethod Assessment of Empathic abilities in university students. Cad. de Pós-Graduação em Distúrbios do Desenv. 2015; 15(2): 8–17
- View Article
- Google Scholar
14. Carvalho S, Leite J, Galdo-Álvarez S, Gonçalves OF. The emotional movie database (EMDB): A self-report and psychophysiological study. Appl psychophysiol biofeedback. 2012; 37(4): 279–294. pmid:22767079
- View Article
- PubMed/NCBI
- Google Scholar
15. Dziobek I, Rogers K, Fleck S, Bahnemann M, Heekeren HR, Wolf OT, et al. Dissociation of cognitive and emotional empathy in adults with Asperger syndrome using the Multifaceted Empathy Test (MET). J Autism Dev Disord. 2008; 38(3): 464–473. pmid:17990089
- View Article
- PubMed/NCBI
- Google Scholar
16. Yu Z. The multifaceted empathy test scale: development, reliability and validity. Thesis, Anhui Medical University. 2018. Available from: https://www.cnki.net/kcms/doi/10.19405/j.cnki.issn1000-1492.2018.07.023.html.
17. Foell J, Brislin SJ, Drislane LE, Dziobek I, Patrick CJ. Creation and validation of an english-language version of the multifaceted empathy test (MET). J Psychopathol Behav Assess. 2018; 40(3): 431–439.
- View Article
- Google Scholar
18. Lima FF. Translation, cross-cultural adaptation and psychometric study of the Multifaceted Test of Empathy (MET). Thesis, Ribeirão Preto Medical School—University of São Paulo. 2022. Available from: https://www.teses.usp.br/teses/disponiveis/17/17148/tde-01122022-123031/pt-br.php.
19. Kuypers KPC, Dolder PC, Ramaekers JG, Liechti ME. Multifaceted empathy of healthy volunteers after single doses of MDMA: A pooled sample of placebo-controlled studies. J Psychopharmacol. 2017 May;31(5):589–598. pmid:28372480
- View Article
- PubMed/NCBI
- Google Scholar
20. Ze O, Thoma P, Suchan B. Cognitive and affective empathy in younger and older individuals. Aging & mental health. 2014; 18(7): 929–935. pmid:24827596
- View Article
- PubMed/NCBI
- Google Scholar
21. Müller KD. Erstellung und Erprobung zweier Parallelversionen des Multidimensionalen Empathietests (MET) zur longitudinalen Erfassung des Empathievermögens. Doctoral dissertation. Charité-Universitätsmedizin of Berlin. 2021. Available from: https://refubium.fu-berlin.de/handle/fub188/29764.
22. MacCallum RC, Widaman KF, Zhang S, Hong S. Sample size in factor analysis. Psychological Methods. 1999; 4(1): 84–99.
- View Article
- Google Scholar
23. Flick U. Introdução à pesquisa qualitativa. 3rd. Porto Alegre: Artmed. 2008; pp. 207–217.
24. Davis MH. Measuring individual differences in empathy: Evidence for a multidimensional approach. J Pers Soc Psychol. 1983; 44(1): 113.
- View Article
- Google Scholar
25. Koller SH, Camino C, Ribeiro J. Adaptation and internal validation of two empathy scales for use in Brazil. Estud Psicol (Campinas). 2001; 18(3): 43–53.
- View Article
- Google Scholar
26. Sinclair VG, Wallston KA. The development and psychometric evaluation of the Brief Resilient Coping Scale. Assessment. 2004; 11(1): 94–101. pmid:14994958
- View Article
- PubMed/NCBI
- Google Scholar
27. Ribeiro JLP, Morais R. Portuguese adaptation of the Brief Resilient Coping Scale. Psicologia: Saúde e Doenças 2010; 11(1): 5–13.
- View Article
- Google Scholar
28. Cohen S, Kamarck T, Mermelstein R. A global measure of perceived stress. J Health Soc Behav. 1983; 24(4):385–96. pmid:6668417
- View Article
- PubMed/NCBI
- Google Scholar
29. Luft CDB. Sanches SDO, Mazo GZ, Andrade A. Brazilian version of the Perceived Stress Scale: translation and validation for the elderly. Rev Saúde Pública, 2007; 41(4), 606–615.
- View Article
- Google Scholar
30. Arrais KC, Machado-de-Sousa JP, Trzesniak C, Santos Filho A, Ferrari MCF, Osório FL, et al. Social anxiety disorder women easily recognize fearfull, sad and happy faces: the influence of gender. Journal of Psychiatric Research. 2010; 44(8): 535–540. pmid:19962717
- View Article
- PubMed/NCBI
- Google Scholar
31. Cortina JM. What is coefficient alpha? An examination of theory and applications. J Appl Psychol. 1993; 78(1): 98.
- View Article
- Google Scholar
32. Hair JF, Black WC, Babin BJ, Anderson RE, Tatham RL. Análise multivariada de dados. 6th ed. Bookman editora. 2009.
33. Streiner DL, Norman GR, Cairney J. Health measurement scales: a practical guide to their development and use. 5th ed. Oxford University Press, USA. 2014.
34. Rasch G. Studies in mathematical psychology: Probabilistic models for some intelligence and attainment tests. 1st ed. Denmark: Danmarks Paedagogiske Institut. 1960.
35. Andrich D. Application of a psychometric rating model to ordered categories which are scored with successive integers. Applied Psychological Measurement. 1978; 2(4): 581–594.
- View Article
- Google Scholar
36. Linacre JM. Winsteps® Rasch measurement computer program User’s Guide. Version 5.2.3. Portland, Oregon: Winsteps.com. 2022.
37. Wright BD, Linacre JM. Reasonable mean-square fit values. Rasch Measurement Transactions. 1996; 8: 370–371.
- View Article
- Google Scholar
38. Bond TG, Fox CM. Applying the Rasch model: Fundamental Measurement in the Human Sciences, 2rd ed. New Jersey: Lawrence Ilrlbaum Associatcs. 2015.
39. Haukoos JS, Lewis RJ. Advanced statistics: bootstrapping confidence intervals for statistics with “difficult” distributions. Acad Emerg Med. 2005; 12(4): 360–365. pmid:15805329
- View Article
- PubMed/NCBI
- Google Scholar
40. Raykov T, Marcoulides GA. Introduction to psychometric theory. 1st ed. New York, Routledge: 2010.
41. Osório FL, Salum GA, Donadon MF, Forni-dos-Santos L, Loureiro SR, Crippa JAS. Psychometrics properties of early trauma inventory self report–short form (ETISR-SR) for the Brazilian context. PLoS One. 2013; 8(10): e76337. pmid:24098478
- View Article
- PubMed/NCBI
- Google Scholar
42. Sotoodeh Ghorbani S, Ghaffari M, Hashemi Nazari SS. Psychometric properties of Haj-Yahia’s questionnaire of violence against women in a sample of married women in Tehran, Iran. BMC Public Health. 2022 7;22(1):448 pmid:35255890
- View Article
- PubMed/NCBI
- Google Scholar
43. Cristobal E, Flavián C, Guinalíu M. Perceived e-service quality (PeSQ): Measurement validation and effects on consumer satisfaction and web site loyalty. Manag Serv Qual, 2007. 17(3), 317–340.
- View Article
- Google Scholar
44. Gerdes KE., Lietz CA, Segal EA. Measuring empathy in the 21st century: Development of an empathy index rooted in social cognitive neuroscience and social justice. Soc Work Res. 2011; 35(2): 83–93.
- View Article
- Google Scholar
45. Carré A, Stefaniak N, d’Ambrosio F, Bensalah L, Besche-Richard C. The Basic Empathy Scale in adults (BES-A): fator structure of a revised form. Psychol Assess. 2013; 25(3): 679.
- View Article
- Google Scholar
46. Alcorta-Garz A, González-Guerrero JF, Tavitas-Herrera SE, Rodríguez-Lara FJ, Hojat M. Validación de la escala de empatía médica de Jefferson em estudiantes de medicina mexicanos. Salud Mental. 2005; 28(5): 57–63.
- View Article
- Google Scholar
47. Sherman JJ, Cramer A. Measurement of changes in empathy during dental school. J Dent Educ. 2005; 69(3): 338–345. pmid:15749944
- View Article
- PubMed/NCBI
- Google Scholar
48. Chen D, Lew R, Hershman W, Orlander J. A cross-sectional measurement of medical student empathy. J Gen Intern Med. 2007; 22(10): 1434–1438. pmid:17653807
- View Article
- PubMed/NCBI
- Google Scholar
49. Brody L. Gender, emotion, and the family. In Gender, Emotion, and the Family. Cambridge: Harvard University Press; 1999
50. Eisenberg N, Lennon R. Sex differences in empathy and related capacities. Psychological Bulletin. 1983; 94(1): 100.
- View Article
- Google Scholar
51. Knickmeyer R, Baron-Cohen S, Raggatt P, Taylor K, Hackett G. Fetal testosterone and empathy. Horm Behav. 2006; 49(3): 282–292. pmid:16226265
- View Article
- PubMed/NCBI
- Google Scholar
52. Fukushima H, Hiraki K. Perceiving an opponent’s loss: gender-related differences in the medial-frontal negativity. Soc Cogn Affect Neurosci. 2006 Sep;1(2):149–57. pmid:18985125
- View Article
- PubMed/NCBI
- Google Scholar
53. Santos CMTM. Emotion recognition training in the development of Theory of Mind skills: a study with children with Autism Spectrum Disorder. Thesis. Universidade Lusófona do Porto. 2014. Avaliable from: https://recil.ensinolusofona.pt/handle/10437/5878.
54. Parreira TAP. Empathy and Recognition of Emotions: The Effect of Gender and the Relation of Empathy to the Target. Thesis. University of Algarve. 2013. Available from: http://hdl.handle.net/10400.1/6120
55. Wakabayashi A, Baron-Cohen S, Wheelwright S, Goldenfeld N, Delaney J, Fine D, et al. Development of short forms of the Empathy Quotient (EQ-Short) and the Systemizing Quotient (SQ-Short). Pers Individ Dif. 2006; 41(5): 929–940.
- View Article
- Google Scholar
56. Vinayak S., & Judge J. (2018). Resilience and empathy as predictors of psychological wellbeing among adolescents. International Journal of Health Sciences and Research, 8(4), 192–200.
- View Article
- Google Scholar
57. Spreng RN, McKinnon MC, Mar RA, Levine B. The Toronto Empathy Questionnaire: Scale development and initial validation of a 6ator-analytic solution to multiple empathy measures. J Perso Assess. 2009; 91(1): 62–71.
- View Article
- Google Scholar
58. Campbell Sills L, Stein MB. Psychometric analysis and refinement of the connor–davidson resilience scale (CD RISC): Validation of a 10 item measure of resilience. J Trauma Stress. 2007; 20(6): 1019–1028. pmid:18157881
- View Article
- PubMed/NCBI
- Google Scholar
59. Gupta K, Kiran NC. Empathy and Perceived Stress among College Students. Int J Indian Psychol 2021; 9(2): 220–231.
- View Article
- Google Scholar
60. Jolliffe D, Farrington DP. Development and validation of the Basic Empathy Scale. J Adolesc. 2006; 29(4): 589–611. pmid:16198409
- View Article
- PubMed/NCBI
- Google Scholar
61. Luxton DD, Pruitt LD, Osenbach JE. Best practices for remote psychological assessment via telehealth technologies. Professional. Prof Psychol Res Pr. 2014; 45(1): 27–35.
- View Article
- Google Scholar

[ref1] 1. Melloni M, Lopez V, Ibanez A. Empathy and contextual social cognition. Cogn Affect Behav Neurosci. 2014; 14: 407–425. pmid:23955101
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Shamay-Tsoory SG. The neural bases for empathy. Neuroscientist. 2011;17(1):18–24. pmid:21071616
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Baron-Cohen S, Knickmeyer RC, Belmonte MK. Sex differences in the brain: implications for explaining autism. Science. 2005; 310(5749): 819–823. pmid:16272115
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Harmsen IE. Empathy in autism spectrum disorder. J Autism Dev Disord. 2019; 49(10): 3939–3955. pmid:31147901
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Ritter K, Dziobek I, Preissler S, Rüter A, Vater A, Fydrich T, et al. Lack of empathy in patients with narcissistic prsonality disorder. Psychiatry research. 2011; 187(1–2): 241–247.
View Article
Google Scholar

[18] View Article

[19] Google Scholar

[ref6] 6. Domes G, Ower N, von Dawans B, Spengler FB, Dziobek I, Bohus M, et al. Effects of intranasal oxytocin administration on empathy and approach motivation in women with borderline personality disorder: a randomized controlled trial. Transl Psychiatry. 2019; 9(1): 328. pmid:31801937
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Salgado RM, Pedrosa R, Bastos-Leite AJ. Dysfunction of empathy and related processes in borderline personality disorder: a systematic review. Harv Rev Psychiatry. 2020; 28(4): 238–254. pmid:32692088
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Pittelkow MM, Aan Het Rot M, Seidel LJ, Feyel N, Roest AM. Social anxiety and empathy: a systematic review and meta-analysis. J Anxiety Disord. 2021 78: 102357. pmid:33588287
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref9] 9. Thompson NM, Uusberg A, Gross JJ, Chakrabarti B. Empathy and emotion regulation: An integrative account. Prog Brain Res. 2019; 247: 273–304. pmid:31196438
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Bordin D, Vascoski VC, Pereira ARG, Santos CBD, Zanesco C, Fadel CB. Relationship between empathy and quality of life: a study with primary health care professionals. Rev Min Enferm. 2019; 23: 1–8.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref11] 11. Spilg EG, McNeill K, Sabri E, Duffy MC, Ananny L, Graham ID, et al. A cross-sectional study of the interrelationship between burnout, empathy and resilience in academic physicians. Psychol Health Med. 2021; 19: 1–8. pmid:34281438
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref12] 12. Lima FF, de Lima Osório. Empathy: assessment instruments and psychometric quality–a systematic literature review with a meta-analysis of the past ten years. Front Psychol. 2021; 12: 781346 pmid:34899531
View Article
PubMed/NCBI
Google Scholar

[44] View Article

[45] PubMed/NCBI

[46] Google Scholar

[ref13] 13. Ding YJ, de Paula Ferreira R, Pereira ART, Osório AAC. Multimethod Assessment of Empathic abilities in university students. Cad. de Pós-Graduação em Distúrbios do Desenv. 2015; 15(2): 8–17
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref14] 14. Carvalho S, Leite J, Galdo-Álvarez S, Gonçalves OF. The emotional movie database (EMDB): A self-report and psychophysiological study. Appl psychophysiol biofeedback. 2012; 37(4): 279–294. pmid:22767079
View Article
PubMed/NCBI
Google Scholar

[51] View Article

[52] PubMed/NCBI

[53] Google Scholar

[ref15] 15. Dziobek I, Rogers K, Fleck S, Bahnemann M, Heekeren HR, Wolf OT, et al. Dissociation of cognitive and emotional empathy in adults with Asperger syndrome using the Multifaceted Empathy Test (MET). J Autism Dev Disord. 2008; 38(3): 464–473. pmid:17990089
View Article
PubMed/NCBI
Google Scholar

[55] View Article

[56] PubMed/NCBI

[57] Google Scholar

[ref16] 16. Yu Z. The multifaceted empathy test scale: development, reliability and validity. Thesis, Anhui Medical University. 2018. Available from: https://www.cnki.net/kcms/doi/10.19405/j.cnki.issn1000-1492.2018.07.023.html.

[ref17] 17. Foell J, Brislin SJ, Drislane LE, Dziobek I, Patrick CJ. Creation and validation of an english-language version of the multifaceted empathy test (MET). J Psychopathol Behav Assess. 2018; 40(3): 431–439.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref18] 18. Lima FF. Translation, cross-cultural adaptation and psychometric study of the Multifaceted Test of Empathy (MET). Thesis, Ribeirão Preto Medical School—University of São Paulo. 2022. Available from: https://www.teses.usp.br/teses/disponiveis/17/17148/tde-01122022-123031/pt-br.php.

[ref19] 19. Kuypers KPC, Dolder PC, Ramaekers JG, Liechti ME. Multifaceted empathy of healthy volunteers after single doses of MDMA: A pooled sample of placebo-controlled studies. J Psychopharmacol. 2017 May;31(5):589–598. pmid:28372480
View Article
PubMed/NCBI
Google Scholar

[64] View Article

[65] PubMed/NCBI

[66] Google Scholar

[ref20] 20. Ze O, Thoma P, Suchan B. Cognitive and affective empathy in younger and older individuals. Aging & mental health. 2014; 18(7): 929–935. pmid:24827596
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref21] 21. Müller KD. Erstellung und Erprobung zweier Parallelversionen des Multidimensionalen Empathietests (MET) zur longitudinalen Erfassung des Empathievermögens. Doctoral dissertation. Charité-Universitätsmedizin of Berlin. 2021. Available from: https://refubium.fu-berlin.de/handle/fub188/29764.

[ref22] 22. MacCallum RC, Widaman KF, Zhang S, Hong S. Sample size in factor analysis. Psychological Methods. 1999; 4(1): 84–99.
View Article
Google Scholar

[73] View Article

[74] Google Scholar

[ref23] 23. Flick U. Introdução à pesquisa qualitativa. 3rd. Porto Alegre: Artmed. 2008; pp. 207–217.

[ref24] 24. Davis MH. Measuring individual differences in empathy: Evidence for a multidimensional approach. J Pers Soc Psychol. 1983; 44(1): 113.
View Article
Google Scholar

[77] View Article

[78] Google Scholar

[ref25] 25. Koller SH, Camino C, Ribeiro J. Adaptation and internal validation of two empathy scales for use in Brazil. Estud Psicol (Campinas). 2001; 18(3): 43–53.
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref26] 26. Sinclair VG, Wallston KA. The development and psychometric evaluation of the Brief Resilient Coping Scale. Assessment. 2004; 11(1): 94–101. pmid:14994958
View Article
PubMed/NCBI
Google Scholar

[83] View Article

[84] PubMed/NCBI

[85] Google Scholar

[ref27] 27. Ribeiro JLP, Morais R. Portuguese adaptation of the Brief Resilient Coping Scale. Psicologia: Saúde e Doenças 2010; 11(1): 5–13.
View Article
Google Scholar

[87] View Article

[88] Google Scholar

[ref28] 28. Cohen S, Kamarck T, Mermelstein R. A global measure of perceived stress. J Health Soc Behav. 1983; 24(4):385–96. pmid:6668417
View Article
PubMed/NCBI
Google Scholar

[90] View Article

[91] PubMed/NCBI

[92] Google Scholar

[ref29] 29. Luft CDB. Sanches SDO, Mazo GZ, Andrade A. Brazilian version of the Perceived Stress Scale: translation and validation for the elderly. Rev Saúde Pública, 2007; 41(4), 606–615.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref30] 30. Arrais KC, Machado-de-Sousa JP, Trzesniak C, Santos Filho A, Ferrari MCF, Osório FL, et al. Social anxiety disorder women easily recognize fearfull, sad and happy faces: the influence of gender. Journal of Psychiatric Research. 2010; 44(8): 535–540. pmid:19962717
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref31] 31. Cortina JM. What is coefficient alpha? An examination of theory and applications. J Appl Psychol. 1993; 78(1): 98.
View Article
Google Scholar

[101] View Article

[102] Google Scholar

[ref32] 32. Hair JF, Black WC, Babin BJ, Anderson RE, Tatham RL. Análise multivariada de dados. 6th ed. Bookman editora. 2009.

[ref33] 33. Streiner DL, Norman GR, Cairney J. Health measurement scales: a practical guide to their development and use. 5th ed. Oxford University Press, USA. 2014.

[ref34] 34. Rasch G. Studies in mathematical psychology: Probabilistic models for some intelligence and attainment tests. 1st ed. Denmark: Danmarks Paedagogiske Institut. 1960.

[ref35] 35. Andrich D. Application of a psychometric rating model to ordered categories which are scored with successive integers. Applied Psychological Measurement. 1978; 2(4): 581–594.
View Article
Google Scholar

[107] View Article

[108] Google Scholar

[ref36] 36. Linacre JM. Winsteps® Rasch measurement computer program User’s Guide. Version 5.2.3. Portland, Oregon: Winsteps.com. 2022.

[ref37] 37. Wright BD, Linacre JM. Reasonable mean-square fit values. Rasch Measurement Transactions. 1996; 8: 370–371.
View Article
Google Scholar

[111] View Article

[112] Google Scholar

[ref38] 38. Bond TG, Fox CM. Applying the Rasch model: Fundamental Measurement in the Human Sciences, 2rd ed. New Jersey: Lawrence Ilrlbaum Associatcs. 2015.

[ref39] 39. Haukoos JS, Lewis RJ. Advanced statistics: bootstrapping confidence intervals for statistics with “difficult” distributions. Acad Emerg Med. 2005; 12(4): 360–365. pmid:15805329
View Article
PubMed/NCBI
Google Scholar

[115] View Article

[116] PubMed/NCBI

[117] Google Scholar

[ref40] 40. Raykov T, Marcoulides GA. Introduction to psychometric theory. 1st ed. New York, Routledge: 2010.

[ref41] 41. Osório FL, Salum GA, Donadon MF, Forni-dos-Santos L, Loureiro SR, Crippa JAS. Psychometrics properties of early trauma inventory self report–short form (ETISR-SR) for the Brazilian context. PLoS One. 2013; 8(10): e76337. pmid:24098478
View Article
PubMed/NCBI
Google Scholar

[120] View Article

[121] PubMed/NCBI

[122] Google Scholar

[ref42] 42. Sotoodeh Ghorbani S, Ghaffari M, Hashemi Nazari SS. Psychometric properties of Haj-Yahia’s questionnaire of violence against women in a sample of married women in Tehran, Iran. BMC Public Health. 2022 7;22(1):448 pmid:35255890
View Article
PubMed/NCBI
Google Scholar

[124] View Article

[125] PubMed/NCBI

[126] Google Scholar

[ref43] 43. Cristobal E, Flavián C, Guinalíu M. Perceived e-service quality (PeSQ): Measurement validation and effects on consumer satisfaction and web site loyalty. Manag Serv Qual, 2007. 17(3), 317–340.
View Article
Google Scholar

[128] View Article

[129] Google Scholar

[ref44] 44. Gerdes KE., Lietz CA, Segal EA. Measuring empathy in the 21st century: Development of an empathy index rooted in social cognitive neuroscience and social justice. Soc Work Res. 2011; 35(2): 83–93.
View Article
Google Scholar

[131] View Article

[132] Google Scholar

[ref45] 45. Carré A, Stefaniak N, d’Ambrosio F, Bensalah L, Besche-Richard C. The Basic Empathy Scale in adults (BES-A): fator structure of a revised form. Psychol Assess. 2013; 25(3): 679.
View Article
Google Scholar

[134] View Article

[135] Google Scholar

[ref46] 46. Alcorta-Garz A, González-Guerrero JF, Tavitas-Herrera SE, Rodríguez-Lara FJ, Hojat M. Validación de la escala de empatía médica de Jefferson em estudiantes de medicina mexicanos. Salud Mental. 2005; 28(5): 57–63.
View Article
Google Scholar

[137] View Article

[138] Google Scholar

[ref47] 47. Sherman JJ, Cramer A. Measurement of changes in empathy during dental school. J Dent Educ. 2005; 69(3): 338–345. pmid:15749944
View Article
PubMed/NCBI
Google Scholar

[140] View Article

[141] PubMed/NCBI

[142] Google Scholar

[ref48] 48. Chen D, Lew R, Hershman W, Orlander J. A cross-sectional measurement of medical student empathy. J Gen Intern Med. 2007; 22(10): 1434–1438. pmid:17653807
View Article
PubMed/NCBI
Google Scholar

[144] View Article

[145] PubMed/NCBI

[146] Google Scholar

[ref49] 49. Brody L. Gender, emotion, and the family. In Gender, Emotion, and the Family. Cambridge: Harvard University Press; 1999

[ref50] 50. Eisenberg N, Lennon R. Sex differences in empathy and related capacities. Psychological Bulletin. 1983; 94(1): 100.
View Article
Google Scholar

[149] View Article

[150] Google Scholar

[ref51] 51. Knickmeyer R, Baron-Cohen S, Raggatt P, Taylor K, Hackett G. Fetal testosterone and empathy. Horm Behav. 2006; 49(3): 282–292. pmid:16226265
View Article
PubMed/NCBI
Google Scholar

[152] View Article

[153] PubMed/NCBI

[154] Google Scholar

[ref52] 52. Fukushima H, Hiraki K. Perceiving an opponent’s loss: gender-related differences in the medial-frontal negativity. Soc Cogn Affect Neurosci. 2006 Sep;1(2):149–57. pmid:18985125
View Article
PubMed/NCBI
Google Scholar

[156] View Article

[157] PubMed/NCBI

[158] Google Scholar

[ref53] 53. Santos CMTM. Emotion recognition training in the development of Theory of Mind skills: a study with children with Autism Spectrum Disorder. Thesis. Universidade Lusófona do Porto. 2014. Avaliable from: https://recil.ensinolusofona.pt/handle/10437/5878.

[ref54] 54. Parreira TAP. Empathy and Recognition of Emotions: The Effect of Gender and the Relation of Empathy to the Target. Thesis. University of Algarve. 2013. Available from: http://hdl.handle.net/10400.1/6120

[ref55] 55. Wakabayashi A, Baron-Cohen S, Wheelwright S, Goldenfeld N, Delaney J, Fine D, et al. Development of short forms of the Empathy Quotient (EQ-Short) and the Systemizing Quotient (SQ-Short). Pers Individ Dif. 2006; 41(5): 929–940.
View Article
Google Scholar

[162] View Article

[163] Google Scholar

[ref56] 56. Vinayak S., & Judge J. (2018). Resilience and empathy as predictors of psychological wellbeing among adolescents. International Journal of Health Sciences and Research, 8(4), 192–200.
View Article
Google Scholar

[165] View Article

[166] Google Scholar

[ref57] 57. Spreng RN, McKinnon MC, Mar RA, Levine B. The Toronto Empathy Questionnaire: Scale development and initial validation of a 6ator-analytic solution to multiple empathy measures. J Perso Assess. 2009; 91(1): 62–71.
View Article
Google Scholar

[168] View Article

[169] Google Scholar

[ref58] 58. Campbell Sills L, Stein MB. Psychometric analysis and refinement of the connor–davidson resilience scale (CD RISC): Validation of a 10 item measure of resilience. J Trauma Stress. 2007; 20(6): 1019–1028. pmid:18157881
View Article
PubMed/NCBI
Google Scholar

[171] View Article

[172] PubMed/NCBI

[173] Google Scholar

[ref59] 59. Gupta K, Kiran NC. Empathy and Perceived Stress among College Students. Int J Indian Psychol 2021; 9(2): 220–231.
View Article
Google Scholar

[175] View Article

[176] Google Scholar

[ref60] 60. Jolliffe D, Farrington DP. Development and validation of the Basic Empathy Scale. J Adolesc. 2006; 29(4): 589–611. pmid:16198409
View Article
PubMed/NCBI
Google Scholar

[178] View Article

[179] PubMed/NCBI

[180] Google Scholar

[ref61] 61. Luxton DD, Pruitt LD, Osenbach JE. Best practices for remote psychological assessment via telehealth technologies. Professional. Prof Psychol Res Pr. 2014; 45(1): 27–35.
View Article
Google Scholar

[182] View Article

[183] Google Scholar

Figures

Abstract

Background

Objective

Methods

Results

Introduction

Materials and methods

Participants and procedures

Data analysis

Results

Characterization of the samples

Reliability indicators and item analyses

Validity indicators

Discussion

Conclusions

Supporting information

S1 Fig. Flowchart regarding the inclusion/exclusion of participants in the computer-based (face-to-face) and online version samples.

S2 Fig. Item-person map of the MET cognitive subscale–computer-based (face-to-face) version.

S3 Fig. Item-person map of the MET emotional subscale–computer-based (face-to-face) version.

S4 Fig. Item-person map of the MET cognitive subscale–online version.

S5 Fig. Item-person map of the MET emotional subscale–online version.

S1 Table. Additional information on data collection.

S2 Table. Item-total correlation, internal consistency and test-retest reliability of the MET–computer-based (face-to-face) and online version.

S3 Table. Indicators related to the items of the cognitive and emotional subscales of the MET–computer-based (face-to-face) and online version, depending on emotional valence.

S4 Table. Indicators of validity based on external measures—known groups—MET cognitive and emotional subscales–computer-based (face-to-face) and online version.

S5 Table. Analysis of metric invariance between the computer-based (face-to-face) and online versions of the MET considering the release of factor loadings for each item.

Acknowledgments

References