Figures
Abstract
There are several effective self-report instruments used by Chinese researchers to retrospectively assess exposure to childhood maltreatment. However, these measures do not assess the timing of exposure, restricting efforts to identify periods of development when childhood maltreatment maximally increases vulnerability to psychopathology and health outcomes. In the current study we created a Chinese version of the Maltreatment and Abuse Chronology of Exposure (MACE) scale, which assesses multiplicity (number of types of maltreatment experienced) and severity of maltreatment as well as when it occurred during childhood and adolescence. Rasch modeling was used for scale development in a sample of 812 undergraduate students. Item reduction analysis of the original 75 items produced a 58-item Chinese version with ten subdimensions. The new scale showed good three-week test-retest reliability, and good convergent validity with the Childhood Trauma Questionnaire (CTQ) and the revised Adverse Childhood Experiences Questionnaire (ACEQ-R). Variance decomposition analyses found that compared to the CTQ and ACE, the MACE Severity and Multiplicity scores explained higher variance in self-reported depression and anxiety symptom ratings on the Depression Anxiety Stress Scales (DASS). The results of the present study confirmed that the Chinese version of the MACE has sound psychometric properties in the Chinese cultural context. This new instrument will be a valuable tool for Chinese researchers, psychiatrists and psychologists to ascertain the type and timing of exposure to maltreatment.
Citation: Chen Y, Wang Z, Zheng X, Wu Z, Zhu J (2022) The Chinese version of the Maltreatment and Abuse Chronology of Exposure (MACE) scale: Psychometric properties in a sample of young adults. PLoS ONE 17(6): e0270709. https://doi.org/10.1371/journal.pone.0270709
Editor: Torsten Klengel, Harvard Medical School, UNITED STATES
Received: February 21, 2022; Accepted: June 15, 2022; Published: June 30, 2022
Copyright: © 2022 Chen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data could be found in the Supporting information.
Funding: This research was supported by the grant from the National Natural Science Foundation of China (32000755) to Jianjun Zhu. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Childhood maltreatment is a relatively common adverse experience [1], which has been widely implicated as a risk factor for adult psychiatric disorders, such as major depressive disorder [2, 3], anxiety disorder [4, 5], post-traumatic stress disorder [6], and substance abuse [7]. Further, childhood maltreatment is associated with earlier onset of mood disorders, as well as more recurrent episodes and poorer treatment response [8]. The population-attributional risk fraction explained by childhood maltreatment and household dysfunction for a wide range of disorders is between 30% and 70% [5, 9–11].
There are numerous questionnaire measures that assess adults’ retrospective reports of the severity and multiplicity of exposure to childhood maltreatment during the first 18 years of life. The Childhood Trauma Questionnaire (CTQ) [12, 13] and the Adverse Childhood Experience Scale (ACE) [14] are currently the most commonly used instruments of this type. The 28-item CTQ assesses the severity of exposure to five types of maltreatment (i.e., physical abuse, emotional abuse, sexual abuse, emotional neglect and physical neglect), and the 18-item ACE measures the multiplicity of exposure to three categories of childhood abuse (i.e., emotional, physical, and sexual abuse) as well as five categories of household dysfunction (i.e., exposure to substance abuse, mental illness, violent treatment of mother or stepmother, incarceration for criminal behavior, and parental separation, divorce or death). Both of CTQ and ACE have been shown to have good reliability and validity [12–14]. Finkelhor et al. (2015) added items to the ACE to assess exposure to the adverse experiences of peer victimization, peer isolation/rejection, and community violence, constituting a revised Adverse Childhood Experiences Questionnaire (ACEQ-R) with satisfactory reliability and validity. Some researcher have introduced CTQ, ACE and ACEQ-R into China [15, 16] and demonstrated that these measures are good tools for evaluating child adversity of Chinese people.
Although the CTQ and the ACE are popular instruments for measuring child maltreatment, two critical limitations need to be mentioned. First, the CTQ and ACE do not assess some important types of maltreatment, such as witnessing violence between parents or towards siblings, although these have been shown to be strong risk factors for psychiatric symptoms [17–19]. Second, neither instrument collects information on the timing of exposures to maltreatment over the course of childhood and adolescence. This consideration is critically important as there may be periods of development when exposure to specific types of maltreatment is maximally associated with vulnerability for psychopathology and alterations in the structure and function of stress-susceptible brain regions [9]. Ascertaining whether the consequences of maltreatment vary as a function of when the exposure occurred may have important practical application. The information could guide research on when interventions are most likely to be effective, and prompt innovative approaches for preventing the onset and development of psychopathology [20].
Teicher and Parigger (2015) developed the Maltreatment and Abuse Chronology of Exposure (MACE) scale to address the limitations of earlier retrospective questionnaire measures of child maltreatment. First, the MACE assesses whether or not there was exposure, and the severity of the exposure, to ten types of maltreatment (emotional neglect, physical neglect, parental physical maltreatment, parental verbal abuse, non-verbal emotional abuse, peer emotional abuse, peer physical bullying, sexual abuse, witnessing violence toward siblings and witnessing interparental violence). Second, the MACE assesses the time period in which the maltreatment occurred. Each item asks for information about each year of childhood and adolescence, from 1 to 18. Research using the MACE has detected sensitive periods of maltreatment exposure associated with risk of psychopathology [21–23] as well as alterations in brain morphometry and function [24, 25]. This information could be useful for delineating periods of vulnerability to the most negative effects of maltreatment.
The MACE has shown excellent reliability and validity in U.S. samples [26]. The authors of the MACE have also made available the original set of 75 items, called the MACE-X, from which they extracted a 52 item US version of the MACE. From the larger set of items, researchers in multiple countries have been able to produce versions of the MACE appropriate for the local culture. The U.S., Norwegian, and German versions [26–28] of the MACE have demonstrated excellent reliability and validity. To our knowledge, this scale has not been introduced into China and owing to cultural differences across countries, it is necessary to explore whether the MACE is a valid measure of retrospective reports of childhood maltreatment in this cultural context. Thus, this study aimed to (1) establish the MACE-CH, the Chinese version of the MACE scale, and (2) test its psychometric properties among a general sample of university students ages 18 to 26. Rasch modeling, test-retest reliability, convergent validity and predictive validity would be examined.
Methods
Procedure
Permission for the study was obtained from the Research Ethics Committee of the authors’ university. The recruitment procedure began by handing out a brochure describing the project, making announcements and giving invitations to students. Participation in the study was entirely voluntary, and participants gave written consent to be part of the research. On the day of assessment, students were invited to participate voluntarily in their classes. Trained researchers administered the self-report questionnaires to students during class time. During data collection at baseline, we asked participants whether they could complete the MACE-X again after three months. Then 109 subjects completed the MACE-X once more to assess test-retest reliability. Participants were told that they should respond to the questionnaire items by themselves and that they were free to withdraw at any time during data collection. The confidentiality of the study was emphasized at the beginning of collection sessions.
Participants
Participants were 812 undergraduate students (ages 18 to 26, M = 19.99, SD = 1.21) from five universities in the provinces of Zhejiang, Anhui and Guangdong in China. These students completed a translated version of the 75 MACE-X items as the basis for developing MACE-CH. Their demographic characteristics reflected those of students in these four universities. They came from diverse parts of the area: rural areas (32.14%), county seats (31.24%), small-medium cities (22.79%), and metropolitan areas (13.83%). Most reported financial sufficiency during childhood, defined as the amount of money available to the family: much less than enough (6.92%), less than enough (22.22%), enough (65.36%), more than enough (5.23%), and much more than enough (0.26%). A subsample of 513 students (30.66% males; ages 18 to 26, M = 19.84, SD = 1.21) also completed CTQ [29], QACE-R [16] and Depression Anxiety Stress Scales [30] at the beginning of the study. Of these, 109 completed the Chinese version of MACE-X again three weeks later to assess test-retest reliability.
Initial items
The 75 item MACE-X assesses ten types of childhood maltreatment experienced before age 18. The types of maltreatment by parents or other adults were emotional neglect, physical neglect, physical maltreatment, verbal abuse, non-verbal emotional abuse, sexual abuse, witnessing interparental violence, and witnessing violence to siblings. The types of maltreatment by peers were peer emotional abuse and peer physical bullying. On each item, the respondent indicates "yes" or "no" separately for each year of exposure between the ages of 1 to 18. For the purposes of this study, the original English-language version of the MACE-X was translated into Chinese by three graduates who were fluent in both English and Chinese and knowledgeable about childhood adversity research. Further, we conducted a pilot study of 20 subjects to test if participants had any difficulty in understanding or responding to the items in the translated version of the MACE-X. Items that were identified as problematic in the pilot study were discussed and modified. Finally, the updated version was back-translated into English and was compared with the original English version to confirm that the Chinese translated version of the MACE-X was consistent with the English version.
Evaluation for item inclusion in a subscale
We aimed to develop ten subscales for the Chinese version of the MACE, corresponding to the ten types of maltreatment assessed by the MACE-X. Similarly, for statistical purposes we aimed for each subscale to include at least four items, and for these items to correspond to the MACE-X subscale items. A simple Rasch model was used to determine if final items measured the latent trait for each subscale. Rasch modeling was conducted in R version 4.0.3 with the eRm package [31] and ltm package [32].
We evaluated item fit using infit and outfit, which are the most widely used diagnostic Rasch mean-square fit statistics. Infit and outfit are calculated as the average squared residual for each person-item combination. Infit depicts unexpected responses to items with a difficulty close to the individual exposure levels, whereas Outfit is an outlier-sensitive fit that depicts responses to items with difficulty far from the individual. Acceptable ranges for mean square fit values are still up for debate. In the current analyses, outfit and infit values within the range of 0.50 to 1.50 were considered acceptable [33]. We plotted the test information function, which integrates the individual item information curves, in order to illustrate how well the information provided by each subscale identified participants in a category.
After items with acceptable Infit and Outfit were selected, we examined the measurement invariance using Andersen’s Likelihood Ratio (LR) test [34]. Participants were placed in one of two groups based on a median-split on age among 812 subjects or on gender among 513 subjects (gender information was not collected for other 300 subjects). When the Anderson LR test is not significant, the item parameter estimates in Rasch modeling are invariant across two or more groups.
Scoring algorithms and cutoffs
Based on the number of items positively endorsed by participants and person parameters in the Rasch model, subscales that included 5 or more items were scored for severity of exposure to the latent category. Typically these mean-centered logit scores range from -4 to +4. For the purposes of the current study they were further recalibrated to range from 0–10, enabling total exposure severity levels across the 10 subscales to fall in the range of 0 to 100. Because a Rasch model with only 4 items cannot export sufficient person parameters, subscales that contained 4 items were scored 0, 3, 5, 8 and 10 determined by a linear interpolation of the number of items that the participant positively endorsed.
The MACE and the QACE-R have nine overlapping categories (i.e., sexual abuse, emotional neglect, physical maltreatment, emotional abuse, physical neglect, witnessing interparental violence, peer emotional abuse and peer physical abuse). According to the procedures used to develop the English-language MACE, criterion (threshold) scores on the Chinese MACE to indicate exposure to these nine overlapping categories were created by comparing MACE severity scores to ACE (comparator) scores among 513 subjects. The R package “OptimalCutpoints,” which is based on ROC analyses, was used to select the optimal cutpoint for each subscale. Due to there being no corresponding subscale on the ACE, we operationally defined presence versus absence of witnessing violence to siblings as one selected item on the subscale.
Assessment of reliability and validity
To evaluate the test-retest reliability, 109 participants from the original sample again completed the items in the Chinese MACE three weeks later. Test-retest reliability was assessed using Pearson correlations among Chinese MACE overall total scores, the 10 subscale total scores, and the Chinese MACE total score in each age group. Convergent validity was assessed by comparing the Chinese MACE scores to the QACE-R and CTQ scores using Pearson correlations.
Assessment of utility as predictor of anxiety and depressive symptom scores
The utility of the Chinese MACE was then tested by examining its power in predicting scores on the Chinese version of the Depression Anxiety Stress Scales (DASS), a measure of self-rated stress, anxiety and depressive symptoms [30]. This measure has shown good reliability and validity in prior research [35]. For the purposes of this study, we used the anxiety and depression subscales as the outcome variables. Participants were instructed to assess their anxiety symptoms (e.g., “I was aware of dryness of my mouth”) or depressive symptoms (e.g., “I found it difficult to work up the initiative to do things.”) during the past week. Fourteen items (seven for each) were rated on a four-point scale ranging from 1 (did not apply to me at all) to 4 (applied to me very much of the time). The responses were averaged across items, with higher scores indicating higher anxiety or depressive symptoms. Consistent with a study by Teicher and Parigger (2015), we tested the Chinese MACE’s Multiplicity scores’ predictive power in relation to the QACE-R scores, and tested the MACE Severity scores’ predictive power in relation to the total CTQ scores. First, ordinary least squares regression was performed to calculate the relationship between the MACE scores and symptom ratings. By this way, we can ascertain whether MACE had significantly stronger or weaker predictive power than the comparator scale (i.e., QACE-R or CTQ). Then the R relaimpo package was used to conduct multiple regression analysis with variance decomposition. This analysis provided a more precise determination of the percentage of variance in the DASS scores that was explained by scores on the MACE and scores on the comparator scales. Gender, age and financial sufficiency were included as covariates to control for the confounding influence of sociodemographic variables.
Results
Rasch analyses of ten subscales
Parental physical maltreatment.
All six items of the considered items were included in this scale. These items had acceptable outfit and infit mean-square values (Table 1). None of the items had a fit index exceeding 1.5. However, one item had an outfit mean square fit less than 0.7. Fig 1 in S1 File shows the Item Characteristic Curves (ICC), Item Information Curves (IIC) and Test Information Function for the Rasch parental physical maltreatment subscale. Logit scores for the latent trait in the Rasch model ranged from -3.57 (no items selected) to 3.39 (all items selected). The Test Information Function indicated that the scale is more reliable in predicting medium to high exposure levels, with 37.03% of the overall information between logit scores of 0–2. Andersen’s likelihood ratio test was not significant when splitting for age, χ2(5) = 7.427, p = .191, indicating acceptable fit to the Rasch model. Andersen’s likelihood ratio test also showed no difference for males and females, χ2(5) = 3.080, p = .688. Based on ROC analyses, endorsement of at least two of the six selected items served as the operational definition of the presence of parental physical maltreatment.
Parental verbal abuse.
This scale consisted of all five of the considered items. The items had acceptable infit and outfit, with mean square values ranging from 0.70–1.25 (Table 2). Fig 2 in S1 File shows the ICC, IIC and Test Information Function for the Rasch parental verbal abuse scale. Logit scores of the latent trait in the Rasch model ranged from -2.88 (no items selected) to 2.81 (all items selected). The Test Information Function indicated that the scale was best at discriminating medium to high exposure levels with 42.38% of the overall information between logit scores of 0–2. Further, Andersen’s likelihood ratio test showed that the Rasch model did not significantly differ across the two age groups, χ2(4) = 5.262, p = .261. Andersen’s likelihood ratio test was also not significant when splitting for gender, χ2(4) = 8.972, p = .062. Based on ROC analyses, a threshold was set at two of the five selected items to operationally define presence versus absence of parental verbal abuse.
Non-verbal emotional abuse.
This scale included five of six considered items. One item with excessively high mean square outfit was eliminated. The remaining five items had acceptable infit and outfit mean square values, ranging from 0.83–1.01 (Table 3). Fig 3 in S1 File shows the ICC, IIC and Test Information Function for the Rasch non-verbal emotional abuse scale. Logit scores of the items representing the latent trait in the Rasch model ranged from -2.88 (no items selected) to 2.99 (all items selected). The Test Information Function indicated that the scale was best at discriminating for medium to high exposure levels with 37.69% of the overall information having logit scores between 0 and 2. The Andersen test was not significant when splitting for age, χ2(4) = .773, p = .942. When comparing males and females, the Andersen test was also not significant, χ2(4) = 4.561, p = .335. A threshold was set at one selected item to operationally define presence versus absence of non-verbal emotional abuse.
Peer emotional abuse.
All five considered items showing acceptable mean square infit and outfit statistics were included in this scale (Table 4). Fig 4 in S1 File shows the ICC, IIC and Test Information Function for the Rasch peer emotional abuse scale. Logit scores of the latent trait in the Rasch model ranged from -2.77 (no items selected) to 2.86 (all items selected). The Test Information Function indicated that the scale was best at discriminating for medium to high exposure levels with 42.08% of the overall information between logit scores of 0–2. The Andersen test was not significant when splitting for age, χ2(4) = 2.877, p = .579. When comparing males and females, the Andersen test was significant, χ2(4) = 49.548, p < .001. The Wald test indicated differential response patterns for males and females for item 41 (Z = 3.264, p = .001), item 42 (Z = 3.842, p < .001) and item 43 (Z = -5.742, p < .001). A threshold was set at two selected items to operationally define presence versus absence of peer emotional abuse.
Peer physical bullying.
All five considered items were included in this scale. The infit and outfit mean squares ranged from 0.66–1.14 (Table 5). Fig 5 in S1 File shows the ICC, IIC and Test Information Function for the Rasch peer physical bullying scale. Logit scores of the latent trait in the Rasch model ranged from -2.97 (no items selected) to 3.19 (all items selected). The overall Test Information Function indicated that this test provided the most information in the high exposure level of the trait with 37.9% of the overall information between logit scores of 2–4. The Andersen test was not significant when splitting for age, χ2(4) = 8.806, p = .066. When comparing males and females, the Andersen test was significant, χ2(4) = 11.569, p = .021. The Wald test indicated differential response patterns for males and females for item 45 (Z = 2.854, p = .004). A threshold was set at one selected item to operationally define presence versus absence of peer physical bullying.
Sexual abuse.
This scale consisted of eight of the twelve considered items involving adult familial, adult extrafamilial as well as peer sexual abuse. The infit and outfit mean squares of the remaining items ranged from 0.53 to 1.31 (Table 6). Fig 6 in S1 File showed the ICC, IIC and Test Information Function for the Rasch sexual abuse scale. Logit scores of the latent trait in the Rasch model ranged from -3.21 (no items selected) to 3.14 (all items selected). The overall Test Information Function indicated that this test provided the most information in the high exposure level of the trait with 35.66% of the overall information between logit scores of 2–4. The Andersen test was not significant when splitting for age, χ2(7) = 3.176, p = .868. When comparing males and females, the Andersen test was also not significant, χ2(7) = 11.332, p = .125. A threshold was set at one selected item to operationally define presence versus absence of sexual abuse.
Witnessing violence to siblings.
Four of the eight considered items involving witnessing physical and/or sexual abuse to siblings were included in this scale (Table 7). These items provided the best overall fit. Fig 7 in S1 File shows the ICC, IIC and Test Information Function for the Rasch scale for witnessing violence to siblings. The overall Test Information Function indicated that this test provided the most information in the high exposure level of the trait with 37.49% of the overall information between logit scores of 2–4. The Andersen test was not significant when splitting for age, χ2(3) = 4.281, p = .233. When comparing males and females, the Andersen test was significant, χ2(3) = 9.366, p = .025. The Wald test indicated differential response patterns for males and females for item 19 (Z = 2.695, p = .007) and item 25 (Z = -2.368, p = .018).
Witnessing interparental violence.
This scale consisted of six of eight considered items. Two items were removed because of high outfit mean square fit. The remaining items showed acceptable infit and outfit mean square values that ranged from 0.65 to 1.35 (Table 8). Fig 8 in S1 File shows the ICC, IIC and Test Information Function for the Rasch witnessing interparental violence scale. Logit scores of the latent trait in the Rasch model ranged from -3.64 (no items selected) to 3.86 (all items selected). The overall Test Information Function indicated that this test provided the most information in the high exposure level of the trait with 38.88% of the overall information between logit scores of 2–4. The Andersen test was not significant when splitting for age, χ2(5) = 5.625, p = .344. When comparing males and females, item 38 didn’t meet the requirement to be included in the Andersen test. For the remaining items, the Andersen test was also not significant when splitting for gender, χ2(4) = 1.822, p = .769. A threshold was set at one selected item to operationally define presence versus absence of witnessing interparental violence.
Emotional neglect.
This scale included eight of the nine considered items. Only one item with excessively high mean square outfit was removed. The remaining eight items had acceptable infit and outfit mean square values ranging from 0.75–1.16 (Table 9). Four of the items were reverse scored (items 57, 58, 74, 75 on the MACE-X). Fig 9 in S1 File shows the ICC, IIC and Test Information Function for the Rasch emotional neglect scale. Logit scores of the latent trait in the Rasch model ranged from -2.88 (no items selected) to 2.88 (all items selected). The Test Information Function indicated that the scale was best at discriminating for medium to high exposure levels with 38.85% of the overall information between logit scores of 2–4 and with 36.98% of the overall information between logit scores of 0–2. The Andersen test was not significant when splitting for age, χ2(7) = 10.521, p = 0.161. When comparing males and females, the Andersen test was significant, χ2(7) = 17.305, p = .016. The Wald test indicated differential response patterns for males and females for item 57 (Z = -2.579, p = .010) and item 53 (Z = 2.675, p = .007). A threshold was set at one selected item to operationally define presence versus absence of emotional neglect.
Physical neglect.
Six of seven considered items were included in this scale. These items showed acceptable infit and outfit mean square values ranging from 0.74–1.35 and provided the best overall fit (Table 10). Three of the items were reverse scored (items 60, 61, 73 on the MACE-X). Fig 10 in S1 File shows the ICC, IIC and Test Information Function for the Rasch physical neglect scale. Logit scores of the latent trait in the Rasch model ranged from -2.82 (no items selected) to 2.84 (all items selected). The Test Information Function indicated that the scale was best at discriminating for medium to high exposure levels with 40.04% of the overall information between logit scores of 2–4. The Andersen test was not significant when splitting for age, χ2(5) = 4.951, p = 0.422. When comparing males and females, the Andersen test was not significant, χ2(5) = 7.231, p = .204. A threshold was set at one selected item to operationally define presence versus absence of physical neglect.
Test–retest reliability
A subsample of 109 volunteers participated in the retest three weeks later. Using the Chinese version of the MACE with 58 items (S2 File) on the basis of the analyses above, test-retest reliability was excellent (defined as r > 0.8) for total scores (r = 0.887, p < 10−16), multiplicity scores (r = 0.835, p < 10−16), duration scores (r = 0.812, p < 10−16), overall degree of exposure to parental physical maltreatment (r = 0.885, p < 10−16), overall degree of exposure to witnessing interparental violence (r = 0.865, p < 10−16), and overall degree of exposure to emotional neglect (r = 0.821, p < 10−16). Test-retest reliability was very good (defined as 0.7 < r < 0.8) for the overall scores of exposure to the seven other types of maltreatment (Table 11). Similarly, test-retest reliability for MACE total scores at each age from years 1 to 18 showed very good to excellent reliability (Table 12).
Convergent validity
A subsample of 513 participants completed the Chinese MACE and two other maltreatment scales (QACE-R and CTQ) at the beginning of the study. Tests of convergent validity were conducted by assessing the correlations among the Chinese MACE scores (Total score and Multiplicity score) and scores on the QACE-R and CTQ. The QACE-R score was highly correlated with both the MACE Total score (r = 0.738, p < 10−16) and the MACE Multiplicity score (r = 0.706, p < 10−16). CTQ scores showed moderate to high correlations with the MACE Total score (r = 0.584, p < 10−16) and the MACE Multiplicity score (r = 0.569, p < 10−16). As expected, the MACE Total score was highly correlated with the MACE Multiplicity score (r = 0.934, p < 10−16). The correlations between each MACE subscale score and the corresponding CTQ and QACE-R subscale scores were all significant (p < 10−16) and in the expected direction (Table 13).
Comparison of MACE, QACE-R and CTQ scores as predictors of anxiety and depression (DASS)
We tested the predictive validity of the Chinese MACE in the same subsample of 513 participants. The MACE Severity score and the CTQ score were compared in terms of the strength of their associations with the DASS anxiety and depression scores. The MACE Multiplicity score and the QACE-R score were similarly compared using Ordinary Least Squares regression (Tables 14 and 15). The Williams test showed that anxiety was significantly (p = .05) more strongly correlated with the MACE Total score (r = 0.27) than with the CTQ score (r = 0.12). There was a significant difference in the strength of the correlation between depression and the MACE Total score (r = 0.27) and the correlation between depression and the CTQ score (r = 0.16). The MACE Multiplicity score and the QACE-R did not differ in terms of the strength of their association with either anxiety (r = 0.26 and r = 0.24, respectively) or depression (r = 0.23 and r = 0.26, respectively). Two multiple regression analyses with gender, age and financial sufficiency as covariates revealed significant effects of MACE Severity and Multiplicity scores on depressive and anxiety symptoms. Further, variance decomposition analyses found that MACE Severity scores explained more variance in symptom ratings of both anxiety and depression than the CTQ scores did (Table 14), and MACE Multiplicity scores explained more variance in symptom ratings of anxiety than the QACE-R scores did (Table 15).
Discussion
The MACE is a commonly used scale used to assess exposure to maltreatment. In this study we created a Chinese version of the MACE (MACE-CH). Scale development was based on the Chinese translation of the MACE-X [26], a set of items that can be adapted to create culture-specific versions of the scale. The Chinese version includes 58 of the 75 MACE-X items, and retained the ten-subscale structure of the MACE. The scale showed good internal consistency, three-week test-retest reliability, convergent validity with other maltreatment scales, and validity in predicting anxiety and depression. Based on Rasch modeling, each of the ten subscales consisted of 4–9 items. Subscales that had five or more items were scored for severity of exposure using latent logit scores in a Rasch model. However, in the current Chinese MACE, there were only four items in the subscale assessing witnessing violence to siblings. These items could not cover the latent trait by using logit scores. Instead, they were scored 0, 3, 5, 8 and 10 according to a linear interpolation of how many items were positively endorsed. The other nine subscales with 5 or more items were scored using logit scores.
The test-retest reliability for the Chinese MACE subscale scores was similar to that found in the original U.S. version of the MACE [26]. However, the time interval from test to retest was much shorter in our study—three weeks compared to 66 days on average in Teicher and Parigger’s (2015) study of the original MACE. The shorter time lag may account for the numerically higher test-retest reliability values for the physical and emotional neglect subscales in the Chinese MACE compared to the U.S. MACE. The test-retest reliability of the MACE severity scores for reports of maltreatment from age 1 to age 4 was also higher in the Chinese MACE than in the U.S. MACE. On the other hand, test-retest reliability in the current study was slightly lower than that in the Norwegian MACE [27] although the time interval from test to retest was very similar. The difference in test-retest reliability may have to do with the nature of the two samples. The sample in the Norwegian scale development study included both psychiatric outpatients and employees, and the average MACE score was high. By contrast, our study was conducted in a sample of healthy undergraduates.
Convergent validity was tested by comparing the Chinese MACE to two other measures of maltreatment, namely the CTQ and QACE-R. The three measures were moderately inter-correlated, suggesting that all three can be said to have convergent validity in measuring retrospective accounts of maltreatment. However, research on the original versions of these three measures showed that compared to the CTQ and ACE instruments, the MACE explained twice as much variance in psychiatric symptoms [26]. The Norwegian MACE also showed better predictive validity than the CTQ on SCL-90 in patients with moderately severe mental health problems. Our results were consistent with this earlier evidence. The Chinese MACE scores showed a significantly stronger correlation with depressive and anxiety symptoms than the CTQ and QACE-R scores did. Moreover, results of variance decomposition indicated that MACE Total scores accounted for on average 6.28% of the variance in symptom ratings, whereas the CTQ accounted for 1.04%. The MACE Multiplicity scores on average accounted for 1.28 times more of the variance in anxiety symptom ratings compared to the QACE-R score.
There is growing evidence that the most important determinants of psychological disorders is exposure to specific types of childhood maltreatment during specific windows of vulnerability [1, 36]. Using the U.S. version of MACE, Khan et al. (2015) reported that the most important predictor of suicidal ideation was parental verbal abuse at age 5 in males and sexual abuse at age 18 in females. A neuroimaging study in the U.S. also identified windows of vulnerability using the MACE to assess childhood maltreatment. The results showed that exposure to maltreatment during early childhood was significantly associated with blunted amygdala response, whereas early teen exposure was significantly associated with augmented amygdala response [25]. The availability of the Chinese MACE will allow similar research to be conducted in the Chinese cultural context.
This work presents four limitations that have to be considered. First, there is an imbalance in number of items per perpetrator within sexual abuse subscale. Only two items of parental sexual abuse were included, while for other adults 4 items were included. Other items of parental sexual abuse appear to be rarely reported events, contributing to very low Outfit MSQ. Thus, they are not included in final subscale of sexual abuse. It would be valuable in future to test whether there are different results when these items of parental sexual abuse are assessed in highly exposed individuals (e.g. individuals with mental illness). Second, the Chinese MACE subscale for witnessing violence to siblings had too few items to be represented by mean-centered logit scores. In future research, a subscale with a sufficient number of items could be further analyzed in a Rasch model. Third, the duration for testing the test-retest reliability is not ideal. Future work using a longer duration would be better. Last, in addition to the Depression and Anxiety Stress Scales, more scales rating psychopathology should be used to test the predictive power of MACE-CH.
Conclusion
This research created the first Chinese version of the MACE as a self-report measure of childhood maltreatment. The Chinese version is not simply a translation of the original measure, but is instead a version of the MACE that is appropriate for research in the Chinese cultural context. Chinese MACE retained the ten-subscale structure of the MACE and showed good internal consistency, three-week test-retest reliability, convergent validity with other maltreatment scales, and validity in predicting anxiety and depression. The development of this measure is a key advancement in Chinese researchers’ ability to identify the type and timing of childhood maltreatment. The MACE-CH will be provided as open access to help facilitate its free use.
Supporting information
S1 File. Figures from Rasch modeling for MACE-CH subscales.
https://doi.org/10.1371/journal.pone.0270709.s001
(DOCX)
Acknowledgments
We are very grateful to Dr. Martin H. Teicher (McLean Hospital, USA) for his permission and help.
References
- 1. Teicher MH, Samson JA, Anderson CM O K. The effects of childhood maltreatment on brain structure, function and connectivity. Nature Reviews Neuroscience. 2016;17(10):652–66. pmid:27640984
- 2. Bernet CZ, Stein MB. Relationship of childhood maltreatment to the onset and course of major depression in adulthood. Depression and Anxiety. 1999;9(4):169–74. Epub 1999/08/04. pmid:10431682.
- 3. Vallati M, Cunningham S, Mazurka R, Stewart JG, Larocque C, Milev RV, et al. Childhood maltreatment and the clinical characteristics of major depressive disorder in adolescence and adulthood. Journal of Abnormal Psychology. 2020;129(5):469–79. Epub 2020/04/03. pmid:32237880.
- 4. Quenneville AF, Kalogeropoulou E, Kung AL, Hasler R, Nicastro R, Prada P, et al. Childhood maltreatment, anxiety disorders and outcome in borderline personality disorder. Psychiatry Research. 2020;284:112688. Epub 2019/12/01. pmid:31784066.
- 5. Bruce LC, Heimberg RG, Blanco C, Schneier FR, Liebowitz MR. Childhood maltreatment and social anxiety disorder: implications for symptom severity and response to pharmacotherapy. Depression and Anxiety. 2012;29(2):131–8. Epub 2011/11/09. pmid:22065560.
- 6. Grassi-Oliveira R, Stein LM. Childhood maltreatment associated with PTSD and emotional distress in low-income adults: the burden of neglect. Child Abuse & Neglect. 2008;32(12):1089–94. Epub 2008/11/28. pmid:19036445.
- 7. Capusan AJ, Gustafsson PA, Kuja-Halkola R, Igelstrom K, Mayo L, Heilig M. Re-examining the link between childhood maltreatment and substance use disorder: a prospective, genetically informative study. Mol Psychiatr. 2021:1–9. pmid:33824431
- 8. Lippard ETC, Nemeroff CB. The Devastating Clinical Consequences of Child Abuse and Neglect: Increased Disease Vulnerability and Poor Treatment Response in Mood Disorders. American Journal of Psychiatry. 2020;177(1):20–36. Epub 2019/09/21. pmid:31537091.
- 9. Teicher MH, Samson JA, Anderson CM, Ohashi K. The effects of childhood maltreatment on brain structure, function and connectivity. Nat Rev Neurosci. 2016;17(10):652–66. pmid:27640984.
- 10. Dube SR, Felitti VJ, Dong M, Giles WH, Anda RF. The impact of adverse childhood experiences on health problems: evidence from four birth cohorts dating back to 1900. Preventive medicine. 2003;37(3):268–77. pmid:12914833
- 11. Dube SR, Felitti VJ, Dong M, Chapman DP, Giles WH, Anda RF. Childhood abuse, neglect, and household dysfunction and the risk of illicit drug use: the adverse childhood experiences study. Pediatrics. 2003;111(3):564–72. pmid:12612237
- 12. Bernstein DP, Fink L, Handelsman L, Foote J, Lovejoy M, Wenzel K, et al. Initial reliability and validity of a new retrospective measure of child abuse and neglect. American Journal of Psychiatry. 1994;151(8):1132–6. Epub 1994/08/01. pmid:8037246.
- 13. Bernstein DP, Ahluvalia T, Pogge D, Handelsman L. Validity of the Childhood Trauma Questionnaire in an adolescent psychiatric population. Journal of the American Academy of Child & Adolescent Psychiatry. 1997;36(3):340–8. Epub 1997/03/01. pmid:9055514.
- 14. Felitti VJ, Anda RF, Nordenberg D, Williamson DF, Spitz AM, Edwards V, et al. Relationship of childhood abuse and household dysfunction to many of the leading causes of death in adults. The Adverse Childhood Experiences (ACE) Study. American Journal of Preventive Medicine. 1998;14(4):245–58. Epub 1998/06/23. pmid:9635069.
- 15. He J, Zhong X, Gao Y, Xiong G, Yao S. Psychometric properties of the Chinese version of the Childhood Trauma Questionnaire-Short Form (CTQ-SF) among undergraduates and depressive patients. Child Abuse & Neglect. 2019;91:102–8. pmid:30856597
- 16. Wang Y, Lin P, Cao F. Validity and reliability of the Chinese version of the Revised Adverse Childhood Experience Questionnaire. Chinese Mental Health Journal. 2018;32:760–4.
- 17. Bayarri Fernandez E, Ezpeleta L, Granero R, de la Osa N, Domenech JM. Degree of exposure to domestic violence, psychopathology, and functional impairment in children and adolescents. Journal of Interpersonal Violence. 2011;26(6):1215–31. Epub 2010/08/21. pmid:20724296.
- 18. Teicher MH, Samson JA, Sheu YS, Polcari A, McGreenery CE. Hurtful words: association of exposure to peer verbal abuse with elevated psychiatric symptom scores and corpus callosum abnormalities. American Journal of Psychiatry. 2010;167(12):1464–71. Epub 2010/07/17. pmid:20634370.
- 19. Teicher MH, Vitaliano GD. Witnessing violence toward siblings: an understudied but potent form of early adversity. PLoS One. 2011;6(12):e28852. Epub 2012/01/05. pmid:22216127.
- 20. McLaughlin KA, Weissman D, Bitran D. Childhood Adversity and Neural Development: A Systematic Review. Annual Review of Developmental Psychology. 2019;1:277–312. Epub 2020/05/27. pmid:32455344.
- 21. Schalinski I, Teicher MH, Nischk D, Hinderer E, Muller O, Rockstroh B. Type and timing of adverse childhood experiences differentially affect severity of PTSD, dissociative and depressive symptoms in adult inpatients. Bmc Psychiatry. 2016;16. ARTN 295 pmid:27543114
- 22. Khan A, McCormack HC, Bolger EA, McGreenery CE, Vitaliano G, Polcari A, et al. Childhood maltreatment, depression, and suicidal ideation: critical importance of parental and peer emotional abuse during developmental sensitive periods in males and females. Front Psychiatry. 2015;6. ARTN 42 pmid:25870565
- 23. Inga S, Susanne B, Vanessa H, Martin HT, Michael O, Brigitte R. Environmental adversities and psychotic symptoms: The impact of timing of trauma, abuse, and neglect. Schizophr Res. 2019;205:4–9. pmid:29141785
- 24. Teicher MH, Anderson CM, Ohashi K, Khan A, McGreenery CE, Bolger EA, et al. Differential effects of childhood neglect and abuse during sensitive exposure periods on male and female hippocampus. Neuroimage. 2018;169:443–52. pmid:29288867
- 25. Zhu JJ, Lowen SB, Anderson CM, Ohashi K, Khan A, Teicher MH. Association of Prepubertal and Postpubertal Exposure to Childhood Maltreatment With Adult Amygdala Function. Jama Psychiat. 2019;76(8):843–53. pmid:31241756
- 26. Teicher MH, Parigger A. The ’Maltreatment and Abuse Chronology of Exposure’ (MACE) Scale for the Retrospective Assessment of Abuse and Neglect During Development. Plos One. 2015;10(2). ARTN e0117423 pmid:25714856
- 27. Fosse R, Skjelstad DV, Schalinski I, Thekkumthala D, Elbert T, Aanondsen CM, et al. Measuring childhood maltreatment: Psychometric properties of the Norwegian version of the Maltreatment and Abuse Chronology of Exposure (MACE) scale. Plos One. 2020;15(2). ARTN e0229661 pmid:32106231
- 28. Isele D, Teicher MH, Ruf-Leuschner M, Elbert T, Kolassa IT, Schury K, et al. KERF—An instrument for measuring adverse childhood experiences: Construction and psychometric evaluation of the German MACE (Maltreatment and Abuse Chronology of Exposure) scale. Z Kl Psych Psychoth. 2014;43(2):121–30.
- 29. Zhao X, Zhang Y, LI L, Zhou Y. Evaluation on reliability and validity of Chinese version of childhood trauma questionnaire. Chinese Journal of Tissue Engineering Research. 2005:209–11.
- 30.
Taouk M, Lovibond PF, Laube R. Psychometric properties of a Chinese version of the short Depression Anxiety Stress Scales (DASS21). Report for new South Wales transcultural mental health centre, Cumberland Hospital, Sydney. 2001.
- 31. Mair P, Hatzinger R. Extended Rasch modeling: The eRm package for the application of IRT models in R. J Stat Softw. 2007;20(9).
- 32. Rizopoulos D. ltm: An R package for latent variable modeling and item response theory analyses. J Stat Softw. 2006;17(5).
- 33. DiStefano C, Greer FW, Dowdy E. Examining the BASC-3 BESS Parent Form-Preschool Using Rasch Methodology. Assessment. 2019;26(6):1162–75. pmid:28770616
- 34. Andersen EB. A goodness of fit test for the Rasch model. Psychometrika. 1973;38(1):123–40.
- 35. Wang K, Shi H-S, Geng F-L, Zou L-Q, Tan S-P, Wang Y, et al. Cross-cultural validation of the depression anxiety stress scale–21 in China. Psychological assessment. 2016;28(5):e88. pmid:26619091
- 36. Dunn EC, Nishimi K, Powers A, Bradley B. Is developmental timing of trauma exposure associated with depressive and post-traumatic stress disorder symptoms in adulthood? Journal of psychiatric research. 2017;84:119–27. pmid:27728852