Childhood maltreatment has diverse, lifelong impact on morbidity and mortality. The Childhood Trauma Questionnaire (CTQ) is one of the most commonly used scales to assess and quantify these experiences and their impact. Curiously, despite very widespread use of the CTQ, scores on its Minimization-Denial (MD) subscale—originally designed to assess a positive response bias—are rarely reported. Hence, little is known about this measure. If response biases are either common or consequential, current practices of ignoring the MD scale deserve revision. Therewith, we designed a study to investigate 3 aspects of minimization, as defined by the CTQ’s MD scale: 1) its prevalence; 2) its latent structure; and finally 3) whether minimization moderates the CTQ’s discriminative validity in terms of distinguishing between psychiatric patients and community volunteers. Archival, item-level CTQ data from 24 multinational samples were combined for a total of 19,652 participants. Analyses indicated: 1) minimization is common; 2) minimization functions as a continuous construct; and 3) high MD scores attenuate the ability of the CTQ to distinguish between psychiatric patients and community volunteers. Overall, results suggest that a minimizing response bias—as detected by the MD subscale—has a small but significant moderating effect on the CTQ’s discriminative validity. Results also may suggest that some prior analyses of maltreatment rates or the effects of early maltreatment that have used the CTQ may have underestimated its incidence and impact. We caution researchers and clinicians about the widespread practice of using the CTQ without the MD or collecting MD data but failing to assess and control for its effects on outcomes or dependent variables.
Citation: MacDonald K, Thomas ML, Sciolla AF, Schneider B, Pappas K, Bleijenberg G, et al. (2016) Minimization of Childhood Maltreatment Is Common and Consequential: Results from a Large, Multinational Sample Using the Childhood Trauma Questionnaire. PLoS ONE 11(1): e0146058. https://doi.org/10.1371/journal.pone.0146058
Editor: James G. Scott, The University of Queensland, AUSTRALIA
Received: August 5, 2015; Accepted: December 11, 2015; Published: January 27, 2016
Copyright: © 2016 MacDonald et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: The work from BB’s lab was supported by National Institutes Health Grants MH071537 and HD071982. The work from DK’s lab was supported by the Australia Research Council and New South Wales Dept of Juvenile Justice, Australia. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: Childhood Trauma Questionnaire (CTQ), a self-report questionnaire used to comprehensively evaluate the levels of various types of trauma experienced during childhood; Emotional Abuse (EA), subscale of the CTQ that evaluates the level of emotional abuse experienced during childhood; Emotional Neglect (EN), subscale of the CTQ that evaluates the level of emotional neglect experienced during childhood; Minimization Denial (MD), subscale of the CTQ that evaluates the level to which a person minimizes or denies trauma they may have experienced during childhood; Physical Abuse (PA), subscale of the CTQ that evaluates the level of physical abuse experienced during childhood; Physical Neglect (PN), subscale of the CTQ that evaluates the level of physical neglect experienced during childhood; Sexual Abuse (SA), subscale of the CTQ that evaluates the level of sexual abuse experienced during childhood
Childhood maltreatment is both prevalent and impactful [1, 2]. Correlates of these adverse early experiences include increased stress responses , dysfunctional regulation of glucocorticoid signaling , impaired psychological functioning , adult intimate partner violence , a variety of mental illnesses [1, 7, 8], suicide attempts and suicides [9, 10], and all cause morbidity and mortality [11, 12]. Due to its ubiquity—as well as its myriad, cumulative effects on the developing mind, brain, body, and relationships—early maltreatment is perhaps the most important general historical factor to assess in a variety of health care contexts [13, 14].
Though more nuanced and sensitive tools for quantifying early maltreatment are in development , one of the most commonly-used and well-validated measures—with over 1,000 citations—is the Childhood Trauma Questionnaire (CTQ) (,  for review). This scale measures five categories of childhood maltreatment: Emotional, Sexual and Physical Abuse (EA, SA and PA), and Emotional and Physical Neglect (EN and PN) (Bernstein & Fink, 1998). Scores on the CTQ, specifically, correlate with both the onset and course of mental illness [1, 18, 19], markers of cellular aging , important psychological parameters like stereotype awareness and temperament [21, 22], as well as the structure, function and connectivity of critical brain regions associated with resilience and vulnerability to life stressors (i.e. amygdala) [23–27].
In spite of the fact that evidence suggests moderate to good consistency of self-reports of maltreatment over time , the retrospective nature of the CTQ means that response bias has the potential to undermine its validity. Aware of this issue, and that underreporting is a greater risk than over-reporting , the CTQ scale’s authors included in it a 3-item response bias subscale called the Minimization-Denial (MD) scale. Attesting to this subscale’s perceived import, the MD scale survived the CTQ’s abridgement from a 70-item scale to its current 28-item version: the most ubiquitous version in current use .
In the CTQ manual, the scale’s authors warn that responses of “very often true” to any one of the three MD items may suggest underreporting of childhood trauma (Bernstein & Fink, 1998). Despite this caveat, the overwhelming majority of studies that report CTQ data do not mention the MD items or take them into account in analyses (but see  and  and  for discussion). Arguably, this curious, widespread, systematic omission assumes, de facto, either that: 1. the incidence of minimization (i.e. defined herein as a positive MD score) is too rare to warrant examination; or 2. the MD scale does not serve its intended purpose and has no bearing on results. Importantly—as far as we are aware—neither of these assumptions has been systematically examined until recently. Regarding the incidence of minimization: as discussed in a prior publication , it is relatively common (10–40% of respondents). Regarding the impact of minimization on reported rates of maltreatment—or on the relationship between maltreatment and outcomes of interest—information is lacking. To address the peculiar vacuum in the literature on the MD scale’s characteristics, frequency, and import, we designed the present study, examining the CTQ and MD scores of a large, varied population of clinical (psychiatric) and community subjects.
Regarding the specific details of the MD scale, items answered “very often true” (hereafter, “MD-positive”) convey a naïvely positive, almost idyllic representation of childhood experiences. These particular items somewhat hyperbolically suggest that: 1) there was “nothing” the person wanted to change about their family; 2) their childhood was “perfect”; and 3) their family was the “best […] in the world” (Bernstein & Fink, 1998). Notably, scoring of the MD scale differs from the regular CTQ items. While the CTQ’s abuse and neglect scales are scored based on sums of polytomous item ratings (range of 1 to 5), MD items are dichotomized: scores of 1 through 4 are coded as 0 and scores of 5 (“very often true”) are coded as 1. This dichotomous coding system is thought to isolate “exaggeratedly desirable responses” . In one of the few examinations of the MD items, Gerdner and Allgulander (2009) suggested that when raw, polytomous responses on these 3 items are summed, a new, nonpathological subscale (which they called Idealization of the Upbringing scale) can be created . They furthermore found that MD—but not Idealization of the Upbringing—is correlated with a social desirability scale, the Marlowe-Crowne Social Desirability Scale . Thus, it is also important to distinguish between dichotomous versus polytomous scoring of MD items, which appear to indicate different constructs.
Originally, the MD scale was validated against the Balanced Inventory of Desirable Responding . Both the self-deception and impression management subscales of the Balanced Inventory of Desirable Responding were strongly and positively correlated with MD, in contrast to their negative correlations with the CTQ’s five primary scales . As mentioned above, subsequent studies like have confirmed that the MD scale correlates with other response bias measures . The real-world consequence of this response bias on the CTQ, however, is understudied. That is, even if we accept that the MD scale indicates a social desirability bias, it is still not clear 1) whether this bias has a positive or negative connotation; or 2) whether such a bias has a meaningful impact on the validity of the CTQ and its role in clinical research and practice.
Importantly, context influences response biases. For example, bias indicators are both common and well-studied in forensic settings . In terms of the CTQ specifically, in a study of 800 young offenders, 38.2% demonstrated elevated scores on the CTQ MD scale, indicating significant underreporting of abuse and neglect in this particular population . Outside of forensic settings, is the MD scale a valid marker of a consequential response bias? On one hand, some researchers have suggested that when a patient has no external motivation to deceive the examiner, certain response biases may be either inconsequential, or even indicative of good mental health (, and see  for another perspective on “minimization” of early maltreatment). On the other hand, the denial of traumatic events in childhood can be associated with severe mental disturbance [40, 41]. To whit, even if the MD scale does reliably indicate a response bias, the impact or import of this bias on the CTQ scale’s validity is unclear.
The goals of the present study, then, were to answer three fundamental questions about minimization and its measurement with the MD scale. The first concerns its prevalence, asking: how common is it? The second concerns the MD scale’s characteristics, asking: Is MD characterized by types or degrees of response bias (i.e., is the latent construct discrete or continuous)? More pointedly, this second question concerns whether CTQ responses should be considered either valid or invalid based on a categorical interpretation of the MD subscale, or whether response bias is increasingly prevalent to the degree that MD scores are higher. The third and perhaps most important question concerns the consequence of minimization, asking: Does minimization moderate the discriminative validity of the CTQ in predicting a real-world outcome of interest (i.e. psychiatric illness)? Specifically, this final issue hinges on the well-established fact that childhood maltreatment is predictive of both internalizing and externalizing psychiatric disorders, is associated with an almost half of childhood-onset disorders and nearly a third of later-onset disorders, and approximately doubles the likelihood of a broad range of adverse mental health outcomes [1, 42–44]. Therewith, if response bias (here, MD positivity) indicates denial and the underreporting of maltreatment, and if this bias is consequential, the MD scale should moderate the discriminative validity of the CTQ, diminishing its ability to differentiate between psychiatric patients and community volunteers.
For this archival research study, a literature review was performed in peer-reviewed journals, recruiting research groups who had used the 28-item CTQ. Corresponding authors were contacted and asked to participate if their studies A) included the 28-item CTQ; and B) had a generous sample size (typically, at least 100 participants). Because our goal was to gather a large and generalizable sample, no further restrictions were placed on study inclusion. In all, de-identified, item-level data were collected from 24 samples provided by 21 researchers for a total of 19,652 participants. The studies included (see S1 Table) were conducted in Germany, the Netherlands, Norway, South Africa, South Korea, Sweden, Switzerland, Turkey, the United Kingdom, and the United States of America. In all, 7 different languages (and 7 different, validated versions of the CTQ [31, 33, 45–49]) were represented: English (n = 8,636), German (n = 7,557), Turkish (n = 1301), Swedish (n = 1,026), Dutch (n = 488), Norwegian (n = 481), and Korean (n = 163). The mean age of participants was 38 (SD = 16); 63% (n = 12,037) were female. Complete data on race and ethnicity were not available on all participants. This study used information that was recorded by the investigators in such a manner that subjects could not be identified, directly or through identifiers linked to the subjects, and therefore was certified as exempt by the Human Research Protection Program of the University of California, San Diego School of Medicine. Specifically, patient records and information was anonymized and de-identified prior to analysis.
Thirty-one percent of participants (n = 6,131) were psychiatric patients and the remaining 69% (n = 13,521) were community-based individuals not actively seeking psychiatric treatment. As data were combined from multiple, independent studies with different screening procedures and instruments, not all participants were systematically screened for all DSM or ICD psychiatric disorders (see S1 Table).
Childhood Trauma Questionnaire.
As previously described, the CTQ is a 28-item self-report inventory with five subscales (EA, PA, SA, EN, and PN) and one response bias subscale, the minimization and denial scale (MD) (Bernstein & Fink, 1998). Each subscale is composed of 5 items (except MD which is composed of 3) and require respondents to rate statements using 1 of 5 polytomous response options: (1) “never true”, (2) “rarely true”, (3) “sometimes true”, (4) “often true”, and (5) “very often true”. Two items from the PN subscale and five items from the EN subscale are reverse-coded. The CTQ manual  contains a table which classifies both subscale scores as well as CTQ total score into severity quintiles: “none/minimal” (EA < = 8, PA < = 7, SA = 5, EN < = 9, PN< = 7, CTQ < = 36), “low to moderate” (EA > 8 and < = 12, PA > 7 and < = 9, SA > 5 and < = 7, EN > 9 & < = 14, PN > 7 and < = 9, CTQ > 36 and < = 51), “moderate to severe” (EA > 12 and < = 15, PA > 9 and < = 12, SA > 7 and < = 12, EN > 15 and < = 17, PN > 9 and < = 12, CTQ > 51 and < = 68), and “severe to extreme” (EA > = 16, PA > = 13, SA > = 13, EN > = 18, PN > = 13, CTQ > = 69). Per the CTQ’s scoring instructions, MD item scores of 1 through 4 were coded as 0 and scores of 5 were coded as 1. “MD positivity,” then, means the MD score is greater than 0. The psychometric properties of the CTQ have been extensively validated in a number of English-speaking samples [33, 50, 51] and in every language in the current study, including: German ; Swedish ; Norwegian ; Turkish ; Korean  and Dutch . In the current study, the reliabilities (α) were as follows: EA = .87; PA = .83; SA = .94; EN = .89; PN = .62; and MD = 0.68.
Besides documenting the frequency of minimization, our second goal was to determine whether the MD construct measured by the CTQ is best represented as a taxon (i.e., different types of minimization) or as a dimension (i.e., degrees of minimization). To do so, we relied on taxometric analyses [52–56], procedures that determine if relations among observed data are better accounted for by the presence of dimensional or categorical latent structure. We analyzed data using three separate taxometric procedures (mean above minus below a cut, MAMBAC; maximum eigenvalue, MAXEIG; and latent mode factor, L-Mode) with Ruscio’s (2012) taxometric program for R . Inverted U-shaped graphs for the MAMBAC procedure, peaked graphs for the MAXEIG procedure, and multimodal distributions of factor scores for the L-Mode procedure are all suggestive of taxonic structure. Inverted U-shaped graphs for the MAMBAC procedure, peaked graphs for the MAXEIG procedure, and bimodal distributions of factor scores for the L-Mode procedure are all suggestive of taxonic structure. As part of the software used, taxonicity was judged based on parallel analyses of categorical and dimensional comparison data (see ). Specifically, the approach compares MAMBAC, MAXEIG, and L-Mode curves based on the observed data to curves based on categorical and dimensional simulations. The curves are plotted against the simulated data for comparison. Additionally, the results are summarized using the comparison curve fit index (CCFI; ). CCFI values range from 0 to 1; values closer to 0 indicate dimensional structure and values closer to 1 indicate categorical structure. If the taxometric results suggest dimensional structure, MD should be treated continuously, with higher scores indicative of increasing levels of minimization and denial; if the taxometric results suggest categorical structure, MD should be treated discretely, with scores used to determine either absence or presence of minimization, with no middle option.
To address our third goal—determining whether the MD subscale scores impact the discriminative validity of the CTQ for a meaningful, real-world variable —we examined whether the MD scale moderated the relationships between CTQ total scores (or subscale scores) and patient versus community status, using a multilevel generalized linear model (see ) allowing a random intercept effect for language (nL = 7). Fixed effects included gender, age, standardized MD total scores, and standardized CTQ total scores. We also included an interaction/moderation term for CTQ by MD. Data were analyzed using the lme4 package for R . A logistic link with a binomial error distribution was used. In addition to standard output, we computed coefficients scaled in log-odds [exp(b)] and partial correlation coefficients (ρXY.Z) for each effect.
The average CTQ total score was 40.95 (SD = 15.56) across all samples, 38.78 (SD = 14.98) in community samples, and 45.91 (SD = 18.79) in patient samples. Means are reported in Table 1, and descriptive associations between CTQ severity ratings and the clinical versus community criterion variable are reported in Table 2. Table 2 also reports correlations between CTQ scores and the community versus clinical criterion. Patients consistently reported more childhood maltreatment compared to community participants (Fig 1); correlations representing these effects were in the small to medium range. As such, being in the clinical group was positively associated with CTQ total scores (rpb = .20; p < .001). Fig 1 illustrates the relative percentages of clinical versus community patients in each severity quartile of childhood maltreatment.
X-Axis: Quartiles of childhood maltreatment based on total CTQ scores: none, low, moderate, and severe. Y-Axis: The percentage of subjects whose CTQ scores fall into that severity quartile. Within each quartile, the bar depicted on the left represents the percentage of clinical subjects (n = 5429–5876), and the bar on the right represents the percentage of community subjects (n = 12432–12915). Notably, the largest relative percentage of community subjects was in the “none” maltreatment quartile. That trend was reversed in the “moderate” and “severe” categories, where double the percentage of subjects were in the clinical group.
The average MD scale score was 0.66 (SD = 0.96) across all samples, 0.46 (SD = 0.83) in patient samples, and 0.74 (SD = 1.00) in community samples (Table 1). 42% of community samples were MD positive, versus 28% of clinical samples, and clinical samples scored significantly lower on the MD scale (rpb = -.14, p < .001) (Table 3). MD scores demonstrated a strong negative correlation with CTQ total scores (-0.53; p < 0.001) (Table 4).
Fig 2 presents averaged curves, along with categorical and dimensional comparisons, for all three taxometric procedures. As can be seen, the averaged MAMBAC curve is highly consistent with the dimensional comparison data. Moreover the MAMBAC CCFI value of .12 supports a dimensional MD construct. The MAXEIG curve is more ambiguous, as is the MAXEIG CCFI value of .45, and only weakly favors dimensional structure. Although the L-Mode plot is somewhat multimodal, it is difficult to determine whether the curve is more consistent with either the categorical or dimensional comparison data. The L-Mode CCFI value of .31 suggests the latter.
Top row: left panel—average MAMBAC curve for the observed data (dark line) in comparison to simulated taxonic data (light lines representing one standard deviation above and below the mean); right panel—average MAMBAC curve for the observed data (dark line) in comparison to simulated dimensional data (light lines representing one standard deviation above and below the mean). Middle row: left panel—average MAXEIG curve for the observed data (dark line) in comparison to simulated taxonic data (light lines representing one standard deviation above and below the mean); right panel—average MAXEIG curve for the observed data (dark line) in comparison to simulated dimensional data (light lines representing one standard deviation above and below the mean). Bottom row: left panel—average L-Mode curve for the observed data (dark line) in comparison to simulated taxonic data (light lines representing one standard deviation above and below the mean); right panel—average L-Mode curve for the observed data (dark line) in comparison to simulated dimensional data (light lines representing one standard deviation above and below the mean). Inverted U-shaped graphs for the MAMBAC procedure, peaked graphs for the MAXEIG procedure, and bimodal distributions of factor scores for the L-Mode procedure are all suggestive of taxonic structure.
Given that the overall taxometric results suggest that MD is consistent with a dimensional rather than categorical construct, a multilevel model was fitted to the data assuming a continuous MD variable (i.e., subscale total scores). The overall model’s pseudo R2 was .23. Without the main effect of MD, or the interaction between CTQ and MD, the pseudo R2 was .14. The main effects of gender, age, CTQ total scores, and MD subscale scores were all significant. Patients were more likely to be younger (b = -0.02 [CI95% = -0.02, -0.01], SE = 0.001, p < 0.01, exp(b) = 0.98, ρXY.Z = 0.10), male (b = 0.19 [CI95% = 0.10, 0.28], SE = 0.05, p < 0.01, exp(b) = 1.21, ρXY.Z = 0.03), have higher CTQ total scores (b = 0.35 [CI95% = 0.30, 0.41], SE = 0.03, p < 0.001, exp(b) = 1.43, ρXY.Z = 0.11), and lower MD total scores (b = -0.09 [CI95% = -0.16, -0.03], SE = 0.03, p < 0.01, exp(b) = 0.91, ρXY.Z = 0.02). The interaction between CTQ and MD total scores was also significant. CTQ total scores were less accurate in predicting patient status when MD subscale scores were high (b = -0.08 [CI95% = -0.15, -0.01], SE = 0.04, p = 0.03, exp(b) = 0.92, ρXY.Z = 0.01). We next examined whether MD moderated associations between CTQ subscale scores and the clinical versus community criterion variable. Although the main effects for all subscale scores on the criterion were significant and negative (i.e., patients reported more abuse and neglect), the only significant interaction term was between EN and MD. As with CTQ scores, EN subscale scores were less accurate in predicting patient status when MD subscale scores were high (b = -0.16 [CI95% = -0.22, -0.10], SE = 0.03, p < .001, exp(b) = 0.85, ρXY.Z = 0.04). Overall, these results indicate that MD subscale scores have a small but significant moderating effect on the relation between CTQ total scores and the clinical versus community criterion variable, and that this moderation effect is particularly pronounced for the EN subscale.
In this analysis of the Childhood Trauma Questionnaire’s (CTQ) Minimization and Denial (MD) scale, we report three main findings. First—despite the fact that its import has been marginalized in the vast majority of studies that utilize the CTQ—in this large, multinational sample, minimization (defined as MD positivity—see Methods) is not rare, occurring in about thirty percent of the CTQ scales from clinical subjects and forty percent of the scales from community subjects. Secondly, results indicate that the latent MD construct is characterized by degrees rather than types of response bias. That is, people vary along a continuum between low and high levels of minimization. Third, our data indicate that the MD subscale is consequential. Specifically, the strength of association between the CTQ and the probability of being in the patient sample (if CTQ was high), or being in the community sample (if CTQ was low), was attenuated by MD scores. This latter result provides evidence—for the first time that we are aware—that MD scores moderate the discriminative validity of the CTQ for a meaningful clinical outcome measures. Importantly, then, given that minimization is both common and consequential, these findings call into question the current practice of ignoring the MD scale, and support the scale’s intended function: earmarking certain people’s CTQ results for further investigation, analyses, or exclusion.
Consistent with prior findings that childhood maltreatment is correlated with a wide range of mental disorders [1, 12, 62], CTQ total scores (especially the EN and EA subscales) significantly predicted patient versus community status. Simply put, participants who reported more childhood experiences of abuse and neglect were more likely to be psychiatric patients. Though causality cannot be inferred from a cross-sectional sample, this result is consistent with previous metanalytic research that addresses issues of causality , and indicates that retrospectively-assessed childhood trauma has a causal role in increasing the risk for a wide range of psychiatric illness, including psychotic illness, mood disorders, dissociative disorders, anxiety disorders, substance use disorders, and personality disorders [1, 2, 62, 63]. Numerically, comparing our results with Baker’s review of another large (n > 1400), heterogenous, combined sample of clinical and community CTQ scores demonstrates a striking similarity between mean scores on the two subscales which showed the largest differences between clinical and community samples in our study: EA (10.1 vs 7.8) and EN (12.5 vs 9.4) compared with Baker’s EA (11.4 vs 8.5) and EN (12.5 vs 9.7) . Though subtypes of maltreatment co-occur more often than not , and though historically, the impact of psychological maltreatment has been perhaps underemphasized (but see ), our findings again emphasize the unique impact of this more occult and less-studied subtype of maltreatment [62, 66].
Regarding minimization, MD scores were negatively related to being in the patient sample (i.e., decreased log odds). As mentioned in the introduction, the dichotomized scoring of MD items is designed specifically to identify response bias. It is possible that polytomous scoring of MD items (i.e., Gerdner’s aforementioned Idealization of the Upbringing construct; see ) would have a stronger effect. We chose not to explore this option, but it represents a possible direction for new research. High MD scores also attenuated the otherwise strong associations between high CTQ total scores and being in the patient sample as well as low CTQ total scores and being in the community sample. Thus, it appears that low CTQ scores in the presence of high MD scores are more likely to result in false negative diagnoses/classifications, consistent with the original design and purpose of the MD scale . Given that childhood maltreatment increases the risk of psychiatric illness, and that the MD score attenuates that relationship, the most straightforward interpretation of our results is that they support the validity of the MD scale in detecting minimization and denial of trauma. Interestingly, when we examined the impact of MD on CTQ subscale scores, we found the EN subscale was particularly sensitive to the impact of minimization and denial. Reasons for this may include content overlap (four EN and two MD items contain the word “family”, for example), as well as the reality that EN (along with EA) was one of the subscales of the CTQ most predictive of our criterion variable in the first place.
Pragmatically, the results of this study suggest that the inclusion of MD-positive respondents in published studies using the CTQ may lead to attenuated relations between CTQ total and subscale scores (especially EN) and the various outcomes reported. In other words, findings from studies that: 1) used the 25-item CTQ (which excludes the MD subscale); or 2) use the 28-item CTQ (but fail to exclude MD-positive participants from analyses) may actually represent a conservative estimate of the true association between childhood trauma and its sequelae: exclusion of MD-positive participants from such analyses could strengthen associations. That said, future research is needed to determine how to best handle participants with high MD scores.
Expanding on this latter point, although the current findings support the practice of removal of MD-positive participants due to potential reporting bias (for example ), at least three practical issues warrant consideration. First, we found evidence that MD is characterized by degrees rather than types of response bias, and therefore, there is no simple cutoff MD score for valid versus invalid responding. Second, removing MD positive cases may result in restriction of range problems (i.e. a reduced range of CTQ scores), and may attenuate statistical relations with outcome variables in research studies. It is possible that more comprehensive or multidimensional psychometric models  may be able to account for the attenuating impact of MD without throwing away data. Third, the actual degree of attenuation produced by MD appears to be small, and therefore, its effect is likely most noticeable when sample sizes are large, when there are a large number of MD positive scores, and when outcome criteria have moderate base rates and strong associations with CTQ scales. The absence of these sample characteristics likely explains why some previous studies have failed to find an effect of MD moderating the association between CTQ scores and clinical variables (e.g.,).
We highlight five limitations to the findings presented here. First, although the large sample size was a strength, it is possible that combining distinct, multinational and multilingual samples created unknown biases in the results. Though each study included in the analysis used a valid and reliable translation of the scale (see references above), measurement bias (invariance) due to language or cultural differences among the samples is a possibility . Second, participants in the community sample were not systematically screened for psychiatric disorders. As such, although the patient versus community criterion variable is meaningful in itself, it is not a pure indicator of psychiatric illness due to criterion-group contamination. To the extent that community participants had undiagnosed or unacknowledged psychiatric illness (which is likely ), our results may actually underestimate the moderating effect of MD on the CTQ’s discriminative validity. Third, the relatively small absolute degree of moderation we found may be influenced by our particular sample. Other, different samples (for example, with a higher percentage of very low or very high CTQ scores) may have demonstrated either higher or lower degrees of moderation. Fourth, although we recognize the importance of brevity in response bias measures, the taxometric findings in this study are limited by the relatively small number of MD items. Taxometric analysis, moreover, cannot confirm the validity of the MD construct, and an assessment of the MD scale’s validity was not our aim. Fifth and lastly is the issue of the reliability of the MD scale. Though most studies that report on the CTQ’s reliability do not mention the reliability of the MD scale [16, 47, 70–73], and though some researchers who have examined this issue report the MD scale has low test-retest reliability (Daeho Kim, and Linde Martin, personal communication), others find it has satisfactory internal consistency (Arne Gerdner, personal communication). In this particular sample, the MD scale was—in point of fact—more reliable than the PN scale (see Methods), whose factor structure has repeatedly been questioned [31, 73–75]. Replication studies investigating response bias with a scale that contains a greater number of items are recommended.
In conclusion, our results call into question the widespread desuetude of the CTQ’s MD scale, and suggest that this frequently-ignored response bias scale does have a small but significant moderating effect on the CTQ’s discriminative validity. Clinical researchers and practitioners using the CTQ to study the prevalence or correlates of childhood maltreatment are advised to carefully identify study participants and patients with positive MD scores—particularly in the presence of very low CTQ scores—and consider whether their response data can be considered valid. Finally, to the extent that our findings are true, many of the reported effects of childhood maltreatment assayed by the CTQ may actually more significant than reported.
S1 Dataset. This Excel spreadsheet that contains the pooled, raw, data from all of the collaborating investigators.
This has been submitted at the request of the publishing entity, so that other researchers may also have access to the dataset used in our analyses.
S1 Table. Samples included in the analysis.
This table lists all of the data sets used for this research by primary investigator, providing the: number of community members in their sample; number of clinical patients used in their sample (alongside the type of clinical sample used); the language used by that research group; and a reference to where else these results were published. Validation studies for the foreign-language CTQ: German: (Wingenfeld, et al., 2010), Swedish:(Gerdner & Allgulander, 2009), Norwegian: (Dovran, et al., 2013) Turkish: (Sar, et al., 2012), Korean: (Kim, et al., 2011) Dutch: (Thombs, et al., 2009)
Disclaimer: The contents of this manuscript do not reflect the views of the Department of Veterans Affairs or the United States Government.
Conceived and designed the experiments: KM AS MT. Performed the experiments: RFD TK HG BB MB NS KW MD MV UD LC CF AC HK VS DTK MLS ER JL CL SR IS MBS CSW HJG MH GB GL AG DK. Analyzed the data: MT KM AS BS. Contributed reagents/materials/analysis tools: MT KM AS BS KP RFD TK HG BB MB NS KW MD MV UD LC CF AC HK VS DK ER JL CL SR IS MLS CSW HJG MH GB GL AG DTK MBS. Wrote the paper: KM AS MT. Performed background research: BS KP. Collated, organized, and processed raw data: BS KP. Collated author feedback: BS KP. Edited the final MS: BS KP.
- 1. Norman RE, Byambaa M, De R, Butchart A, Scott J, Vos T. The long-term health consequences of child physical abuse, emotional abuse, and neglect: a systematic review and meta-analysis. PLOS Med. 2012;9(11):e1001349. pmid:23209385
- 2. Gilbert R, Widom CS, Browne K, Fergusson D, Webb E, Janson S. Burden and consequences of child maltreatment in high-income countries. Lancet. 2009 Jan 3;373(9657):68–81. pmid:19056114
- 3. Shenk CE, Noll JG, Putnam FW, Trickett PK. A prospective examination of the role of childhood sexual abuse and physiological asymmetry in the development of psychopathology. Child Abuse Negl. 2010 Oct;34(10):752–61. pmid:20850183
- 4. Danese A, Pariante CM, Caspi A, Taylor A, Poulton R. Childhood maltreatment predicts adult inflammation in a life-course study. Proc Natl Acad Sci U S A. 2007 Jan 23;104(4):1319–24. pmid:17229839
- 5. Bradley B, Defife JA, Guarnaccia C, Phifer J, Fani N, Ressler KJ, et al. Emotion dysregulation and negative affect: association with psychiatric symptoms. J Clin Psychiatry. 2011 May;72(5):685–91. pmid:21658350
- 6. Barrios YV, Gelaye B, Zhong Q, Nicolaidis C, Rondon MB, Garcia PJ, et al. Association of Childhood Physical and Sexual Abuse with Intimate Partner Violence, Poor General Health and Depressive Symptoms among Pregnant Women. PLOS One. 2015;10(1):e0116609. pmid:25635902
- 7. Kessler RC, Davis CG, Kendler KS. Childhood adversity and adult psychiatric disorder in the US National Comorbidity Survey. Psychol Med. 1997 Sep;27(5):1101–19. pmid:9300515
- 8. Nanni V, Uher R, Danese A. Childhood maltreatment predicts unfavorable course of illness and treatment outcome in depression: a meta-analysis. Am J Psychiatry. 2012 Feb;169(2):141–51. pmid:22420036
- 9. Hoertel N, Franco S, Wall MM, Oquendo MA, Wang S, Limosin F, et al. Childhood maltreatment and risk of suicide attempt: a nationally representative study. J Clin Psychiatry. 2015 Jul;76(7):916–23; quiz 23. pmid:26231006
- 10. Dube SR, Anda RF, Felitti VJ, Chapman DP, Williamson DF, Giles WH. Childhood abuse, household dysfunction, and the risk of attempted suicide throughout the life span: findings from the Adverse Childhood Experiences Study. Jama. 2001 Dec 26;286(24):3089–96. pmid:11754674
- 11. Wegman HL, Stetler C. A meta-analytic review of the effects of childhood abuse on medical outcomes in adulthood. Psychosom Med. 2009 Oct;71(8):805–12. pmid:19779142
- 12. Anda RF, Felitti VJ, Bremner JD, Walker JD, Whitfield C, Perry BD, et al. The enduring effects of abuse and related adverse experiences in childhood. A convergence of evidence from neurobiology and epidemiology. Eur Arch Psychiatry Clin Neurosci. 2006 Apr;256(3):174–86. pmid:16311898
- 13. Shonkoff JP, Boyce WT, McEwen BS. Neuroscience, molecular biology, and the childhood roots of health disparities: building a new framework for health promotion and disease prevention. Jama. 2009 Jun 3;301(21):2252–9. pmid:19491187
- 14. Rossiter A, Byrne F, Wota AP, Nisar Z, Ofuafor T, Murray I, et al. Childhood trauma levels in individuals attending adult mental health services: An evaluation of clinical records and structured measurement of childhood trauma. Child Abuse Negl. 2015 Jan 27.
- 15. Teicher MH, Parigger A. The 'Maltreatment and Abuse Chronology of Exposure' (MACE) Scale for the Retrospective Assessment of Abuse and Neglect During Development. PLOS One. 2015;10(2):e0117423. pmid:25714856
- 16. Bernstein DP, Fink L, Handelsman L, Foote J, Lovejoy M, Wenzel K, et al. Initial reliability and validity of a new retrospective measure of child abuse and neglect. Am J Psychiatry. 1994 Aug;151(8):1132–6. pmid:8037246
- 17. Baker AJ, Maiorino E. Assessments of emotional abuse and neglect with the CTQ: Issues and estimates. Children and Youth Services Review. 2010;32:740–8.
- 18. Bernet CZ, Stein MB. Relationship of childhood maltreatment to the onset and course of major depression in adulthood. Depress Anxiety. 1999;9(4):169–74. pmid:10431682
- 19. Klein DN, Arnow BA, Barkin JL, Dowling F, Kocsis JH, Leon AC, et al. Early adversity in chronic depression: clinical correlates and response to pharmacotherapy. Depress Anxiety. 2009;26(8):701–10. pmid:19434623
- 20. Shalev I, Moffitt TE, Sugden K, Williams B, Houts RM, Danese A, et al. Exposure to violence during childhood is associated with telomere erosion from 5 to 10 years of age: a longitudinal study. Mol Psychiatry. 2013 May;18(5):576–81. pmid:22525489
- 21. van Zelst C, van Nierop M, van Dam DS, Bartels-Velthuis AA, Delespaul P. Associations between Stereotype Awareness, Childhood Trauma and Psychopathology: A Study in People with Psychosis, Their Siblings and Controls. PLOS One. 2015;10(2):e0117386. pmid:25705878
- 22. Sudbrack R, Manfro PH, Kuhn IM, de Carvalho HW, Lara DR. What doesn't kill you makes you stronger and weaker: How childhood trauma relates to temperament traits. J Psychiatr Res. 2015 Jan 15.
- 23. Teicher MH, Anderson CM, Ohashi K, Polcari A. Childhood maltreatment: altered network centrality of cingulate, precuneus, temporal pole and insula. Biol Psychiatry. 2014 Aug 15;76(4):297–305. pmid:24209775
- 24. Dannlowski U, Stuhrmann A, Beutelmann V, Zwanzger P, Lenzen T, Grotegerd D, et al. Limbic scars: long-term consequences of childhood maltreatment revealed by functional and structural magnetic resonance imaging. Biol Psychiatry. 2012 Feb 15;71(4):286–93. pmid:22112927
- 25. Berglund KJ, Balldin J, Berggren U, Gerdner A, Fahlke C. Childhood maltreatment affects the serotonergic system in male alcohol-dependent individuals. Alcohol Clin Exp Res. 2013 May;37(5):757–62. pmid:23384117
- 26. Grant MM, Wood K, Sreenivasan K, Wheelock M, White D, Thomas J, et al. Influence of Early Life Stress on Intra- and Extra-Amygdaloid Causal Connectivity. Neuropsychopharmacology. 2015 Jan 29.
- 27. Swartz JR, Knodt AR, Radtke SR, Hariri AR. A neural biomarker of psychological vulnerability to future life stress. Neuron. 2015 Feb 4;85(3):505–11. pmid:25654256
- 28. Fergusson DM, Mullen PE. Childhood Sexual Abuse: an evidence based perspective. Thousand Oaks (California): SAGE; 1999.
- 29. Maughan B, Rutter M. Retrospective reporting of childhood adversity: issues in assessing long-term recall. J Pers Disord. 1997 Spring;11(1):19–33. pmid:9113820
- 30. Bernstein DP, Stein JA, Newcomb MD, Walker E, Pogge D, Ahluvalia T, et al. Development and validation of a brief screening version of the Childhood Trauma Questionnaire. Child Abuse Negl. 2003 Feb;27(2):169–90. pmid:12615092
- 31. Gerdner A, Allgulander C. Psychometric properties of the Swedish version of the Childhood Trauma Questionnaire-Short Form (CTQ-SF). Nord J Psychiatry. 2009;63(2):160–70. pmid:19021077
- 32. MacDonald K, Thomas ML, MacDonald TM, Sciolla AF. A Perfect Childhood? Clinical Correlates of Minimization and Denial on the Childhood Trauma Questionnaire. J Interpers Violence. 2014 Jun 30.
- 33. Bernstein DP, Fink L. Childhood Trauma Questionnaire: A Retrospective Self Report. Manual. San Antonio, Tx.: The Psychological Corporation.: Harcourt Brace and Company; 1998.
- 34. Crowne DP, Marlowe D. A new scale of social desirability independent of psychopathology. J Consult Psychol. 1960 Aug;24:349–54. pmid:13813058
- 35. Paulhus DL. Measurement and control of response bias. In: Robinson JP, Shaver PR, Wrightsman LW, editors. Measurement of personality and social psychological attitudes. San Diego, CA: Academic Press; 1991. p. 17–53.
- 36. Rogers R, editor. Clinical assessment of malingering and deception (3rd ed). New York: Guiliford Press; 2008.
- 37. Kenny DT, Lennings CJ, Nelson P. The mental health of young offenders serving orders in the community: Implications for rehabilitation. Journal of Offender Rehabilitation (Special Edition). 2008;45(1 and 2):123–48.
- 38. Uziel L. Rethinking Social Desirability Scales: From Impression Managemment to Interpersonally Oriented Self-Control. Perspectives on Psychological Science. 2010;5(3):243–62. pmid:26162157
- 39. Varia R, Abidin RR, Dass P. Perceptions of abuse: effects on adult psychological and social adjustment. Child Abuse Negl. 1996 Jun;20(6):511–26. pmid:8800526
- 40. Lewis DO, Yeager CA, Swica Y, Pincus JH, Lewis M. Objective documentation of child abuse and dissociation in 12 murderers with dissociative identity disorder. Am J Psychiatry. 1997 Dec;154(12):1703–10. pmid:9396949
- 41. Dalenberg CJ, Brand BL, Gleaves DH, Dorahy MJ, Loewenstein RJ, Cardena E, et al. Evaluation of the evidence for the trauma and fantasy models of dissociation. Psychol Bull. 2012 May;138(3):550–88. pmid:22409505
- 42. Scott KM, Smith DR, Ellis PM. Prospectively Ascertained Child Maltreatment and Its Association With DSM-IV Mental Disorders in Young Adults. Arch Gen Psychiatry. 2010 Jul;67(7):712–9. pmid:20603452
- 43. Green JG, McLaughlin KA, Berglund PA, Gruber MJ, Sampson NA, Zaslavsky AM, et al. Childhood adversities and adult psychiatric disorders in the national comorbidity survey replication I: associations with first onset of DSM-IV disorders. Arch Gen Psychiatry. 2010 Feb;67(2):113–23. pmid:20124111
- 44. Keyes KM, Eaton NR, Krueger RF, McLaughlin KA, Wall MM, Grant BF, et al. Childhood maltreatment and the structure of common psychiatric disorders. Br J Psychiatry. 2012 Feb;200(2):107–15. pmid:22157798
- 45. Wingenfeld K, Spitzer C, Mensebach C, Grabe HJ, Hill A, Gast U, et al. [The German version of the Childhood Trauma Questionnaire (CTQ): preliminary psychometric properties]. Psychother Psychosom Med Psychol. 2010 Nov;60(11):442–50. pmid:20200804
- 46. Dovran A, Winje D, Overland SN, Breivik K, Arefjord K, Dalsbo AS, et al. Psychometric properties of the Norwegian version of the Childhood Trauma Questionnaire in high-risk groups. Scand J Psychol. 2013 Aug;54(4):286–91. pmid:23672336
- 47. Kim D, Park SC, Yang H, Oh DH. Reliability and validity of the korean version of the childhood trauma questionnaire-short form for psychiatric outpatients. Psychiatry Investig. 2011 Dec;8(4):305–11. pmid:22216039
- 48. Thombs BD, Bernstein DP, Lobbestael J, Arntz A. A validation study of the Dutch Childhood Trauma Questionnaire-Short Form: factor structure, reliability, and known-groups validity. Child Abuse Negl. 2009 Aug;33(8):518–23. pmid:19758699
- 49. Sar V, Ozturk E, Ikikardes E. Validity and Reliability of the Turkish Version of Childhood Trauma Questionnaire. Turkiye Klinikleri J Med Sci. 2012;32(4):1054–63.
- 50. Scher CD, Stein MB, Asmundson GJ, McCreary DR, Forde DR. The childhood trauma questionnaire in a community sample: psychometric properties and normative data. J Trauma Stress. 2001 Oct;14(4):843–57. pmid:11776429
- 51. Spinhoven P, Penninx BW, Hickendorff M, van Hemert AM, Bernstein DP, Elzinga BM. Childhood Trauma Questionnaire: factor structure, measurement invariance, and validity across emotional disorders. Psychol Assess. 2014 Sep;26(3):717–29. pmid:24773037
- 52. Schmidt NB, Kotov R, Joiner TE Jr. Taxometrics: Toward a new diagnostic scheme for psychology. Washingtion, D.C.: American Psychology Association.; 2004.
- 53. Meehl PE. Bootstraps taxometrics: Solving the classification problem in psychopathology. American Psychologist. 1995;50:266–75. pmid:7733538
- 54. Meehl PE, Yonce LJ. Taxometric analysis: I. Detecting taxonicity with two quantitative indicators using means above and below a sliding cut (MAMBAC procedure). Psychol Rep. 1994;74:1059–274.
- 55. Ruscio J, Haslam N, Ruscio AM. Introduction to the taxometric method: A practical guide. Mahwah, NJ: Lawrence Erlbaum Associates Publishers; 2006.
- 56. Waller NG. Carving nature at its joints: Paul Meehl's development of taxometrics. J Abnorm Psychol. 2006;115:210–5. pmid:16737384
- 57. R Development Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria; 2011.
- 58. Ruscio J, Walters GD, Marcus DK, Kaczetow W. Comparing the relative fit of categorical and dimensional latent variable models using consistency tests. Psychol Assess. 2010;22:5–21. pmid:20230147
- 59. Haynes SN, Smith GT, Hunsley JD. Scientific foundations of clinical assessment. New York: Routledge; 2011.
- 60. Hox JJ. Multilevel analysis: Techniques and applications (2nd ed.). New York: Routledge/Taylor & Francis Group; 2010.
- 61. lme4: Linear mixed-effects models using Eigen and S4. R package version 1.1-5 [database on the Internet]2014. Available from: http://CRAN.R-project.org/package=lme4.
- 62. Carr CP, Martins CM, Stingel AM, Lemgruber VB, Juruena MF. The role of early life stress in adult psychiatric disorders: a systematic review according to childhood trauma subtypes. J Nerv Ment Dis. 2013 Dec;201(12):1007–20. pmid:24284634
- 63. Heim C, Shugart M, Craighead WE, Nemeroff CB. Neurobiological and psychiatric consequences of child abuse and neglect. Dev Psychobiol. 2010 Nov;52(7):671–90. pmid:20882586
- 64. Felitti VJ, Anda RF, Nordenberg D, Williamson DF, Spitz AM, Edwards V, et al. Relationship of childhood abuse and household dysfunction to many of the leading causes of death in adults. The Adverse Childhood Experiences (ACE) Study. Am J Prev Med. 1998 May;14(4):245–58. pmid:9635069
- 65. Claussen AH, Crittenden PM. Physical and psychological maltreatment: relations among types of maltreatment. Child Abuse Negl. 1991;15(1–2):5–18. pmid:2029672
- 66. Wright MO, Crawford E, Del Castillo D. Childhood emotional maltreatment and later psychological distress among college students: the mediating role of maladaptive schemas. Child Abuse Negl. 2009 Jan;33(1):59–68. pmid:19167067
- 67. Reckase MD. Multidimensional item response theory. New York: Springer; 2009.
- 68. Millsap RE. Statistical approaches to measurement invariance. New York: Routledge/Taylor & Francis Group; 2011.
- 69. Molnar BE, Buka SL, Kessler RC. Child sexual abuse and subsequent psychopathology: results from the National Comorbidity Survey. Am J Public Health. 2001 May;91(5):753–60. pmid:11344883
- 70. Li XB, Liu JT, Zhu XZ, Zhang L, Tang YL, Wang CY. Childhood trauma associates with clinical features of bipolar disorder in a sample of Chinese patients. J Affect Disord. 2014 Oct 15;168:58–63. pmid:25036010
- 71. Garrusi B, Nakhaee N. Validity and reliability of a Persian version of the Childhood Trauma Questionnaire. Psychol Rep. 2009 Apr;104(2):509–16. pmid:19610481
- 72. Paivio SC, Cramer KM. Factor structure and reliability of the Childhood Trauma Questionnaire in a Canadian undergraduate student sample. Child Abuse Negl. 2004 Aug;28(8):889–904. pmid:15350772
- 73. Grassi-Oliveira R, Cogo-Moreira H, Salum GA, Brietzke E, Viola TW, Manfro GG, et al. Childhood Trauma Questionnaire (CTQ) in Brazilian samples of different age groups: findings from confirmatory factor analysis. PLOS One. 2014;9(1):e87118. pmid:24475237
- 74. Klinitzke G, Romppel M, Hauser W, Brahler E, Glaesmer H. [The German Version of the Childhood Trauma Questionnaire (CTQ)—Psychometric Characteristics in a Representative Sample of the General Population]. Psychother Psychosom Med Psychol. 2012 Feb;62(2):47–51. pmid:22203470
- 75. Kim D, Bae H, Han C, Oh HY, Macdonald K. Psychometric properties of the Childhood Trauma Questionnaire-Short Form (CTQ-SF) in Korean patients with schizophrenia. Schizophr Res. 2013 Mar;144(1–3):93–8. pmid:23352775