Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The dimensionality and latent structure of mental health difficulties and wellbeing in early adolescence

  • Louise Black ,

    Roles Conceptualization, Formal analysis, Methodology, Visualization, Writing – original draft

    Affiliation Manchester Institute of Education, University of Manchester, Manchester, United Kingdom

  • Margarita Panayiotou,

    Roles Conceptualization, Methodology, Supervision, Writing – review & editing

    Affiliation Manchester Institute of Education, University of Manchester, Manchester, United Kingdom

  • Neil Humphrey

    Roles Funding acquisition, Investigation, Project administration, Resources, Supervision, Writing – review & editing

    Affiliation Manchester Institute of Education, University of Manchester, Manchester, United Kingdom


Research with adults and older adolescents suggests a general factor may underlie both mental health difficulties and wellbeing. However, the classical bifactor model commonly used to demonstrate this general trait has recently been criticised when a unidimensional structure is not supported. Furthermore, research is lacking in this area with children and early adolescents. We present confirmatory factor analysis models to explore the structure of psychopathology and wellbeing in early adolescents, using secondary data from a large U.K. sample (N = 1982). A simple correlated factors structure fitted the data well and revealed that wellbeing was just as related to internalising as this was to externalising symptoms. The classical bifactor solution also fitted the data well but was rejected as the general factor explained only 55% of the total common variance. S-1 models were therefore used to explore general covariance in a more robust way, and revealed that a general internalising distress factor could play an important role in all item responses. Gender and income differences in mental health were also explored through invariance testing and correlations. Our findings demonstrate the importance of considering mental health difficulties and wellbeing items together, and suggestions are made for how their correspondence could be controlled for.


Both mental ill health and positive wellbeing in young people are associated with outcomes such as academic attainment and social functioning [15], as well as demographic and environmental correlates [614]. The majority of mental health problems have first onset in adolescence [15], and can result in significant disability [6, 8, 9]. Furthermore, it is widely agreed that adolescence, ranging from ages 10–24, is critical to functioning in later life [1618], while recent evidence suggests young people’s mental health may be deteriorating [6, 12].

Despite this clear need to understand the form of mental health, particularly in young people, its conceptualisation and measurement have been inconsistent. A historic focus on disorder remains the basis for measurement [19], even though the absence of disorder symptoms consistently fails to fully explain wellbeing in young people [15, 7, 13, 14]. The limitations of categorical diagnoses are also becoming increasingly clear, with criticisms focussing predominantly on stigmatisation via poorly evidenced medical models [19], and a lack of validity for discrete disorders [20, 21]. For instance, hyperactivity disorders have been criticised as pathologising typical and expected behaviour in children and adolescents, particularly boys [22], and studies have repeatedly failed to discern groups experiencing one externalising disorder without other comorbid problems [2325]. Symptom-level and hierarchical approaches, on the other hand, are emerging as useful ways to understand structure, risk and comorbidity in mental health difficulties. Such approaches have demonstrated consistent covariance between symptoms, cutting across traditional disorder taxonomies [20, 2632]. In fact, not only is there strong evidence of general covariance between symptoms of mental health, longitudinal research (from birth to midlife) suggests that experiencing symptoms of mental disorder is the norm, with only a small minority remaining completely symptom-free over time [33]. This supports the current shift in understanding, in which taxonomic approaches to mental disease classification are being rejected. Continuous dimensional frameworks are instead being adopted and encouraged, to reflect evidence that mental health symptoms seem to be extreme and distressing variations in typical processes rather than indicative of categorical diagnoses [34].

While dual-factor approaches have sought to gain a more comprehensive view of child and adolescent mental health by capitalising on the benefits of wellbeing measures [2], they too have typically resorted to simplistic categorical approaches. Though a moderate relationship between psychopathology and wellbeing has been consistently demonstrated [3538], a focus has emerged which has emphasised their dissociation, forcing participants into one of four categories [15, 13, 14]. At either extreme, these are content and free of symptoms (flourishing), and dissatisfied and suffering symptoms (languishing). Also included, however, are the more surprising groups of individuals who are symptom-free and dissatisfied, and satisfied but symptomatic. This approach has demonstrated the important finding that absence of symptoms is not synonymous with the presence of wellbeing. However, it distracts from the known association between the two constructs, and finding that the majority of participants are straightforwardly either flourishing or languishing [15, 13, 14]. Nevertheless, wellbeing approaches do not appear to suffer from the outdated biases outlined above, and in young people there is also strong correspondence between different instruments and wellbeing subtypes, suggesting strong construct validity [10]. Given the association of mental health difficulties and wellbeing, the need for continuous approaches to mental health, and the relative strengths of wellbeing measures, there is therefore an opportunity to consider these outcomes together as part of a comprehensive structure.

Despite this, robust methods interrogating the measurement structure of wellbeing and mental health difficulties in early adolescence have yet to be employed, despite the existence of theoretical frameworks such as complete mental health, the two-continua approach, or the dual-factor model [2, 38, 39]. The current study addresses this major gap, building on research with adults and older adolescents [38, 40, 41].

Mental health difficulties and wellbeing

Wellbeing is typically considered to comprise positive (cognitive) evaluations of life, positive affect and the absence of negative affect [42]. These three aspects are typically considered to form hedonic wellbeing, while eudaimonic wellbeing captures aspects beyond pleasure, reflecting how well a person feels they align with their own values and ideals [43]. In young people, these different approaches to wellbeing have been shown to be highly related [10].

The present analysis draws on instruments designed for general population screening and will therefore focus on internalising and externalising symptoms. Though this means not all disorders and symptom-types are covered, this approach builds on previous research [7], provides insight into the two most common forms of mental health difficulties in childhood [8, 9], and is supported by evidence that broad internalising and externalising spectra can explain covariance across disorders [26].

Internalising is typically considered to include depressive and anxious type disorders and is therefore concerned with somatic, worry and sadness symptoms [26, 44]. There is, therefore, some conceptual crossover between this aspect of mental health difficulties and wellbeing, given that they are each is concerned with happiness or unhappiness. This can be seen in measures such as the General Health Questionnaire 12 (GHQ-12), which is sometimes considered to be a symptom measure, and sometimes a wellbeing instrument capturing negative affect [40, 45].

In children, externalising symptoms and disorders typically include conduct and attentional problems [46, 47]. Given the controversy surrounding attentional problems mentioned above, the current study focuses particularly on conduct problems. Though externalising symptoms often share comorbidity with internal distress symptoms, when considered alone these are behavioural and related to disinhibition [44].

Gender differences in child and adolescent mental health

The prevalence of disorders between genders is complex in each developmental period. Between ages 6 and 11 boys are up to twice as likely to suffer from severe mental health difficulties, but levels of internalising symptoms are similar [7, 8, 48]. However, between 11 and 14, girls are substantially more likely to suffer from internalising problems [6, 49]. Bifactor modelling has also yielded inconsistent results: While some research has suggested a general mental health factor was not associated with gender in early adolescence [28], a study with slightly older participants suggested it was [41]. The expression of mental health is therefore linked to gender in a complex way at the beginning of adolescence (around age 11), and warrants further investigation.

Wellbeing also shows consistent complex differences for gender, varying significantly by domain [10, 11]. Typically, girls show higher satisfaction with school and social relationships, while boys are happier with their appearance [11, 12]. Overall, wellbeing is higher for boys in some countries and for girls in others [11]. In the U.K., child and adolescent boys were shown to have higher overall happiness [12]. From a unidimensional perspective, this is incongruent with the finding in the same country that boys are at greater risk of mental health difficulties [48]. However, it perhaps echoes the finding that U.K. adolescent girls are at particular risk of depression [6, 49]. The complexity of gender relationships with mental health difficulties and wellbeing challenges assumptions of unipolarity, and suggests empirical evidence of their structure is needed.

Family income differences in child and adolescent mental health

Though country-level economic factors show no or very little association with children and adolescents’ wellbeing or mental health difficulties, household-level income is significantly associated with these outcomes [6, 10, 11, 48, 50]. While patterns for income are more straightforward than for gender, with children from poorer backgrounds reporting greater mental health difficulties and lower wellbeing, the extent to which income explains each outcome is quite different. Family income consistently more strongly predicts variability in mental health difficulties than wellbeing [6, 10, 11, 48, 50]. The existence of this relationship for both outcomes in varying strength, suggests their composite structure may provide insight into the role of income for mental health.

Problems with the existing dual-factor approach

When mental health difficulties and wellbeing are analysed independently (i.e. any covariance is not accounted for), they do appear to be somewhat distinct. For instance, longitudinal research suggests that, even among the minority who never experience mental disorder, over 20% have been found to report low life satisfaction [33]. Similarly, the two constructs have been found to have a discrete set of correlates, as well as some shared predictors in early adolescence [7]. It remains unclear, however, to what extent items for each construct overlap and tap similar dimensions. For instance, while Patalay et al. [7] aggregated internalising and externalising symptoms (likely only moderately correlated; see [47]), and then found the corresponding coefficient between mental health difficulties and wellbeing to be only -.20, Kinderman et al. [51] treated wellbeing and internalising psychopathology as related latent factors, and these were correlated at -.82. The conceptual overlap between internalising and wellbeing alluded to above may explain this discrepancy between correlations since though both referred to outcomes as mental ill health, Kinderman et al. [51] included only depression and anxiety.

Given that mental health difficulties and wellbeing are known to be correlated, [37, 38], it seems illogical not to control for this association. Furthermore, since results are likely biased, already suggested by Patalay and Fitzsimons’[7] surprisingly low correlation between the two constructs and dimensionality is assumed rather than tested, conclusions based on analyses ignoring the association of mental health difficulties and wellbeing should be treated with caution.

Problems with existing approaches to modelling mental health

The definitions above make clear that mental health difficulties represent a broad range of symptoms, some of which intuitively relate to wellbeing, and that these constructs show complex relationships with gender and income. Complex measurement models are already common in mental health research since high rates of comorbidity and correlations between items have led researchers to model symptoms or disorders together through bifactor structures, termed psychopathology or p-factor models [27]. These models have been used to argue for a general transdiagnostic factor and two studies have extended these to include wellbeing [40, 41]. Despite appropriately controlling for wellbeing, these studies have focused on older samples and age generalisability cannot be assumed [6, 10, 48]. These studies also have theoretical and methodological problems leaving many questions unanswered. For instance, the study by Böhnke et al. [40] was restricted since the measure used for mental health difficulties (the GHQ-12) has been argued by some to mainly capture negative affect [45]. Therefore the finding by Böhnke et al. [40] of a strong general factor explained almost entirely by GHQ-12 indicators is arguably unsurprising, since this measure could be expected to strongly mirror wellbeing instruments [10, 45].

While Böhnke et al. [40] studied adults in the general population, St Clair et al. [41] aimed to understand the structure of mental health in a sample of older adolescents and young adults. While symptom measures were included, these tended to be old, based on categorical diagnoses, or poorly validated [5255], and self-esteem was also included as a measure of positive mental health with no clear theoretical justification. This is therefore at odds with contemporary spectra approaches [26], and may explain why an arguably uninterpretable result emerged: The best fitting model was a bifactor solution, but items did not always load on both general and specific factors, some loadings were low and even reversed on specific factors, and crossloadings seemed to be allowed, such that wellbeing and self-esteem items were allowed to load on a shared positive factor as well as two separate specific factors. Eid et al. [56] point out that such problematic solutions can arise where bifactor models are misapplied, while the questionable choice of measures, unsupported by theory is likely to have contributed to the results outlined above. There is, therefore, a clear need to study the complex structure of mental health in adolescents using more appropriate measures.

Beyond these specific problems with dual-factor bifactor studies, there has recently been a great deal of criticism of bifactor modelling more generally, which the current study aims to address. Firstly, where there are correlations between all indicators, as is the case in mental health models, a general factor which accounts for this covariance will always occur, even where this pattern of covariance arises for another reason, such as network structures, where one symptom leads to another [57]. Secondly, bifactor structures are highly parameterised and tend to overfit the data such that sample and measure complexity (e.g. cross loadings and correlated residuals) can be absorbed by the general factor, making the bifactor structure apparently better fitting even when this is not the case [58]. Thirdly, though evaluating competing models is important to avoid selecting a model based on close fit alone, when others may be viable or better, model comparison between correlated factors, second-order and bifactor solutions as is typically conducted could lead to false conclusions [5759]. While these structures have substantially different interpretations, they are mathematically very close and sometimes even equivalent (depending on the number of factors). As a result, differences may not be attributable to superior structure, but instead be an artefact of the sample, unmodeled complexity or an alternative explanation for covariance such as mutualism in which problems co-occur [5759]. Relative fit of such models must therefore be interpreted with caution.

Recent criticisms have also proposed that the classical bifactor model (see Fig 1B) is not psychometrically well defined, since a single source of variability (the participants) is used to define a dual decomposition of a single score into two random variables, which ought to each have a distinct source of randomness [56]. This means that latent general and specific factors are unrelated while simultaneously being a function of the true score of the same indicators. Where these specific factors have substantial variance and salient loadings, these are therefore uninterpretable since they represent constructs that are wholly orthogonal to each other and the general factor, while this general factor simultaneously represents shared covariance [56, 60]. If we consider the general factor to represent liability for all symptoms, the residual specific factors must represent something wholly unrelated to the symptoms captured by the general factor [60]. On the other hand, if we consider a specific internalising factor to represent specific depressive, somatic and anxious symptomology, we must assume that the general factor does not include these in the same way. Given that both general and specific factors are generated from the same responses to the same item set, it is impossible to substantively distinguish these orthogonal true score variables as the constraints of the bifactor model require [56].

Fig 1. Confirmatory factor analysis model examples.

(A) Correlated factors model. (B) Classical bifactor model. (C) S-1 model.

In order to estimate a meaningful general factor that captures the covariance of all items, one specific factor can be removed [56]. This allows the general factor to become a function of the true score of the items with no specific factor, so that it can become well defined psychometrically as a random variable. The general factor in this model, known as S-1 (see Fig 1C), however, has a slightly different interpretation. For instance, if the specific wellbeing factor is removed (S-1wellbeing), the general factor represents general wellbeing accounting for the covariance of this construct with internalising and externalising items. The specific internalising and externalising factors, on the other hand, would represent the residual variance not explained in these items by the general wellbeing domain. We argue that this model should be considered, not only because it is statistically more robust than the classical bifactor model, but also because it provides an opportunity to generate an interpretable measurement structure in the presence of general covariance but not essential unidimensionality.

Despite such criticisms, some argue bifactor models can be successfully used when essential unidimensionality is supported, such that the specific factors represent noise (e.g. method factors) [59, 60]. Such a structure was found for mental health difficulties and wellbeing in adults [40], suggesting that this should be tested in adolescence (despite the potential noise introduced by GHQ-12 noted above). Furthermore, bifactor models provide a platform to examine dimensionality via a robust method, the Explained Common Variance (ECV) index [6163]. Though the question of dimensionality has underpinned much dual-factor research, this has yet to be statistically explored. However, for the reasons described above, and despite common practice [28, 41], we suggest that bifactor structures should not be accepted and interpreted merely based on model fit, especially when unidimensionality is not supported.

It has also been recently pointed out that measurement structures, such as bifactor models, should not be interpreted as evidence of broader construct validity, beyond measures employed [60]. The purpose of this study, however, is to demonstrate an example of models and methods needed, given that mental health difficulties and wellbeing are routinely used together as outcomes in adolescent research [25, 13]. We therefore aim to provide evidence of their measurement structure so that bias through failing to account for covariance, can be avoided, rather than to present a definitive structure.

The current study

On the basis of the evidence reviewed above, several predictions were made. Firstly, latent wellbeing would be correlated with latent mental health difficulties factors, particularly internalising, at moderate levels (hypothesis 1). This hypothesis was operationalised in a correlated factors model (see Fig 1A). Secondly, we predicted that a classical bifactor solution (see Fig 1B) would fit the data well, but that this would not be essentially unidimensional as found by Böhnke et al. [40], since we used more clearly dissociated measures, and research with adolescents has also suggested multidimensionality (hypothesis 2) [41]. Thirdly, if hypotheses one and two were supported, we predicted that an S-1wellbeing model (see Fig 1C) would provide a useful and robust structure to account for the covariance of mental health difficulties and wellbeing (hypothesis 3). This model would provide an indication of wellbeing corrected for symptoms. Finally, given that group differences have been noted across gender and income for both outcomes, we explored invariance and associations for the strongest model, based on a balance of psychometric rigor, interpretability and fit (hypothesis 4).


We conducted secondary analysis of baseline data from an evaluation of locally developed interventions designed to prevent mental health problems in young people from 12 areas of England (HeadStart) [64]. The University College London Research Ethics Committee granted ethical approval, and parental consent was given for early adolescents to complete the secure online surveys during their usual school day. Teachers read out an information sheet to pupils before these were completed. This emphasised pupils’ confidentiality and their right to withdraw.


A total of 1982 pupils in their final year of primary education (1051 male, 53%) were drawn from 59 schools in England. Pupils’ age ranged between 10.75 and 12.25 (M = 11.21, SD = .30). The sample was not drawn to be representative since it reflected the areas participating in the HeadStart programme. As such, statements of special educational needs were below average (1.3% compared to the national average of 2.8%), while those with registered additional needs not meeting the threshold for a statement was above the national average (21.7% compared to 15.4%) [65]. The percentage of participants from white, non-ethnic minority backgrounds was also slightly above the national average for primary schools (74% compared to 70%) [66], while the number of those exposed to a language at home other than English was similar (20% compared to 19%) [66]. In terms of deprivation, 24% of participants were eligible for free school meals (FSM) when data were collected. This is above the national average of 15.6% [66], but typical of U.K. early adolescents’ mental research in schools [67].


Self-report measures (see S1 Appendix) were used since at age 11 these are a valid indication of early adolescents’ internal perspectives [68]. Though externalising symptoms can be more accurately reported by a parent or teacher, internalising and wellbeing symptoms are considered to be more reliable from the child’s perspective [68]. Given that informant type may have an impact on the modelling structure and therefore act as a confound, the limitation of self-report for externalising was seen to be outweighed by the strength of using a single informant in the specific analysis conducted.

Mental health difficulties.

Mental health difficulties was measured through the Me and My School (M&MS; also referred to as Me and My Feelings) questionnaire, which consists of 10 internalising, and six externalising items [69]. This measure was designed to provide a similar screening function to the Strengths and Difficulties Questionnaire [70], but for a younger age range. Participants responded never, sometimes or always (coded one to three) to brief statements (e.g. “I worry a lot”). Possible scores therefore ranged from 10–30 for internalising and 6–18 for externalising, assuming no missing responses. M&MS has been found to be psychometrically robust, with good internal consistency (in 11–12 year-olds, externalising α = .80, internalising α = .77); concurrent validity, r = .67 - .70, for equivalent, and r = .22–24 for non-equivalent subscales of the Strengths and Difficulties Questionnaire; and good known-groups validity between clinical and non-clinical populations [71]. M&MS contains one reverse-coded item in the externalising subscale (item 14 “I am calm”).


Wellbeing was measured by the four-item Child Outcome Rating Scale (CORS) [72]. Four aspects (me, school, family and everything) were responded to by clicking on a smooth line between a happy and sad face. For online administration, this line was measured from 0–100, but then divided by 10 for analysis to match the paper version and facilitate model convergence. Possible scores therefore ranged between 0–10 for each item. CORS has been found to be psychometrically robust with good internal consistency (α = .84), test-retest reliability (r = .60), and concurrent validity (care-taker CORS, r = .63, care-taker Youth Outcome Questionnaire, r = -.43)[72]. These researchers also found good responsiveness and known-groups validity between clinical and non-clinical samples.

Family income.

Pupil FSM eligibility is captured in a number of ways in England [73]. In the current study, data were used on whether pupils had ever been eligible for FSM, rather than their current status, since transitions in and out of poverty as well as persistent and current poverty, have all been shown to be associated with child and adolescent mental health [50]. Of the sample, 43% (N = 860) had ever been eligible for FSM.


Survey data were collected in schools in spring 2015 through a secure online portal and subsequently matched to individual socio-demographic characteristics drawn from the National Pupil Database.

Statistical analysis

Confirmatory factor analysis (CFA) was conducted using Weighted Least Squares with Means and Variance adjustment (WLSMV) in Mplus 8.1. One exception to this was the CFA of the CORS instrument, for which robust maximum likelihood was used since all items were continuous. WLSMV was selected to account for the categorical nature of the M&MS measure [74], handle the substantial floor effects associated with screening measures [75], and because this estimator has been shown to produce minimal bias with clustered data [76]. In addition, correlated residuals, which are better handled by WLSMV [77], were of particular interest in the current study given the tendency of the classical bifactor model to absorb unmodeled complexity of this kind [58]. Finally, WLSMV is recommended where there are a large number of variables and factors, and sample size is large [77], as was the case in the current study.

Chi-square statistics are reported but not used to judge fit given their known sensitivity to sample size. The Comparative Fit Index (CFI), Tucker Lewis Index (TLI) and Root Mean Square Error of Approximation (RMSEA), and its 90% confidence interval (CI) are reported to indicate model fit, with values close to .95 for CFI and TLI, and .06 for RMSEA, typically interpreted as good fit [78]. However, given the overfitting problems associated with bifactor solutions, these indices were interpreted alongside the psychometric rigor of each model as well as other indices such as the ECV.

Evaluation of error variances.

Given the problems with not modelling correlated systematic error where this is indicated by modification indices and theoretically supported [58, 59], this was investigated in all instruments and solutions before final models were estimated. Individual CFAs of each instrument were therefore conducted in addition to the models shown in Fig 1, so that systematic error could be evaluated here as well. The evaluation of each instrument at this stage also allowed assessment of how well factors were indicated by items, via loadings. In addition to this we calculated Cronbach’s α as basic description of subscale reliability to further ensure all items were appropriate for subsequent analysis.

While in a strict sense bifactor modeling assumes zero error covariances, where this error is systematic (e.g. due to similar wording), the question of correlated errors is one that can be tested [79, 80]. Furthermore, while correlating error terms limits the causal power of the latent factor [81], dimensional covariance between measures was of interest in the current study rather than latent disorders. We therefore included correlated error terms in the current analysis, in line with Reise et al. [59].

Evaluation of mental health models.

Intra cluster correlations for indicator variables were calculated to assess non-independence due to sampling from schools. Since these were relatively low (.004-.067), clustering was accounted for using the type = complex option in Mplus, which adjusts the chi-square statistics and standard errors based on non-independence [82]. After estimating the models described in hypotheses 1–3, these were compared using chi-square difference testing: Each of the correlated factors and S-1 models were nested in the bifactor solution following Reise [83].

Explained common variance.

ECV represents a ratio of variance explained by the general factor to that explained by the specific factors, while the Percentage of Uncontaminated Correlations (PUC) provides the percentage of correlations that inform on the general factor relative to the specific factors [61]. When PUC is higher (more correlations relate to the general than the specific factors), less bias is introduced by misfitting a unidimensional structure to multidimensional data. High PUC in combination with moderate to high ECV suggests that though a bifactor, multidimensional structure fits well, there is a strong case for modelling the construct as unidimensional. This is because the general factor would account for most of the variance, and factor loadings in a unidimensional model would likely be very similar to those on the general factor [62]. Reise et al. [61] suggest that PUC > .80 and ECV > .60 may be sufficient to consider unidimensionality.

Group differences.

Gender and income measurement invariance were tested for the final model through multigroup CFA. To account for the categorical nature of the M&MS items, a three-step procedure was employed: This involved the estimation of baseline models in each subgroup separately; a configural measurement invariance model, where all loading, threshold and intercept parameters were freely estimated in both groups; and a scalar measurement invariance model where loadings and intercepts/thresholds were considered in tandem, and constrained to be equal across groups [84]. Model-based associations between latent mental health factors and gender and income were then explored via individual regression statements, rather than correlations, due to the categorical nature of the exogenous variables income and gender.


Preliminary analysis

Gender was available for every child, ever FSM eligibility was missing for .9% of the sample, while for M&MS and CORS items, missing data ranged from .6–2.6%. Data were assumed to be missing at random, due to absence on the day of data collection, error or omission of individual items, or lack of up-to-date records from the National Pupil Database. The trivial amount of missing data confirmed that results would likely not be negatively affected by using the limited information estimator WLSMV [77].

Descriptive statistics and correlations are presented in Table 1. As expected, observed wellbeing was moderately associated with both observed mental health difficulties domains, though not with gender or family income. Family income was also not significantly associated with internalising. Externalising symptoms were inversely related to being a girl, as expected.

Table 1. Descriptive statistics and bivariate correlations.

Evaluation of measurement models and correlated error variances


Although acceptable internal consistency was found for both M&MS subscales (externalising α = .776; internalising α = .792), preliminary CFA indicated a poor factor loading for one item (“I am shy”, λ = .291), which was consistent with other analyses [28, 69]. This item also had a low item total correlation (r = .257), and its removal improved internal consistency (α = .799). Furthermore, we felt this item could be interpreted as conceptually different from the others (see S1 Appendix), as it is the only one clearly linked to social functioning. The fit of the initial two-factor M&MS scale, χ2 (103) = 549.444, p < .001, RMSEA = .047 (90% CI = .043-.051), CFI = .955, TLI = .947, remained good following this item’s removal, χ2 (89) = 511.309, p < .001, RMSEA = .049 (90% CI = .045-.053), CFI = .958, TLI = .951.

Modification indices supported three pairs of correlated residuals between items with similar conceptual content and or wording. These were M&MS items 1 and 3: “I feel lonely” with “Nobody likes me”; M&MS items 5 and 6: “I worry when I am at school” with “I worry a lot”; and M&MS items 7 and 8 “I have problems sleeping” with “I wake up in the night”. The inclusion of these correlated error terms resulted in good model fit, χ2 (86) = 262.342, p < .001, RMSEA = .032 (90% CI = .028-.037), CFI = .983, TLI = .979, so this modified structure was taken forward.


While internal consistency for CORS was acceptable (α = .745), the model fit of a unidimensional structure was poor, χ2 (2) = 24.831, p < .001, RMSEA = .076 (90% CI = .51-.104), CFI = .976, TLI = .928. Modification indices supported the inclusion of one pair of correlated errors due to conceptual and wording overlap: CORS items 1 and 3 “how am I doing” with “how am I doing at school”. The inclusion of this error correlation substantially improved fit, χ2 (1) = 1.281, p = .258, RMSEA = .012 (90% CI = .000- .062), CFI = 1, TLI = .998, and was therefore taken forward.

Dual-factor mental health models

Hypothesis 1 was supported since the correlated factors model had excellent fit to the data (See Table 2), and significant loadings for all items (λ ≥ .43, see Fig 2). Furthermore, the estimated correlation between latent internalising and wellbeing was found to equal that between the two latent mental health difficulties dimensions (r = -.58). Latent externalising was also found to be substantially related to latent wellbeing, though to a lesser degree than was internalising (r = -.42).

Although these clear relationships were found between constructs, a unidimensional structure was not supported, as predicted in hypothesis two (PUC = .67, ECV = .55). The classical bifactor model did, however, show excellent fit to the data (see Table 2), and each item had at least one salient loading on the general or specific factor (see Fig 3). In addition to the lack of unidimensionality, inspection of the parameter estimates revealed further problems. Four internalising items had very low loadings on the specific factor (unhappy λ = .28; unliked λ = .15; sleep problems λ = .18; wakeup λ = .08), and the factor variance for internalising was also low compared to the externalising factor, which was on the same response scale (ξ = .13 versus ξ = .36). While it could be argued that internalising acted as a particularly good indicator of the general factor, we interpret this result in line with Eid et al. [56], and suggest that this is evidence of a vanishing factor, a result identified as consistent with the psychometric misspecification of classical bifactor solutions. Though the classical bifactor model therefore showed superior fit to other models estimated, it was rejected based on the ECV and disappearing internalising factor.

Contrary to hypothesis 3, the S-1wellbeing model was also rejected for a number of reasons. It showed inferior fit compared to the correlated model (which was less likely to overfit), the internalising factor remained relatively weak, consistent with the classical bifactor model, and the general wellbeing factor was more strongly defined by internalising than wellbeing items (see Fig 4). This suggested that general wellbeing covariance in mental health difficulties items was not a good representation of the data. In light of this, and the vanishing internalising factor found in the classical bifactor solution, post-hoc analysis of an S-1internalising model was conducted (see Fig 5). This model showed almost identical fit to the correlated factors model (see Table 2) and unlike the S-1wellbeing model, the general factor was this time most strongly defined by its unique items. The general factor in S-1internalising can therefore be interpreted as modelling general internalising distress (GID) that is tapped not only by items designed to do so, but also variance of this construct captured by externalising and wellbeing items.

Difference testing was conducted between models where possible (based on number of parameters and the Nesting and Equivalence Test, NET) [85]. Of the possible comparisons, the classical bifactor model was the best as expected. It has been suggested that comparisons between models of the types we explored here should be interpreted with caution due to mathematical closeness [57]. Indeed, fit statistics revealed the correlated factors and S-1internalising models to be extremely similar, though the latter appeared to be slightly worse based on qualitative inspection of fit statistics (this was necessary since the NET procedure revealed these models were not nested). Though the correlated factors model was therefore likely the best given its relative parsimony [74], and we recommend it be retained where possible in similar analysis, hypothesis 4 was considered in both correlated factors and S-1internalising models since each are useful for different scenarios (see discussion below).

Measurement invariance testing.

Invariance testing was therefore conducted on both of these models and results can be seen in Table 3. Partial measurement invariance was supported for gender in both models, with the items “I cry a lot” showing non-invariance in both, and the item “How am I doing at school” showing non-invariance in the correlated factors model. Full measurement invariance was supported for income in both models, though a small negative residual variance (-.14) was found for CORS4 (“How is everything going?”) in the ever FSM group for the S-1internalising model. This impossible result appeared to arise from the correlated error term between the CORS items “How am I doing?” and “How am I doing at school?”, which was retained in the model since it was significant and meaningful, r = .26. In line with Muthén [86], the residual variance of CORS4 was fixed to zero since this parameter was non-significant (p = .84), and fixing this to zero did not substantially change the model fit. Since full measurement invariance is frequently seen to be untenable [87], we interpreted these results as indicating that models functioned reasonably well across the groups studied.

In order to estimate the association of latent mental health factors with gender and income, non-invariant items were removed from both correlated factors and S-1internalising models [8890] . Their removal resulted in slightly better fitting models (correlated factors without non-invariant items, χ2 = 311.847*(113), RMSEA = .030, (90% CI = .026-.034) CFI = .978, TLI = .967; S-1internalising without non-invariant item, χ2 = 364.857*(121); RMSEA = .032 (90% CI = .028-.036); CFI = .973; TLI = .966 ) possibly due to removal of noise, and or the fact that CFI is known to be sensitive to the number of items [91]. For both models, wellbeing was not significantly associated with gender, internalising was modestly associated with being a girl, and externalising was substantially associated with being a boy (see Table 4). In line with the observed score correlations in Table 1, only externalising was significantly associated with low family income in either the correlated factors or S-1internalising models.

Table 4. Gender and income associations with mental health factors.


The aim of the current study was to further our understanding of the structure of mental health difficulties and wellbeing in early adolescence, using secondary data from a large U.K. sample (N = 1982). Despite existing theoretical frameworks (e.g., two-continua approach) [39], the robust analysis of the measurement structure of mental health difficulties and wellbeing, and especially in younger populations, has been lacking from the extant literature. Given recent limitations pertaining to common methodological approaches, such as bifactor modeling [5659], alternative methodologies were considered (ECV, S-1), and competing CFA models were estimated, which allowed for a more robust representation of the comprehensive mental health model.

Overall, unidimensionality was not supported in the current study. Instead, our results demonstrate that mental health difficulties and wellbeing are distinct but related constructs and should therefore be considered alongside each other within late childhood-early adolescent research. The simple correlated factors structure fitted the data well and revealed that wellbeing was just as related to internalising difficulties as this was to externalising symptoms. Despite the superior fit of the bifactor model, this was rejected in the current study, as the general factor explained only 55% of the total common variance. Results from the S-1 models further revealed that a general internalising distress factor could play an important role in all item responses. Partial gender and full income measurement invariance were established for the correlated and S-1internalizing models. However, given that the correlated model was the most parsimonious, with a slightly better fit than that of S-1internalizing, we considered that to be the most theoretically and statistically plausible model of comprehensive mental health.

In line with previous findings [38], medium to large latent correlations were observed between wellbeing and mental health difficulties domains. The present study, however, accounted for the known distinction between childhood internalising and externalising symptoms [47], rather than conflating these as has sometimes been the case [7]. This also enabled comparison of effect sizes for estimated correlations between all latent constructs in the correlated factors model and demonstrated that wellbeing was no more dissociated from mental health difficulties constructs than these were from one another. This strengthens the idea that wellbeing may be used to calibrate psychopathology scores [40], and provides clear justification for the inclusion of wellbeing in mental health models.

In contrast to previous research [28, 41], we did not accept the classical bifactor solution as the final model, despite its superior fit. Since the general factor explained only 55% of the total common variance, the classical bifactor model was substantively uninterpretable, and was therefore rejected. In other words, while some previous research has suggested symptoms of mental health difficulties and wellbeing could be considered a single continuum [40], in line with hypothesis 2 our findings did not support this. We found that when internalising, externalising and wellbeing were modelled together in a large sample of early adolescents, these constructs should be treated as distinct but related factors. As suggested earlier, our choice of M&MS as a mental health difficulties measure capturing more than just negative affect, and the age of our sample, are likely to have contributed to our contrasting results. It should also be noted that this lack of support for unidimensionality is somewhat consistent with research with older adolescents [41], though in contrast to this work, we followed recent criticisms and rejected the multidimensional bifactor solution [56, 60]. This was in part facilitated by our inclusion of the ECV, which had not been considered in mental health difficulties and wellbeing bifactor models previously, and reinforces the importance of not solely relying on model fit.

Insights from stochastic measurement theory also allowed models with better defined factors to be estimated [56]. Though our hypothesised S-1wellbeing model presented a poor fit, parameter estimates in the classical bifactor solution led to post-hoc analysis of an S-1internalising model which explained the data well. This post-hoc analysis was conducted since internalising appeared to be weakened as a specific factor in the classical bifactor and S-1wellbeing solutions, but showed strong loadings on the general factors in both models. In line with Eid et al. [56], we therefore considered a model in which specific internalising was removed, allowing internalising items to define the general factor. Since relatively stable general loadings were also observed across the classical bifactor and both S-1 models, GID covariance may have been responsible for each of these models’ general factors. Moreover, in the S-1wellbeing model the strongest loadings on the general factor were seen for internalising, rather than wellbeing items as would be expected. Statistical comparison was not possible between the correlated factors and S-1internalising models, and in fact it has been suggested anyway that comparison of such models is problematic, due to their mathematical closeness [57]. Nevertheless, the correlated model appeared to have slightly better fit than the S-1internalising model, and since this was the simpler solution, we suggest that this should be preferred where possible.

This is not say, however, that the S-1internalising model is inadmissible, as such a model would be able to address certain research questions unanswerable by the correlated factors solution. For instance, where the specific role of external correlates is of interest for particular mental health domains, as explored by Patalay et al. [7], S-1internalising would allow researchers to estimate the effects of these on GID, externalising behaviour and wellbeing separately, while controlling for each of the other outcomes. While S-1internalising was considered less optimal, particularly since it had more parameters, in combination with the other models and ECV results, it provides further insight into previous research. For this reason, our discussion focuses on the interpretation of both the correlated and S-1internalising models.

For instance, together, our models shed light on previous findings relating to internalising. Specifically, externalising and wellbeing group factors have tended to show substantial loadings after accounting for a general factor, whereas internalising loadings have behaved differently, becoming small, sometimes insignificant, and even negative on occasion [28, 40, 41]. The S-1internalising model could clarify this since it represents the influence of a latent internalising trait on responses to all mental health difficulties and wellbeing items. Such a structure could therefore underlie other bifactor solutions, since the consistent presence of relatively weak specific internalising suggests that this could be defining other general factors found [28, 40, 41, 56].

Theoretically GID is also consistent with the wider literature, since some of the covariance with wellbeing could be explained by the conceptual overlap (e.g. happiness and unhappiness). Covariance with externalising, on the other hand could reflect known comorbidity, which is thought to arise for a number of complex reasons, including method factors as well as cascading or predisposing effects [20, 92, 93]. Previous research has often combined internalising and externalising symptoms when considering the relationship of mental health difficulties to wellbeing [1, 3, 13]. However, our study suggests this may be problematic since both overlap and dissociation between constructs was found. It is possible that overlap at the latent level explains response patterns, and that dimensions such as those we propose should be considered rather than summed scores. While some research has categorised young people according to flourishing, languishing, etc., latent dimensional approaches could yield different results. For instance, in the S-1internalising model it is possible that those with considerable GID show tendencies towards languishing, while those with behavioural externalising symptoms, separate from distress, could show higher wellbeing. A symptomatic but content group could therefore arise under circumstances in which the behavioural aspect of externalising is tapped as psychopathology in early adolescents who are not distressed, and therefore in turn report high wellbeing.

The estimation of both S-1 models in the current study, in combination with the calculation of ECV in the bifactor model, clarified the covariance structure of the items. This is namely that just over half of all common variance could be explained by a classical general factor, but that this is likely due to shared internalising variance across all items. While the current study draws on a relatively new area of work [56], current findings support the wider utility of S-1 models. These have not only addressed some of the concerns raised around bifactor modeling [56, 60], but also added substantive theoretical insight.

Having explored the covariance structure of mental health domains, our final aim was to shed light on their complex relationships with gender and family income. Externalising symptoms are often associated with boys, and emphasis tends to be on girls reporting higher internalising symptoms because of elevated rates in later adolescence [6, 49]. However, there is evidence that internalising symptoms also play an important role in boys’ psychopathology and externalising symptoms [67, 93]. For instance, initial lower levels of internalising were shown to predict lower levels of externalising at a later time point in both boys and girls [67].

Consistent with these studies, our results suggest only a weak association of internalising distress with gender in early adolescence. For both the correlated and S-1internalising models internalising (at the specific level for the former, and global GID level for the latter) showed a small association with being a girl. Therefore, when specific externalising behaviour (not associated with GID) was accounted for in the S-1internalising model, girls still showed only slightly higher levels of GID than boys. Similarly when the effect of latent internalising on externalising item responses was accounted for, the association of being a boy with externalising behaviour was notably much larger. This therefore suggests that while behavioural problems were associated with being male, this was particularly the case after controlling for GID. Furthermore, when poor behaviour (not associated with distress) was accounted for, girls still showed only slightly higher levels of internalising distress than boys. An alternative explanation for this finding could be that externalising psychopathology is entirely distinct from internalising, and remained associated with being a boy for this reason. However, five of the six externalising items had salient loadings on the GID factor (λ = .38-.64), suggesting that these items were well defined by GID, and these constructs were therefore not entirely separate.

As with gender, the associations found in the current study between mental health factors and income advance previous work which treated these factors as a single variable [7]. It was unsurprising that wellbeing did not show significant associations with low income [10]. However, it was more unexpected that only externalising was significantly and substantially related to this outcome [7], though similar conduct and emotional domains have shown stronger associations to income for the former than the latter [50]. The discrepancy in significance may therefore be due to the use of a larger sample by Fitzsimons et al. [50].

Beyond the benefits of adding S-1 models to understand covariance and relationships to key outcomes, the modeling approach was also strengthened by the inclusion of correlated errors. These were included to avoid overfitting in an entirely locally independent bifactor model, such that covariance beyond specific latent constructs would be absorbed by the general factor [58, 59]. These were carefully evaluated according to item content, wording and modification indices. Though inclusion of such parameters weakens the causal power of the latent trait, it is untenable to assume no relationship between conceptually similar items such as “I have problems sleeping” and “I wake up in the night” [81]. While CFA was used, the current study was somewhat exploratory, investigating the dimensionality of mental health difficulties and wellbeing, therefore allowing for relationships beyond hypothesised factors. In addition, consistent with recent calls [34], our analysis was focused at a symptom level. It therefore did not assume causal disorders, but rather considered the covariance structure of items. Nevertheless, it remains important to understand that there are associations between items beyond the latent traits modelled. As stated previously, the analysis of comprehensive mental health put forward here is not an attempt to conceptualise a definitive structure of “positive” and “ill” mental health. If such an approach were adopted, the violation of local independence would be potentially more serious in our view. Rather, our hypotheses, findings and discussion were designed to interrogate measurement assumptions routinely made for these outcomes in research with young people.

It is clear that epidemiological measures, such as those used here, can be problematic in terms of item content for local independence assumptions. While some would argue that alternative approaches to latent trait models should therefore be adopted, we feel that the robust analysis of dimensionality and covariance provided here was a key first step, before further exploration or alternative approaches considering mental health difficulties and wellbeing items together could be employed. If strong relationships between constructs had not been found in the present analysis, there would be little value in further study. It could be argued that analysis of the kind we have presented should have been employed even sooner, before analysis of correlates was considered. Our critical review of the literature and findings also suggest that categorical treatment of these outcomes can be problematic, and does not appear to be a good representation of the data. This reinforces that previous treatment of the outcomes as such [15, 13, 14] may lead to false conclusions.

However, it should be noted that the latent trait account we have offered may not be the only reason items covaried as they did, and that other approaches such as network analysis should be considered in future [94]. It has also been demonstrated that complex bifactor solutions can overfit data when these account for unusual response patterns [59]. Estimating the percentage of respondents who fit the model to ascertain whether complex solutions account for a minority implausible response patterns as Reise et al. [59] did, would also be pertinent to dual-factor research, given the consistent finding that a minority are neither flouring nor languishing [15, 13, 14].

This was the first study to our knowledge to empirically explore the structure of latent mental health difficulties and wellbeing in early adolescence. Furthermore, we employed more appropriate measures and robust approaches to bifactor modelling than those commonly used [40, 41]. Unidimensionality was not supported, but clear justification was found for the inclusion of wellbeing in mental health models, and GID was found to explain responses to all items at a salient level. This study therefore draws together and improves on school psychology dual-factor [15, 13, 14], and mental health bifactor research [27, 28, 30]. While the former has tended to categorically dichotomise mental health difficulties and wellbeing, and therefore lose important information [34], the latter has generally failed to account for the statistical properties of bifactor models, leading to potentially misleading conclusions [56].

Despite the use of rigorous methodology, several limitations should also be acknowledged. Firstly, the exploration of any construct is tied to the measures used, and results will inevitably vary by instrument, as already seen in the contrast between the present study and that by Böhnke et al. [40]. Though well-validated instruments were selected, replication studies should consider employing alternative measures. Similarly, constructs were assessed via self-report measures for feasibility and design reasons and as already noted, externalizing symptoms may be more accurate when reported by an adult. However, wellbeing and internalizing symptoms are likely more valid from the young person’s perspective [68]. Informant reports are also limited in that the informant (e.g. parent, teacher) typically only observes the adolescent in a single context [95]. Use of mixed informants would also likely have acted as a confound since self and informant ratings are often only weakly or moderately correlated, particularly for children and adolescents [9698]. Though the sample size was substantial and met the recommended minimum N:q ratio (at 25.7:1), future research, particularly if more complex structural predictive components are added, should consider Monte Carlo simulations for decisions on sample size [99]. The representativeness of the sample may also be considered a limitation since poorer adolescents were overrepresented, though as stated previously, rates here were comparable to other U.K. school-based mental health research. FSM eligibility has also been criticised as a measure of socioeconomic status and proxy for family income [100], and though efforts were made to mitigate this through the use of everFSM, future studies should consider including more accurate and comprehensive measures of family income. Finally, this study used the relatively new ECV and PUC indices. While some thresholds have been recommended for these [61], further research is needed to confirm their accuracy.


In the first study of its kind, early adolescents’ comprehensive mental health was explored using a large sample and robust analytical strategy. Previous research in mental health and school psychology has been extended, with our results clarifying how general factors may arise, through thorough investigation via the ECV and S-1 models. Clear correspondence was found between internalising and externalising symptoms, and wellbeing, and evidence suggested common GID variance was meaningfully predictive of responses to all items. This research therefore offers insight into comorbidity and dual-factor response patterns, since it suggests that common internalising may contribute across mental health domains. Given the problems with bifactor modelling in previous research, and categorical approaches often taken, our analysis provides the first robust platform from which relationships between wellbeing and mental health difficulties domains can be explored further.

Supporting information

S1 Appendix. Items of Me and My School and Child Outcome Rating Scale questionnaires.



The data used in this study were collected as part of the HeadStart learning programme. The authors are therefore grateful for the work of the wider research teams at the Anna Freud Centre and the University of Manchester for their role in coordinating the evaluation, as well as collecting and managing the data. The authors also acknowledge the National Pupil Database from which demographic data were obtained. Finally, we are extremely grateful to all students who took part in this study, as well as the local authorities and schools for their help in recruiting them.


  1. 1. Antaramian SP, Scott Huebner E, Hills KJ, Valois RF. A Dual-Factor Model of Mental Health: Toward a More Comprehensive Understanding of Youth Functioning. American Journal of Orthopsychiatry. 2010;80(4):462–72. pmid:20950287
  2. 2. Greenspoon PJ, Saklofske DH. Toward an Integration of Subjective Well-Being and Psychopathology. Social Indicators Research. 2001;54(1):81–108.
  3. 3. Lyons MD, Huebner ES, Hills KJ. The Dual-Factor Model of Mental Health: A Short-Term Longitudinal Study of School-Related Outcomes. Social Indicators Research. 2013;114(2):549–65.
  4. 4. Suldo S, Thalji A, Ferron J. Longitudinal academic outcomes predicted by early adolescents’ subjective well-being, psychopathology, and mental health status yielded from a dual factor model. The Journal of Positive Psychology. 2011;6(1):17–30.
  5. 5. Suldo S, Thalji-Raitano A, Kiefer SM, Ferron JM. Conceptualizing High School Students' Mental Health Through a Dual-Factor Model. School Psychology Review. 2016;45(4):434–57.
  6. 6. Patalay P, Fitzsimons E. Mental ill-health among children of the new century: trends across childhood with a focus on age 14. September 2017. London: Centre for Longitudinal Studies; 2017.
  7. 7. Patalay P, Fitzsimons E. Correlates of Mental Illness and Wellbeing in Children: Are They the Same? Results From the UK Millennium Cohort Study. J Am Acad Child Adolesc Psychiatry. 2016;55(9):771–83. pmid:27566118
  8. 8. Kovess-Masfety V, Husky MM, Keyes K, Hamilton A, Pez O, Bitfoi A, et al. Comparing the prevalence of mental health problems in children 6–11 across Europe. Social Psychiatry and Psychiatric Epidemiology. 2016;51(8):1093–103. pmid:27314494
  9. 9. Polanczyk GV, Salum GA, Sugaya LS, Caye A, Rohde LA. Annual Research Review: A meta-analysis of the worldwide prevalence of mental disorders in children and adolescents. Journal of Child Psychology and Psychiatry. 2015;56(3):345–65. pmid:25649325
  10. 10. Bradshaw J, Rees G. Exploring national variations in child subjective well-being. Children and Youth Services Review. 2017;80:3–14.
  11. 11. Dinisman T, Ben-Arieh A. The Characteristics of Children’s Subjective Well-Being. Social Indicators Research. 2016;126(2):555–69.
  12. 12. Pople L, Society TCs, Rees G. <the-good-childhood-report-2017_full-report_0.pdf>. 2017. p. 1–64.
  13. 13. Lyons MD, Huebner ES, Hills KJ, Shinkareva SV. The Dual-Factor Model of Mental Health:Further Study of the Determinants of Group Differences. Canadian Journal of School Psychology. 2012;27(2):183–96.
  14. 14. Suldo S, Shaffer EJ. Looking Beyond Psychopathology: The Dual-Factor Model of Mental Health in Youth. School Psychology Review. 2008;37(1):52–68.
  15. 15. Jones PB. Adult mental health disorders and their age at onset. British Journal of Psychiatry. 2013;202(s54):s5–s10.
  16. 16. Sawyer SM, Azzopardi PS, Wickremarathne D, Patton GC. The age of adolescence. The Lancet Child & Adolescent Health. 2018;2(3):223–8.
  17. 17. Patton GC, Sawyer SM, Santelli JS, Ross DA, Afifi R, Allen NB, et al. Our future: a Lancet commission on adolescent health and wellbeing. The Lancet. 2016;387(10036):2423–78.
  18. 18. Dahl RE, Allen NB, Wilbrecht L, Suleiman AB. Importance of investing in adolescence from a developmental science perspective. Nature. 2018;554:441. pmid:29469094
  19. 19. Kinderman P, Sellwood W, Tai S. Policy implications of a psychological model of mental disorder. Journal of Mental Health. 2009;17(1):93–103.
  20. 20. Krueger RF, Markon KE. Reinterpreting Comorbidity: A Model-Based Approach to Understanding and Classifying Psychopathology. Annual Review of Clinical Psychology. 2006;2(1):111–33.
  21. 21. Carragher N, Krueger RF, Eaton NR, Slade T. Disorders without borders: current and future directions in the meta-structure of mental disorders. Social Psychiatry and Psychiatric Epidemiology. 2015;50(3):339–50. pmid:25557024
  22. 22. Moncrieff J, Timimi S. The social and cultural construction of psychiatric knowledge: an analysis of NICE guidelines on depression and ADHD. Anthropology & Medicine. 2013;20(1):59–71.
  23. 23. Sondeijker FEPL Ferdinand RF, Oldehinkel AJ, Veenstra R, De Winter AF, Ormel J, et al. Classes of adolescents with disruptive behaviors in a general population sample. Social Psychiatry and Psychiatric Epidemiology. 2005;40(11):931–8. pmid:16222441
  24. 24. van Lier PAC, Verhulst FC, van der Ende J, Crijnen AAM. Classes of disruptive behaviour in a sample of young elementary school children. Journal of Child Psychology and Psychiatry. 2003;44(3):377–87. pmid:12635967
  25. 25. de Nijs PFA, van Lier PAC, Verhulst FC, Ferdinand RF. Classes of Disruptive Behavior Problems in Referred Adolescents. Psychopathology. 2007;40(6):440–5. pmid:17709974
  26. 26. Forbes MK, Tackett JL, Markon KE, Krueger RF. Beyond comorbidity: Toward a dimensional and hierarchical approach to understanding psychopathology across the life span. Development and Psychopathology. 2016;28(4pt1):971–86. pmid:27739384
  27. 27. Caspi A, Houts RM, Belsky DW, Goldman-Mellor SJ, Harrington H, Israel S, et al. The p Factor:One General Psychopathology Factor in the Structure of Psychiatric Disorders? Clinical Psychological Science. 2014;2(2):119–37. pmid:25360393
  28. 28. Patalay P, Fonagy P, Deighton J, Belsky J, Vostanis P, Wolpert M. A general psychopathology factor in early adolescence. Br J Psychiatry. 2015;207(1):15–22. pmid:25906794
  29. 29. Carragher N, Teesson M, Sunderland M, Newton NC, Krueger RF, Conrod PJ, et al. The structure of adolescent psychopathology: a symptom-level analysis. Psychological Medicine. 2015;46(5):981–94. pmid:26620582
  30. 30. Castellanos-Ryan N, Brière FN, O'Leary-Barrett M, Banaschewski T, Bokde A, Bromberg U, et al. The structure of psychopathology in adolescence and its common personality and cognitive correlates. Journal of Abnormal Psychology. 2016;125(8):1039–52. pmid:27819466
  31. 31. Tackett JL, Lahey BB, van Hulle C, Waldman I, Krueger RF, Rathouz PJ. Common genetic influences on negative emotionality and a general psychopathology factor in childhood and adolescence. Journal of Abnormal Psychology. 2013;122(4):1142–53. pmid:24364617
  32. 32. Waldman ID, Poore HE, van Hulle C, Rathouz PJ, Lahey BB. External validity of a hierarchical dimensional model of child and adolescent psychopathology: Tests using confirmatory factor analyses and multivariate behavior genetic analyses. Journal of abnormal psychology. 2016;125(8):13.
  33. 33. Schaefer JD, Caspi A, Belsky DW, Harrington H, Houts R, Horwood LJ, et al. Enduring mental health: Prevalence and prediction. Journal of abnormal psychology. 2017;126(2):212. pmid:27929304
  34. 34. Krueger RF, Kotov R, Watson D, Forbes M, Eaton N, Ruggero C, et al. Progress in achieving quantitative classification of psychopathology. World Psychiatry. in press.
  35. 35. Ravens-Sieberer U, Gosch A, Rajmil L, Erhart M, Bruil J, Power M, et al. The KIDSCREEN-52 Quality of Life Measure for Children and Adolescents: Psychometric Results from a Cross-Cultural Survey in 13 European Countries. Value in Health. 2008;11(4):645–58. pmid:18179669
  36. 36. Clarke A, Friede T, Putz R, Ashdown J, Martin S, Blake A, et al. Warwick-Edinburgh Mental Well-being Scale (WEMWBS): Validated for teenage school students in England and Scotland. A mixed methods assessment. BMC Public Health. 2011;11(1):487.
  37. 37. Spinhoven P, Elzinga BM, Giltay E, Penninx BWJH. Anxious or Depressed and Still Happy? PLOS ONE. 2015;10(10):e0139912. pmid:26461261
  38. 38. Keyes CLM. Mental Illness and/or Mental Health? Investigating Axioms of the Complete State Model of Health. Journal of Consulting and Clinical Psychology. 2005;73(3):539–48. pmid:15982151
  39. 39. Westerhof GJ, Keyes CLM. Mental Illness and Mental Health: The Two Continua Model Across the Lifespan. Journal of Adult Development. 2010;17(2):110–9. pmid:20502508
  40. 40. Böhnke JR, Croudace TJ. Calibrating well-being, quality of life and common mental disorder items: psychometric epidemiology in public mental health research. The British Journal of Psychiatry. 2016;209(2):162–8. pmid:26635327
  41. 41. St Clair MC, Neufeld S, Jones PB, Fonagy P, Bullmore ET, Dolan RJ, et al. Characterising the latent structure and organisation of self-reported thoughts, feelings and behaviours in adolescents and young adults. PLOS ONE. 2017;12(4):e0175381. pmid:28403164
  42. 42. Diener E. Subjective well-being. The science of happiness and a proposal for a national index. The American psychologist. 2000;55(1):34–43. pmid:11392863
  43. 43. Ryan RM, Deci EL. On Happiness and Human Potentials: A Review of Research on Hedonic and Eudaimonic Well-Being. Annual Review of Psychology. 2001;52(1):141–66.
  44. 44. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders (DSM-5®). Washington, UNITED STATES: American Psychiatric Publishing; 2013.
  45. 45. Vanhoutte B. The Multidimensional Structure of Subjective Well-Being In Later Life. Journal of Population Ageing. 2014;7(1):1–20. pmid:25089162
  46. 46. Achenbach TM, Edelbrock CS. Psychopathology of Childhood. Annual Review of Psychology. 1984;35(1):227–56.
  47. 47. Goodman A, Lamping DL, Ploubidis GB. When to Use Broader Internalising and Externalising Subscales Instead of the Hypothesised Five Subscales on the Strengths and Difficulties Questionnaire (SDQ): Data from British Parents, Teachers and Children. Journal of Abnormal Child Psychology. 2010;38(8):1179–91. pmid:20623175
  48. 48. Gutman L, Joshi H, Parsonage M, Schoon I. Children of the new century. Mental health findings from the Millennium Cohort Study London: Centre for Mental Health2015.
  49. 49. Fink E, Patalay P, Sharpe H, Holley S, Deighton J, Wolpert M. Mental Health Difficulties in Early Adolescence: A Comparison of Two Cross-Sectional Studies in England From 2009 to 2014. Journal of Adolescent Health. 2015;56(5):502–7. pmid:25907650
  50. 50. Fitzsimons E, Goodman A, Kelly E, Smith JP. Poverty dynamics and parental mental health: Determinants of childhood mental health in the UK. Social Science & Medicine. 2017;175:43–51.
  51. 51. Kinderman P, Schwannauer M, Pontin E, Tai S. Psychological Processes Mediate the Impact of Familial Risk, Social Circumstances and Life Events on Mental Health. PLOS ONE. 2013;8(10):e76564. pmid:24146890
  52. 52. Bamber D, Tamplin A, Park RJ, Kyte ZA, Goodyer IM. Development of a Short Leyton Obsessional Inventory for Children and Adolescents. Journal of the American Academy of Child & Adolescent Psychiatry. 2002;41(10):1246–52.
  53. 53. Raine A. The SPQ: a scale for the assessment of schizotypal personality based on DSM-III-R criteria. Schizophrenia bulletin. 1991;17(4):555. pmid:1805349
  54. 54. Reynolds CR, Richmond BO. What i think and feel: A revised measure of children's manifest anxiety. Journal of Abnormal Child Psychology. 1978;6(2):271–80. pmid:670592
  55. 55. Horwood J, Salvi G, Thomas K, Duffy L, Gunnell D, Hollis C, et al. IQ and non-clinical psychotic symptoms in 12-year-olds: results from the ALSPAC birth cohort. British Journal of Psychiatry. 2008;193(3):185–91. pmid:18757973
  56. 56. Eid M, Geiser C, Koch T, Heene M. Anomalous results in G-factor models: Explanations and alternatives. Psychological Methods. 2017;22(3):541–62. pmid:27732052
  57. 57. van Bork R, Epskamp S, Rhemtulla M, Borsboom D, van der Maas HLJ. What is the p-factor of psychopathology? Some risks of general factor modeling. Theory & Psychology. 2017;27(6):759–73.
  58. 58. Murray AL, Johnson W. The limitations of model fit in comparing the bi-factor versus higher-order models of human cognitive ability structure. Intelligence. 2013;41(5):407–22.
  59. 59. Reise SP, Kim DS, Mansolf M, Widaman KF. Is the Bifactor Model a Better Model or Is It Just Better at Modeling Implausible Responses? Application of Iteratively Reweighted Least Squares to the Rosenberg Self-Esteem Scale. Multivariate Behavioral Research. 2016;51(6):818–38. pmid:27834509
  60. 60. Bonifay W, Lane SP, Reise SP. Three Concerns With Applying a Bifactor Model as a Structure of Psychopathology. Clinical Psychological Science. 2017;5(1):184–6.
  61. 61. Reise SP, Scheines R, Widaman KF, Haviland MG. Multidimensionality and Structural Coefficient Bias in Structural Equation Modeling:A Bifactor Perspective. Educational and Psychological Measurement. 2013;73(1):5–26.
  62. 62. Rodriguez A, Reise SP, Haviland MG. Applying Bifactor Statistical Indices in the Evaluation of Psychological Measures. Journal of Personality Assessment. 2016;98(3):223–37. pmid:26514921
  63. 63. Ten Berge JMF, Sočan G. The greatest lower bound to the reliability of a test and the hypothesis of unidimensionality. Psychometrika. 2004;69(4):613–25.
  64. 64. Anna Freud Centre. HeadStart Pilot Evaluation [internet]. London: Anna Freud Centre; n.d. [Available from:
  65. 65. Department for Education. Special educational needs in England: January 2015. 2015.
  66. 66. Department for Education. Schools, pupils and their characteristics: January 2015. 2015.
  67. 67. Panayiotou M, Humphrey N. Mental health difficulties and academic attainment: Evidence for gender-specific developmental cascades in middle childhood. Development and Psychopathology. 2017:1–16.
  68. 68. Humphrey N, Wigelsworth M. Making the case for universal school-based mental health screening. Emotional and Behavioural Difficulties. 2016;21(1):22–42.
  69. 69. Deighton J, Tymms P, Vostanis P, Belsky J, Fonagy P, Brown A, et al. The Development of a School-Based Measure of Child Mental Health. Journal of Psychoeducational Assessment. 2013;31(3):247–57. pmid:25076806
  70. 70. Goodman R. The Strengths and Difficulties Questionnaire: A Research Note. Journal of Child Psychology and Psychiatry. 1997;38(5):581–6. pmid:9255702
  71. 71. Patalay P, Deighton J, Fonagy P, Vostanis P, Wolpert M. Clinical validity of the Me and My School questionnaire: a self-report mental health measure for children and adolescents. Child and Adolescent Psychiatry and Mental Health. 2014;8(1):17.
  72. 72. Duncan B, Sparks J, Miller S, Bohanske R, Claud D. Giving Youth a Voice: A Preliminary Study of the Reliability and Validity of a Brief Outcome Measure for Children, Adolescents, and Caretakers. Journal of Brief Therapy. 2006;5(2):71–88.
  73. 73. Gorard S. A cautionary note on measuring the pupil premium attainment gap in England. British journal of education, society and behavioural science. 2016;14(2):1–8.
  74. 74. Brown TA. Confirmatory factor analysis for applied research: Guilford Publications; 2015.
  75. 75. Li C-H. Confirmatory factor analysis with ordinal data: Comparing robust maximum likelihood and diagonally weighted least squares. Behavior Research Methods. 2016;48(3):936–49. pmid:26174714
  76. 76. Hox J, Maas C, Brinkhuis M. The effect of estimation method and sample size in multilevel structural equation modeling. Statistica Neerlandica. 2010;64(2):157–70.
  77. 77. Mutheén BO, Mutheén LK, Asparouhov T. Estimator choices with categorical outcomes. Mplus and Mplus2015.
  78. 78. Lt Hu, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal. 1999;6(1):1–55.
  79. 79. Raykov T, Marcoulides GA. Introduction to psychometric theory: Routledge; 2011.
  80. 80. Raykov T, Marcoulides GA, Patelis T. The Importance of the Assumption of Uncorrelated Errors in Psychometric Theory. Educational and Psychological Measurement. 2015;75(4):634–47. pmid:29795836
  81. 81. Cramer AOJ, Sluis S, Noordhof A, Wichers M, Geschwind N, Aggen SH, et al. Measurable Like Temperature or Mereological Like Flocking? On the Nature of Personality Traits. European Journal of Personality. 2012;26(4):451–9.
  82. 82. Mutheén LK, Mutheén BO. Mplus User’s Guide. Eighth Edition. Los Angeles, CA: Mutheén & Mutheén; 1998–2017.
  83. 83. Reise SP. The Rediscovery of Bifactor Measurement Models. Multivariate Behavioral Research. 2012;47(5):667–96. pmid:24049214
  84. 84. Bowen NK, Masa RD. Conducting Measurement Invariance Tests with Ordinal Data: A Guide for Social Work Researchers. Journal of the Society for Social Work and Research. 2015;6(2):229–49.
  85. 85. Asparouhov T, Mutheén BO. Nesting and Equivalence Testing in Mplus [internet]. 2018 [1–17]. Available from:
  86. 86. Mutheén LK. Negative Residual Variance [internet]. 2007 [Available from:
  87. 87. Byrne BM, Shavelson RJ, Muthén B. Testing for the equivalence of factor covariance and mean structures: The issue of partial measurement invariance. Psychological Bulletin. 1989;105(3):456–66.
  88. 88. Millsap RE, Kwok OM. Evaluating the impact of partial factorial invariance on selection in two populations. Psychol Methods. 2004;9(1):93–115. pmid:15053721
  89. 89. Sass DA. Testing Measurement Invariance and Comparing Latent Factor Means Within a Confirmatory Factor Analysis Framework. Journal of Psychoeducational Assessment. 2011;29(4):347–63.
  90. 90. Cheung GW, Rensvold RB. Testing Factorial Invariance across Groups: A Reconceptualization and Proposed New Method. Journal of Management. 1999;25(1):1–27.
  91. 91. Kenny DA, McCoach DB. Effect of the Number of Variables on Measures of Fit in Structural Equation Modeling. Structural Equation Modeling: A Multidisciplinary Journal. 2003;10(3):333–51.
  92. 92. Lilienfeld SO. Comorbidity Between and Within Childhood Externalizing and Internalizing Disorders: Reflections and Directions. Journal of Abnormal Child Psychology. 2003;31(3):285–91. pmid:12774861
  93. 93. Moilanen KL, Shaw DS, Maxwell KL. Developmental cascades: Externalizing, internalizing, and academic competence from middle childhood to early adolescence. Development and Psychopathology. 2010;22(3):635–53. pmid:20576184
  94. 94. Borsboom D, Cramer AOJ. Network Analysis: An Integrative Approach to the Structure of Psychopathology. Annual Review of Clinical Psychology. 2013;9(1):91–121.
  95. 95. Marsh JK, De Los Reyes A. Explaining away disorder: The influence of context on impressions of mental health symptoms. Clinical Psychological Science. 2018;6(2): 189–202.
  96. 96. De Los Reyes A, Augenstein TM, Wang M, Thomas SA, Drabick DAG, Burgers DE, Rabinowitz J. The validity of the multi-informant approach to assessing child and adolescent mental health. Psychological Bulletin. 2015;141(4):858–900. pmid:25915035
  97. 97. Patalay P, Fitzsimons E. Mental ill-health among children of the new century: Trends across childhood with a focus on age 14. London: Centre for Longitudinal Studies; 2017.
  98. 98. Patalay P, Fitzsimons E. Development and predictors of mental ill-health and wellbeing from childhood to adolescence. Social Psychiatry and Psychiatric Epidemiology. 2018;53(12):1311–1323. pmid:30259056
  99. 99. Kline R. Principles and practice of structural equation modeling Fourth Edition. New York: The Guilford Press; 2015.
  100. 100. Ilie S, Sutherland A, Vignoles A. Revisiting free school meal eligibility as a proxy for pupil socio‐economic deprivation. British Educational Research Journal. 2017;43(2):253–74.