Figures
Abstract
Purpose
This study aimed to understand whether incorporating wellbeing as another dimension within general factor models of mental health is (a) feasible and (b) useful.
Methods
Data from two time points (Year 7 and Year 9) for 15258 adolescents who participated in the HeadStart programme in England was used. In Stage 1, we used structural equation modelling on time point 1 data to test different latent variable models incorporating psychopathology and wellbeing dimensions. In Stage 2, we tested whether the latent factors identified in Stage 1 significantly predicted impairment at time point 2.
Results
A general factor model incorporating a shared underlying dimension between (lack of) wellbeing and psychopathology as well as unique specific factors had good fit to the data at Stage 1. Further, although both general factor models with and without wellbeing fit the data well at Stage 1, only the general factor model with wellbeing met all required fit thresholds when regressions to predict impairment were added in. The model without any general factor (correlated factors model) met pre-defined fit thresholds but had lower fit indices.
Conclusion
The incorporation of wellbeing into general factor models may help represent more nuanced mental health states and may be useful in predicting future functional states, however such a model needs further replication with comprehensive measures and comparison with alternative models to verify its validity and utility.
Citation: Chokhani R, Lereya ST, Deighton J (2025) Incorporating wellbeing into general factor models: A more complete mental state? PLoS One 20(11): e0335657. https://doi.org/10.1371/journal.pone.0335657
Editor: Zheng Zhang, South China Normal University, CHINA
Received: April 23, 2025; Accepted: October 13, 2025; Published: November 17, 2025
Copyright: © 2025 Chokhani et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: An anonymised and archived version of the dataset without 4 sociodemographic variables is available through the UK Data Service at: http://doi.org/10.5255/UKDA-SN-9150-1 The code for this study is available at: https://osf.io/2egv9/.
Funding: R.C. is supported by the Wellcome Trust (218497/Z/19/Z). Wellcome Trust played no role in the study design, data collection and analysis, decision to publish and preparation of this manuscript. The data in this study was collected as part of the HeadStart learning programme and supported by funding from The National Lottery Community Fund. The content is solely the responsibility of the authors, and it does not reflect the views of The National Lottery Community Fund.
Competing interests: The authors have declared that no competing interests exist.
Introduction
General factor modelling of mental health has gained traction in the last two decades, perhaps best exemplified in research on a general factor of psychopathology or the ‘p factor’ [1,2]. For proponents, such general factor modelling appears to have advantages on both theoretical and empirical fronts. Theoretically, it represents a transdiagnostic rather than a disorder-specific model of mental ill health, which better explains high rates of comorbidity in mental disorder diagnoses and better accounts for individuals waxing and waning through symptoms over a time period [3]. Empirically, it appears to strongly predict outcomes as diverse as future psychopathology [4], academic attainment and school functioning [4,5], criminal convictions [6] and suicide attempts [1]. The p factor is also related to common constructs in the nomological networks of mental health disorders, including socioemotional and cognitive constructs [7–10]. Although the p factor’s substantive meaning is strongly contested (see [11] and [12] for a review), its consistent identification across different populations, samples, measures and statistical models [4–6,8–10,13–16], its stability and homotypic continuity [17,18] and its empirical association with variables of interest remains formidable [8–10].
Even if general factors do not represent etiologically substantive constructs, they model mental health status in a transdiagnostic and dimensional manner, as compared to the categorical diagnoses approach [19]. In this study, we conceptualize general factors as useful indices of the general mental health status of an individual on a continuum. As such, they may have utility in predicting future mental or functional states or other related outcomes. Further, from a measurement perspective, general factor models incorporating specific factors are a useful way to understand and model the different constructs that should be considered when measuring mental health status and to test the relationship of these constructs to each other.
Bifactor models have commonly been used to model general factors of mental health, often including a general factor dimension as well as specific internalizing and externalizing dimensions, with additional specific psychopathology dimensions representing thought disorder, hyperactivity and substance use variably added [1,14,18]. We argue that wellbeing could be another dimension in such a general factor model. In 2005, Keyes proposed the complete state model, which argued that mental illness and mental health are distinct, but related constructs [20]. This model conceptualized the presence of mental health as the presence of emotional, psychological and subjective wellbeing which may co-occur with both the presence and absence of mental disorder [20,21]. This model has been empirically tested using three different approaches. First, Confirmatory Factor Analysis (CFA) models have found an oblique two-factor model having the best fit to data, suggesting that wellbeing and psychopathology are distinct but correlated constructs (see [22] for a review). Second, many studies have found unique as well as shared predictors of wellbeing and mental illness [23–26], lending further support to the idea of distinct constructs with some overlap. Third, studies have classified individuals into quadrants based on combinations of high/low mental illness and high/low wellbeing; two out of these subgroups would typically not be represented in unidimensional psychopathology models (those with low wellbeing but no mental illness; those with mental illness but reporting high wellbeing). These studies have indeed found distinct relationships between being situated in a particular quadrant with both outcome and predictor variables. Notably, having low wellbeing even in the absence of mental illness predisposes one to worse functioning and poorer resilience [20,27]. Similarly, complete mental health is associated with better functioning than just low psychopathology [28]. While both the quadrant approach and the dual-factor CFA approach appropriately tend to highlight the distinctness of wellbeing and psychopathology, they may downplay their shared aspects and may not fully represent the complex relationship between wellbeing and psychopathology [29].
Including wellbeing in general factor models along with psychopathology could address some of these issues and represent a more nuanced model. Such a general factor could represent a mental health status that indexes shared variance between psychopathology and wellbeing and thus could better distinguish between, for example, people with low wellbeing/low psychopathology and high wellbeing/low psychopathology. Current general factor models including only psychopathology would not be able to capture this difference. Further, similar to the argument for p factor models better accounting for dynamic symptom changes, such a model would better account for individuals who may cycle between high psychopathology and low wellbeing states, but remain with some form of distress overall. Finally, since wellbeing has been found to additionally distinguish functional outcomes even amongst individuals with mental illness [30], such a general factor could be particularly useful in predicting future functional impairment. In other words, the general factor could be expected to retain the utility of being a parsimonious construct that captures general mental health status and yet be a better predictor of future functional states than a general factor that does not take wellbeing into account. At the same time, the model would retain the distinct aspects of wellbeing and psychopathology, as captured by the specific factors that are free of their shared variance. The model, as a whole, could then be considered to represent a more complete mental health status.
Some previous literature has incorporated wellbeing into general factor models [19,31–33]. Böhnke & Croudace [31] found that Goldberg’s General Health Questionnaire (GHQ-12) and the Warwick-Edinburgh Mental Wellbeing Scale (WEMWBS) loaded onto a strong general factor and concluded that the two instruments may not measure distinct constructs. Black et al. [19] used pilot data of the main study the current article uses data from and found a general factor, which they labelled ‘internalising distress’, that explained the covariance between internalizing psychopathology, externalizing psychopathology and wellbeing. However, their measure of wellbeing was the Child Outcomes Rating Scale, which was originally designed as a self-report outcome measure in child and adolescent therapy and does not strictly measure wellbeing [34]. Finally, St Clair et al. [32] measured various psychopathologies (e.g., depression, generalized anxiety, OCD) as well as self-esteem and wellbeing and found a bifactor model fit well, with a general factor and five specific factors (representing self-confidence, antisocial behaviour, worry, aberrant thought and low mood). They also interpreted the general factor as a construct representing ‘distress’ and found it was associated with self-reported hazardous behaviour such as substance use and self-harm. However, the researchers allowed cross-loadings and adjusted the model as per modification indices, which may have led to overfitting. Van Erp Taalman Kip and Hutschemaekers [33] fit a model incorporating wellbeing and mental illness with a clinical population and found that while a dual-factor model was supported, the wellbeing factor only explained a small amount of the variance. They concluded that “if independency between the factors is a prerequisite, then only one continuum emerges with psychopathology and wellbeing as bipolar opposites” (p. 1725). However, as noted above, we cannot consider independency between wellbeing and psychopathology to be a prerequisite, as that is likely to be untrue. Further, as the authors concluded, in a clinical sample, the importance of wellbeing may be subsumed by the dominant role of psychopathology. Finally, while these studies have fit models incorporating both psychopathology and wellbeing, to our knowledge, no study has, as yet, evaluated the association of such a general factor to a prospective variable indexing a future functional state.
Functional impairment can be defined as the difficulty in handling the routine demands of everyday life [35] and can be conceptualized as a crucial consequence of mental illness [36,37], albeit it can also be argued to have a bidirectional relationship with mental illness. Empirical research has found that functional impairment is a better predictor of mental health service use than symptoms [38] and is also a valuable measure of treatment effectiveness, in addition to symptom change measures [39,40]. Both objective (e.g., psychiatric hospitalizations) and subjective (e.g., self-report) measures have been used to measure functional impairment. We argue that both have value, as the relationship between objective and subjective measures has been found to be stable, but weak, suggesting that both capture unique aspects of functional impairment [36]. Further, in concert with the recent calls to increase young people’s voices in research [41], there is a value-based argument to use self-reports of impairment in research, which highlight the young person’s unique perspective on their own impairment. Hence, in this study, we focus on self-reported subjective functional impairment, acknowledging that this may only capture one dimension of functional impairment.
Aims and objectives
This study aimed to understand whether incorporating wellbeing as another dimension within general factor models of mental health is (a) feasible and (b) useful in predicting future impairment. We refer to the general factor in such a model as gwb to conceptually distinguish it from the p factor.
Research Question 1. Can a model with a gwb factor be identified in a school-based adolescent sample?
Research Question 2. Is the gwb factor independently and prospectively associated with future impairment?
Based on previous literature [19,32], we predicted that a general factor model with wellbeing would fit the data satisfactorily and that gwb would represent a negative mental state, with negative loadings of the wellbeing items and positive loadings of the psychopathology items. We predicted that gwb would have a significant positive association with future functional impairment, even accounting for the effects of concurrent functional impairment. We also planned to estimate a correlated factor model for comparison purposes and predicted that this model would also have satisfactory fit. Finally, the specific factors in the correlated factors model would also have significant positive (Internalizing, Externalizing) and negative (Wellbeing) associations with future impairment.
Methodology
We conducted secondary analysis of data collected between 2016/17–2018/19 as part of the HeadStart programme. HeadStart was a 6-year large-scale programme which aimed to explore methods to improve mental health and resilience in young people in England. We included data collected at two time points (Year 7 and Year 9).
Participants
Overall, the HeadStart programme accessed a school-based sample of 67871 adolescents from six different local authorities in England (Blackpool, Cornwall, Hull, Kent, Newham and Wolverhampton). These six local authorities were identified based on levels of deprivation and engagement with the programme. Hence, the sample might not be representative of all school children in England.
The analytic sample for this study was 15258 adolescents in Year 7 who had participated in Wave 1 and filled out the Strengths and Difficulties Questionnaire and the Short Warwick Edinburgh Mental Wellbeing Scale (defined as responding to at least one question on each measure). As compared to national-level data published by Department for Education in January 2017 [42,43], the study sample at baseline had a higher percentage of children who were White as compared to other ethnicities (study: 74.0%; national: 70.9%, for state-funded secondary schools) and a lower percentage of children with special educational needs (SEN) (study: 11.6%; national: 14.4%, for all school types). The percentage of children ever eligible for Free School Meals (FSM) in our sample was 35.2%. Out of the initial sample, 11535 participants (75.6%) were followed up in Year 9, two years later.
Procedures
Data collection for the study was conducted between January 2017 – June 2021. At the start of the study, parents and carers of children in the longitudinal group were provided with a written information sheet and opt-out consent form, with a deadline of at least two weeks from the date of issue. Opt-outs could be submitted via Freepost, phone, or email to the Data Manager. All children in an eligible year group in participating schools (excepting those whose carers had opted out) were invited to fill out the measures. For each survey session, written child assent was sought and recorded via computer at the beginning of the session. Students filled out measures during their usual school day, in the presence of a teacher or adult. This procedure was approved by the University College London Research Ethics Committee (ID number 7963/003). Further details are available in the HeadStart National Evaluation Report [44]. The data used in the current study were accessed after completion of the study on 30th October 2023 and these data were anonymized, i.e., researchers could not identify participants. This use was covered under the original ethical approval for the study.
Measures
All measures were based on youth-self-report, except socio-demographic characteristics.
Socio-demographic characteristics.
Socio-demographic data, i.e., gender, ethnicity, children with SEN, eligibility for FSM and Index of Deprivation Affecting Children (IDACI) score were obtained through a data linkage with the National Pupil Database. The IDACI score is an area-level deprivation index and measures the proportion of all children aged 0–15 living in income deprived families in a particular area [45].
Psychopathology.
The 25-item Strengths and Difficulties Questionnaire (SDQ) was used to measure psychopathology [46,47]. The SDQ contains a total of five subscales: emotional problems, conduct problems, peer problems, hyperactivity and prosocial behaviour. The emotional problems subscale was considered to represent internalizing symptoms and the conduct problems subscale was considered to represent externalizing symptoms, following Patalay et al. [4]. Respondents use a 3-point Likert scale, where higher scores generally indicate higher psychopathology, excepting for one item, which is reverse scored.
Wellbeing.
The 7-item Short Warwick Edinburgh Mental Well-being Scale (SWEMWBS) was used to measure subjective wellbeing [48]. Despite appearing to tap psychological and eudaimonic aspects of wellbeing to a greater extent than hedonic wellbeing, the SWEMWBS has been found to be a reliable and unidimensional measure of mental well-being [48]. Respondents use a 5-point Likert scale, where higher scores indicate higher wellbeing and no items are reverse scored.
Functional impairment.
In this study, we measured functional impairment using the impact supplement of the SDQ, which measures impairment in social and occupational domains: home life, friendships, classroom learning and leisure activities [38]. Item 1 in the impact supplement was: ‘Overall, do you think you have difficulties in one or more of the following areas?’. Individuals who reported no perceived difficulties on Item 1 are considered as having an impact score of zero [49]. For individuals who report any perceived difficulties, the impact score is typically computed using five items. We further a priori decided to exclude the ‘distress’ item (Item 3; which is typically included in the impact score) as this item might be conceptually overlapping with both internalizing difficulties and wellbeing. Hence, we computed impact score by adding Items 4–7, answered on a 4-point Likert scale. Overall, the impact score ranged from 0–12.
Data analyses
All analyses were conducted using R version 4.3.1 [50].
Missing data.
As previously indicated, out of the analysable sample of 15258 at baseline, 11535 participants had been followed up two years later while 3723 had been not. Chi-squared tests and t-tests comparing these two groups on baseline socio-demographic characteristics and the measures of interest were significant (S1 Table), indicating that the sample who were followed up were significantly different on these baseline indicators than the sample who were not followed up. In particular, the sample who were not followed up were more likely to be boys, those eligible for FSM, those with SEN provision and those with lower wellbeing, higher symptom scores and impairment at baseline. Children from White ethnic groups were also less likely to be followed up as compared to children from Black, Asian and Other ethnic groups, but this was not true when comparing to a Mixed ethnic group. Overall, this is indicative of an attrition bias in the data with at least a Missing at Random (MAR) mechanism, although Missing Not At Random (MNAR) is also possible (see [51] for an explanation of missingness mechanisms). To account for this attrition bias, we included the biased socio-demographic variables as covariates in the analyses and imputed missing data on the initial sample of n = 15258 using multiple imputation by chained equations [52–54].
We used the R package mice version 3.16.0 [55] to conduct multiple imputation under MAR assumption. Even if data are Missing Not At Random, multiple imputation can produce estimates robust against MNAR [56]. For the final dataset entered into the imputation, variable-wise missingness ranged from 0–28.5% and case-wise missingness ranged from 0–92%. All variables (predictors, covariates and outcome variables) included in the analysis were included in the imputation model. All covariates shown to be associated with missingness, as above, were included. Further, we also included all other variables available in the dataset as auxiliary variables in the imputation model, excepting variables excluded on administrative grounds (e.g., participant id) and substantive grounds (e.g., all variables measured post the COVID-19 pandemic). For computational efficiency, we also excluded variables at time point 2 for whom the outbound statistic for imputing future impairment (the main outcome variable) was 0.15 or below [56]. The predictive mean matching method was used for imputation because a) it works well when data do not meet the assumption of multivariate normality b) only plausible values are imputed, allowing distributions to be preserved and c) it is computationally faster. Given available time and complexity of the model, we imputed a total number of m = 20 datasets at 10 iterations and visually checked for convergence, which is suggested to be acceptable [56]. The summary statistics of the imputed datasets are available in S2 Table.
Structural equation modelling: Measurement model.
To address Research Question 1, we estimated measurement models that tested whether our theorized models fit the data as per pre-defined fit thresholds. The semTools package version 0.5–6.932 in R [57] was used to run models on imputed datasets using the functions cfa.mi and sem.mi and to pool parameter estimates and standard errors. For the structural models, we converted pooled unstandardized estimates to pooled standardized estimates using a custom function (available in OSF code files), as this functionality was not available in semTools at the time of analyses.
In accordance with literature that generally tests a bifactor and correlated factor model [13], we initially specified two models:
Bifactor gwb model. This model was a general factor model using items indexing both psychopathology and wellbeing (Fig 1(a)). It had three specific factors (Internalizing, Externalizing and Wellbeing) and one general factor, which we called the general factor with wellbeing or gwb. In concert with recommendations [13], we refer to these specific factors as gwb-free specific factors, to conceptually distinguish them from specific factors in a correlated factors model. The specific factors were specified to be orthogonal to each other as the general factor is theoretically supposed to capture the variance shared by the specific factors.
(a) Bifactor gwb model (b) Correlated factors model (c) Bifactor gwb with method factor model (d) Bifactor model with method factors (e) Bifactor p model. Structural models. (f) Partial mediation model. For the partial mediation model, the latent factor of interest is gwb factor or p factor or specific factors (for correlated factors model), specific factors at baseline are gwb-free specific factors or p-free specific factors, depending on the measurement model and control variables are gender, ethnicity, FSM, SEN and IDACI score. In all models, rectangles indicate observed variables and ovals indicate latent variables.
Correlated factors model. This model used items indexing both psychopathology and wellbeing, but it did not have a general factor (Fig 1(b)). It was specified to have three specific factors (Internalizing, Externalizing and Wellbeing factors) which were allowed to correlate.
In a post-hoc decision, we also specified the following models:
Bifactor gwb model with a method factor. This model was the same as the bifactor gwb model, with the addition of a method factor for all negatively worded items (Fig 1(c)). This model was specified to account for wording effects that can occur when some items are negatively worded (nine out of 10 SDQ items) and others are positively worded (all SWEMWBS items and one SDQ item). Wording effects could confound the distinctness of the dimensions of psychopathology and wellbeing. To account for this, we estimated a method factor for all negatively worded items. Loadings of all items on the method factor were constrained to equality to aid identification. When we included a method factor for positively phrased items as well (i.e., all the SWEMWBS items + one item from the SDQ), we found that the variance estimated for this latent factor was negative, despite the model overall having a better fit. Notably, Item 7, a reverse-worded item, is tricky to interpret as positive or negative (‘I usually do as I am told’) and has shown inconsistent results in previous analyses [58]. Hence, we decided to only include a method factor for negative wording effects in this model, which is in line with previous work on similar questionnaires that has focused on negative wording effects but not positive wording effects [31].
Bifactor p model. We also specified a general factor model using items indexing only psychopathology (Fig 1(e)). It had two specific factors (Internalizing, Externalizing) and a general factor, which we called the psychopathology factor, in accordance with the literature on the p factor. This model did not include a method factor since all the items excepting one (SDQ Item 7) were negatively worded. This model was specified for descriptive comparison purposes.
Bifactor model with two method factors. This was a general factor model where all the items loaded on the general factor (Fig 1(d)). There were no specific factors for internalizing, externalizing and wellbeing. There were two method factors for positively worded and negatively worded items. This model was specified to understand whether the general factor along with two method factors is a sufficient representation of the underlying structure of the data.
We used the WLSMV estimator in all models, to better represent the categorical nature of the indicator variables. Cluster-robust standard errors with ordinal data are not supported yet in semTools so we could not account for cluster in the analysis model. However, previous studies have shown that this is a small effect [59].
Structural equation modelling: Measurement and structural models.
Subject to the measurement models having satisfactory fit indices and factor loadings, the corresponding structural model was then simultaneously estimated along with the selected measurement models. The aim of the structural model was to relate the latent factor of interest (the gwb factor, the specific factors from the correlated factors model, the p factor) to future impairment. We included concurrent impairment as a mediator between the latent factor of interest and future impairment, to ensure that we could test for the direct effect of the latent factor on future impairment, accounting for effects due to concurrent impairment. We also included gender, ethnicity, SEN, FSM and IDACI score at baseline as covariates (Fig 1(f)). In the case of general factor models, we also included the gwb-free or p-free specific factors from the same model at baseline as covariates.
Model fit and model comparisons.
All models were checked for fit using the following fit indices thresholds [13,60], which were specified a priori: Comparative Fit Index (acceptable fit > 0.90, excellent fit > 0.95); Tucker Lewis Index (acceptable fit > 0.90, excellent fit > 0.95); RMSEA (acceptable fit < 0.08; excellent fit < 0.06); SRMR (fit < 0.08). For the full model with the measurement and structural parts, we manually specified a baseline model for computation of the incremental fit indices, which was the typical independence or null model. Unstandardized estimates were used to test against the null hypothesis. Standardized estimates were used to descriptively compare factor loadings and effect sizes.
For bifactor measurement models, we also calculated bifactor indices such as omega hierarchical and explained common variance [13,61,62]. Omega total (ω) refers to the proportion of variance in an observed total score that can be attributed to all modelled sources of variance (general and specific factors). Omega hierarchical (ωH) refers to the proportion of variance in an observed total score that can be attributed only to the general factor, partitioning out the specific factors. Omega hierarchical subscale (ωHS) refers to the proportion of variance in an observed subscale score that can be attributed only to the specific factors, controlling for the general factor. Finally, Explained Common Variance represents the proportion of all common variance explained by a factor and ranges from 0 to 1 [13,61].
Our primary objective was to test each model individually for fit. However, we also attempted to compare models as a secondary objective. Where models were nested, they were compared using scaled chi-square difference tests. Most models were non-nested so could not be compared in this manner. Relative fit indices such as AIC and BIC that are typically used to compare non-nested models are not available for the WLSMV estimator. Hence, we descriptively compared non-nested models.
Results
Structural equation modelling: Measurement models
Table 1 depicts the fit indices for all five measurement models estimated. Although all models had satisfactory fit indices as per the more lenient fit thresholds, only Model 3 (bifactor gwb with method) and Model 4 (bifactor p) came close to meeting all the conservative thresholds. The chi-squared test was significant; however this was due to the sensitivity of the chi-squared test statistic to large samples. Table 2 depicts additional bifactor indices. Below, each model’s fit indices and factor loadings are summarized. Full results tables are available in S3 Table.
Bifactor gwb model. For the bifactor gwb model, the overall model fit was nearly excellent (χ2(102, 15258) = 5032.5, p < 0.01, CFI = 0.948, TLI = 0.931, RMSEA = .056, SRMR = 0.049). Standardized factor loadings had a range between 0.25–0.64 for the gwb-free internalizing factor, 0.34–0.53 for the gwb-free wellbeing factor, 0.41–0.77 for the gwb-free externalizing factor and an absolute range of 0.27–0.68 for the complete mental state factor. The mean of the absolute standardized factor loadings was moderate for all latent factors (0.48 for the gwb-free internalizing factor, 0.51 for the gwb-free externalizing factor, 0.44 for gwb-free wellbeing factor and 0.44 for the gwb factor). Although some of these loadings are relatively low, low factor loadings are common with child and adolescent populations (e.g., Patalay et al., 2015; Black et al., 2019). Highest standardized loadings on the gwb factor included ‘I am often unhappy, downhearted and tearful’ (0.68), ‘I’ve been thinking clearly’ (−0.62), ‘I’ve been dealing with problems well’ (−0.56), ‘I’ve been feeling relaxed’ (−0.55) and ‘I get very angry and often lost my temper’ (0.54).
Following Caspi et al. [13], the omega hierarchical for gwb was acceptable (0.60) and suggested that a total score of SDQ and SWEMWBS items would predominantly reflect individual differences on the general factor. Following Stucky and Edelen’s [62] guidelines, the proportion of common variance explained by the general factor in this model (47%) was not high enough to warrant a unidimensional model.
Correlated factors model. For the correlated factors model, the overall model fit was not excellent, but acceptable (χ2(116, 15258) = 7566.8, p < 0.01, CFI = .922, TLI = 0.908, RMSEA = .065, SRMR = 0.064). Standardized factor loadings had a range between 0.54–0.85 for the internalizing factor, 0.48–0.77 for the wellbeing factor and 0.51–0.78 for the externalizing factor. The mean standardized loadings were fairly high (0.66 for the internalizing factor, 0.63 for the externalizing factor and 0.63 for the wellbeing factor). Further, the covariance was 0.44 between internalizing and externalizing factors, −0.52 between internalizing and wellbeing factors and −0.48 between externalizing and wellbeing factors.
Bifactor gwb with method factor. For the bifactor gwb model with the method factor, the overall model fit was excellent (χ2(101, 15258) = 3994.2, p < 0.01, CFI = 0.959, TLI = 0.945, RMSEA = 0.050, SRMR = 0.045). Standardized factor loadings had slightly reduced for the gwb-free internalizing factor (Mean: 0.45) and the gwb-free externalizing factor (Mean: 0.50) as compared to the original bifactor model. Standardized loadings on the gwb-free wellbeing factor were low in this model (Mean: 0.25), with four items, in particular, having low factor loadings that albeit were significantly different from zero (“feeling relaxed”, “dealing with problems well”, “thinking clearly”, “been able to make up my own mind”). The mean standardized loading on the gwb factor was similar (Mean: 0.44) although the contributions of the internalizing and externalizing items had reduced, presumably due to some of their variance now being accounted for by the method factor. In turn, the contributions of wellbeing items had increased, especially those items that now had low loadings on the gwb-free and method-free wellbeing specific factor. In terms of items, highest standardized loadings were similar, including ‘I’ve been thinking clearly’ (−0.62), ‘I’ve been dealing with problems well’ (−0.56), ‘I’ve been feeling relaxed’ (−0.55), ‘I am often unhappy, downhearted and tearful’ (0.54) and ‘I’ve been feeling useful’ (−0.54). Notably, although ‘I get very angry and often lose my temper’ still had the highest loading on gwb out of the externalizing items, its standardized loading had reduced from 0.54 to 0.41.
The omega hierarchical for gwb was acceptable (0.68) and suggested that a total score of SDQ and SWEMWBS items would predominantly reflect individual differences on the general factor. For the specific factors, the omegas hierarchical subscale for Int, Ext and WB were 0.34, 0.39 and 0.12 respectively, suggesting that the subscale total scores would reflect the general factor more than the intended factor, particularly for the Wellbeing subscale. The proportion of common variance explained by the general factor in this model (48%) was also not high enough to warrant a unidimensional model [62].
Bifactor p. As per the fit indices, this model had excellent fit (χ2(25, 15258) = 1154.1, p < 0.01, CFI = 0.971, TLI = 0.949, RMSEA = .054, SRMR = 0.042). Standardized factor loadings had a range of 0.17–0.64 for the internalizing factor (Mean: 0.47), 0.40–0.69 for the externalizing factor (Mean: 0.53) and 0.31–0.72 for the p factor (Mean: 0.42). For the first time in all models, one of the factor loadings on the general factor (in this case, p factor) was not significant and had an estimate of 0.08 (Item 7: I usually do as I am told). Highest standardized loadings on the p factor included ‘I am often unhappy, downhearted or tearful’ (0.72), ‘I get very angry and often lose my temper’ (0.55) and ‘I get a lot of headaches, stomachaches or sickness’ (0.54).
In this model, the omega hierarchical for the general factor, p, was lower than usually considered acceptable [13], suggesting that the variance in the total score of all items explained by the p factor was moderate and the specific factors were also important in explaining variance in the observed total score. Similarly, the proportion of common variance explained by the general factor in this model (43%) was also not high enough to warrant a unidimensional model [62].
Bifactor with method. This model was estimated to test the hypothesis that a general factor along with two method factors was sufficient to explain the data. The model had acceptable fit as per lenient fit indices (χ2(102, 15258) = 7031.8, p < 0.01, CFI = 0.927, TLI = 0.903, RMSEA = .067, SRMR = 0.059). The standardized loadings on the general factor gwb ranged between 0.21–0.76 (Mean: 0.46). However, the factor loadings for the negative wording factor clearly split along the internalizing items (all positive loadings) and externalizing items (all negative loadings) of the SDQ, suggesting that it is not representing a negative wording factor, as we would expect such a factor to have loadings from all the specified items in the same direction. Similarly, for the positive wording factor, the factor loadings split between the one SDQ item (positive loading) and all the SWEMWBS items (all negative loadings), again suggesting that it is not representative of a positive wording factor. Hence, we considered this a misspecified model based on factor loadings.
Finally, based on our reviewer’s suggestion, we also fit an additional pure unidimensional model with a general factor (gwb) on which all items loaded. There were no specific factors or method factors. This model was fit on an archived version of the dataset, since we no longer had access to the original dataset. This model showed poor fit (χ2(119, 15255) = 23258.556, p < 0.01, CFI = 0.758, TLI = 0.723, RMSEA = 0.113, SRMR = 0.116), suggesting that a single general factor is not adequate to explain the variance in the data. Further information as well as the explanation of minor differences between the original and archived dataset are reported in S7 Text.
Since Model 1 (bifactor gwb) and Model 3 (bifactor gwb with method factor) were nested, they could be compared using a scaled chi-square difference test. This indicated that Model 3 (bifactor gwb with method factor) was superior (∆χ2 = 500, df = 1, p < 0.001). Since the method factor in the bifactor model included 9/10 SDQ items, it is difficult to conclusively establish that it truly represents the method effect of negative-phrased items only. It could also be capturing elements of shared psychopathology that are non-specific to internalizing and externalizing but that are not shared with wellbeing. However, constraining the loadings of items on the method factor to equality should work against this.
Hence, for the next stage of the analysis, we carried forward the correlated factors model, the bifactor gwb with a method factor model and bifactor p model.
Structural equation modelling: Measurement and structural models
For all three selected models, we specified a partial mediation model along with the original measurement model. Table 3 depicts the fit indices for these structural models.
As seen in Table 3, only the bifactor gwb mediation model with the method factor and the adjusted correlated factors model met the pre-defined fit thresholds. None of the models met the more conservative criteria for fit indices. Tables 4 and 5 contain the regression coefficient estimates and the estimates for direct and indirect paths for the bifactor gwb mediation model and the adjusted correlated factors model. Below, key findings from each model are summarized.
Bifactor gwb mediation model with method factor. The overall model fit of the bifactor gwb partial mediation model with the method factor was satisfactory (χ2(263, 15258) = 7210.0, p < 0.01, CFI = 0.939, TLI = 0.925, RMSEA = 0.042, SRMR = 0.039). The estimate for the direct path from the general factor gwb to future impairment was positive and significant (unstandardized estimate: 2.51, p < 0.001; pooled standardized estimate: 0.31). The estimate for the indirect path from the general factor to future impairment via concurrent impairment was also positive and significant, though lower in terms of effect size (unstandardized estimate: 0.59, p < 0.001; pooled standardized estimate: 0.07). With respect to the gwb-free and method-free specific factors, both the Int and Ext factors were significantly and positively associated with concurrent impairment (Int: pooled standardized estimate: 0.26; Ext: pooled standardized estimate = 0.15), however the relationship with impairment became weaker over time (Int: pooled standardized estimate: 0.11; Ext: pooled standardized estimate = 0.059). The gwb-free and method-free Wellbeing factor had small but significant positive associations with impairment in this model. With respect to the control variables, SEN status and FSM status were positively and significantly associated with concurrent impairment, with the relationship maintained over time. Gender and ethnicity were associated with concurrent impairment, with males (as compared to females) reporting lower impairment and Asian and Black adolescents (as compared to white adolescents) reporting lower impairment. The relationship of gender and ethnicity with impairment became stronger over time. IDACI score was significantly associated with concurrent impairment, but the relationship became non-significant over time (See S4 Table for full results of control variables).
Adjusted Correlated Factors Mediation Model. This model simultaneously included the internalizing, externalizing and wellbeing factors. Thus, it can be considered an adjusted correlated factors model as it estimates the unique effect of each specific factor, adjusting for the effect of the other specific factors. The overall model fit was satisfactory, but descriptively worse as compared to the bifactor gwb mediation model (χ2 (280, 15258) = 10146.3, p < 0.01, CFI = 0.913, TLI = 0.900, RMSEA = 0.048 and SRMR = 0.056). The internalizing and externalizing factors had positive and significant associations with concurrent impairment (internalizing: unstandardized estimate: 1.82, p < 0.001, pooled standardized estimate: 0.36; externalizing: unstandardized estimate: 0.72, p < 0.001, pooled standardized estimate: 0.21), while the wellbeing factor had a negative and significant association with concurrent impairment (unstandardized estimate: −0.81, p < 0.001, pooled standardized estimate: −0.14). Hence, the strongest association was with the internalizing factor and the weakest association with the wellbeing factor. All associations with impairment decreased over time, with the internalizing factor maintaining the relatively strongest association (unstandardized estimate: 0.76, p < 0.001, pooled standardized estimate: 0.14), followed by the externalizing factor (unstandardized estimate: 0.28, p < 0.001, pooled standardized estimate: 0.075) and then the wellbeing factor (unstandardized estimate: −0.50, p < 0.001, pooled standardized estimate: −0.08). The pattern of associations with the control variables remained similar to the bifactor gwb mediation model (See S4 Table for full results). We also estimated separate unadjusted models for the internalizing factor, externalizing factor and wellbeing factor as the latent factor of interest associated with impairment; none of these models met the pre-defined fit thresholds (S5 Table).
Bifactor p mediation model. The overall model fit of the bifactor p partial mediation model was adequate as per some fit criteria but did not meet all fit thresholds (χ2(119, 15258) = 4567.9, p < 0.01, CFI = .915, TLI = 0.884, RMSEA = .050, SRMR = 0.036). Hence, we decided not to interpret path coefficients from this model.
The bifactor gwb mediation model satisfactorily fit the data whereas the correlated factors mediation model just about fit the data and the bifactor p model failed to meet all required fit indices. Owing to the non-nested nature of the bifactor and correlated factor structural models, we could not statistically compare them. Hence, we considered the bifactor gwb mediation model as representing the best model fit to our data, with the correlated factors mediation model as also viable.
Sensitivity analysis
To test the robustness of the results, we also estimated the bifactor gwb with method model, the bifactor p model and the correlated factors model with the MLR estimator with full information maximum likelihood to account for missing data (S6 Table A and B). The overall pattern of results was similar, but the incremental fit indices were lower for all models (bifactor gwb with method: CFI: 0.907; TLI: 0.886; bifactor p: CFI: 0.894; TLI: 0.856; correlated factors: CFI: 0.871; TLI: 0.852). The fit for just the measurement models were very good for both bifactor models (bifactor gwb with method: CFI: 0.950; TLI: 0.932; bifactor p: CFI: 0.958; TLI: 0.924) but not for the correlated factors model (CFI: 0.906; TLI: 0.890). With the WLSMV estimator without multiple imputation (Cases used: 8709), the results were similar to the current results (S6 Table C). Finally, we tested for the analytic decision of carrying forward the bifactor gwb model with the method factor for the structural model; in case we had carried forward the bifactor gwb model without the method factor instead, it would still have met the prespecified fit indices at the structural model stage (CFI: 0.934; TLI: 0.919; RMSEA: 0.043; SRMR: 0.043), the associations with impairment for the general factor were similar and the broad conclusions would have remained unchanged (S6 Table D).
Discussion
This study aimed to test whether a general factor model with a wellbeing dimension is feasible and useful. Our first research question related to feasibility, i.e., whether such a general factor gwb could be identified in a school-based adolescent sample. To this end, we estimated various latent variable models incorporating psychopathology and wellbeing dimensions.
We found that most models had acceptable, but not excellent fit to the data, namely, the bifactor gwb model, the bifactor gwb with method model, the bifactor p model and the correlated factors model. Notably, while the correlated factors model had acceptable fit as per our pre-defined fit indices, the fit indices were descriptively lower than the other models. Further, in a direct model comparison test, the bifactor gwb with method model was superior to the bifactor gwb model. Overall, this indicates that, for representing the overall structure of mental health, the bifactor gwb model with the method factor, the bifactor p model and the correlated factors model are all viable representations of the underlying data structure, with a tentative preference for the bifactor models, which is similar to what has been found previously with child and adolescent samples [4,14–16]. Neither the bifactor model with the method factors only or the pure unidimensional model (the latter fit on the archived dataset) were viable models, suggesting that a single general factor does not adequately represent the structure of mental health. This was also corroborated by the explained common variance in all bifactor models, which suggested that unidimensionality was not warranted. This is consistent with previous findings about unidimensional models in the literature [4,14–16]. Overall, this suggests that there are both shared and distinct aspects between the internalizing, wellbeing and externalizing dimensions; if there was nothing shared, the general factor models would not fit and if there was nothing distinct, the unidimensional model would fit adequately. This is further supported by the finding that there were moderate associations between the specific factors in the correlated factors model. While moderate associations between the internalizing and externalizing are well-established, it is interesting that wellbeing too had moderate (negative) associations with the internalizing and externalizing factor, especially the internalizing factor, which it has previously been suggested to conceptually overlap with to a greater extent [19].
The shared underlying construct (gwb) between (lack of) wellbeing and psychopathology appears to be a negative mental state. The mean standardized absolute factor loadings on the general factor for the bifactor gwb with method model was 0.44, which was similar to the mean loading on the general factor for the bifactor p model (0.42). The items with highest loadings on the gwb factor were ‘I am unhappy, downhearted and tearful’, ‘I get very angry and often lose my temper’, ‘I’ve been thinking clearly’, ‘I’ve been dealing with problems well’ and ‘I’ve been feeling relaxed’. Of note, most studies have found such a general factor to represent poor rather than good mental states [19,31,32]; it could be that when measuring a construct such as wellbeing, it is easier to find a shared commonality on lack of wellbeing whereas positive wellbeing states might be more multidimensional [31]. Such a shared commonality between (lack of) wellbeing and psychopathology could represent more substantive, higher-order tendencies for poorer psychological wellbeing and higher psychopathology, especially since there are shared genetic influences between mental health and mental illness [21]. For example, this general factor could be capturing a construct such as self-directedness from Cloninger’s psychobiological model of personality [63,64]. Self-directedness is conceptualized as a self-regulatory domain of human personality, with a heritable component, and has been linked to both different aspects of wellbeing (e.g., positive affect, life satisfaction; [63]) as well as psychopathology (e.g., emotional and behaviour problems; [65]). However, we also note that substantive interpretations of general factors have been subject to much debate and general factors could also be indexing current mental health status (e.g., distress or impairment), rather than representing antecedents of wellbeing and illness [11].
Comparing standardized factor loadings on the specific factors in the correlated factors model to the gwb-free specific factors in the bifactor gwb with method model could suggest which items are more indicative of the general factor and the specific factors [1]. With respect to internalizing items, the factor loadings of “headaches” and “unhappy” substantially reduced from the correlated-factors to the bifactor gwb model, indicating that these items were more indicative of the general factor, whereas the factor loadings for “worry” and “fears” did not change much, indicating that these items formed an internalizing-specific factor even after the general factor has been partialled out. With respect to externalizing items, the factor loadings of “angry” and “lie” reduced substantially from the correlated factors to the bifactor model, while the factor loading of “steal” reduced moderately, the factor loading of “obedient” remained similar and the factor loading of “fight” on the specific factor actually increased in the bifactor model. This indicates that “steal”, “obedient” and “fight” might have been capturing quite specific dimensions of externalizing whereas “angry” and “lie” might have been more indicative of the general factor. With respect to the wellbeing items, the factor loadings for “feeling relaxed” “thinking clearly” “dealing with problems” and “make up mind” reduced substantially from the correlated factors to the bifactor model, indicating that these items were more indicative of the general factor, while the loadings for “feeling useful” and “feeling close” reduced moderately and “optimistic” remained similar. Overall, for this dataset, the gwb-free and method-free INT factor seems to be capturing aspects of anxiety and fear, whereas the gwb-free and method-free EXT factor seems to be capturing aspects of antisocial behaviour. The gwb-free and method-free WB factor is more difficult to interpret and less robust due to its low factor loadings; however, it is notable that all wellbeing items loading on the general factor seem to capture internal aspects of lack of wellbeing whereas “feeling close” is the only item that implies more external aspects such as relationships. Emotional states of low mood, anger, (lack of) feeling useful, (lack of) feeling relaxed, (lack of) thinking clearly and so on appear to be more transdiagnostic as they are more shared between internalizing, externalizing and wellbeing dimensions rather than remaining within the specific factors. While the current study has limitations in terms of the specific measures used, it provides a proof of concept that representing the relationship between (lack of) wellbeing and psychopathology in the form of a model including both general and specific factors allows one to acknowledge both the shared and distinct aspects of (lack of) wellbeing with psychopathology [29]. One could imagine a scenario wherein estimated factor scores on such a general factor and the gwb-free specific factors can together be used to more accurately and comprehensively describe the mental health status of individuals. In such a hypothetical scenario, an adolescent’s mental health status could be described using multiple dimensions. Their score on the general factor could be a parsimonious representation of their general (negative) mental state, which would usefully incorporate lack of wellbeing in addition to psychopathology. Additionally, their score on the internalizing factor could be representative of their fear/anxiety state, their score on the externalizing factor representative of their antisocial behaviour and their score on the wellbeing factor representative of the extent to which they report distinct wellbeing states. While the specific aspects of such a model will certainly need to be refined—for example, the consideration of other dimensions beyond internalizing and externalizing would be very reasonable, based on previous studies having done the same [14,18]—such a dimensional representation of a “complete mental state” might have advantages over more discrete representations such as the quadrant model, which inherently force the individual to be in one of a limited number of categories [19,29]. Notably, while factor scores provide model-based estimates of the latent dimensions, our bifactor reliability indices are more informative regarding the common practice of using unit-weighted total scores. In our models, these indices suggested that simple total scores across all items would predominantly reflect the general factor and subscale totals would also primarily reflect the general factor, with some unique variance from the specific factor. This suggests that factor scores might allow for a cleaner distinction between general and specific factors, whereas unit-weighted total scores may be less interpretable as proxies for their respective specific factors. Hence, using structural equation modelling can be useful in future research (a) to understand the complex relationship between wellbeing and psychopathology and (b) to distinguish between common and specific sources of variance in a cleaner manner.
Our second research question related to the utility of a general factor involving both psychopathology and wellbeing dimensions in independently and positively predicting future impairment. To do this, we estimated structural models with the latent factor of interest as the predictor variable and concurrent and future impairment as mediator and outcome variables.
We found that on this dataset, a general factor model incorporating psychopathology, wellbeing and impairment fit better than a general factor model with just psychopathology and impairment. Specifically, the bifactor gwb partial mediation model, which predicted future impairment from the gwb factor via concurrent impairment had good fit indices. The adjusted correlated factors partial mediation model, which predicted future impairment from the internalizing, externalizing and wellbeing factors via concurrent impairment also had satisfactory fit indices. However, the bifactor p mediation model, which predicted future impairment from the p factor via concurrent impairment, did not meet all pre-defined fit thresholds. Overall, this suggests that a ‘complete mental state’ model which includes the dimensions of internalizing, externalizing, wellbeing as well as their shared aspects, when associated with concurrent and future impairment, represents a more satisfactory fit to the underlying data structure, than a model that just incorporates the dimensions of internalizing, externalizing and their shared aspects. Note that since the bifactor p mediation model failed to meet criteria by only a small margin, we cannot outright reject this as a viable model, but consider it to be worse-performing on this dataset than the bifactor gwb mediation model.
Further, we found that the gwb factor is significantly and positively associated with future impairment measured two years later. For every 1 standard deviation increase in gwb scores at time 1, impairment scores two years later tended to be higher by 0.31 standard deviations. The direct path from gwb to future impairment was stronger than the indirect path, which had small effect sizes, suggesting that gwb is independently associated with future impairment, even after accounting for concurrent impairment. This finding is aligned with the literature that has found wellbeing to be associated with poor and good functioning [20,28,30]. What is interesting that the shared underlying dimension between (lack of) wellbeing and psychopathology has the strongest association with future functioning as compared to any of the specific factors. This is suggestive of the potential utility of incorporating both wellbeing and psychopathology into measurements of general mental states to inform identification of those at risk for future negative outcomes. Unexpectedly, the weakest path across all models was the path from concurrent impairment to future impairment, albeit it was still positive and significant. This suggests that the general factor might account for most of the expected relationship between concurrent and future impairment. Although we did not change our model to keep with our initial goal of confirmatory analyses, future research might consider using other models than mediation models to represent the relationships between these variables.
Beyond the general factor, even the gwb-free Int and Ext factors may be useful in predicting future impairment. These specific factors seem to have retained significant, albeit low, associations with impairment, even after partialling out the general factor. This is conceptually similar to the adjusted correlated factor model, which estimates the unique effect of each specific factor adjusting for the other factors. On the other hand, unadjusted models for these specific factors did not have good fit, suggesting that only using one of the specific factors might not be adequate in predicting future impairment. However, there is more of a caveat to interpreting the associations of the gwb-free specific factors with impairment; for the gwb-free Ext/WB factors, the effect sizes were low, with relatively higher standard errors, suggesting lower confidence in these associations; this is similar to what Caspi et al. [13] also concluded regarding the stability and interpretation of the specific factors.
Overall, our study tentatively supports the feasibility of a ‘complete mental state’ model which includes a general factor representing the shared underlying dimension between psychopathology and (lack of) wellbeing as well as specific factors representing internalizing, externalizing and wellbeing specific dimensions. In terms of implications for research, our study supports previous calls that argue against a simplistic conceptualization of psychopathology and wellbeing as totally unidimensional or totally independent and adds to the evidence that that mental illness and wellbeing have both shared and distinct dimensions [19,22,29,32]. A key area for future research might be to attain conceptual and empirical agreement on what the shared and distinct dimensions capture. Notably, in our model, the wellbeing-specific factor was the most difficult to interpret and least robust in terms of its associations with future impairment; this could be because of the limitations of the comprehensiveness of the wellbeing measure used, which has been found to be unidimensional [66]. One priority for the future may be to identify what, if any, are the distinct aspects of wellbeing that are not shared with psychopathology. These could be aspects such as having positive relationships, finding meaning and experiencing positive affect, which are typically not captured by measures of psychopathology but are part of wellbeing models [64,67].
In terms of implications for policy and practice, our study provides some empirical evidence to support calls to measure wellbeing in addition to mental health symptoms when measuring mental health status [68,69], especially for school-based non-clinical samples, as the incorporation of wellbeing in the model seemed to lead to best fit and aided in predicting future impairment. Previous literature has identified the unique contribution of wellbeing and quality of life to predicting protective factors, outcomes and referrals to CAMHS services in the UK, over and above mental health symptoms [27,28,70]. Considering that CAMHS services are often overloaded with referrals, leading to a prioritization of adolescents in crisis [71], measuring both wellbeing and mental health symptoms and conducting an integrated assessment of mental health status based on these measures might prove particularly informative in screening for targeted preventative interventions to prevent young people from reaching such crisis points. Such a measurement is likely to be more inclusive and capture more variability in mental health status [72], which could be particularly useful for school and community samples, where the goal is early identification. The longitudinal nature of our study is a strength here, as it suggests the importance of wellbeing to predicting functional outcomes over time. Secondly, while recognizing that brief self-report measures are probably the most pragmatically useful measures of mental health status in such settings, our study also tentatively argues for going beyond sum score approaches to synthesizing the information from these self-report measures, especially when both mental illness and wellbeing measures are used. Even if existing validated measures are drawn upon for purposes of pragmatics and consistency, there can still be rich information available about the transdiagnostic and distinct positive and negative dimensions of mental health states. Further, different dimensions might be differentially predictive of outcomes (e.g., the externalizing dimension in our study seemed to particularly capture “fighting” and “stealing” and not just “anger”, which could be a useful marker of antisocial behaviour).
Our study has the following limitations. Firstly, in the case of all structural equation modelling, what one gets is what one puts in. Our results are necessarily a product of the measures we had, which in this case were limited to the SDQ and the SWEMWBS. Hence, we do not claim to have the most accurate representation of the wellbeing and psychopathology dimensions but rather aim to provide a proof-of-concept of how these dimensions are likely related. Using more comprehensive measures of psychopathology and wellbeing would be important to determine the most appropriate number of specific factors in the model as well as to flesh out what the wellbeing-specific dimension would represent. Secondly, both our predictor variable and outcome variable had the same informant (the adolescents themselves), leading to a risk of common method variance. The fact that the variables in question were measured two years apart somewhat alleviates this, albeit our mediator variable was measured at the same time as the predictor. Finally, it would be remiss to not acknowledge the growing criticism of bifactor models and overfitting with large sample sizes [12], which represents a risk in our sample as well. However, as Caspi et al. [13] argue, the statistical approach taken to general factor modelling (S-1 vs bifactor vs other models) does not seem to influence findings. The more relevant argument is the conceptual argument around the substantive validity and utility of a general factor itself. Here, we do not make any claims about etiological substantiveness of a general factor but use it to conceptualize a model representing the shared and distinct aspects of psychopathology and wellbeing. By doing so, we argue that it is not enough to merely include wellbeing as an additional measure when measuring mental health status in adolescents, but there also needs to be a deeper consideration of how the dimensions of psychopathology and wellbeing are related and can be modelled to provide a more accurate, comprehensive and useful picture of a young person’s mental health status.
Supporting information
S1 Table. Missingness by variables of interest at baseline.
https://doi.org/10.1371/journal.pone.0335657.s001
(DOCX)
S2 Table. Results of multiple imputation for variables of interest.
https://doi.org/10.1371/journal.pone.0335657.s002
(DOCX)
S3 Table. Factor loadings for measurement models.
https://doi.org/10.1371/journal.pone.0335657.s003
(DOCX)
S4 Table. Factor loadings and parameter estimates for structural models.
https://doi.org/10.1371/journal.pone.0335657.s004
(DOCX)
S5 Table. Unadjusted models with specific factors.
https://doi.org/10.1371/journal.pone.0335657.s005
(DOCX)
Acknowledgments
The authors wish to thank Jan Boehnke for providing comments on an earlier version of the manuscript.
References
- 1. Caspi A, Houts RM, Belsky DW, Goldman-Mellor SJ, Harrington H, Israel S, et al. The p Factor: One General Psychopathology Factor in the Structure of Psychiatric Disorders? Clin Psychol Sci. 2014;2(2):119–37. pmid:25360393
- 2. Lahey BB, Applegate B, Hakes JK, Zald DH, Hariri AR, Rathouz PJ. Is there a general factor of prevalent psychopathology during adulthood? J Abnorm Psychol. 2012;121(4):971–7. pmid:22845652
- 3. Ronald A. Editorial: The psychopathology p factor: will it revolutionise the science and practice of child and adolescent psychiatry? J Child Psychol Psychiatry. 2019;60(5):497–9. pmid:30968422
- 4. Patalay P, Fonagy P, Deighton J, Belsky J, Vostanis P, Wolpert M. A general psychopathology factor in early adolescence. Br J Psychiatry. 2015;207(1):15–22. pmid:25906794
- 5. Lahey BB, Rathouz PJ, Keenan K, Stepp SD, Loeber R, Hipwell AE. Criterion validity of the general factor of psychopathology in a prospective study of girls. J Child Psychol Psychiatry. 2015;56(4):415–22. pmid:25052460
- 6. Pettersson E, Lahey BB, Larsson H, Lichtenstein P. Criterion Validity and Utility of the General Factor of Psychopathology in Childhood: Predictive Associations With Independently Measured Severe Adverse Mental Health Outcomes in Adolescence. J Am Acad Child Adolesc Psychiatry. 2018;57(6):372–83. pmid:29859553
- 7. Carver CS, Johnson SL, Timpano KR. Toward a Functional View of the P Factor in Psychopathology. Clin Psychol Sci. 2017;5(5):880–9. pmid:29057170
- 8. Brislin SJ, Martz ME, Joshi S, Duval ER, Gard A, Clark DA, et al. Differentiated nomological networks of internalizing, externalizing, and the general factor of psychopathology ('p factor’) in emerging adolescence in the ABCD study. Psychol Med. 2022;52(14):3051–61. pmid:33441214
- 9. Shields AN, Reardon KW, Brandes CM, Tackett JL. The p factor in children: Relationships with executive functions and effortful control. Journal of Research in Personality. 2019;82:103853.
- 10. Tackett JL, Lahey BB, van Hulle C, Waldman I, Krueger RF, Rathouz PJ. Common genetic influences on negative emotionality and a general psychopathology factor in childhood and adolescence. J Abnorm Psychol. 2013;122(4):1142–53. pmid:24364617
- 11. Smith GT, Atkinson EA, Davis HA, Riley EN, Oltmanns JR. The General Factor of Psychopathology. Annual Review of Clinical Psychology. 2020;16:75–98.
- 12. Watts AL, Greene AL, Bonifay W, Fried EI. A critical evaluation of the p-factor literature. Nat Rev Psychol. 2023;3(2):108–22.
- 13. Caspi A, Houts RM, Fisher HL, Danese A, Moffitt TE. The general factor of psychopathology (p): Choosing among competing models and interpreting p. Clin Psychol Sci. 2024;12(1):53–82. pmid:38236494
- 14. Castellanos-Ryan N, Brière FN, O’Leary-Barrett M, Banaschewski T, Bokde A, Bromberg U, et al. The structure of psychopathology in adolescence and its common personality and cognitive correlates. J Abnorm Psychol. 2016;125(8):1039–52. pmid:27819466
- 15. Laceulle OM, Vollebergh WAM, Ormel J. The Structure of Psychopathology in Adolescence. Clinical Psychological Science. 2015;3(6):850–60.
- 16. Stochl J, Khandaker GM, Lewis G, Perez J, Goodyer IM, Zammit S, et al. Mood, anxiety and psychotic phenomena measure a common psychopathological factor. Psychol Med. 2015;45(7):1483–93. pmid:25394403
- 17. Snyder HR, Young JF, Hankin BL. Strong Homotypic Continuity in Common Psychopathology-, Internalizing-, and Externalizing-Specific Factors Over Time in Adolescents. Clin Psychol Sci. 2017;5(1):98–110. pmid:28239532
- 18. Murray AL, Eisner M, Ribeaud D. The Development of the General Factor of Psychopathology “p Factor” Through Childhood and Adolescence. J Abnorm Child Psychol. 2016;44(8):1573–86. pmid:26846993
- 19. Black L, Panayiotou M, Humphrey N. The dimensionality and latent structure of mental health difficulties and wellbeing in early adolescence. PLoS One. 2019;14(2):e0213018. pmid:30807608
- 20. Keyes CLM. Mental illness and/or mental health? Investigating axioms of the complete state model of health. J Consult Clin Psychol. 2005;73(3):539–48. pmid:15982151
- 21. Keyes CLM, Martin CC. The Complete State Model of Mental Health. In: Slade M, Oades L, Jarden A, eds. Wellbeing, Recovery and Mental Health. Cambridge University Press; 2017; p. 86–98.
- 22. Iasiello M, van Agteren J, Muir Cochrane E. Mental Health and/or Mental Illness: A Scoping Review of the Evidence and Implications of the Dual-Continua Model of Mental Health. EB. 2020;2020.
- 23. Kinderman P, Tai S, Pontin E, Schwannauer M, Jarman I, Lisboa P. Causal and mediating factors for anxiety, depression and well-being. Br J Psychiatry. 2015;206(6):456–60. pmid:25858180
- 24. Lereya ST, Patalay P, Deighton J. Predictors of mental health difficulties and subjective wellbeing in adolescents: A longitudinal study. JCPP Adv. 2022;2(2):e12074. pmid:37431456
- 25. Patalay P, Fitzsimons E. Correlates of Mental Illness and Wellbeing in Children: Are They the Same? Results From the UK Millennium Cohort Study. J Am Acad Child Adolesc Psychiatry. 2016;55(9):771–83. pmid:27566118
- 26. Rees G. The Association of Childhood Factors with Children’s Subjective Well-Being and Emotional and Behavioural Difficulties at 11 years old. Child Ind Res. 2017;11(4):1107–29.
- 27. Jefferies P, Fritz J, Deighton J, Ungar M. Analysis of Protective Factors in Schoolchildren in England Using the Dual-factor Model of Mental Health. Res Child Adolesc Psychopathol. 2023;51(7):907–20. pmid:36786892
- 28. Arslan G, Allen K-A. Complete mental health in elementary school children: Understanding youth school functioning and adjustment. Curr Psychol. 2020;41(3):1174–83.
- 29. Agteren J, Iasiello M. Advancing our understanding of mental wellbeing and mental health: The call to embrace complexity over simplification. Australian Psychologist. 2020;55(4):307–16.
- 30. Westerhof GJ, Keyes CLM. Mental Illness and Mental Health: The Two Continua Model Across the Lifespan. J Adult Dev. 2010;17(2):110–9. pmid:20502508
- 31. Böhnke JR, Croudace TJ. Calibrating well-being, quality of life and common mental disorder items: psychometric epidemiology in public mental health research. Br J Psychiatry. 2016;209(2):162–8. pmid:26635327
- 32. St Clair MC, Neufeld S, Jones PB, Fonagy P, Bullmore ET, Dolan RJ, et al. Characterising the latent structure and organisation of self-reported thoughts, feelings and behaviours in adolescents and young adults. PLoS One. 2017;12(4):e0175381. pmid:28403164
- 33. van Erp Taalman Kip RM, Hutschemaekers GJM. Health, well-being, and psychopathology in a clinical population: Structure and discriminant validity of Mental Health Continuum Short Form (MHC-SF). J Clin Psychol. 2018;74(10):1719–29. pmid:29603212
- 34. Duncan BL, Sparks JA, Miller SD, Bohanske RT, Claud DA. Giving youth a voice: A preliminary study of the reliability and validity of a brief outcome measure for children, adolescents, and caretakers. J Brief Ther. 2006;5(2).
- 35. Naglieri J, Goldstein S, (eds). Assessing Impairment: From Theory to Practice. Boston, MA: Springer US; 2009.
- 36. McKnight PE, Kashdan TB. The importance of functional impairment to mental health outcomes: a case for reassessing our goals in depression treatment research. Clin Psychol Rev. 2009;29(3):243–59. pmid:19269076
- 37.
Barkley RA. Attention-Deficit Hyperactivity Disorder: A Handbook for Diagnosis and Treatment, 3rd Ed. Attention-Deficit Hyperactivity Disorder: A Handbook for Diagnosis and Treatment, 3rd Ed. New York, NY: The Guilford Press; 2006.
- 38. Goodman R. The Extended Version of the Strengths and Difficulties Questionnaire as a Guide to Child Psychiatric Caseness and Consequent Burden. Child Psychology Psychiatry. 1999;40(5):791–9.
- 39. McKnight PE, Monfort SS, Kashdan TB, Blalock DV, Calton JM. Anxiety symptoms and functional impairment: A systematic review of the correlation between the two measures. Clin Psychol Rev. 2016;45:115–30. pmid:26953005
- 40. Weiss MD, McBride NM, Craig S, Jensen P. Conceptual review of measuring functional impairment: findings from the Weiss Functional Impairment Rating Scale. Evid Based Ment Health. 2018;21(4):155–64. pmid:30314990
- 41. Deighton J, Croudace T, Fonagy P, Brown J, Patalay P, Wolpert M. Measuring mental health and wellbeing outcomes for children and adolescents to inform practice and policy: a review of child self-report measures. Child Adolesc Psychiatry Ment Health. 2014;8:14. pmid:24834111
- 42.
Anonymous. Schools, Pupils and Their Characteristics: January 2017. UK: Department for Education; 2017.
- 43.
Anonymous. Special Educational Needs in England: January 2017. UK: Department for Education; 2017.
- 44.
Evidence-Based Practice Unit. HeadStart National Evaluation Final Report. London, UK; 2023.
- 45.
Ministry of Housing, Communities and Local Government. The English Indices of Deprivation 2019. Statistical Release. London, UK; 2019.
- 46. Goodman R. The Strengths and Difficulties Questionnaire: a research note. J Child Psychol Psychiatry. 1997;38(5):581–6. pmid:9255702
- 47. Goodman R. Psychometric properties of the strengths and difficulties questionnaire. J Am Acad Child Adolesc Psychiatry. 2001;40(11):1337–45. pmid:11699809
- 48. Stewart-Brown S, Tennant A, Tennant R, Platt S, Parkinson J, Weich S. Internal construct validity of the Warwick-Edinburgh Mental Well-being Scale (WEMWBS): a Rasch analysis using data from the Scottish Health Education Population Survey. Health Qual Life Outcomes. 2009;7:15. pmid:19228398
- 49.
Anonymous. Scoring the SDQ. 2016. [Last accessed: 11/4/2024]. Available from: https://www.sdqinfo.org/py/sdqinfo/c0.py
- 50.
R Core Team. R: A Language and Environment for Statistical Computing. 2023.
- 51.
Little RJA, Rubin DB. Statistical Analysis with Missing Data/ Roderick J.A. Little, Donald B. Rubin. 2nd ed. Wiley Series in Probability and Statistics. Hoboken, NJ: Wiley-Interscience; 2002.
- 52.
Raghunathan TE, Lepkowski JM, Hoewyk JV, Solenberger P. A multivariate technique for multiply imputing missing values using a sequence of regression models. 12.
- 53. van Buuren S. Multiple imputation of discrete and continuous data by fully conditional specification. Stat Methods Med Res. 2007;16(3):219–42. pmid:17621469
- 54. Azur MJ, Stuart EA, Frangakis C, Leaf PJ. Multiple imputation by chained equations: what is it and how does it work? Int J Methods Psychiatr Res. 2011;20(1):40–9. pmid:21499542
- 55. Buuren S van, Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations inR. J Stat Soft. 2011;45(3).
- 56. Van Buuren S. Flexible Imputation of Missing Data, Second Edition. 2nd ed. Boca Raton, Florida: Chapman and Hall/CRC; 2018.
- 57.
Jorgensen TD, Pornprasertmanit S, Schoemann AM, Rosseel Y. semTools: Useful Tools for Structural Equation Modeling. 2022.
- 58. Duinhof EL, Lek KM, de Looze ME, Cosma A, Mazur J, Gobina I, et al. Revising the self-report strengths and difficulties questionnaire for cross-country comparisons of adolescent mental health problems: the SDQ-R. Epidemiol Psychiatr Sci. 2019;29:e35. pmid:31046859
- 59. Demkowicz O, Panayiotou M, Qualter P, Humphrey N. Longitudinal relationships across emotional distress, perceived emotion regulation, and social connections during early adolescence: A developmental cascades investigation. Dev Psychopathol. 2024;36(2):562–77. pmid:36734229
- 60. Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling: A Multidisciplinary Journal. 1999;6(1):1–55.
- 61. Rodriguez A, Reise SP, Haviland MG. Evaluating bifactor models: Calculating and interpreting statistical indices. Psychol Methods. 2016;21(2):137–50. pmid:26523435
- 62. Stucky BD, Edelen MO. Using Hierarchical IRT Models to Create Unidimensional Measures From Multidimensional Data. In: Handbook of Item Response Theory Modeling Routledge. 2014. p. 201–24.
- 63. Moreira PAS, Inman RA, Cloninger CR. Disentangling the personality pathways to well-being. Sci Rep. 2023;13(1):3353. pmid:36849800
- 64. Cloninger CR, Zohar AH. Personality and the perception of health and happiness. J Affect Disord. 2011;128(1–2):24–32. pmid:20580435
- 65. Kim SJ, Lee SJ, Yune SK, Sung YH, Bae SC, Chung A, et al. The relationship between the biogenetic temperament and character and psychopathology in adolescents. Psychopathology. 2006;39(2):80–6. pmid:16391509
- 66. Black L, Humphrey N, Panayiotou M, Marquez J. Mental Health and Well-being Measures for Mean Comparison and Screening in Adolescents: An Assessment of Unidimensionality and Sex and Age Measurement Invariance. Assessment. 2024;31(2):219–36. pmid:36864693
- 67.
Seligman ME. Flourish: A Visionary New Understanding of Happiness and Wellbeing. New York, NY:Simon and Schuster; 2012.
- 68. Black L, Panayiotou M, Humphrey N. Internalizing symptoms, well-being, and correlates in adolescence: A multiverse exploration via cross-lagged panel network models. Dev Psychopathol. 2022;34(4):1477–91. pmid:34128457
- 69. Bartels M, Cacioppo JT, van Beijsterveldt TCEM, Boomsma DI. Exploring the association between well-being and psychopathology in adolescents. Behav Genet. 2013;43(3):177–90. pmid:23471543
- 70. Yoon Y, Deighton J, Wickersham A, Edbrooke-Childs J, Osborn D, Viding E, et al. The role of mental health symptomology and quality of life in predicting referrals to special child and adolescent mental health services. BMC Psychiatry. 2021;21(1):366. pmid:34301207
- 71. Edbrooke-Childs J, Deighton J. Problem severity and waiting times for young people accessing mental health services. BJPsych Open. 2020;6(6):e118. pmid:33040766
- 72. Alexander LM, Salum GA, Swanson JM, Milham MP. Measuring strengths and weaknesses in dimensional psychiatry. J Child Psychol Psychiatry. 2020;61(1):40–50. pmid:31423596