Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Measurement invariance of the Center for Epidemiological Studies-Depression scale and associations with genetic risk in older adults

  • Toni T. Saari ,

    Roles Conceptualization, Formal analysis, Writing – original draft, Writing – review & editing

    toni.saari@helsinki.fi

    Affiliations Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland, Department of Neurology, University of Eastern Finland, Kuopio, Finland

  • Maarit Piirtola,

    Roles Conceptualization, Writing – review & editing

    Affiliations Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland, UKK Institute for Health Promotion Research, Tampere, Finland

  • Aino Aaltonen,

    Roles Writing – review & editing

    Affiliation Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland

  • Teemu Palviainen,

    Roles Data curation, Writing – review & editing

    Affiliation Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland

  • Anni Varjonen,

    Roles Data curation, Writing – review & editing

    Affiliation Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland

  • Valtteri Julkunen,

    Roles Writing – review & editing

    Affiliation Department of Neurology, University of Eastern Finland, Kuopio, Finland

  • Juha O. Rinne,

    Roles Funding acquisition, Investigation, Writing – review & editing

    Affiliations Turku PET Centre, Turku University Hospital, Turku, Finland, University of Turku, Turku, Finland

  • Jaakko Kaprio,

    Roles Funding acquisition, Investigation, Writing – review & editing

    Affiliation Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland

  • Eero Vuoksimaa

    Roles Funding acquisition, Investigation, Writing – original draft, Writing – review & editing

    Affiliation Institute for Molecular Medicine Finland (FIMM), HiLIFE, University of Helsinki, Helsinki, Finland

Abstract

Background

As populations are aging, it needs to be ensured that valid depression rating scales are available across old adulthood. Center for Epidemiological Studies-Depression scale (CES-D) is a common depression rating scale, however, few studies have assessed its validity in individuals with age over 90 and/or cognitive impairment. We examined the factor structures of 20-, 15-, and 8-item CES-D scales, their measurement invariance for age and cognition, and associations with genetic risk of depression.

Methods

Participants were from a population-based older Finnish Twin Cohort study including 71–79-year-olds from the MEMTWIN II (n = 1034 for exploratory and n = 664 for confirmatory factor analyses) and 90+ year-olds from the NONAGINTA (n = 134, confirmatory factor analyses) sub-studies. Associations of polygenic risk score of major depressive disorder (MDD-PRS) with CES-D scales were examined in MEMTWIN II.

Results

Exploratory factor analyses (n = 1034) suggested four- (CES-D 20) and three-factor (CES-D 8) structures and these models fit well in confirmatory analyses (n = 664). Unidimensional models had good (CES-D 15 & 20) or fair fit (CES-D 8). Results supported scalar invariance of all CES-D versions for age and cognitive status. Higher MDD-PRS was associated with more depressive symptoms in different CES-D versions.

Conclusions

Different CES-D versions are adequate for measuring depressive symptoms across age groups and cognitive spectrum in old age. Genetic risk of depression predicts depressive symptoms even in old age.

Introduction

Late-life depression is a growing burden to patients, health care systems, and societies [1]. In a recent meta-analysis of older adults, depressive symptoms were more prevalent in over 75-year-old individuals compared to under 75-year-old individuals [2], likely reflecting increased health and psychosocial challenges in old age [3, 4]. However, the stability of depression prevalence across the adult life has also been stated [5]. Prevalence estimates of depression in older adults vary from 7% to nearly 40% [2, 6], and similar figures have been reported in older adults with cognitive impairment [7, 8].

The Center for Epidemiological Studies Depression scale (CES-D) is a self-report instrument for depressive symptoms with psychometric support in diverse populations, including older adults [913]. In the original study, Radloff [9] suggested a four-domain structure of the CES-D which has received meta-analytic support [14]. To reduce the length, various short forms of the CES-D have been developed. An 8-item version of the CES-D has received psychometric support in studies of older adults [3, 1517], but the number of factors in these studies has varied. In terms of diagnostic properties, the CES-D 8 has good sensitivity and specificity when comparing it to the long version cut-off for clinically significant depressive symptoms [13].

Despite the validity evidence for the CES-D as a measure of depressive symptoms in older adults, few studies have examined whether the CES-D is invariant for age and cognition. Comparison of depressive symptoms across age and cognitive groups is only meaningful if measurement invariance has been established [3, 18]. In older adults, the CES-D has demonstrated measurement invariance for age comparing 20–54 to over 55-year-olds [19] and 50–64 to over 65-year-olds [3]. In over 65-year-olds, the CES-D 8 has demonstrated temporal measurement invariance, meaning the factor structure has stayed the same across multiple follow-ups of the same persons [15]. However, there is no information about measurement invariance comparing older adults and those who are 90 years or older, the fastest growing population segment in many countries. Thus, it remains unclear whether the CES-D has a similar factor structure in the oldest-old compared to younger old adults.

Relatively few studies have investigated whether the CES-D is invariant for cognitive status. In one study, diagnostic accuracy of the CES-D did not markedly differ between individuals with and without dementia [20]. Another study using differential item functioning found small differences between cognitively healthy and cognitively impaired, but these differences were not sufficiently large to suggest that the CES-D functions differently across cognitive states [21]. Considering that the proportion and number of older adults in populations around the world are increasing [22] and the risk of dementia increases with age [23], it must be ensured that suitable depression rating scales are available for older adults across age and cognitive spectra.

Assessing factor structures and measurement invariance can inform about the validity of CES-D and its short forms among older individuals. In addition, investigating the association of genetic risk of depression with different CES-D versions can provide further evidence for the utility of CES-D. Polygenic risk score of major depressive disorder (MDD-PRS), summing up the small effects of multiple single nucleotide polymorphisms (SNPs) with 8.7% SNP-based heritability, explains 1.9% of the genetic liability to major depressive disorder and is related to severity of depression [24]. It has been suggested that genetic factors related to depression vulnerability persist in aging [25], but this has been scarcely examined [26]. Inflammatory, vascular and neurodegenerative markers of accelerated biological aging may contribute to depression in late life [25], but these etiological factors may not be represented in MDD-PRS that is derived mainly from samples of middle-aged participants [24]. Thus, it needs to be examined whether the MDD-PRS is associated with CES-D in late old age.

The aim of this study was to compare factor structures of different versions of the CES-D and to test the measurement invariance regarding old-age groups and cognitive status by using a population-based sample including those in their 70’s and those who are 90 years or older. We also investigated the association of MDD-PRS with different CES-D versions. We found that different versions of the CES-D showed measurement invariance for age and cognitive status, promoting the use of this instrument in studies of aging. Furthermore, MDD-PRS showed associations with all CES-D versions and most subscales.

Materials and methods

Participants

We used two sub-studies based on a population-based older Finnish Twin Cohort (FTC) study that includes Finnish twins from same-sex pairs born before 1958 [27]. A total of 1698 individuals (age M = 73.6±1.6, born 1938–1944) were from MEMTWIN II (participation rate of 67%) sub-study with data collected between August 29, 2013 to July 4, 2017 [28]. We split the MEMTWIN II sample by requiring distinct family numbers: the first co-twins of a pair or twins without a co-twin formed the exploratory sample (n = 1034) and the second co-twins from full pairs formed the confirmatory sample (n = 664).

The older subsample were individuals from the FTC who were over 90 years old or reached 90 years of age during the ongoing NONAGINTA study data collection that started June 1, 2020. We included the first 150 participants (reached on January 10, 2023) with a mean age of 91.4 (SD 1.9) years and participation rate of 28%. Of these, 134 had CES-D data (see S1 File for additional information about the samples).

MEMTWIN II study protocol was approved by the Ethics Committee of the Hospital District of Southwest Finland and NONAGINTA study protocol was approved by the Ethics Committee of the Hospital District of Helsinki and Uusimaa. For both sub-studies, written informed consents were obtained and the principles outlined in the declaration of Helsinki were followed.

Measures

Center for Epidemiological Studies Depression scale.

We used the Finnish version of the CES-D 20 and its shorter versions as measures of depressive symptoms [29]. The CES-D is a 20-item self-report instrument assessing depressive symptoms on a four-point scale (0–3), with a maximum score of 60 points (Table 1). Four items are reverse-coded (Good, Hopeful, Happy, Enjoyed). The CES-D has shown good validity and reliability in studies of older adults [12]. As all items of the CES-D were required for factor analyses, only participants with complete CES-D data were used in this study. Additionally, the scale was administered in pen-and-paper format, leading us to omit responses that fell between options (e.g., participant marked their choice between point 1 and 2) as equal number of response options are required for factor analyses.

thumbnail
Table 1. Items of the CES-D 20 and their inclusion in published subscales.

https://doi.org/10.1371/journal.pone.0312194.t001

Modified Telephone Interview for Cognitive Status.

The modified Telephone Interview for Cognitive Status (TICS-m) was used as a measure of cognitive functioning [28], see S1 File for more details.

Polygenic risk score of major depressive disorder.

Technical details of genotyping, imputation and polygenic risk score calculation have been described elsewhere [30]. MDD-PRS was derived using GWAS summary statistics of major depressive disorder [24] and used as a continuous predictor of CES-D scores. The total number of single nucleotide polymorphisms (SNP) used for polygenic score calculation was 1147810. The number of samples used for the original genome-wide association study was 173005.

Statistical analyses

The exploratory sample (n = 1034) from MEMTWIN II was used for exploratory factor analyses (EFA) of the CES-D 20 and the CES-D 8. EFAs were ran first instead of trying to confirm previous factor structures because 1) we used the Finnish version of the CES-D and different language versions have yielded different factor structures for both short and long forms, 2) many previous factor analyses have used analytic approaches which might not be ideal for ordinal indicators, leading to possible biases in factor structures [31], and 3) the split samples still exceeded common recommendations for sample sizes in factor analysis [32]. Fig 1 shows which analyses were conducted in each subsample of MEMTWIN II.

thumbnail
Fig 1. Exploratory and confirmatory factor analyses.

First, exploratory factor analyses were conducted for all 20 items of the CES-D and for the 8-item subset. Then, these structures were tested in confirmatory factor analysis in addition to two theory-driven models by Van de Velde et al. (2010) and Edwards et al. (2010). CFA = confirmatory factor analysis; EFA = exploratory factor analysis.

https://doi.org/10.1371/journal.pone.0312194.g001

Confirmatory factor analyses (CFA) were run in the confirmatory MEMTWIN II sample (n = 664) based on the factor structures identified in EFAs for both CES-D 8 and CES-D 20. We were also interested to see whether we could confirm two structures proposed in the earlier literature. First, we considered the 15-item unidimensional structure reported in a sample of young individuals from the US [10]. In their study, the authors suggested that the positively worded four items in the CES-D 20 are not necessary and the items Unfriendly and Dislike were interchangeable (with the item Unfriendly dropped from the 15-item version). Furthermore, the researchers used polychoric correlations and an estimation method suitable for ordinal data, lending their results suitable for replication. We considered the 15-item structure worth investigating as it is more comprehensive than the CES-D 8, but unidimensional unlike the CES-D 20.

Next, owing to the challenges posed by the positively worded items also in the CES-D 8 items, we considered a unidimensional CES-D 8 structure with added covariances between Happy & Enjoy, which has resulted in good fits in large studies [11, 16]. The CES-D 20 and the subscales under investigation in this study are shown in Table 1. Additional information about analyses examining suitability of the data for factor analyses, model estimation and reporting are in S1 File.

Measurement invariance testing was performed for age group comparisons (70-year-olds from the MEMTWIN II vs 90-year-olds from the NONAGINTA) and cognitive status (impaired versus unimpaired in MEMTWIN II). First, we estimated configural invariance (M1), where only the number and pattern of factor loadings are equal. Next, invariance for thresholds was investigated (M2), followed by invariance for thresholds and factor loadings (M3, metric invariance) and thresholds, factor loadings and intercepts (M4, scalar invariance). These four invariance models were tested with the CES-D 20 and CES-D 8 models derived from our exploratory analyses and the two theory-driven models–CES-D 15 [10] and CES-D 8 [11] with correlated residuals–mentioned previously (structures from Fig 1 confirmatory analyses). For comparison of latent means across groups, scalar invariance should be met [33]. Measurement invariance testing was done following Wu & Estabrook [34] using semTools for generating appropriate lavaan syntax for model specification. Theta parameterization was used in the models.

Differences between age groups and in cognitively impaired versus cognitively healthy were compared with regression analyses. MDD-PRS associations with CES-D versions were studied with linear regression analysis adjusting for age, sex and population stratification (10 genetic principal components). Clustered standard errors were used to account for non-independence of observations (twins within families, which were the primary sampling units).

All analyses were conducted with R version 4.2.2 [35]. R packages psych [36], lavaan [37], and semTools [38] were used for factor analyses and measurement invariance testing. Polychoric correlations were used in model estimation and scaled fit indices are reported. Regression analyses with clustered standard errors were performed using estimatr [39], and adjustments for multiple comparisons in these analyses were made following Holm [40]. All R code is available at https://osf.io/kxzta/.

Results

Sample characteristics

Demographic and clinical characteristics of the samples are described in Table 2. A total of 260 (18.7%) MEMTWIN II participants were classified as cognitively impaired based on TICS-m cut-offs. The missing data distributions of CES-D items did not differ between cognitively unimpaired and cognitively impaired groups (χ2 [19] = 7.43, p = 0.99) or between the MEMTWIN II and NONAGINTA cohorts (χ2 [19] = 6.43, p = 0.99). Additional results concerning response style and missing values are presented in S2 File.

thumbnail
Table 2. Clinical and demographic characteristics of the study sample.

https://doi.org/10.1371/journal.pone.0312194.t002

Exploratory factor analyses

A total of 1034 individuals (first member of a twin pair and all twins without a co-twin in the study) comprised the exploratory sample and these data were used for EFA. Prior to EFA, Kaiser-Meyer-Olkin coefficient of sampling adequacy was examined and found adequate for the 20 items and 8 items (.93 and .85, respectively). Bartlett’s test of sphericity was found highly significant for both item pools (p < .01×10−70). Polychoric parallel analyses were ran using the psych package and 50 iterations; the results suggested four to five factors for the full scale and three factors for the 8-item subscale.

A four-factor solution in the EFA for the CES-D 20 (factors explaining 11–20% of the common variance) was considered the best, although this solution still had issues with fit (Root Mean Square Error of Approximation [RMSEA] = 0.084 [90% CI 0.079–0.089], Tucker-Lewis Index [TLI] = 0.901). The 20 items loaded largely onto the same factors as in the original Radloff [9] study with loadings ranging .45–.75. The only exceptions were Bothered and Sleep, which now loaded onto a depressed affect factor rather than to a factor with Appetite, Effort and Going. Of note, Dislike and Unfriendly loaded on their own factor, and all positively worded items and Failure loaded on their own factor. A five-factor solution yielded a negligible improvement in fit, and the fifth factor explained under 10% of the variance. Weighing these findings against the increase in model complexity and deviation from models in the prior literature led us to consider the more parsimonious four-factor model as the better option.

For the 8-item model, 3 factors were extracted with a better fit (RMSEA = .068 [90% CI .049-.089], TLI = .97) and loadings .30–.67, the lowest loading being on the item Sleep. The three-factor CES-D 8 model had a similar structure to the CES-D 20 but with fewer items and neither item of the Unfriendly/Dislike factor. A pair of two-item factors emerged (Enjoyed & Happy and Effort & Going), explaining 20% and 19% of the common variance, respectively, with the final four-item factor (Sad, Depressed, Lonely, Sleep) explaining 30% of the variance. Items with positive content loaded onto their own factors in both CES-D 20 and CES-D 8.

Confirmatory factor analyses

In the CFA, the two EFA-driven models and two theory-driven models generally fit well to the data (Table 3). A slight exception was found for the CES-D 8 unidimensional model, which had RMSEA = 0.078 (90% CI 0.063–0.093) and non-overlapping RMSEA confidence intervals compared to the other three models, but comparative fit indices and SRMR were also within commonly accepted boundaries in this model. The lowest reliability values were found for the 2-item Dislike & Unfriendly subscale (α = .63 and ωT = .67); as expected with factor models with varying loadings, ωT provided higher reliability estimates than α. Factor loadings varied .55-.96 in CES-D 20, .53-.90 in CES-D 8 three-factor model, .53-.83 in CES-D 15 and .53-.81 in the unidimensional CES-D 8 (Table 4). Of note, the item Sleep had the lowest factor loading in all models. The correlations between factors were ≥ 0.55 in the multi-factor models (S1 Table).

thumbnail
Table 3. Summary of the confirmatory factor analysis results.

https://doi.org/10.1371/journal.pone.0312194.t003

thumbnail
Table 4. Summary of factor loadings and reliabilities in confirmatory factor analyses.

https://doi.org/10.1371/journal.pone.0312194.t004

Measurement invariance for age and cognition

Measurement invariance for age was analyzed comparing 70- to 90-year-olds using MEMTWIN II and NONAGINTA samples, respectively. Models with configural invariance, threshold invariance, threshold invariance with loading invariance, and threshold invariance with loading invariance and intercept invariance are presented in Table 5. The 1-factor CES-D 8 model showed initially only a mediocre fit that improved in more constrained models. All scalar invariance models showed a decrease in fit compared to the metric invariance models as assessed by the χ2-test, and all but the CES-D 8 3-factor model also showed a decrease in fit when comparing metric models to models with threshold invariance. However, considering the stability (or even improvement) of the other fit indices, we interpret the χ2-test results insufficient to reject invariance in more constrained models. These results indicated scalar measurement invariance for age. The mean differences between older and younger old adults were significant for the CES-D 20 (β = 2.67 [95% CI 1.97–3.37), the CES-D 15 (β = 1.87 [95% CI 1.27–2.48) and the CES-D 8 (β = 1.15, [95% CI .82–1.47] with over 90-year-olds having markedly more depressive symptoms (Table 2).

thumbnail
Table 5. Summary of measurement invariance tests for age and cognitive groups.

https://doi.org/10.1371/journal.pone.0312194.t005

After excluding 48 individuals with incomplete data for calculating TICS-m total score, measurement invariance for cognition was analyzed in the remaining MEMTWIN II participants (unimpaired cognition n = 1390, impaired cognition n = 260; Table 5). In measurement invariance testing for cognition, all models fit well with the added equality constraints except for the unidimensional CES-D 8. For the 1-factor CES-D 8 model, RMSEA values in the configural model indicate a mediocre fit; the more constrained models show better RMSEA values, but the confidence interval in the scalar model still exceeds the cut-off for good fit. In all, scalar measurement invariance for cognition was supported by these analyses. No significant differences were found in CES-D 20, CES-D 15 or CES-D 8 total scores between cognitively unimpaired and cognitively impaired individuals (S2 Table).

Associations with genetic risk for depression

Finally, we looked at the associations of genetic risk (MDD-PRS) to CES-D scores (Table 6). MDD-PRS was similarly associated with all three CES-D total scores (CES-D 20: β = 0.084 [95% CI 0.029–0.138]; CES-D 15: β = 0.071 [95% CI 0.017–0.126]; CES-D 8: β = 0.082 [95% CI 0.027–0.137]). The analyses were adjusted for age, sex and 10 first genetic principal components for population stratification, the clustering of individuals as twin siblings was taken into account and the analyses were corrected for multiple comparisons [40]. The associations between MDD-PRS and CES-D 20 and CES-D 8 subscale scores varied from β = .075 to β = .081 in significant associations. In contrast, MDD-PRS did not have statistically significant associations to CES-D 20 subscale 2 (Concentrating, Appetite, Effort, Talk, Going), subscale 4 (Unfriendly, Dislike) or CES-D 8 subscale 3 (Effort, Going). These associations were also smaller in magnitude (β = .029 to β = .045). In MDD-PRS decile plots (Fig 2), the differences between different deciles on CES-D scores were quite low, but CES-D scores were the highest in the top decile for MDD-PRS.

thumbnail
Fig 2. Box-and-whiskers plot of CES-D scores by MDD-PRS deciles.

(A) CES-D 20. (B) CES-D 15. (C) CES-D 8. Boxes show lower quartile, median and upper quartiles, whiskers indicate a maximum of 1.5 times interquartile range from the corresponding upper and lower quartiles. CES-D = Center for Epidemiological Studies depression scale; MDD-PRS = polygenic risk score of major depressive disorder.

https://doi.org/10.1371/journal.pone.0312194.g002

thumbnail
Table 6. Associations between polygenic risk scores for depression and CES-D scores (n = 1529).

https://doi.org/10.1371/journal.pone.0312194.t006

Discussion

We found psychometric support for the shorter and longer forms of the CES-D in older adults. Our analyses suggest that the CES-D measures the same constructs in the presence or absence of cognitive impairment and in over 90-year-olds and those in their 70’s. Over 90-year-olds had substantially more depressive symptoms than individuals in early old age and higher polygenic risk score associated with higher CES-D scores.

The CES-D 20 has some limitations as some items may load onto a factor based on shared variance unrelated to depression (e.g. wording of the items, overlap in item content in Unfriendly and Dislike) [19]. We found the 15-item unidimensional structure [10] without the positive items and the redundant Unfriendly item to have a good fit to the data and the scale showed measurement invariance for cognition and age. Using the 15-item version as a unidimensional measure of depressive symptoms when the study participation is not overly burdensome seems reasonable.

The 3-factor CES-D model showed a better fit than the unidimensional model for 8-item version [11], comparable fit to the CES-D 15 and the correlations among the three factors were ≥ 0.55. Both CES-D 8 models showed measurement invariance for cognition and age and the total score of CES-D 8 showed similar magnitude of association with MDD-PRS as the full-length CES-D. Overall, our results support using both, the 3-factor and the unidimensional CES-D 8 in older adults, particularly when the aim is to minimize the burden of administering a depression rating scale. Thus, for studies of older individuals, the CES-D 8 seems adequate for measuring depressive symptoms even in late old age and in those with cognitive impairment.

Our results showing measurement invariance for age, together with the previously demonstrated temporal measurement invariance [15], indicate that the CES-D can be used to compare depressive symptoms across age strata in older adults. Our study also adds to the literature on the self-assessment of depressive symptoms in individuals with cognitive impairment. Due to anosognosia and cognitive impairment, informant- or clinician-ratings of depression are used with individuals with cognitive decline. However, informant-rated symptoms correlate poorly with those of self-report [41]. Importantly, the CES-D items seem invariant for cognitive status [21] and the diagnostic accuracy seems to be similar in individuals with mild dementia and cognitively unimpaired older adults [20]. A recent study also suggests that the discordance between self-reported and informant-reported depressive symptoms is not related to cognitive impairment [42]. Using self-reported depressive symptoms may provide data that could not be accessed with informant ratings, and it appears that these data are as meaningful as they are in older adults without cognitive impairment.

In our study, over 90-year-olds had almost two-fold scores in CES-D compared to those in their 70’s: this does not necessarily mean that depressive symptoms increase from old age to very old age, but can reflect cohort differences whereby younger cohorts experience less depressive symptoms than older cohorts at the same age over 20 years ago [43]. The data collection in NONAGINTA coincided with the COVID-19 pandemic, possibly influencing the CES-D scores in 90-year-olds.

Finally, the associations of MDD-PRS with CES-D scores were of similar magnitude in different versions. An earlier study with two-factor CES-D 8 found associations between MDD-PRS and both factors [26]. In contrast, we found that the factor comprising items “Going” and “Effort” had no association with MDD-PRS. This lack of association may relate to these items capturing aspects of apathy and executive dysfunction that are not specific to depression.

The CFA methods used here are appropriate for ordinal data, but as a limitation we note that they may yield higher values than maximum likelihood-based approaches. Improvement of fit indexes with added equality constraints for age and cognition is somewhat surprising, but has been reported in previous depression scale measurement invariance literature using ordinal data [44]. This observation may relate to insensitivity of some fit indexes, especially with high degrees of freedom [45]. Furthermore, individuals with severe dementia likely did not participate in our study, limiting the range of conclusions about the invariance for cognitive status. Previous research indicates that cognitive decline begins to affect self-reported depressive symptoms at the moderate stage of dementia [46]. It is also possible that anosognosia could have affected the scores of individuals with cognitive impairment. Therefore, not adjusting for anosognosia can be considered a limitation of the study.

The strength of our study was using population-based samples and comparisons between cohorts from the same underlying population including representants also from the oldest-old age group, comprising the fasted growing age group in the world [47]. Furthermore, our sample size allowed us to examine both data-driven and theory-driven factor structures of the CES-D. The measurement invariance analyses in this study use the relatively recent advances in conducting these analyses using ordinal data [34]. We also used a validated measure of cognitive performance with an education adjustment to reduce the number of false positive cases of cognitive impairment [28].

Conclusion

In conclusion, the CES-D and its short forms can be used in older adults with cognitive impairment or in 90-year-olds. The polygenic risk for depression is captured by the CES-D in older adults.

Supporting information

S1 File. Additional information about participants and methods.

https://doi.org/10.1371/journal.pone.0312194.s001

(DOCX)

S2 File. Missing values and response style.

https://doi.org/10.1371/journal.pone.0312194.s002

(DOCX)

S1 Table. Correlations among factors in multi-factor models.

https://doi.org/10.1371/journal.pone.0312194.s003

(DOCX)

S2 Table. CES-D score differences between those with cognitive impairment and those with normal cognition in the MEMTWIN II cohort.

https://doi.org/10.1371/journal.pone.0312194.s004

(DOCX)

Acknowledgments

We thank the participants of the older Finnish Twin Cohort study. We thank Noora Lindgren, Kristiina Saanakorpi and Ulla Kulmala-Gråhn for data collection in the MEMTWIN II study and Mia Urjansson for data collection in the NONAGINTA study.

References

  1. 1. Zivin K, Wharton T, Rostant O. The Economic, Public Health, and Caregiver Burden of Late-life Depression. Psychiatr Clin North Am. 2013 Dec;36(4):631–49. pmid:24229661
  2. 2. Zenebe Y, Akele B, W/Selassie M, Necho M. Prevalence and determinants of depression among old age: a systematic review and meta-analysis. Ann Gen Psychiatry. 2021 Dec;20(1):55. pmid:34922595
  3. 3. Missinne S, Vandeviver C, Van de Velde S, Bracke P. Measurement equivalence of the CES-D 8 depression-scale among the ageing population in eleven European countries. Soc Sci Res. 2014 Jul;46:38–47. pmid:24767588
  4. 4. Blazer DG. Depression in late life: review and commentary. J Gerontol Med Sci. 2003;58A(3):249–65. pmid:12634292
  5. 5. Gana K, Boudouda NE, Salanova T, Broc G. What Do EURO-D Scores Capture? Disentangling Trait and State Variances in Depression Symptoms Across the Adult Life Span in Nine European Nations. Assessment. 2023 Feb 25;107319112311538. pmid:36840515
  6. 6. World Health Organization. Mental health of older adults [Internet]. 2017. Available from: https://www.who.int/news-room/fact-sheets/detail/mental-health-of-older-adults
  7. 7. Leung DKY, Chan WC, Spector A, Wong GHY. Prevalence of depression, anxiety, and apathy symptoms across dementia stages: A systematic review and meta‐analysis. Int J Geriatr Psychiatry. 2021 Sep;36(9):1330–44. pmid:33905138
  8. 8. Apostolova LG, Di LJ, Duffy EL, Brook J, Elashoff D, Tseng CH, et al. Risk Factors for Behavioral Abnormalities in Mild Cognitive Impairment and Mild Alzheimer’s Disease. Dement Geriatr Cogn Disord. 2014;37(5–6):315–26. pmid:24481207
  9. 9. Radloff LS. The CES-D Scale: A Self-Report Depression Scale for Research in the General Population. Appl Psychol Meas. 1977 Jun;1(3):385–401.
  10. 10. Edwards MC, Cheavens JS, Heiy JE, Cukrowicz KC. A reexamination of the factor structure of the Center for Epidemiologic Studies Depression Scale: Is a one-factor model plausible? Psychol Assess. 2010;22(3):711–5. pmid:20822284
  11. 11. Van de Velde S, Bracke P, Levecque K, Meuleman B. Gender differences in depression in 25 European countries after eliminating measurement bias in the CES-D 8. Soc Sci Res. 2010 May;39(3):396–404.
  12. 12. Michela B, Cataldi F, Carlucci L, Padulo C, Fairfield B. Assessment of late-life depression via self-report measures: a review. Clin Interv Aging. 2018 Oct;Volume 13:2021–44. pmid:30410319
  13. 13. Briggs R, Carey D, O’Halloran AM, Kenny RA, Kennelly SP. Validation of the 8-item Centre for Epidemiological Studies Depression Scale in a cohort of community-dwelling older people: data from The Irish Longitudinal Study on Ageing (TILDA). Eur Geriatr Med. 2018 Feb;9(1):121–6. pmid:34654281
  14. 14. Shafer AB. Meta-analysis of the factor structures of four depression questionnaires: Beck, CES-D, Hamilton, and Zung. J Clin Psychol. 2006 Jan;62(1):123–46. pmid:16287149
  15. 15. Schlechter P, Ford TJ, Neufeld SAS. The Eight-Item Center for Epidemiological Studies Depression Scale in the English Longitudinal Study of Aging: Longitudinal and Gender Invariance, Sum Score Models, and External Associations. Assessment. 2022 Dec 13;107319112211389. pmid:36511122
  16. 16. Karim J, Weisz R, Bibi Z, ur Rehman S. Validation of the Eight-Item Center for Epidemiologic Studies Depression Scale (CES-D) Among Older Adults. Curr Psychol. 2015 Dec;34(4):681–92.
  17. 17. O’Halloran AM, Kenny RA, King-Kallimanis BL. The latent factors of depression from the short forms of the CES-D are consistent, reliable and valid in community-living older adults. Eur Geriatr Med. 2014 Apr;5(2):97–102.
  18. 18. Meredith W. Measurement invariance, factor analysis and factorial invariance. Psychometrika. 1993 Dec;58(4):525–43.
  19. 19. Hertzog C, Alstine JV, Usala PD, Hultsch DF, Dixon R. Measurement Properties of the Center for Epidemiological Studies Depression Scale (CES-D) in Older Populations. Psychol Assess. 1990;2(1):64–72.
  20. 20. Papassotiropoulos A, Heun R, Maier W. The impact of dementia on the detection of depression in elderly subjects from the general population. Psychol Med. 1999 Jan;29(1):113–20. pmid:10077299
  21. 21. Fieo R, Mukherjee S, Dmitrieva NO, Fyffe DC, Gross AL, Sanders ER, et al. Differential item functioning due to cognitive status does not impact depressive symptom measures in four heterogeneous samples of older adults: DIF due to cognitive status in depression scales. Int J Geriatr Psychiatry. 2015 Sep;30(9):911–8.
  22. 22. World Health Organization. World Report on Ageing and Health. Geneva: World Health Organization; 2015. 246 p.
  23. 23. Alzheimer’s Association. 2023 Alzheimer’s disease facts and figures. Alzheimers Dement. 2023 Apr;19(4):1598–695. pmid:36918389
  24. 24. Wray NR, Ripke S, Mattheisen M, Trzaskowski M, Byrne EM, Abdellaoui A, et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat Genet. 2018 May;50(5):668–81. pmid:29700475
  25. 25. Szymkowicz SM, Gerlach AR, Homiack D, Taylor WD. Biological factors influencing depression in later life: role of aging processes and treatment implications. Transl Psychiatry. 2023 May 10;13(1):160. pmid:37160884
  26. 26. Frank P, Ajnakina O, Steptoe A, Cadar D. Genetic susceptibility, inflammation and specific types of depressive symptoms: evidence from the English Longitudinal Study of Ageing. Transl Psychiatry. 2020 May 12;10(1):140. pmid:32398645
  27. 27. Kaprio J, Bollepalli S, Buchwald J, Iso-Markku P, Korhonen T, Kovanen V, et al. The Older Finnish Twin Cohort—45 Years of Follow-up. Twin Res Hum Genet. 2019 Aug;22(4):240–54. pmid:31462340
  28. 28. Lindgren N, Rinne JO, Palviainen T, Kaprio J, Vuoksimaa E. Prevalence and correlates of dementia and mild cognitive impairment classified with different versions of the modified Telephone Interview for Cognitive Status (TICS‐m). Int J Geriatr Psychiatry. 2019 Dec;34(12):1883–91. pmid:31469194
  29. 29. Autio T. CES-D, The Center for Epidemiologic Studies Depression Scale. TOIMIA-mittarit. [Internet]. 2020 [cited 2023 Aug 28]. Available from: https://www.terveysportti.fi/apps/dtk/tmi/article/tmm00085/search/CESD
  30. 30. Kujala UM, Palviainen T, Pesonen P, Waller K, Sillanpää E, Niemelä M, et al. Polygenic Risk Scores and Physical Activity. Med Sci Sports Exerc. 2020 Jul;52(7):1518–24. pmid:32049886
  31. 31. Baglin J. Improving Your Exploratory Factor Analysis for Ordinal Data: A Demonstration Using FACTOR. Pract Assess Res Eval. 2014;19:1–15.
  32. 32. Fabrigar LR, Wegener DT, MacCallum RC, Strahan EJ. Evaluating the Use of Exploratory Factor Analysis in Psychological Research. Psychol Methods. 1999;4(3):272–99.
  33. 33. Putnick DL, Bornstein MH. Measurement invariance conventions and reporting: The state of the art and future directions for psychological research. Dev Rev. 2016 Sep;41:71–90. pmid:27942093
  34. 34. Wu H, Estabrook R. Identification of Confirmatory Factor Analysis Models of Different Levels of Invariance for Ordered Categorical Outcomes. Psychometrika. 2016 Dec;81(4):1014–45. pmid:27402166
  35. 35. R Core Team. R: A Language and Environment for Statistical Computing [Internet]. Vienna, Austria: R Foundation for Statistical Computing; 2022. Available from: https://www.R-project.org/
  36. 36. Revelle W. psych: Procedures for Personality and Psychological Research [Internet]. Northwestern University, Evanston, Illinois, USA; 2022. Available from: https://CRAN.R-project.org/package=psych
  37. 37. Rosseel Y. lavaan: An R Package for Structural Equation Modeling. J Stat Softw [Internet]. 2012 [cited 2023 Feb 14];48(2). Available from: http://www.jstatsoft.org/v48/i02/
  38. 38. Jorgensen TD, Pornprasertmanit S, Schoemann AM, Rosseel Y. semTools: Useful tools for structural equation modeling. [Internet]. 2022. Available from: https://CRAN.R-project.org/package=semTools
  39. 39. Blair G, Cooper J, Coppock A, Humphreys M, Sonnet L. estimatr: Fast Estimators for Design-Based Inference. [Internet]. 2022. Available from: https://CRAN.R-project.org/package=estimatr
  40. 40. Holm S. A Simple Sequentially Rejective Multiple Test Procedure. Scand J Stat. 1979;6(2):7.
  41. 41. Saari TT, Hallikainen I, Hintsa T, Koivisto AM. Network structures and temporal stability of self- and informant-rated affective symptoms in Alzheimer’s disease. J Affect Disord. 2020 Nov;276:1084–92. pmid:32771860
  42. 42. Joe E, Segal-Gidan F, Cummings JL, Galasko D, Johnson DK, Ross L, et al. Association Between Self and Proxy Reported Depression and Quality of Life in Mild-Moderate Alzheimer’s Disease. Am J Geriatr Psychiatry. 2023 Aug;S1064748123003895. pmid:37827916
  43. 43. Kekäläinen T, Koivunen K, Pynnönen K, Portegijs E, Taina Rantanen. Cohort Differences in Depressive Symptoms and Life Satisfaction in 75- and 80-Year-Olds: A Comparison of Two Cohorts 28 Years Apart. J Aging Health. 2023 Mar 22;089826432311647. pmid:36947727
  44. 44. Stochl J, Fried EI, Fritz J, Croudace TJ, Russo DA, Knight C, et al. On Dimensionality, Measurement Invariance, and Suitability of Sum Scores for the PHQ-9 and the GAD-7. Assessment. 2022 Apr;29(3):355–66. pmid:33269612
  45. 45. Savalei V, Brace JC, Fouladi RT. We need to change how we compute RMSEA for nested model comparisons in structural equation modeling. Psychol Methods [Internet]. 2023 Jan 9 [cited 2023 Feb 6]; Available from: http://doi.apa.org/getdoi.cfm?doi=10.1037/met0000537 pmid:36622720
  46. 46. McGivney SA, Mulvihill M, Taylor B. Validating the GDS Depression Screen in the Nursing Home. J Am Geriatr Soc. 1994 May;42(5):490–2. pmid:8176142
  47. 47. United Nations. Shifting demographics [Internet]. 2023 [cited 2023 Aug 28]. Available from: https://www.un.org/en/un75/shifting-demographics