Depression is experienced as a persistent low mood or anhedonia accompanied by behavioural and cognitive disturbances which impair day to day functioning. However, the diagnosis is largely based on self-reported symptoms, and there are no neurobiological markers to guide the choice of treatment. In the present study, we examined the prognostic and diagnostic potential of the structural neural correlates of depression.
Methodology and Principal Findings
Subjects were 37 patients with major depressive disorder (mean age 43.2 years), medication-free, in an acute depressive episode, and 37 healthy individuals. Following the MRI scan, 30 patients underwent treatment with the antidepressant medication fluoxetine or cognitive behavioural therapy (CBT). Of the patients who subsequently achieved clinical remission with antidepressant medication, the whole brain structural neuroanatomy predicted 88.9% of the clinical response, prior to the initiation of treatment (88.9% patients in clinical remission (sensitivity) and 88.9% patients with residual symptoms (specificity), p = 0.01). Accuracy of the structural neuroanatomy as a diagnostic marker though was 67.6% (64.9% patients (sensitivity) and 70.3% healthy individuals (specificity), p = 0.027).
Conclusions and Significance
The structural neuroanatomy of depression shows high predictive potential for clinical response to antidepressant medication, while its diagnostic potential is more limited. The present findings provide initial steps towards the development of neurobiological prognostic markers for depression.
Citation: Costafreda SG, Chu C, Ashburner J, Fu CHY (2009) Prognostic and Diagnostic Potential of the Structural Neuroanatomy of Depression. PLoS ONE 4(7): e6353. https://doi.org/10.1371/journal.pone.0006353
Editor: Katharina Domschke, University of Muenster, Germany
Received: May 6, 2009; Accepted: June 22, 2009; Published: July 27, 2009
Copyright: © 2009 Costafreda et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: SC was supported by a Wellcome Trust Value in People Award. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
While neurodegenerative disorders such as Alzheimer’s disease have diagnostic structural and functional brain abnormalities , the diagnosis of other psychiatric disorders is based entirely on clinical signs and symptoms. Investigation of objective, neurobiological markers would support diagnostic systems and treatment decisions. The potential of a biomarker though depends on its predictive power at the level of the individual.
We found that the functional neuroimaging correlates of core affective processing have significant potential as a diagnostic marker for depression. The functional neuroanatomy of implicit processing of sad facial expressions showed an accuracy of 86% in identifying individuals in an acute depressive episode , while verbal working memory had a more limited but still significant diagnostic accuracy . Sad facial expressions are socially relevant, emotional cues which engage a distributed network of regions  that show an abnormal response during an acute depressive episode –. Moreover, the neural pattern to sad faces also demonstrated high prognostic potential for the prediction of clinical response to cognitive behavioural therapy (CBT) .
In the present study, we investigated the structural neuroanatomy of depression as a prognostic and diagnostic marker for depression. As a marker of clinical response in depression, we found that regional volumes in the anterior cingulate, temporal cortices and basal ganglia were correlated with the rate of clinical improvement . The analysis though was limited to the original sample, and the predictive response in novel data was not explicitly examined. In schizophrenia, Davatzikos et al.  reported a diagnostic accuracy of 81% from whole brain structural neuroimaging features. However, global cerebral volume in major depression is comparable to healthy individuals, in contrast to schizophrenia . Instead, structural deficits in depression appear to be more localised within a distributed pattern, which include the hippocampus , subgenual anterior cingulate –, orbitofrontal and middle frontal cortices , and basal ganglia [reviewed in: 10], .
We expected the structural correlates of depression to show significant predictive potential for treatment with antidepressant medication, implicating regions which would include the anterior cingulate cortex, while the predictive potential for treatment with CBT was less clear. As a potential diagnostic marker, we expected a lower accuracy than observed in schizophrenia , which would encompass a distributed network including the anterior cingulate and prefrontal regions, hippocampus, and basal ganglia.
The structural neuroanatomy of acutely depressed patients, before the initiation of treatment, correctly predicted clinical remission to treatment with the antidepressant medication fluoxetine with an accuracy of 88.9% (88.9% of patients in clinical remission (sensitivity) and 88.9% patients with residual symptoms (specificity), p = 0.01). Clinical remission was predicted by greater grey matter density in the right rostral anterior cingulate cortex (BA 32), left posterior cingulate cortex (BA 31), left middle frontal gyrus (BA 6), and right occipital cortex (BA 19) (Figure 1). Regions which predicted residual symptoms were the orbitofrontal cortices bilaterally (BA 11), right superior frontal cortex (BA 10) and left hippocampus. The structural neuroanatomy did not show a significant prediction of clinical remission to CBT.
In the top panel, sagittal views are presented which show medial regions of decreased grey matter density which contributed to the diagnosis of depression (coloured in green) in the right subgenual anterior cingulate (BA 25) and precuneus (BA 7). No regions of increased grey matter in patients with depression relative to healthy individuals contributed to the diagnosis. In the lower panel, increased grey matter density in the anterior and posterior cingulate cortices (red) increased the probability of clinical remission to treatment with the antidepressant medication fluoxetine. Greater density in the orbitofrontal cortex (blue) increased the odds of residual symptoms of depression following antidepressant medication. Regions depicted were selected as relevant to the classification of patients as achieving remission or non-remission clinical status following fluoxetine treatment by every cross-validated support vector machine classification model. Sagittal views are presented in MNI space at z = −4, 10, 12 and 14.
As a diagnostic marker, the accuracy was 67.6% from whole brain structural neuroanatomy (64.9% patients with depression (sensitivity) and 70.3% healthy individuals (specificity), p = 0.027). Decreased grey matter density in the following regions showed the highest contribution to the diagnosis of depression: right subgenual anterior cingulate (BA 25), medial frontal gyrus (BA 11), superior temporal cortex (BA 22), precuneus (BA 7), hippocampus and thalamus, as well as in the left inferior parietal cortex (BA 40), occipital (BA 19) cortex, and cerebellum. No regions of increased grey matter in depressed patients relative to healthy individuals contributed to the diagnosis.
Regions which contributed to the prediction of treatment response were distinct from those relevant for diagnosis as there was no overlap anywhere in the brain between their respective brain patterns.
Whole brain structural neural correlates of depression identified 89% of patients who subsequently had a full clinical response to the antidepressant medication fluoxetine. The structural neuroanatomy of depression has significant potential as a prognostic marker of treatment response with antidepressant medication. In contrast, the structural neuroanatomy showed limited potential as a diagnostic measure for depression.
The findings support functional , – and structural  neuroimaging studies implicating the anterior cingulate cortex as a marker of clinical response to antidepressant medication, but also identified a more widespread network which included the posterior cingulate. The anterior and posterior cingulate cortices are strongly interconnected , and their functions are complementary with the anterior cingulate subserving executive functions linked to emotional and autonomic responses while the posterior cingulate has a more evaluative role that is postulated to direct activity in the anterior cingulate . The data also point to a more widespread network of regions which are predictive of clinical response, including the hippocampus which may reflect stress-induced neuroplastic changes –. In particular, the present study suggests that grey matter density in a set of regions predicts how well an individual patient will respond to antidepressant treatment. In contrast, whole brain functional responses to sad faces showed high predictive potential to CBT treatment .
Regions important for individual diagnosis have been featured within the cortico-striato-pallido-thalamic loops, which include the medial and orbital prefrontal cortices, amygdala, hippocampus, medial thalamus, and striatum , and cortico-cortical circuits from the medial prefrontal cortex connecting the parahippocampus, posterior cingulate and superior temporal cortices . In depression, volumetric and cellular deficits have most consistently been identified in the hippocampus , but as well in the anterior  and posterior cingulate , orbitofrontal , lateral temporal and occipital cortices , , and amygdala . However, the structural neuroanatomy only showed limited potential for diagnosis, suggesting that structural abnormalities in depression are slight in contrast to other psychiatric disorders, such as schizophrenia . Instead, functional brain activity to sad facial expressions may be a more accurate diagnostic marker of depression .
A limitation of the present study was the small sample sizes in the prediction of clinical response, which may not have provided sufficient power to find an effect for CBT. Although such negative findings should be treated with caution, one interpretation would be that structural brain regions predictive of response to CBT, should they exist, may be more subtle than those predictive of fluoxetine response. Yet, as the sample for the CBT treatment group was sufficient to detect a predictive potential of functional MRI , it is possible that if structural effects exist, they might be more subtle than functional ones. Another limitation was that the pharmacological treatment was a single medication from the class of serotonergic reuptake inhibitors. The predictive potential for other antidepressant medications and from other classes requires further investigation. Moreover, the specificity of the predictive marker is somewhat equivocal as there was no placebo treatment arm. All patients in the present study were medication-free and suffering from an acute depressive episode at the time of the MRI scan. The generalisability of our findings to patients with more chronic forms of depression and the effects of medication from different classes, such as noradrenergic or combined noradrenergic and serotonergic mechanisms , require further investigation.
In summary, the structural neural correlates of depression show high prognostic potential for treatment with the antidepressant medication fluoxetine. However, the diagnostic accuracy with structural neuroanatomy was more limited, while greater diagnostic potential may be found with functional neural correlates. The present findings may provide an initial step towards developing personalised clinical treatment options.
Materials and Methods
All participants provided written informed consent in accordance with the guidelines of the Institute of Psychiatry and South London and Maudsley (SLAM) NHS Trust Ethics (Research) Committee. Patients were 37 right-handed individuals (mean age 41.9 years, SD 8.9; 28 women) meeting Diagnostic and Statistical Manual of Mental Disorder-IV (DSM-IV) criteria  for major depression by Structured Clinical Interview for DSM-IV , in an acute episode of moderate severity, having a minimum score of 18 on the 17-item Hamilton Rating Scale for Depression (HRSD) (mean HRSD 20.7, SD 2.2) . Exclusion criteria were a history of neurological trauma resulting in a loss of consciousness, a current neurological disorder, history of diabetes or other medical disorder, other Axis I disorder including an anxiety disorder, history of substance abuse within 2 months of study participation, or an Axis II disorder. All patients were free of psychotropic medication for a minimum of 4 weeks at recruitment (8 weeks for fluoxetine) and patients in the CBT treatment group remained medication-free throughout the treatment. Healthy controls were 37 right-handed individuals matched for age, gender and IQ (mean age 42.2 years, SD 9.0; 28 women) with no history of a psychiatric disorder, neurological disorder or head injury resulting in a loss of consciousness, and an HRSD score≤7 (mean HRSD 0.2, SD 0.6). There was no significant difference in age between groups (paired t-test, t = 0.17, df = 36, p = 0.87) or verbal IQ: patients 109.6, controls 114.1 (paired t-test, t = 1.16, df = 25, p = 0.25). All participants were recruited by advertisement from the local community, and all patients were outpatients. Some of the patient group had participated in a treatment study of depression with the antidepressant medication fluoxetine 20 mg daily (18 depressed patients)  or with CBT (12 depressed patients) , in which clinical remission was defined as a HRSD≤7 following 8 weeks of treatment with fluoxetine (9 patients achieved remission, 9 with residual symptoms) or 16 weeks with CBT (6 remission, 6 residual symptoms) (Table 1). The remaining patients only participated in a single MRI scan and declined the longitudinal treatment study.
Structural magnetic resonance imaging (MRI) data were acquired as 3D spoiled gradient recalled (SPGR) T1-weighted scans on a 1.5 T GE NV/i Signa system (General Electric, Milwaukee, Wisconsin) at the Maudsley Hospital, SLAM NHS Trust, London. The acquisition parameters were: TE = 8, TR = 24 ms, flip angle = 30°, field of view = 25 cm×25 cm, slice thickness = 1.3 mm, number of slices = 124, image matrix = 256×256×124.
Voxel-based morphometry (VBM) was applied to the structural MRI images using SPM5 (Wellcome Trust Centre for Neuroimaging, UCL, London, UK). The images were segmented into grey matter (GM), white matter (WM) and cerebrospinal fluid and imported into a rigidly aligned space . GM segments were then iteratively registered by non-linear warping to templates generated from all images in each group by the Diffeomorphic Anatomical Registration Through Exponentiated Lie algebra (DARTEL) toolbox . Modulation with additional scaling by the Jacobian determinants of the nonlinear deformation was applied to the normalized images – to preserve the overall amount of each tissue class after normalisation. Images were smoothed with a 6 mm full width at half maximum (FWHM) Gaussian kernel. The outputs of this procedure were the population templates of GM and the deformation parameters of each individual to this template. The deformation parameters were then used to generate the modulated and normalized GM maps, which are in a standard space, and to conserve global GM volumes. The input features for the subsequent analysis were the smoothed modulated normalized GM images.
Given the very high dimensionality of the VBM output (thousands of voxels, or features, for each subject, each one corresponding to one dimension) and the expectation that only a few of these features would be meaningful for prediction, we applied a further feature selection step . We used whole-brain ANOVA filtering to select the areas of maximum group differences between patients and controls. First the t-value and degrees of freedom were estimated for each voxel in the training set. Then the t-map was converted into a p-map, and voxels higher than the threshold (uncorrected p = 0.005) were masked out and discarded for classification purposes.
Support vector machine is a supervised, multivariate classification method  with optimal empirical performance in many classification settings  that has previously been utilized in neuroimaging research –, , . Supervised refers to the training step in which the differences between the groups to be classified are learned. With structural MRI data, individual images are treated as points located in a high dimensional space, defined by the GM voxel values of the ANOVA-thresholded maps. A linear decision boundary in this high dimensional space is defined by a hyperplane, and SVM finds the hyperplane that maximizes the margin between two training groups, i.e. the separation between the training subjects that are most ambiguous and difficult to classify. In the SVM classification, the whole multivariate VBM pattern over the set of thresholded areas jointly generated the significant classification results, and the significance of such results therefore refers to the whole pattern.
To examine whether the SVM classifier could be expected to predict diagnosis or prognosis in new patients, we trained the model with leave-one-out cross validation. For each cross validation iteration, the data were partitioned into training and test sets. A different participant from each group was excluded at each iteration, and the SVM classifier was trained on the data from the other subjects, after the ANOVA feature selection step. This classifier was then used to predict the status of the test participant based on their structural scan alone. The process was repeated leaving each participant out once, allowing an accuracy measure to be determined based on the number of test examples correctly classified. Statistical significance of the overall classification accuracy was determined by permutation testing, by repeating the cross-validation procedure 300 times with a different random permutation of the training group labels. The SVM classifier was implemented using freely available software (LIBSVM, http://www.csie.ntu.edu.tw/~cjlin/libsvm).
Conceived and designed the experiments: CF. Performed the experiments: SGC CF. Analyzed the data: SGC CC JA CF. Wrote the paper: SGC CF.
- 1. Herholz K, Carter SF, Jones M (2007) Positron emission tomography imaging in dementia. Br J Radiol 80: S160–S167.
- 2. Fu CHY, Mourao-Miranda J, Costafreda SG, Khanna A, Marquand A, et al. (2008) Pattern classification of sad facial processing: toward the development of neurobiological markers in depression. Biol Psychiatry 63: 656–662.
- 3. Marquand A, Mourao-Miranda J, Brammer MJ, Cleare AJ, Fu CHY (2008) Neuroanatomy of verbal working memory as a diagnostic biomarker for depression. NeuroReport 19: 1507–1511.
- 4. Haxby JV, Hoffman EA, Gobbini MI (2000) The distributed human neural system for face perception. Trends Cogn Sci 4: 223–233.
- 5. Fu CHY, Williams SCR, Cleare AJ, Brammer MJ, Walsh ND, et al. (2004) Antidepressant treatment attenuates the neural response to sad faces in major depression: a prospective, event-related functional MRI study. Arch Gen Psychiatry 61: 877–889.
- 6. Fu CHY, Williams SCR, Cleare AJ, Scott J, Mitterschiffthaler MT, et al. (2008) Neural responses to sad facial expressions in major depression following cognitive behavior therapy. Biol Psychiatry 64: 505–512.
- 7. Costafreda SG, Khanna A, Mourao-Miranda J, Fu CHY (2009) Neural correlates of sad faces predict clinical remission to CBT in depression. Neuroreport 20: 637–641.
- 8. Chen CH, Suckling J, Ooi C, Fu CHY, Williams SCR, et al. (2008) Functional coupling of the amygdala in depressed patients treated with antidepressant medication. Neuropsychopharm 33: 1909–1918.
- 9. Davatzikos C, Shen D, Gur RC, Wu X, Liu D, et al. (2005) Wholebrain morphometric study of schizophrenia revealing a spatially complex set of focal abnormalities. Arch Gen Psychiatry 62: 1218–1227.
- 10. Konarski JZ, McIntyre RS, Kennedy SH, Rafi-Tari S, Soczynska JK, et al. (2008) Volumetric neuroimaging investigations in mood disorders: bipolar disorder versus major depressive disorder. Bipolar Disord 10: 1–37.
- 11. Campbell S, Marriott M, Nahmias C, MacQueen GM (2004) Lower hippocampal volume in patients suffering from depression: a meta-analysis. Am J Psychiatry 161: 598–607.
- 12. Drevets WC, Price JL, Simpson JR Jr, Todd RD, Reich T, et al. (1997) Subgenual prefrontal cortex abnormalities in mood disorders. Nature 386: 824–827.
- 13. Caetano SC, Kaur S, Brambilla P, Nicoletti M, Hatch JP, et al. (2006) Smaller cingulate volumes in unipolar depressed patients. Biol Psychiatry 59: 702–706.
- 14. Bremner JD, Vythilingam M, Vermetten E, Nazeer A, Adil J, et al. (2002) Reduced volume of orbitofrontal cortex in major depression. Biol Psychiatry 51: 273–279.
- 15. Fu CHY, Walsh ND, Drevets WC (2003) : Neuroimaging studies of mood disorders. In: Fu CHY, Russell T, Senior C, Weinberger DR, Murray RM, editors. Neuroimaging in Psychiatry. London: Martin & Dunitz. pp. 131–169.
- 16. Mayberg HS, Brannan SK, Mahurin RK, Jerabeck PA, Brickman JS, et al. (1997) Cingulate function in depression: a potential predictor of treatment response. Neuroreport 8: 1057–1061.
- 17. Kennedy SH, Evans KR, Krüger S, Mayberg HS, Meyer JH, et al. (2001) Changes in regional brain glucose metabolism measured with positron emission tomography after paroxetine treatment of major depression. Am J Psychiatry 158: 899–905.
- 18. Davidson RJ, Irwin W, Anderle MJ, Kalin NH (2003) The neural substrates of affective processing in depressed patients treated with venlafaxine. Am J Psychiatry 160: 64–75.
- 19. Baleydier C, Mauguiere F (1980) The duality of the cingulate gyrus in monkey. Neuroanatomical study and functional hypothesis. Brain 103: 525–554.
- 20. Vogt BA, Finch DM, Olson CR (1992) Functional heterogeneity in cingulate cortex: the anterior executive and posterior evaluative regions. Cereb Cortex 2: 435–443.
- 21. Frodl T, Meisenzahl EM, Zetzsche T, Höhne T, Banac S, Schorr C, et al. (2004) Hippocampal and amygdala changes in patients with major depressive disorder and healthy controls during a 1-year follow-up. J Clin Psychiatry 65: 492–499.
- 22. Fu CH, Williams SC, Brammer MJ, Suckling J, Kim J, Cleare AJ, et al. (2007) Neural responses to happy facial expressions in major depression following antidepressant treatment. Am J Psychiatry 164: 599–607.
- 23. Frodl T, Jäger M, Smajstrlova I, Born C, Bottlender R, Palladino T, et al. (2008) Effect of hippocampal and amygdala volumes on clinical outcomes in major depression: a 3-year prospective magnetic resonance imaging study. J Psychiatry Neurosci 33: 423–430.
- 24. Frodl TS, Koutsouleris N, Bottlender R, Born C, Jäger M, Scupin I, et al. (2008) Depression-related variation in brain morphology over 3 years: effects of stress? Arch Gen Psychiatry 65: 1156–1165.
- 25. Ongur D, Price JL (2000) The organization of networks within the orbital and medial prefrontal cortex of rats, monkeys and humans. Cereb Cortex 10: 206–219.
- 26. Kondo H, Saleem KS, Price JL (2005) Differential connections of the perirhinal and parahippocampal cortex with the orbital and medial prefrontal networks in macaque monkeys. J Comp Neurol 493: 479–509.
- 27. Shah PJ, Ebmeier KP, Glabus MF, Goodwin GM (1998) Cortical grey matter reductions associated with treatment-resistant chronic unipolar depression. Controlled magnetic resonance imaging study. Br J Psychiatry 172: 527–532.
- 28. Hamilton JP, Siemer M, Gotlib IH (2008) Amygdala volume in major depressive disorder: a meta-analysis of magnetic resonance imaging studies. Molec Psychiatry 13: 993–1000.
- 29. American Psychiatric Association (1994) Diagnostic and Statistical Manual of Mental Disorders, 4th Edition (DSM-IV). Washington DC: APA Press.
- 30. First MB, Spitzer RL, Gibbon M, Williams JBW (1995) Structured Clinical Interview for DSM-IV Axis I Disorders. New York: New York State Psychiatric Institute, Biometrics Research.
- 31. Hamilton M (1960) A rating scale for depression. J Neurol Neurosurg Psychiatry 23: 56–62.
- 32. Ashburner J, Friston KJ (2000) Voxel-based morphometry - the methods. Neuroimage 11: 805–821.
- 33. Ashburner J (2007) A fast diffeomorphic image registration algorithm. Neuroimage 38: 95–113.
- 34. Davatzikos C, Genc A, Xu D, Resnick SM (2001) Voxel-based morphometry using the RAVENS maps: methods and validation using simulated longitudinal atrophy. Neuroimage 14: 1361–1369.
- 35. Good CD, Johnsrude IS, Ashburner J, Henson RN, Friston KJ, Frackowiak RS (2001) A voxel-based morphometric study of ageing in 465 normal adult human brains. Neuroimage 14: 21–36.
- 36. Guyon I, Elisseeff AE (2003) An introduction to variable and feature selection. J Machine Learn Res 3: 1157–1182.
- 37. Vapnik VN (1995) The Nature of Statistical Learning Theory. New York: Springer.
- 38. Caruana R, Niculescu-Mizil A (2006) An empirical comparison of supervised learning algorithms. pp. 161–168. In Proceedings of the 23rd International Conference on Machine learning, ACM New York, NY.