Perspectives on the Efficacy of Antidepressants for Child and Adolescent Depression

Raz looks at the ongoing controversies surrounding the use of SSRI antidepressants in children.

P ractitioners of pediatric medicine may still be undecided as to whether the newer generation of antidepressant drugs is effective for child and adolescent depression (CAD) [1]. Since 1989, when selective serotonin reuptake inhibitors (SSRIs) were introduced in the United States, they have become the top-selling drug category; as many as one in eight adult Americans having tried at least one SSRI in the past ten years. Despite their popularity in treating adult depression, the effi cacy of SSRIs for CAD remains in dispute. In this article, I examine some of the core problems in medical research that have led to this disagreement.

Specifi city, Safety, and Effi cacy
Advances in molecular biology and neuroscience have fostered increasingly specifi c drugs. However, the pharmaceutical industry promotes an idea of drug specifi city that may extend beyond the existing data. For example, SSRIs may selectively block the reuptake of serotonin, as claimed by many SSRI manufacturers, but they also infl uence numerous postsynaptic serotonin receptor systems, instigating multiple neurochemical effects. Furthermore, certain neurotransmitter systems are so tightly entwined that affecting one inevitably infl uences others (e.g., selective norepinepherine reuptake inhibitors also infl uence the serotonergic system). Hence, drugs often have effects that seem unrelated to the presumed therapeutic outcome (e.g., tricyclic antidepressants [TCAs] and SSRIs have signifi cant effects on fast sodium channels and platelet function, respectively). And one drug can treat a variety of syndromes.
For example, SSRIs are effective for symptoms ranging from obsessivecompulsive disorder to panic and anxiety. Thus, specifi city, as defi ned by the pharmaceutical industry, is perhaps an overextended notion.
Antidepressant medications have become central to managing CAD [2]. Because double-blind trials of TCAs have failed to show greater effi cacy than placebo for treating CAD [3,4], and concerns have been raised about the side effects of TCAs, SSRIs have been seen as the viable option for treating CAD [5]. Indeed, the 21st century ushered in major clinical guidelines endorsing SSRIs as fi rstline pharmacotherapy for CAD in both North America and the United Kingdom [6,7]. Most rigorous studies that tested the safety and effi cacy of these medications in depressed adolescents began after these drugs were deemed "fi rst line" by the professional community of child and adolescent psychiatrists.
Yet the recent history of SSRIs is replete with inconsistent verdicts about their safety. For example, in January 2003, the US Food and Drug Administration (FDA) approved fl uoxetine for children and adolescents. However, about fi ve months later, concerns arose among psychiatrists about whether the drug was associated with suicidal thinking and behavior in children and adolescents. Nevertheless, in December 2003, the UK Medicines and Healthcare Products Regulatory Agency (MHRA) supported the use of fl uoxetine in children and adolescents [8,9]. It stated that for three other SSRIs (sertraline, citalopram, and escitalopram) the risks outweighed the benefi ts, while the balance of risks and benefi ts was "unassessable" for a fourth SSRI, fl uvoxamine [8].
In September 2004, based on a review of 24 trials of nine different antidepressant drugs that were used to treat CAD, obsessive compulsive disorder, or "other psychiatric disorders," the FDA also supported the use of fl uoxetine in treating CAD [10]. Prior to this, on March 22, 2004, the FDA had issued a "black box" warning label on all antidepressants, cautioning that these medications may "increase the risk of suicidal thinking and behavior (suicidality) in children and adolescents with major depressive disorder (MDD) and other psychiatric disorders" [11].
In its September 2004 review, the FDA endorsed SSRI safety as well as an arbitrary improvement criterion-some decrease, relative to placebo, on a reputable scale (e.g., a two-point drop on the Hamilton depression scale)-without addressing actual clinical effi cacy. However, the resulting assumption among some practitioners was that SSRIs in general are effective, as well as safe, for CAD. This conclusion is consistent with recent reviews [12,13], drawing on both published and unpublished data, reporting that at least fl uoxetine is seen as a safe and effi cacious treatment for CAD. The tendency to embrace SSRIs for CAD demonstrates a trend in pediatric mental health practice toward taking effi cacy for granted and focusing on safety.

Statistical Signifi cance versus Clinical Signifi cance
Paying too much attention to signifi cance tests and too little attention to the analysis methods [14] or other aspects of the data (e.g., the estimates of the magnitude of the effects [15]) may blur the difference between statistical signifi cance and clinical importance [16,17]. Furthermore, statistical signifi cance itself can be clinically meaningless.
For example, a hypothetical study with a large sample size might show that an average heart rate of 69 on placebo compared to 71 on a drug is statistically signifi cant, but this effect is likely to be clinically meaningless. In fact, the only standard for determining drug effi cacy for "quality-of-life illnesses," including CAD, is a placebocontrolled trial; a comparator (i.e., a "horse-race," or "drug A versus drug B") trial typically boosts the drug effect (P. Roose, personal communication and [18]). (This is partly due to the fact that when people know that they are being treated with either a more or less potent medicine, drug response tends to be more vigorous than when they know that they may be on either an actual drug or a placebo). Finally, scientifi c experiments rarely control for variables known to infl uence drug response (e.g., expectation, suggestion, motivation, site location, and trial length) (P. Roose, unpublished data).
Clinical signifi cance-a meaningful change in the symptomatic state or functioning of an individual patient-requires independent replication of results [19]. Different fi elds have different criteria for clinical signifi cance. Additionally, the statistical method used to calculate clinical signifi cance affects the estimates of meaningful change [14]. Indeed, some professional associations (e.g., the American Psychological Association) have deemed effect sizes and confi dence intervals to be more meaningful measures than signifi cance testing [20].
To determine clinical signifi cance through "risk-benefi t" analysis [21], one must weigh the potential benefi t of improved symptomatology, accompanied by adverse side effects, against the risk of leaving the disease untreated. Trying to address the issue of clinical signifi cance, researchers have considered such parameters as the number needed to treat (NNT), the number needed to harm, and the number needed to prevent [22]. In the case of depression and CAD, FDA approval of fl uoxetine implies that the FDA considered the number needed to harm to be reasonable. But what is a good value for NNT? Ideally, it would be as close as possible to one (i.e., we need to treat only one person in order to see a desired effect in one person), but actually the NNT tends to be much higher than one [23], and it is unclear what range of values permits clinicians to conclude that a favorable benefi t-torisk ratio exists.
In addition, the NNT must be interpreted by using a comparison group. For example, in a placebo controlled trial, NNT = 3 means that on average one of three patients will derive specifi c benefi t from the treatment above and beyond placebo, which is rarely used clinically. Thus, NNT may be more clinically meaningful as an active comparator than in relation to a placebo. The absence of clear criteria for clinical effi cacy is probably partially responsible for interpretation of the same data as being both for [24] and against [25] the effi cacy of a specifi c drug.
Lacking clear criteria for clinical signifi cance, statistical signifi cance is perhaps the most convenient substitute [26]. But researchers may pay too much attention to the results of signifi cance tests, thereby overlooking clinical signifi cance. Statistical signifi cance may not be a suffi cient criterion for recommending a drug [27]. In the case of adult antidepressants, FDA approval requires that two randomized clinical trials (RCTs) show that drug performance is at least two Hamiltonscale points better than a placebo; however, this arbitrary criterion does not signify clinical effi cacy. In fact, it is unclear what criteria should be used to assess clinical signifi cance. For example, how many studies are needed to convince a clinician that a drug is effi cacious? Clinical signifi cance relies on replication and probably requires a meta-analysis. Published Fluoxetine [38] X X X Venlafaxine [94] X a X X X Fluoxetine [29] X X X X (Paroxetine) [35] X X X X (Fluoxetine) [30] X X X X (Sertraline) b [95] X X X X (Citalopram) [54] X c X (Fluoxetine) [23] X Not included meta-analysis by [52]. Since either means or standard deviations were missing in 25% of these RCTs, only nine were amenable to meta-analytic scrutiny. When Kirsch and Antonuccio combined data from these nine studies for analysis, the placebo response was 87% of the drug response, 75% of the SSRI response, and 97% of the TCA response. These results seem to indicate that TCAs have no signifi cant pharmacological effect on CAD. They concluded that the effect of SSRIs may be statistically signifi cant, but possibly not clinically signifi cant.
While some psychopharmacologists dismiss investigators such as Kirsch and Antonuccio as "outliers" or inherently biased against the drug industry, such "outlying" accounts should nevertheless be examined. Opinions on either side of this issue should be considered, especially in the absence of defi nitive data and facts.

Market Forces
In the US, the pharmaceutical industry, Congress, and advocacy groups are known to both lobby and contribute generously to the FDA and may partially inspire its decisions and policies (P. Roose, personal communication). The infl uence of the pharmaceutical industry permeates science [39], and evidence points to the increasing commercial impact of biomedical research on scientifi c reporting [40,41]. Indeed, industry funding for research tends to yield favorable reports concerning the tested drug [42]. For example, among the authors of original research papers, reviews, and letters to the editor that were supportive of the use of specifi c drugs, 96% had fi nancial relationships with the drugs' manufacturers, whereas for publications deemed neutral or critical, the fi gures were only 60% and 37%, respectively [43,44].
Further, since negative results are often discounted or not published [45,46,47], the message conveyed to the popular press and the public is often positively skewed [48], emphasizing benefi ts over risks and predicting improbable breakthroughs [49]. This trend may create unrealistic expectations about scientifi c advances or products and may lead to inappropriate and expensive utilization patterns. However, commercial pressure is not the only source of ambitious interpretations; another is researchers who are eager to promote their latest fi ndings. Thus, confl icts of interest in SSRI trials for treating CAD may potentially cloud results.

Different Reviews Have Had Different Results
Although most previous reviews have been partisan, a few have presented a balanced account [1,50,51]. The fi rst, though least comprehensive, review of RCTs conducted on newer antidepressants for CAD examined six published studies, including studies on venlafaxine and three SSRIs: fl uoxetine (three studies), sertraline (one study), and paroxetine (one study) [52]. In their limited meta-analysis, the authors used a random effects model to pool averaged, selected outcomes across the fi ve SSRI studies. They found a small effect size of 0.26 (95% confi dence interval [CI], 0.13-0.40), which they described as equivalent to a three-tofour-point improvement on the revised children's depression rating scale (which ranges from 17 to 113). They concluded that a large benefi t from newer antidepressant drugs is unlikely. Reviewing the same studies, another report judged the effi cacy data to be inconclusive [53].
Since the above reviews [52,53] included in their analysis a negative study of fl uoxetine which involved a small (n = 30), clinically heterogeneous (mixed inpatients and outpatients) participant group [39], some later reviews opted to exclude this negative study from their analyses. For example, one study analyzed fi ve of the six published papers addressed in the above-mentioned reviews [52,53], as well as data from six unpublished studies accessed through collaboration with the UK's MHRA [50]. These unpublished studies included two investigations of venlafaxine, paroxetine, and citalopram, respectively (one of the citalopram studies was subsequently published [54]). After extracting raw data for outcome measures, including remission, response January 2006 | Volume 3 | Issue 1 | e9

Incentives for Conducting Pediatric Clinical Trials
Giving pharmaceutical companies an incentive to conduct pediatric tests, Congress passed the FDA Modernization Act in November 1997. Section 111 of this "pediatric exclusivity" partnership act offered drug sponsors six months (sometimes up to a year) of additional market exclusivity if they conducted pediatric studies on drugs still under some exclusivity provision. Under the FDA Modernization Act, the pharmaceutical company could continue to set the market price, keeping generic forms of the drug off the market. The original act has since been revised and extended through 2007 under the Best Pharmaceuticals for Children Act, but concerns still linger regarding disproportionate profi ts accrued by drug companies from the six-month extension compared to the cost of clinical trials, and the absence of commitment by the companies to publish or make readily available the safety and effi cacy results of these trials [92]. While it is easy to see why pharmaceutical companies are eager to conduct pediatric research (six-month worldwide exclusivity for fl uoxetine, for example, is estimated to be worth about a billion dollars), clinicians and patients rarely have access to these fi ndings [93].
to treatment, and depressive symptoms scores, the authors reanalyzed, and-where possible-meta-analyzed published and unpublished studies of each drug, using fi xed-effects and random-effects models. The authors concluded that only fl uoxetine had evidence of effi cacy that was robust: across two published trials, fl uoxetine was more likely than placebo to bring about remission (number needed to benefi t 6; 95% CI, 4-15), or a clinically meaningful response (number needed to benefi t 5; 95% CI, 4-13). The study did not draw on unpublished fl uoxetine effi cacy data.
The most recent systematic review of newer antidepressants for CAD to date [55] included all of the data previously reviewed [50,52,53] as well as unpublished data regarding two RCTs of nefazodone, and the recently published Treatment for Adolescents with Depression Study (TADS) [23], which compared fl uoxetine, placebo, and cognitive behavioral therapy (CBT) alone and in combination (see Table  1). After examining effi cacy outcomes and the infl uence of methodology (site selection, study population, study design, and outcome measures) on those outcomes, the authors found that the more methodologically sound SSRI studies tended to have better outcomes. For example, in the TADS, 60.6% (95% CI, 51%-70%) of adolescents responded to fl uoxetine alone, as opposed to a 34.8 (95% CI, 26%-44%) response rate for placebo. Based on these fi ndings and no evidence of differences among these drugs in adult populations, the authors concluded that most newer antidepressants are likely effective, and that different results have been largely due to methodological differences in studies of CAD. They concluded, therefore, that at least fl uoxetine is clinically effective for CAD.

Methodologic Oversights in Published Studies
The TADS concluded that "medical management of MDD with fl uoxetine, including careful monitoring for adverse events, should be made widely available" [23]. However, some researchers disagree with this conclusion [25].
The TADS explores four experimental arms: placebo, fl uoxetine, CBT, and CBT + fl uoxetine. Some of these treatment groups were "blind" (i.e., participants were unaware of what treatment they were receiving, as in the case for placebo) while others were informed (as in the case for CBT). However, Table 2 shows the diffi culty of interpreting a study that compares blinded and unblinded treatment groups (e.g., differences between treatment groups might be due to the varying infl uence of expectation). Thus, in the interpretation of the TADS fi ndings, subtle methodological caveats go unrecognized.
Relative to other studies, CBT does not fare well compared to both placebo and fl uoxetine in the TADS. However, other studies comparing antidepressants and CBT showed that both were moderately effective in relieving depression in adults [56]. Adult neuroimaging fi ndings suggest that the two methods work by improving the functioning of different brain circuits: CBT operates on cortical areas related to attention and comprehension, including the anterior cingulate cortex, whereas antidepressants operate on subcortical areas. Applying the adult data to children, these exploratory imaging results may provide both a context for testing drugs against nonpharmacological therapy and a basis for considering how to treat or even prevent depression in those who are most susceptible [57].
Despite being regarded as effi cacious in adults, meta-analysis of published RCTs indicates that 75% of antidepressant response in adults is duplicated by placebo [58]. This initial meta-analysis was amply critiqued by Klein for multiple limitations [59]. However, follow-up analyses using a different data set taken from the FDA, to which Klein's objections do not apply, again reported that about 80% of the response to antidepressants in adults was duplicated in placebo control groups [60,61]. Together with the notion that antidepressant medication effects are typically weaker in children than adults [58,60], these conclusions accord with earlier reviews that challenge the effects of antidepressants in CAD [4,[62][63][64][65][66][67]. Merely labeling a pill an antidepressant does not make it so. In fact, the existing data suggest that antidepressants are probably more effective in treating anxiety than depression [68].
Another limitation draws on the implications of using antidepressants in early life [69]. Serotonin acts as a brain and glial growth factor in early development. Some serotonin receptors act during development to establish normal anxiety-like behavior later in life, while others play a role in synapse formation [70,71]. Some exploratory fi ndings suggest that artifi cial perturbation of serotonin function in early life may alter the normal development of brain systems related to stress, motor development, and motor control [71,72]. Furthermore, early exposure to fl uoxetine produced abnormal emotional behaviors in adult mice [69]. The critical role of serotonin in the maturation of brain systems that modulate emotional function in the adult suggests that, in concert with genetic makeup, low serotonin levels during early development may increase vulnerability to psychiatric disorders [69,73]. Early exposure to SSRIs, therefore, can potentially exact a heavy price in later life [74,75].
These caveats suggest that, in addition to the potential implications of using antidepressants in early life, there are few compelling data sets, free of funding from drug companies, concerning the effi cacy of antidepressant medications over and above their placebo value for CAD. These caveats also show the diffi culty of assessing the clinical signifi cance January 2006 | Volume 3 | Issue 1 | e9 of the unique effects attributable to antidepressant medications. Because antidepressants work just slightly better than placebos, even according to data endorsed by the pharmaceutical industry, the image of antidepressants as more effective may be overreaching and perhaps a consequence of methodological artifacts [76].

Public Health versus Individual Decisions
The US Surgeon General makes decisions based on the greater good of a vast population: a mere twopoint improvement on the Hamilton depression scale may constitute a meaningful public-health benefi t (D. Shaffer, unpublished data). However, parents decide whether their depressed adolescent child should receive CBT, fl uoxetine, or start a vigorous exercise regime, based on an individually tailored risk-benefi t analysis.
Most clinicians recommend psychotherapy for mild to moderate CAD and reserve SSRIs for severe CAD or when therapy is not effective [21]. Yet given the large numbers of people suffering from depressive disorders (i.e., an estimated 1.5 million adolescents (12-18 years of age) with MDD in the US alone [77]), it is easy to see why offering therapy would be diffi cult (e.g., number of therapists and insurance considerations), thereby making the drug option more popular.
Despite early suggestions in the literature [78][79][80], accounts of the association between adult suicidality and the use of SSRIs have been inconclusive [81][82][83]. One early meta-analysis showed that SSRIs potentially decreased suicidal ideation as measured by a single question on the Hamilton depression score [83], but a more recent study reported a non-signifi cant increase in suicide rates between patients assigned to SSRIs and those assigned to placebo or other antidepressants [82]. Despite the FDA's black-box label, some accounts suggest that since the introduction of SSRIs the number of successful suicides has steadily declined [84,85]. One recent meta-analysis reported that SSRIs increased the risk of suicide attempts, but not completions, across all indications [86]. Another, a metaanalysis of drug company data that were submitted to the MHRA's safety review, reported that SSRIs did not appear to increase the risk of suicide attempts or thoughts [87].
Data regarding SSRIs and youth suicide are sparser, but no less controversial [12,13,88,89]. One study reported an inverse relationship between regional change in use of antidepressants and suicide [90]. Nonetheless, the highest possible standard should be applied to scientifi c data involving drug treatment of children because they are essentially involuntary patients: when a medication is prescribed for a young child, the adult caregiver ensures that the child takes the medication, regardless of the child's own desires. Yet, studies of adolescent compliance with medication treatment report notoriously low compliance outside of the controlled settings of clinical trials.
The sparse RCT fi ndings suggest that improvement may not always be clinically signifi cant. When evaluating a medication with side effects, potential clinical implications for later life, and questionable effectiveness, it should be compared to interventions such as exercise and CBT, which have shown some therapeutic effects on depression without medical side effects and risks [91]. Originally celebrated but recently disparaged by modern psychiatry, therapeutic rapport may prove more clinically signifi cant than drug specifi city.

Conclusions
Given all of these limitations, patients and physicians should demand stronger evidence for the effi cacy of antidepressants for CAD. Some advocates assert that rather than using medication with side effects and low effectiveness, children should be offered interventions that produce therapeutic effects on depression without the medical side effects and associated risks [91]. However, clinicians and laypeople must apply comparable standards for evaluating the effi cacy of drug and psychotherapy data. Whereas medical drug research occurs in a formally regulated, albeit imperfect, environment, safety and effi cacy in psychotherapy research are largely unregulated. Moreover, unlike drug assays, psychotherapy studies do not typically report adverse events, their meta-analyses are sparse, and their experimental design lacks a placebo condition (see Table 2).
Finally, although antidepressants undoubtedly affect brain biochemistry, interpreting these neural changes is controversial, and a risk-benefi t analysis of side effects and long-term health risks may cast a long shadow on the current preference for antidepressants as fi rst-line treatment for CAD. Only more studies, and the passage of enough time to examine the putative long-term effects, will determine the effi cacy of antidepressants in CAD. Clinicians, patients, families, and the public should be cognizant of these issues and exercise critical judgment as they make informed decisions.

January 2006 | Volume 3 | Issue 1 | e9
Patients and physicians should demand stronger evidence for the effi cacy of antidepressants.