This study explored qualitative treatment-subgroup interactions within data of a RCT with two cognitive behavioral treatments (CBT) for adolescents with ADHD: a planning-focused (PML) and a solution-focused CBT (SFT). Qualitative interactions imply that which treatment is best differs across subgroups of patients, and are therefore most relevant for personalized medicine.
Adolescents with ADHD (N = 159) received either PML or SFT. Pre-, post- and three-month follow-up data were gathered on parent-rated ADHD symptoms and planning problems. Pretreatment characteristics were explored as potential qualitative moderators of pretest to follow-up treatment effects, using an innovative analyses technique (QUINT; Dusseldorp & Van Mechelen, 2014). In addition, qualitative treatment-subgroup interactions for the therapeutic changes from pre- to posttest and from post- to follow-up test were investigated.
For the entire time span from pretest to follow-up only a quantitative interaction was found, while from posttest to follow-up qualitative interactions were found: Adolescents with less depressive symptoms but more anxiety symptoms showed more improvement when receiving PML than SFT, while for other adolescents the effects of PML and SFT were comparable.
Whereas subgroups in both treatments followed different trajectories, no subgroup was found for which SFT outperformed PML in terms of the global change in symptoms from pretest to three months after treatment. This implies that, based on this exploratory study, there is no need for personalized treatment allocation with regard to the CBTs under study for adolescents with ADHD. However, for a subgroup with comorbid anxiety symptoms but low depression PML clearly appears the treatment of preference.
Nederlands Trial Register NTR2142
Citation: Boyer BE, Doove LL, Geurts HM, Prins PJM, Van Mechelen I, Van der Oord S (2016) Qualitative Treatment-Subgroup Interactions in a Randomized Clinical Trial of Treatments for Adolescents with ADHD: Exploring What Cognitive-Behavioral Treatment Works for Whom. PLoS ONE 11(3): e0150698. https://doi.org/10.1371/journal.pone.0150698
Editor: Ingmar H.A. Franken, Erasmus University Rotterdam, NETHERLANDS
Received: September 2, 2015; Accepted: February 18, 2016; Published: March 15, 2016
Copyright: © 2016 Boyer et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data contain identifying information and may not be shared publicly. The data will be made available to requesters that meet criteria for access among others 1) the requester identifies him/her selves 2) will access controlled data for the sole purpose of addressing a question or conducting analyses that are in the interest of public health, in line with the spirit of informed consent 3) refrain from any attempt to retroactively identify patients in CTs; this includes linkage of CT data accessed with other databases or programs that could result in the identification of patients 4) not share the data with others 5) has obtained ethics approval as appropriate. The data will be available at the Department of Psychology of the University of Amsterdam from the Institutional Data Access/Data Storage. This request can be made to Marco Teunisse (M.Teunisse@uva.nl).
Funding: The research leading to the results reported in this paper was supported in part by the Netherlands Organization for Health Research & Development (Grant #157003006), by the Research Fund of KU Leuven (GOA/15/003), and by the Interuniversity Attraction Poles programme financed by the Belgian government (IAP/P7/06). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: I have read the journal's policy and the authors of this manuscript have the following competing interests: Bianca E. Boyer is co-developer and author of the manuals ‘Plan My Life’ and ‘Solution Focused Treatment’: she receives royalties for the sales of both interventions. Saskia van der Oord has been a paid consultant for Janssen Pharmaceuticals in the development of a serious game “Healseeker”: aimed at training cognitive functions. Also, she is co-developer and author of the manuals ‘Plan My Life’ and ‘Solution Focused Treatment’. However, she has no financial interest in the sales of the interventions. She has been a speaker for Medice & Shire. Pier Prins is a member of the non-profit foundation Gaming & Training which develops and facilitates the computerized cognitive training programme "Braingame Brian". Other authors declare no conflict of interest. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Attention Deficit Hyperactivity Disorder (ADHD) is a childhood neuro-developmental disorder that affects approximately 5–10% of the children  . Children with the disorder have symptoms of inattention, symptoms of hyperactivity/impulsivity or both, resulting in three subtypes of the disorder  : the inattentive subtype, the hyperactive/impulsive subtype and the combined subtype. The trajectories of these symptom-clusters differ, as symptoms of inattention usually persist during development, while symptoms of hyperactivity/impulsivity often diminish over time. About 65% of children with ADHD still meet criteria for the diagnosis in adolescence   In addition, about 62% of children with ADHD below 19 years have at least one comorbid disorder, whereas 34% even have two or more comorbid conditions . Prevalent comorbid disorders are oppositional defiant disorder (ODD), conduct disorder (CD), anxiety- and mood-disorders, adjustment disorder and substance use disorder . Also, more boys than girls have ADHD (3:1 respectively ) and girls appear to have less severe inattention, hyperactivity and impulsivity, but greater intellectual impairments than boys with ADHD . In addition, girls tend to have more internalizing comorbid disorders than boys, whilst boys with ADHD are at higher risk for externalizing psychiatric comorbidities than girls . Taken together, the group of individuals with ADHD is a heterogeneous one.
This heterogeneity makes it unlikely for one treatment to fit all and increases the need for personalized treatment. Randomized clinical trials (RCTs) can be valuable in revealing moderators of therapeutic change, investigating for whom what treatment is most effective . For example, in a recent RCT the effectiveness of two cognitive behavioral treatments (CBTs) for adolescents with ADHD was investigated . Even though some have been critical of the use of CBT in children with ADHD , others presume that adolescents might have enough cognitive capacity to benefit from CBT . Because adolescents with ADHD have planning problems in daily life that can cause impairment in school, family- and social functioning  and evidence-based non-pharmacological treatments for adolescents with ADHD are lacking , a CBT was developed focusing on planning skills: Plan My Life (PML) . In PML, every session a fixed, planning skills focused, subject and strategy is discussed and trained (e.g., a to-do list). In a multi-site RCT (N = 16 sites, N = 56 therapists ), this treatment was compared to a control CBT, without the proposed active element of enhancing planning skills: a solution-focused treatment (SFT ). Both PML and SFT are individual, manualized treatments consisting of 8 adolescent sessions and 2 parental sessions. Whereas in PML every week planning skills are actively learned by discussing a fixed subject, in SFT the adolescent/parent chooses a problem that is discussed using fixed questions in a solution focused manner, to lead the adolescent to a solution for the problem. To reduce drop-out, motivational interviewing is integrated within both treatments.
Pre-, post- and three-month follow-up data were gathered in 159 adolescents with ADHD (12 to 17 years), with parent-rated ADHD symptoms and planning problems as primary outcomes. Results showed a significant improvement of primary outcomes as well as comorbid symptoms, functioning and impairment (with large effect sizes) from pre- to posttest with maintenance of effects to three months after treatment on most measures, also when controlling for medication use. In addition, 15.2% of adolescents showed normalization of functioning at follow-up. However, only marginally significant treatment differences were found, in favor of PML: PML showed more reduction of parent-rated planning problems compared to SFT, and higher treatment satisfaction of parents and therapists . Also, due to the lack of an adequate control-group like a waitlist or a treatment as usual group, efficacy of both treatments could not be proven. The lack of differences in treatment outcome in this RCT could be due to heterogeneity of treatment effects in different subgroups of adolescents with ADHD: if for specific subgroups of adolescents one treatment is better than the other, while for other subgroups the reverse is true, this can result in comparable mean outcomes for the two treatments.
This phenomenon, where the optimal treatment for one subgroup differs from that for another subgroup, is referred to as a qualitative treatment-subgroup interaction, as opposed to a quantitative treatment-subgroup interaction, where the optimal treatment is the same in all subgroups but the size of the between-treatment difference differs across subgroups  . In both cases, the patient characteristic(s) defining the subgroups in question are called moderators of treatment effect . For example, Ogrodniczuk et al.  compared two forms of short-term individual psychotherapy (interpretative, supportive) for individuals suffering from depression. The authors found that for males interpretative therapy outperforms supportive therapy, whereas for females the reverse is true. This is a classic example of a qualitative treatment-subgroup interaction, where gender acts as a moderator of treatment effect. In the case of a quantitative treatment-subgroup interaction, the same treatment, for example interpretative therapy, would have outperformed supportive therapy in both genders, but for males the difference between the two treatments would have been larger than for females.
In the present study, a new and innovative statistical technique is used to explore these qualitative interactions: QUalitative INteraction Trees (QUINT ) (note that the meaning of the term “qualitative” as used in the context of qualitative treatment-subgroup interactions should not be confused with the meaning of the same term in the context of qualitative research methods). Earlier work on the detection of treatment-subgroup interactions primarily refers to situations in which clear a priori hypotheses exist about which subgroups of clients are involved in the interactions, or situations that involve a small number of potential moderator variables only. Examples include factorial analyses of variance (ANOVA), with one factor pertaining to treatment methods and another one to subgroups , and regression analyses with suitable interaction terms being included in the regression model (see e.g.,  ). In contrast, QUINT does not require a priori hypotheses or a limited number of potential moderator variables but rather induces subgroups involved in treatment-subgroup interactions during the actual data analysis. QUINT further differs from other methods that induce subgroups during the actual data analysis (for a review, see ) in that it focuses on subgroups involved in clinically significant qualitative treatment-subgroup interactions; these are of particular relevance for personalized treatment assignment. The goal of QUINT is to find a partition of the total group of adolescents with ADHD, based on their pretreatment characteristics, into two or three mutually exclusive types of subgroups that are characterized as follows: In the first subgroup type PML outperforms SFT; in the second subgroup type the reverse is true; and in the third (optional) subgroup type, the adolescents assigned to PML would show more or less the same outcome as the adolescents assigned to SFT. Each subgroup type may comprise one or several subgroups of adolescents as defined by a combination of one or several dichotomized patient pretreatment characteristics (for which associated cut-off scores are being provided by QUINT). For example, a combination of pretreatment scores for intellectual impairment and externalization larger than some critical values may imply that SFT yields better results than PML, whereas for other subgroups the reverse holds true. As such, the QUINT results may have straightforward implications for optimal treatment assignment strategies to support healthcare decision makers of adolescents with ADHD. Note, however, that the results of a QUINT analysis may also be that the total group of clients is not partitioned, that is, that no subgroups involved in a qualitative treatment-subgroup interaction can be identified. For an example of a recent successful application of QUINT in the field of psychotherapy research, see .
The present study is the first to investigate qualitative interactions in treatments for adolescents with ADHD, comparing two types of CBT. To our knowledge, there are no studies on qualitative interactions in the treatment of ADHD and this is the first study using QUINT to investigate this. Similar to other tree-based methods, QUINT relies on a large search space based on a large number of covariate split-point combinations and therefore needs large sample sizes . Even though this study is the largest RCT in adolescents with ADHD to date, for QUINT to show replicable results a larger sample size would be preferred. Dusseldorp and Van Mechelen  state that the safest way to deal with QUINT is as an exploratory tool to generate meaningful hypotheses that should be tested in follow-up confirmatory research with new RCTs. This RCTs should make use of a stratified sampling scheme in which the strata are constructed on the basis of the splitting variables and split points as identified by QUINT . This is especially the case in applications with relatively smaller sample sizes. Therefore, this study should be considered exploratory, aimed at generating hypotheses for further research on treatment allocation in adolescents with ADHD.
To date, no treatment moderation studies have been conducted in adolescents with ADHD. However, two research groups have investigated a wide range of treatment moderators (child and family characteristics) in younger children with ADHD (up to 12 years of age), using mainly traditional moderation analyses such as ANOVA in which clear a priori hypotheses about potential moderators are required]        . Although solely based on behavioral (parent) treatment of children with ADHD as compared to other types of treatment (i.e., medication, combined treatment or regular care), the following characteristics appeared to positively influence the effects of behavioral (parent) treatment on ADHD-symptoms: having no or one single comorbid disorder , in particular anxiety  , being older of age, having a mother with high parenting self-efficacy , or having no or one single DAT1 10-repeat allele . Due to a lack of treatment moderation studies in adolescents with ADHD, and taking the heterogeneity of our sample into consideration, we measured a broad range of pretreatment patient characteristics to explore as potentially relevant moderator variables in the present study: age, gender, full-scale IQ, medication use, parental education, ADHD subtype, ADHD severity, comorbid ODD/CD-, depressive- and anxiety symptoms, and overall impairment. In addition, planning problems were included as a potential moderator, as one treatment aimed at enhancing planning skills (PML), whereas the other treatment did not (SFT). Inclusion of moderators in this study was based on1) moderators in previous research (e.g., gender, medication use, IQ, age, parental education, comorbid internalizing disorders like depressive and/or anxiety, externalizing disorders like ODD and/or CD), and 2) because one can hypothesize one CBT to be a better fit than the other. For example, even though this has never been investigated one could argue that older adolescents, adolescents with higher IQs, or less ADHD symptoms or impairment or adolescents with parents who have higher education, have more cognitive capacity or support at home, and therefore do better in a more open treatment like SFT, while younger adolescents, or with lower IQs, more severe ADHD or impairment and less educated parents fare better with a more structured treatment like PML. Outcome measures in this study were the primary outcomes from our RCT: parent-rated ADHD-symptoms and planning problems .
To our knowledge, previous studies on moderation of treatment effects in children with ADHD have focused mainly on moderators of short-term effects of treatment between pre- and posttest (       ; with the exception of ). However, trajectories of treatment effects may differ between phases of treatment. The best-known example thereof is the MTA-study, where children who had received medication only or in combination with behavior therapy improved more from pre- to posttest than children who had received behavior therapy only or community care . However, 10 months after treatment these differences had diminished and approximately 2 years after treatment, groups did not differ significantly on any measure, an effect that remained stable until 8 years after pretest   . Also, even though in the MTA-study moderators of treatment effect were found from pre- to posttest     , no moderators of treatment effect were found from pretest to 36 months after treatment .
Therefore, in the present study, apart from investigating qualitative moderators of treatment effects from pretest to follow-up, also qualitative moderators of the immediate treatment effect (from pretest to posttest) and of treatment effect maintenance (from posttest to follow-up three months after treatment) will be examined. More specifically, using QUINT, we investigated the following primary research question: Are qualitative treatment–subgroup interactions present in the data of our RCT (from pretest to three months after treatment)? And, if so, which treatment is best for which subgroup of adolescents with ADHD? A secondary research question we investigated, was whether there are qualitative treatment-subgroup interactions in the two phases of the trajectory of therapeutic change from pretest to follow-up, namely from pretest to posttest, and from posttest to follow-up? Given the lack of prior research on moderators of treatment effects in adolescents with ADHD, and in particular on moderators involved in qualitative treatment-subgroup interactions, we stated no specific expectations. When indeed qualitative treatment-subgroup interactions would be found from pretest to follow-up, this will generate hypothesis that can be tested in a follow-up RCT. This would be highly relevant for personalized medicine, as it would enable allocation of adolescents with ADHD to specific treatments, resulting in higher treatment efficacy.
For a more detailed description of the sample, treatment content, procedures, research design, and approach to missing data, we refer to Boyer, Geurts, Prins, and Van der Oord .
This was a multi-center (16 sites), two-arm parallel-group randomized clinical trial conducted in the Netherlands (Clinical trials registration “Training adolescents with ADHD to plan and organize: Investigating short- and longterm effects of treatment”, http://www.trialregister.nl/trialreg/admin/rctview.asp?TC=2142, NTR2142). No important changes to methods were made after trial commencement. The date range for participant recruitment was September 2010 until October 2012, the date range for three-month follow-up was February 2011 until April 2013. The CONSORT checklist, original trial study protocol, translated trial study protocol, original ethics protocol, and translated ethics protocol are available as CONSORT Checklist in S1 CONSORT Checklist, Original Protocol in S1 Protocol, Translated Protocol in S2 Protocol, Original Ethics Protocol in S3 Protocol, and Translated Ethics Protocol in S4 Protocol.
Participants and procedure
All participants (N = 159) were adolescents aged 12 to 17 years (Mage = 14.4 years), who attended secondary school. Participants had received a prior DSM-IV-TR diagnosis of ADHD  by a child psychiatrist or certified psychologist, which was confirmed with the Diagnostic Interview Schedule for Children for DSM-IV parent version (DISC-IV ). The DISC-IV is a structured diagnostic interview based on DSM-IV, which establishes ADHD group membership based on a diagnostic algorithm, including a check for the presence of cross-situational impairment. Participants had a full scale IQ (FSIQ) > 80 measured by the short version of the Dutch Wechsler Intelligence Scale for Children (WISC-III  ). During treatment 124 adolescents used psychostimulants (no atomoxetine) and were requested to keep the dose stable until posttest. Note that after posttest adolescents could change their medication use or dose. Exclusion criteria included having a comorbid autism spectrum disorder, depression with suicidal ideations, acute familial crisis, CD or predominant addiction.
Adolescents applied for the study in one of sixteen participating mental health care centers. Written informed consent was obtained from adolescents as well as their parents. After the pretest assessment, adolescents were randomly assigned to either PML (N = 83) or SFT (N = 76), using covariate adaptive randomization  by a blinded and independent researcher, and were stratified on gender and medication use (yes/no). To control for therapist effects, the 56 participating therapists (who all at least had a master degree in psychology) provided both treatments. Attrition in both treatments was low (PML N = 4, SFT N = 4) and the length of treatment was comparable across treatments (MPML = 9.3 weeks, SDPML = 2.6 MSFT = 9.1 weeks, SDSFT = 2.9). Adherence was high in both treatments and no treatment contamination was found. Posttest took place within a week after treatment and follow-up test approximately three months after treatment. All assessments and treatments took place in the same outpatient mental health care center where the participant applied for treatment and were conducted by blinded research assistants. Parent-rated questionnaires were completed by the primary caregiver, usually the mother. The Ethics Committee of the University of Amsterdam approved this study (2010-KP-1079). Details of adolescent trial flow are presented in Fig 1.
Similarities: In our RCT we compared two treatments: PML and SFT . Both are individual treatments that consist of 8 adolescent sessions of 45–60 minutes and 2 parental sessions. In both treatments the adolescent and therapist work together using a workbook and treatment manual. In both treatments motivational interviewing plays an important role: the attitude of the therapist as well as wording in the workbook are based on motivational interviewing. For example, the adolescents choose their own treatment goals (not those of their parents or teachers), which were written down and were always visible during treatment. Also, assignments were not presented as “homework”, but were formulated as an experiment for the upcoming week. In that way, the adolescent was free to choose a treatment strategy that fitted his/her life and the strategy could be adjusted according to their experiences. For a more detailed description of both treatments, see Boyer et al .
Plan My Life: (PML ) is a CBT in which in every session a fixed subject is discussed (e.g. using a daily planner or a to do list). For each subject possible planning- and organization strategies are presented for which the adolescent can compose their own strategy. The chosen strategy is than posited as an experiment that he/she will try the following week. Whenever needed, negative thoughts about the new strategy are challenged and helping thoughts formulated. All session strategies that have been tried the past week are discussed, including successes, possible room for improvement, and associated cognitions.
In the Solution Focused Treatment (SFT ) every session the adolescent discusses a problem he/she encounters. Following fixed questions, the adolescent is guided towards a solution for the posited problem. The fixed questions are: 1) What is the subject you chose? Describe the situation, 2) How is the situation, as it is now, a problem for you?, 3) How would you like it to be?, 4) What are solutions you used in the past and what are other possible solutions to the problem?, 5) Does the situation, as it is now, have advantages?, 6) Would you like to change the situation now/later/not at all, 7) If you choose to change, what is your plan? If you choose to change later or not at all, what are your considerations (pros and cons)? In this treatment the therapist does not tell the adolescent or parent what to do and does not teach them a new skill, but guides them towards a solution using the fixed questions.
Differences between treatments: The important difference between both treatments lies in the content. Whereas in PML every week a fixed subject is discussed regarding planning and organizing and planning skills are actively learned, in SFT the adolescent/parent has to choose the subjects themselves and is guided to his/her own solution.
ADHD symptoms of the adolescents are measured using the Disruptive Behavior Disorder rating scale parent version (DBD  ). The DBD contains four scales composed of the DSM-IV criteria for ADHD Inattention, ADHD Hyperactivity/Impulsivity, Oppositional Defiant Disorder, and Conduct Disorder. In children with ADHD factor scores for parent ADHD ratings were found to have high internal consistency and adequate test-retest reliability . In this study the ADHD symptoms scale is calculated summarizing the ADHD Inattention and the ADHD Hyperactivity/Impulsivity scales (18 items, scores range from 0 to 54). Analyses are conducted on raw scores, with a higher score indicating more symptoms of ADHD.
Planning problems of the adolescent are rated by parents using the Dutch translation of the Behavior Rating Inventory of Executive Function (BRIEF  ): A normative behavioral rating scale for children 5 to 18 years old, designed to elicit everyday planning behavior as observed by the parents in natural everyday environments. The BRIEF has appropriate psychometric properties including internal consistency ranging from .80 to .98 and test-retest reliability ranging from .76 to .88 . The Plan/Organize scale measures the child’s capacity to anticipate future events, set goals, develop appropriate steps to carry out associated tasks or actions, and manage current and future-oriented task demands (12 items, scores range from 12 to 36). Analyses are conducted on raw scores, with higher scores indicating more planning problems.
Outcome measures are assessed at pretest one week before treatment, at posttest within a week after treatment and follow-up test approximately three months after treatment. The DBD was completed by 93.7% of the primary caregivers before treatment, 82.4% at posttest, and 67.3% at follow-up. The BRIEF was completed by 93.7% of the primary caregivers before treatment, 82.4% at posttest, and 66.7% at follow-up. To investigate our primary research question “Are qualitative treatment–subgroup interactions present in the data of our RCT?” analyses were conducted on the difference score between pretest and three-month follow-up on both the DBD and the BRIEF questionnaire. To answer our secondary research question: “Do the two treatment alternatives have a different effect on the trajectories of therapeutic change of ADHD-symptoms and planning problems for different subgroups of adolescents?”, we analyzed the difference between pre- and posttest on both the DBD and the BRIEF and the difference between post- and follow-up test on both the DBD and the BRIEF. The effect found between pre- and posttest is considered to be the short-term effect of treatment. The effect between post- and three-month follow-up test is considered to reflect the treatment maintenance.
In total, 12 patient characteristics were assessed as potential moderators involved in qualitative treatment-subgroup interactions. Demographic variables included were gender, age, and medication use. Age was evaluated as a continuous variable while gender (male/female) and medication use (yes/no) were treated as categorical variables. Also, the average education level of both parents was assessed on a 4-point Likert-scale, with 1 representing the lowest educational level and 4 the highest. Parental education was treated ordinal. Several additional baseline adolescent characteristics were also included in the analyses as moderators:
Full Scale IQ was measured by two subtests of the Dutch Wechsler Intelligence Scale for Children (WISC-III-NL ) that correlate highly with the Total IQ: Vocabulary and Block Design (FSIQ ). The variable full scale IQ was treated as a continuous variable.
ADHD-symptoms of the adolescent were measured at pretest using the DBD rating scale parent version  . The DBD contains four scales composed of the DSM-IV criteria for ADHD Inattention, ADHD Hyperactivity/Impulsivity, Oppositional Defiant Disorder, and Conduct Disorder (for psychometrics see outcome measure ADHD-symptoms). The ADHD symptoms scale is calculated, summarizing the ADHD Inattention and the ADHD Hyperactivity/Impulsivity scales, with a higher score indicating more symptoms of ADHD and was thus treated as a continuous variable.
ADHD subtype was assessed by the DISC-IV, a structured diagnostic interview, that establishes ADHD group membership and subtype status based on a diagnostic algorithm, including a check of cross-situational impairment . The DISC-IV has shown substantial reliability (kappa = .79) in diagnosis of ADHD by parent-report . Based on the DISC-IV the adolescents were classified into the inattentive subtype, the hyperactive/impulsive subtype and the combined subtype. This variable was treated as a categorical variable.
Planning problems of the adolescent were rated by parents at pretest using the BRIEF  . This is a normative behavioral rating scale for children 5 to 18 years old, designed to elicit everyday EF as observed by the parents in natural everyday environments (for psychometrics see outcome measure ADHD-symptoms). In this study the Plan/Organize was used: higher scores on this subscale indicates more planning problems, and was therefore treated as a continuous variable.
Depressive symptoms were measured using the self-reported Child Depression Inventory (CDI  ). The CDI has demonstrated high internal consistency of .94 for normal subjects and .80 for psychiatric populations . Scores range from 0 to 54, in which higher scores indicate more depressive symptoms, and was therefore treated as a continuous variable.
Anxiety symptoms were evaluated using the Screen for Child Anxiety Related Emotional Disorders (SCARED  ). The SCARED demonstrated good internal consistency (a = .74 to .93), test-retest reliability (intraclass correlation coefficients = .70 to .90) and discriminative validity (both between anxiety and other disorders and within anxiety disorders) . Adolescents filled in 69 items, which added up to a total anxiety score, ranging from 0 to 138: Higher scores indicate more anxiety symptoms, and was thus treated as a continuous variable.
ODD/CD symptoms (DBD  ) were measured with the 24 items of the ODD and CD scales of the parent rated DBD (for psychometrics see outcome measure ADHD-symptoms). Scores range from 0 to 72, in which higher scores indicate more externalizing symptoms, and was therefore treated as a continuous variable.
General Impairment was measured using the Impairment Rating Scale (IRS ). Parents answered six questions on a scale from 0 to 10, resulting in a total score ranging from 0 to 60, with higher scores indicating high general impairment, and was thus treated as a continuous variable. The IRS demonstrated acceptable to excellent stability over time, evidence of concurrent, convergent, and discriminant validity, and appeared highly effective for discriminating between children with and without ADHD .
In the analyses, raw scores were used. Table 1 shows the adolescents’ scores on all variables: Independent t-tests and chi-squared tests showed no group differences on all patient characteristics between both treatments.
Intent-to-treat analyses were conducted. Missing parent and adolescent data were imputed using stochastic regression . To address our research questions, we subjected the data to an analysis with the recently developed method QUINT . We will first elaborate on the conceptual basis of QUINT and later on in this section explain how we set out the analysis of our data using this method. As mentioned before, the goal of QUINT is to find the best partition of the total group of adolescents with ADHD, based on their pretreatment characteristics, into two or three mutually exclusive types of subgroups that are characterized as follows: In the first subgroup type, (℘1), the adolescents assigned to treatment PML would show a clearly better outcome than the adolescents assigned to SFT; in the second subgroup type, (℘2), the reverse is true; in the third (optional) subgroup type, (℘3), the adolescents assigned to PML would show more or less the same outcome as the adolescents assigned to SFT. The subgroups may comprise one or several types of adolescents as defined by different (combinations of) dichotomized pre-treatment characteristics (moderators).
QUINT is looking for an optimal partition of the total group of adolescents so that the qualitative treatment-subgroup interaction that is related to that partition has the largest possible practical significance. To achieve this, two conditions with regard to the subgroups ℘1 and ℘2 need to be satisfied: (a) In both subgroups the difference in outcome between PML and SFT should be substantial, and (b) each of the two subgroups should comprise a sufficient number of adolescents. QUINT uses a weighted compound criterion that implies that these two conditions are optimized simultaneously. The difference in outcome between the PML and SFT treatment conditions included in condition (a) can be formalized in terms of either a difference in treatment means or a treatment effect size (Cohen’s d ).
To optimize this criterion, QUINT uses a stepwise tree-building algorithm. We refer to King and Resick  for a conceptual introduction to tree methods in psychological treatment research, and note that QUINT is an innovative member of this family of methods that is custom-made to identify qualitative treatment-subgroup interactions. It should also be noted that QUINT, like related nonparametric methods, makes no assumptions about the probability distribution of the assessed variables. The QUINT algorithm sequentially splits the total group of adolescents into subgroups, while optimizing in each step the weighted compound criterion mentioned above, with the resulting series of splits being represented by a tree structure like Fig 2 (which we will further discuss in the Results section). Starting with the total group of adolescents in the so-called root node, each background characteristic is considered as a candidate splitting variable to divide this group into two child nodes . For each of the candidate splitting variables, all possible split points and corresponding assignments of the child nodes to subgroups ℘1 and ℘2 are evaluated; subsequently, the split point and assignment of the child nodes are chosen that maximize the QUINT criterion. Lastly, across all candidate splitting variables, the variable (along with its maximizing split point and assignment of child nodes to the subgroups) is selected that attains the highest value of the QUINT criterion. After this first split, the stepwise binary splitting procedure is continued. In each step, all end nodes (leaves) of the current tree then become candidate parent nodes. For each candidate parent node, the split (i.e., splitting variable, split point, and assignment of all end nodes or leaves of the tree to subgroups ℘1, ℘2 and ℘3) is selected that maximizes the QUINT criterion. The QUINT criterion values are subsequently compared across all candidate parent nodes and the node which implies the highest criterion value then is subdivided according to its optimal split. Note that from the second split on, leaves may be assigned to ℘1, ℘2 and ℘3 (instead of to ℘1 and ℘2 only as after the first split), and that after each split all leaves are allowed to be re-assigned to the three subgroups.
Smaller outcome values are preferred as these refer to more reduction of parent-rated ADHD symptoms. The ellipsis in the figure represents the root node, which corresponds to the complete group of clients. This ellipsis contains the split variable, with below it the corresponding split point. The rectangles represent the leaves of the tree, that is, the final subgroups of adolescents; each rectangle contains the sample size of the corresponding subgroup, the outcome means (and standard deviations) for the plan my life and solution-focused treatment conditions (ȲPML and ȲSFT), the uncorrected difference in means (ȲPML−ȲSFT), and the uncorrected corresponding effect size d. Assignment of the leaves to the partition classes is represented by ℘1 and ℘2.
The QUINT procedure uses three types of criteria to stop the tree building process: Firstly, after the first split it tests the presence of a qualitative interaction on the basis of a so-called qualitative interaction condition, which reads that in each of the two leaves, the absolute value of the treatment effect size exceeds a critical minimum effect size value (dmin). If QUINT does detect a qualitative treatment-subgroup interaction, the first split will be executed and the stepwise binary splitting procedure is continued. Secondly, the algorithm stops if no longer a split can be found that implies a higher criterion value than in the previous step. Thirdly, QUINT takes into account some additional stopping criteria including a maximum value for the number of leaves, and the fact that each leaf should contain a minimum number of adolescents assigned to PML and SFT.
The tree growing procedure may result in a large tree with a high criterion value for the data at hand, which may not be replicable with future data. QUINT controls for this so-called overfitting by a pruning procedure, which implies that the maximal tree is pruned back to an optimal subtree. This procedure yields a sequence of nested subtrees, for which the number of leaves (L) can be considered a complexity parameter. For each of these subtrees, QUINT has determined a criterion value. Yet, these values are positively biased since they have been determined on the basis of the same data as the ones that were used to build the tree. To overcome this problem, QUINT relies on a bootstrap procedure that yields an estimation of the biases, which further allows calculation of bias-corrected criterion values. The final tree selected by QUINT then is the one with the highest bias-corrected criterion value. Otherwise, QUINT also allows for a bootstrap-based bias correction procedure for the differential treatment effect sizes in the leaves of the finally selected tree, which may give insight into the generalizability of the QUINT solution. As the bootstrap samples have the same sample size as the data, the bootstrap-based bias correction procedure automatically takes into account possible small sample sizes of the data at hand. For a formalization and detailed description of the above, as well as for reports on the results of an extensive simulation study to address the problem of inferential errors (along with a number of strategies and recommendations to control for this), we refer to Dusseldorp and Van Mechelen . To analyze RCTs with QUINT, the R-package quint has been developed by Dusseldorp, Van Mechelen and Doove . In this package some freedom is left to the user (e.g., regarding parameters related to the qualitative interaction condition and tree complexity ). In the current version, clients with one or more missing values on any of the variables are omitted from the analysis (so-called listwise deletion). Therefore, if desired, missing data should be imputed before the QUINT analysis. The R-package quint can be freely downloaded from the Comprehensive R Archive Network (CRAN) and can handle continuous and dichotomous background characteristics. For the analyses in the present paper, we used a slightly extended version of this package that can also handle polytomous categorical background characteristics. This extension can be obtained from the authors.
In this study, the QUINT analyses were performed using the default effect size criterion. When using this criterion, Dusseldorp and Van Mechelen  propose to set the weight for the first constituent of the criterion equal to 1/log(1+ dmax), where dmax is the realistic maximum for treatment effect size. Based on a recent review and meta-analyses on psychological interventions for ADHD   , we set dmax equal to 1. As value for the weight of the second constituent of the optimization criterion, Dusseldorp and Van Mechelen  propose 1/log(0.50N), where N is the sample size. For reliable estimation of the mean outcome in a treatment group, we set the minimum number of adolescents assigned to PML and SFT in each leaf equal to 20. We used the default values for the critical minimum value in the qualitative interaction criterion for the absolute value of the standardized mean difference in treatment outcome (dmin = 0.30, based on results of a simulation study by Dusseldorp and Van Mechelen ), and the maximum number of leaves. Lastly, we set the number of bootstrap samples in the pruning procedure equal to 200, which is a relatively large number leading to more stable results than the default of 25 bootstrap samples. Note that we set out the analysis of our data based on simulation results of Dusseldorp and Van Mechelen (2014) for data sets with a number of potential treatment moderators ranging from 5 to 20 and a sample size equal or larger than 200. In this simulation study the impact of the number of moderators on Type I and Type II error rates appeared to be negligible. That being said, we cannot exclude that in the case of a sample size smaller than 200, there may be an effect of the number of potential treatment moderators on the risk of inferential errors.
To answer the first research question, “Are qualitative treatment–subgroup interactions present in the difference scores between pretest and three-month follow-up?”, we analyzed the difference scores for parent-rated ADHD (DBD). The test of the qualitative interaction condition after the first split (i.e., QUINT’s first stopping criterion, see Statistical analyses section) revealed that no qualitative treatment-subgroup interaction is present in the data. This result implies that there is no need for personalized treatment allocation for adolescents with ADHD when it comes to ADHD symptoms.
Next, we addressed for parent-rated ADHD the second research question on qualitative treatment-subgroup interactions in the two segments of the trajectory of therapeutic change from pretest to follow-up, namely from pretest to posttest, and from posttest to follow-up. With regard to the short-term effect of treatment, that is, the change in ADHD symptoms from pre- to posttest, QUINT again revealed no qualitative treatment-subgroup interaction. However, regarding the treatment maintenance (in other words, the change in ADHD-symptoms from post- to three-month follow-up test), QUINT did reveal a qualitative treatment-subgroup interaction in the data. The analysis resulted in a tree with two leaves (shown in Fig 2). These results show that adolescents with less ADHD symptoms (DBD pretest score ≤ 24.82), should preferably receive SFT over PML (Fig 2, Leaf 1); indeed, for these adolescents, improvement on ADHD symptoms between post- and follow-up test was on average 2.98 points higher when assigned to SFT compared to PML (d = 0.34). On the other hand, for adolescents with more ADHD symptoms at baseline (DBD pretest score > 24.82) greater treatment gains were observed when assigned to PML compared to SFT (Fig 2, Leaf 2). For these adolescents, improvement on ADHD symptoms was on average 3.29 points higher when assigned to PML compared to SFT (d = -0.37). After applying the bias-correction procedure to the two leaves, we found that the improvement on ADHD symptoms for the adolescents in Leaf 1 was on average 0.54 points higher when assigned to SFT compared to PML (d = 0.05), and that the improvement on ADHD symptoms for the adolescents in Leaf 2 was on average 1.17 points higher when assigned to PML compared to SFT (d = -0.11). The values of these bias-corrected effect sizes imply that the detected qualitative treatment-subgroup interaction is small.
In order to more clearly understand the subgroup-specific trajectories across all three assessments, we examined the change in the mean outcome values in the two leaves across the entire time span from pretest to follow-up. Fig 3 shows this change in ADHD symptoms for the induced subgroups of adolescents, while further drawing a distinction between the two treatment conditions. The figure suggests that whereas from posttest to follow-up the adolescents in Leaf 1 benefit relatively more from SFT and the adolescents in Leaf 2 benefit relatively more from PML, this does neither apply to the time from pretest to posttest, nor for the entire time span from pretest to follow-up. From pretest to posttest the subgroups rather seem to be involved in a quantitative treatment-subgroup interaction (with PML outperforming SFT in both subgroups, but with the advantage of PML being larger for the adolescents in Leaf 1). This interaction, however, appeared not to be significant in a post-hoc factorial analysis of variance on the difference score between pre- and posttest (with a first factor pertaining to treatment and a second one to the leaves, F(1,144) = 1.84, p = .177, ωp2 = .006). Furthermore, the relatively larger advantage of PML for the adolescents in Leaf 1 in the change in symptoms from pretest to posttest, is undone by the qualitative treatment-subgroup interaction in the change in symptoms from posttest to follow-up (leading to a non-significant treatment-leaf interaction with regard to the change in symptoms from pretest to follow-up, F(1,144) = 1.03, p = .311, ωp2 = .000).
With regard to the first research question “Are qualitative treatment–subgroup interactions present in the difference scores between pretest and three-month follow-up?”, we investigated these scores for parent-rated planning problems (BRIEF). QUINT revealed that no qualitative treatment-subgroup interaction is present in the data. This implies that there is no need for personalized treatment allocation for adolescents with ADHD when it comes to planning problems.
For the second research question on qualitative treatment-subgroup interactions in the two segments of the trajectory of therapeutic change from pretest to follow-up, difference scores between pre-and posttest, and between posttest and follow-up were subjected to QUINT analyses. With regard to the difference scores between pre- and posttest, QUINT revealed that no qualitative treatment-subgroup interaction is present, whereas for the difference scores between posttest and follow-up a qualitative interaction is present in the data. Regarding the latter, QUINT constructed a tree with three leaves. The pruning procedure indicated that this was also the optimal tree size. The structure of the tree is shown in Fig 4. From this figure it appears that for adolescents in Leaf 3 with more depressive symptoms at baseline (CDI pretest score 11.5) greater treatment gains were observed when assigned to SFT compared to PML: For these adolescents, improvement on planning problems was on average 2.97 points higher when assigned to SFT compared to PML (d = 0.71). On the other hand, adolescents in Leaf 2 with less depressive symptoms (CDI pretest score ≤ 11.5), and more anxiety symptoms (SCARED pretest score > 19.50) improvement on planning problems was on average 2.78 points higher when assigned to PML compared to SFT (d = -0.63). Finally, for adolescents in Leaf 1 with low depression and low anxiety levels, effects of both treatments are comparable. After applying the bias-correction procedure to the two leaves with the most extreme differential treatment effects, we found that the improvement on planning problems for the adolescents in Leaf 2 was on average 1.30 points higher when assigned to PML compared to SFT (d = -0.27), and that the improvement on planning problems for the adolescents in Leaf 3 was on average 1.73 points higher when assigned to SFT compared to PML (d = 0.36). These bias-corrected values imply that the effect size of the detected qualitative treatment-subgroup interaction is between small and medium. Post-hoc analyses revealed that the three subgroups do not differ from each other on other pre-treatment characteristics besides depressive and anxiety symptoms.
Smaller outcome values are preferred as these refer to more reduction of parent-rated planning problems. The ellipses in the figure represent the internal nodes containing the split variables (CDI, Child Depression Inventory; SCARED, Screen for Child Anxiety Related Emotional Disorders), with below each ellipsis the corresponding split point. The upper ellipsis represents the root node corresponding to the complete group of clients. The rectangles represent the leaves of the tree, that is, the final subgroups of adolescents; the quantities contained in the rectangles are analogous to those in Fig 2. Assignment of the leaves to the partition classes is represented by ℘1, ℘2 and ℘3.
In order to more clearly understand the subgroup-specific trajectories across all three assessments, we again examined the change in the mean outcome values in the three leaves across the entire time span. This change in symptoms is shown in Fig 5 for the parent-rated BRIEF for the three different subgroups, while further drawing a distinction between the two treatment conditions. Whereas from posttest to follow-up, the three subgroups are involved in a qualitative treatment-subgroup interaction, this is the case neither for the change in symptoms from pretest to posttest nor for the change in symptoms from pretest to follow-up. Rather, based on Fig 5, the three subgroups seem to be involved in a quantitative treatment-subgroup interaction with regard to the change in symptoms from pretest to posttest, with PML outperforming SFT in leaves 2 and 3, and with no between-treatment difference in Leaf 1. In a post-hoc ANOVA, however, this interaction appeared not be significant, F(2,142) = 0.99, p = .375, ωp2 = .000. With regard to the change in symptoms from pretest to follow-up, Fig 5 also suggests the presence of a quantitative treatment-subgroup interaction with PML outperforming SFT in Leaf 2, and with no between-treatment difference in leaves 1 and 3. In a post-hoc ANOVA, this interaction appeared to be significant, indeed, F(2,142) = 4.18, p = .017, ωp2 = .041. Moreover, a post-hoc Tukey test confirmed that PML significantly outperformed SFT in Leaf 2 whereas the treatment groups did not significantly different from each other in Leaves 1 and 3. Importantly, this implies that adolescents with less depressive symptoms (CDI pretest score ≤ 11.5 and more anxiety symptoms (SCARED pretest score > 19.5) should preferably receive PML, while for other adolescents the effects of PML and SFT are comparable.
This was the first study to explore moderators involved in qualitative treatment-subgroup interactions in adolescents with ADHD, using an innovative statistical technique (QUINT). Our results show that no qualitative treatment–subgroup interactions were present in the data of our RCT for the change in symtoms from pretest to three months after treatment, thereby answering our primary research question. This implies that, when comparing the two CBTs under study in adolescents with ADHD (one that aimed on enhancing planning problems and one that did not), there is no need for personalized treatment allocation when focusing on improvement of ADHD symptoms and planning problems as outcome variables. As noted above, Boyer et al.  reported for the same RCT a significant improvement in ADHD- and comorbid symptoms and general impairment of the adolescents with large effect sizes, but only marginal differences in effect between treatments. This lack of treatment differences is therefore not due to qualitative treatment-subgroup interactions.
Our second research question focused on the trajectories of therapeutic change for different subgroups of adolescents in both treatments, with a distinction between short-term effects of treatment and maintenance effects up to three months after treatment. Our results showed no qualitative treatment–subgroup interactions for the short-term effect of treatment. However, with regard to the treatment maintenance from posttest to follow-up, qualitative treatment-subgroup interactions were found: In terms of improvement in parent-rated planning problems, adolescents with more comorbid anxiety symptoms but with low depressive symptoms fared better with PML, while adolescents with high depressive symptoms showed better results with SFT. Yet, when taking the change in symptoms of the identified subgroups into consideration from pretest to three-month follow-up, only the results in favor of PML were upheld. In addition, in terms of improvement in parent-rated ADHD symptoms, effects of PML were more positive for adolescents with more severe ADHD, while SFT showed more positive effects for adolescents with less symptoms of ADHD. Nevertheless, these results had small effect sizes only and these subgroups were not upheld in the course from pretest to follow-up. The most robust results were therefore, that adolescents with more anxiety symptoms but low levels of depression fared better with PML than with SFT with regard to improvement of parent-rated planning problems.
QUINT provided specific cut-off scores for treatment allocation (CDI: cut-off point = 11.50; SCARED: cut-off point = 19.50). But as this study is exploratory, and found subgroups should be replicated in a follow-up RCT, for now these cut-off scores should not be implemented in clinical practice. If these cut-off scores are replicated they could augment the clinical relevance of this study enabling treatment assignment. Nevertheless, as other factors not included in this study like treatment preference, could also influence treatment effectiveness, treatment allocation should not be based solely on cut-off scores . What is also important to note, is that the subgroups generated by QUINT that are based on these cut-off scores, do not represent the normal or clinical range of scores on these questionnaires. In other words, the interpretation of these scores depends on the norms of the corresponding questionnaire. For example, effects of PML were more positive than SFT for adolescents with a score on the CDI below 11.50, but a higher score than 19.50 on the SCARED. Depending on the age and gender of the participant, a score of 19.50 on the SCARED can be considered a low to average score as compared to the norm . The cut-off point of the CDI can be considered average . All in all, this means that for adolescents with low to average depression, and with some to high anxiety (but not necessarily a clinical anxiety level), PML is the treatment of preference.
Our findings regarding the moderating role of comorbid anxiety symptoms on treatment outcome, are in line with results of previous studies. For example, a non-randomized study on the effects of CBT in adolescents with ADHD that amongst other elements included training of planning and organizing skills , also showed that a subgroup of adolescents with comorbid anxiety improved more in comparison to adolescents with ADHD only . Moreover, in the MTA-study the subgroup of children with comorbid anxiety showed more improvement when receiving behavioral treatment    than those without comorbid anxiety. This might indicate a subgroup within ADHD that is particularly, or at least differently, sensitive to effects of (cognitive) behavioral treatment. The MTA Cooperative Group suggested that, as anxiety reduces after treatment of ADHD, at least some anxiety in individuals with ADHD might be attributed to the stress of ADHD-related problems . This is consistent with the idea that anxiety, in individuals with ADHD, is more strongly associated with negative affectivity and disruptive behavior than with fearfulness or phobic symptoms . If this would be the case, in our study one would expect anxiety not to be a moderator, because in our RCT two CBTs were compared and after both ADHD as well as anxiety had improved . Because the more anxious group of adolescents with ADHD showed more positive results when receiving a CBT focusing on planning skills than when receiving a CBT without such an aim, this suggests that the anxiety experienced by adolescents with ADHD could be attributed to poorer executive functions (e.g.,   ). This is supported by research showing increased working memory deficits and increased rates of sluggish cognitive tempo in children with ADHD and comorbid anxiety . However, this mediation hypothesis needs further testing.
An alternative explanation for the superior effects of PML as compared to SFT in the subgroup with low depression but higher anxiety (see Leaf 2, Figs 4 and 5), could be the change of use of medication for ADHD. Keep in mind that adolescents could not change medication status (yes/no) or dose during treatment, but could only change these after posttest. When testing this hypothesis exploratory, we found an (albeit nonsignificant) trend that in this specific subgroup, adolescents receiving PML increased their dose of methylphenidate after posttest (M = 11.12, SD = 21.46), while adolescents receiving SFT decreased their dose (M = -1.69, SD = 13.88), t(24) = 1.72, p = 0.08 (note that due to missing values on medication dose at pretest and follow-up, this test is based on a subset of the adolescents in Leaf 2). While this is the case for this specific subgroup, this treatment difference is not found for the whole group of adolescents in this study. This may imply that adolescents with low depression and higher anxiety that received PML have become more motivated to adhere to medication as compared to SFT. But this finding may also imply that the increases in methylphenidate were responsible for the treatment gains of adolescents receiving PML. However this hypothesis needs further testing.
Further, a notable result is that pretreatment characteristics may influence the effects of treatment differently during different phases of treatment. In this study the pretest to follow-up change in symptoms of the found qualitative subgroups, clearly shows that the PML and SFT treatment alternatives have a different effect on the change in planning problems during treatment and during treatment maintenance for different subgroups of adolescents. This supports the idea of two clearly separate treatments; both follow different trajectories that seem to converge to comparable endpoints after three months. Moreover, these results show that it is important to consider the phase of treatment when investigating moderator-effects: in this study differential effects were found in different phases of treatment. This may imply that different characteristics determine the effect of treatment during different phases. To our knowledge, previous studies on moderation of treatment effects in individuals with ADHD, have focused primarily on effects of treatment between pre- and posttest and may therefore miss potential moderators        . We would therefore advise to take the phase of treatment into account when interpreting the effects of moderators and to also investigate moderators of follow-up treatment effects in future studies.
This study has several limitations. First, in this study several adolescent characteristics but no family characteristics, such as parenting self-efficacy, were included as potential moderators. Second, because all adolescents were included in a treatment condition, their parents who completed ratings scales (i.e. outcome measures) were potentially biased by expectancy effects. Therefore, in future studies more objective tests (i.e. blinded measures) assessing ADHD core symptoms are needed. Also, due to the lack of an adequate control-group in our RCT like a waitlist or a treatment as usual group effectiveness of both treatments could not be proven . Third, as in all analyses without a priori hypotheses, there is no guarantee that for all results that are generated using QUINT, a clinically satisfactory interpretation can be provided. Fourth, a broad concern related to the goal of finding subgroups involved in treatment-subgroup interactions pertains to the risk of inferential errors. On the one hand, these include Type II errors, which reflect a possible lack of power to detect true interactions (e.g., ). In general, a reliable detection of interactions requires larger samples than a reliable detection of main effects , and perhaps considerably larger than those enrolled in traditional clinical trials in the field of behavior therapy research. On the other hand, one should also beware of Type I errors, that is, erroneous claims about the occurrence of apparent interactions that cannot be replicated in follow-up studies    . Tree-based methods are especially vulnerable at this point as they rely on a very large search space based on a very huge number of covariate split-point combinations and therefore need large sample sizes . Even though this study is the largest RCT in adolescents with ADHD to date, for QUINT to show replicable results, a larger sample size would be preferred. To be sure, we made use of a bias-corrected bootstrap procedure to prevent ourselves from overfitting the data at hand. However, Dusseldorp and Van Mechelen  state that the safest way to deal with QUINT, especially in case of applications with relatively smaller sample sizes, as an exploratory tool to generate meaningful hypotheses. Further, these hypotheses should be tested in follow-up confirmatory research with new RCTs that make use of a stratified sampling scheme in which the strata are constructed on the basis of the splitting variables and split points as identified by QUINT . As a final note, we would like to emphasize that the fact that QUINT (as all post-hoc methods) brings with it the risk of inferential errors does not preclude that in trials like the one studied in this paper the use of subgroup analyses such as QUINT can be meaningful. Conducting an RCT requires lots of time, money and effort, which is mainly an argument for getting as much information as possible out of the data. Exploratory analyses such as the one reported in this paper are still of great value. However, it is of utmost importance that results are regarded as tentative until they can be replicated.
Taken together, this study has several clinical implications. Our RCT showed improvements with large effect sizes for both PML and SFT, but without between-treatment differences on most measures (except for parent-rated planning problems ). The present study supplements these findings by showing that there were no significant moderators of qualitative treatment-subgroup interactions of therapeutic change from pretest to three months after treatment: Both treatments harvest comparable results. There was however a quantitative interaction present in the data, that showed that for adolescents with ADHD with comorbid anxiety symptoms in combination with low levels of comorbid depression, PML appears to be the treatment of preference. Of additional clinical relevance are the cut-off points generated by QUINT, with which clinicians can directly allocate adolescents to the most effective treatment. All in all, personalized treatment assignment appears unnecessary, as the effects of two CBTs are comparable for all adolescents with ADHD, except for a subgroup of adolescents for whom PML is the treatment of preference.
S2 Protocol. Translated trial study protocol.
Conceived and designed the experiments: SVDO HG PP BB LD IVM. Performed the experiments: BB. Analyzed the data: BB LD IVM. Contributed reagents/materials/analysis tools: IVM. Wrote the paper: LD BB SVDO.
- 1. Faraone SV, Sergeant J, Gillberg C, Biederman J. The worldwide prevalence of ADHD: Is it an American condition? World Psychiatry. 2003; 2(2): 104.
- 2. Polanczyk G, de Lima MS, Horta BL, Biederman J, Rohde LA. The worldwide prevalence of ADHD: A systematic review and metaregression analysis. The American Journal of Psychiatry. 2007; 164(6): 942–948. pmid:17541055
- 3. American Psychiatric Association. Diagnostic and statistical manual of mental disorders, Fourth Edition-Text Revision. Washington, DC.; 2000.
- 4. American Psychiatric Association. Diagnostic and statistical manual of mental disorders (5th ed.).: Washington, DC; 2013.
- 5. Biederman J, Mick E, Faraone SV. Age-dependent decline of symptoms of Attention-Deficit/Hyperactivity Disorder: Impact of remission definition and symptom type. American Journal of Psychiatry. 2000; 157(5): 816–818. pmid:10784477
- 6. Faraone SV, Biederman J, Mick E. The age-dependent decline of attention deficit hyperactivity disorder: a meta-analysis of follow-up studies. Psychological Medicine. 2006; 36: 159–165. pmid:16420712
- 7. Yoshimasu K, Barbaresi WJ, Colligan RC, Voigt RG, Killian JM, Weaver AL, et al. Childhood ADHD is strongly associated with a broad range of psychiatric disorders during adolescence: A population-based birth cohort study. Journal of Child Psychology and Psychiatry. 2012; 53(10): 1036–1043. pmid:22647074
- 8. Gershon J. A meta-analytic review of gender differences in ADHD. Journal of Attention Disorders. 2002; 5(3): 143–154. pmid:11911007
- 9. Kraemer HC, Wilson GT, Fairburn CG, Agras WS. Mediators and moderators of treatment effects in randomized clinical trials. Archives of General Psychiatry. 2002; 59(10): 877. pmid:12365874
- 10. Boyer BE, Geurts HM, Prins PJM, der Oord SV. Two novel CBTs for adolescents with ADHD: The value of planning skills. European Child & Adolescent Psychiatry. 2014.
- 11. Sibley MH, Kuriyan AB, Evans SW, Waxmonsky JG, Smith BH. Pharmacological and psychosocial treatments for adolescents with ADHD: An updated systematic review of the literature. Clinical Psychology Review. 2014; 34: 218–232. pmid:24632046
- 12. Antshel KM, Olszewski AK. Cognitive behavioral therapy for adolescents with ADHD. Child and adolescent psychiatric clinics of North America. 2014; 23: 825–842. pmid:25220089
- 13. Abikoff H, Gallagher R, Wells KC, Murray DW, Huang L, Lu F, et al. Remediating organizational functioning in children with ADHD: Immediate and long-term effects from a randomized controlled trial. Journal of Consulting & Clinical Psychology. 2013; 81: 113–128.
- 14. Evans SW, Owens JS, Bunford N. Evidence-based psychosocial treatments for children and adolescents with Attention-Deficit/Hyperactivity Disorder. Journal of Clinical Child and Adolescent Psychology. 2014; 43: 527–551. pmid:24245813
- 15. Kuin M, Boyer BE, Van der Oord S. Plan My Life PML., cognitive behavioral planning focused treatment for adolescents with ADHD Dutch: Zelf Plannen. Houten: Lannoo Campus; 2013.
- 16. Boyer BE, Kuin M, Van der Oord S. Solution Focused Treatment SFT, Dutch: Zelf Oplossingen Bedenken. Houten: Lannoo Campus; 2014.
- 17. Byar DP. Assessing apparent treatment-covariate interactions in randomized clinical trials. Statistics in Medicine. 1985; 4(3): 255–263. pmid:4059716
- 18. Dusseldorp E, Mechelen IV. Qualitative interaction trees: A tool to identify qualitative treatment-subgroup interactions. Statistics in Medicine. 2014; 33(2): 219–237. pmid:23922224
- 19. Ogrodniczuk JS, Piper WE, Joyce AS, McCallum M. Effect of patient gender on outcome in two forms of short-term individual psychotherapy. Journal of Psychotherapy Practice and Research. 2001; 10: 69–78. pmid:11264331
- 20. Shaffer JP. Probability of directional errors with disordinal (qualitative) interaction. Psychometrika. 1991; 56(1): 29–38.
- 21. Dixon DO, Simon R. Bayesian subset analysis. Biometrics. 1991; 47(3): 871–881. pmid:1742443
- 22. Hayward RA, Kent DM, Vijan S, Hofer TP. Multivariable risk prediction can greatly enhance the statistical power of clinical trial subgroup analysis. BMC medical research methodology. 2006; 6(1): 18.
- 23. Doove LL, Dusseldorp E, Van Deun K, Van Mechelen I. A comparison of five recursive partitioning methods to find subgroups involved in meaningful treatment-subgroup interactions. Advances in Data Analysis and Classification. 2014; 8: 403–425.
- 24. Doove LL, Van Deun K, Dusseldorp E, Van Mechelen I. QUINT: A tool to detect qualitative treatment-subgroup interactions in randomized controlled trials. Psychotherapy Research. 2015.
- 25. Jensen PS, Hinshaw SP, Kraemer HC, Lenora N, Newcorn JH, Abikoff HB, et al. ADHD comorbidity findings from the MTA Study: Comparing comorbid subgroups. Journal of the American Academy of Child & Adolescent Psychiatry. 2001; 40(2): 147–158.
- 26. March J, Swanson J, Arnold L, Hoza B, Conners C, Hinshaw S, et al. Anxiety as a predictor and outcome variable in the MTA study: Analysis using hierarchical linear modeling. Journal of Abnormal Child Psychology. 2000; 28: 527–541. pmid:11104315
- 27. MTA Cooperative Group. Moderators and mediators of treatment response for children with Attention-Deficit/Hyperactivity Disorder. Archives of General Psychiatry. 1999; 56(12): 1088. pmid:10591284
- 28. Owens EB, Hinshaw SP, Kraemer HC, Arnold LE, Abikoff HB, Cantwell DP, et al. Which treatment for whom for ADHD? Moderators of treatment response in the MTA. Journal of Consulting and Clinical Psychology. 2003; 71(3): 540–552. pmid:12795577
- 29. Rieppi R, Greenhill LL, Ford RE, Chuang S, Wu M, Davies M, et al. Socioeconomic status as a moderator of ADHD treatment outcomes. Journal of the American Academy of Child & Adolescent Psychiatry. 2002; 41(3): 269–277.
- 30. Van den Hoofdakker BJ, Nauta MH, v d Veen-Mulders L, Sytema S, Emmelkamp PMG, Minderaa RB, et al. Behavioral parent training as an adjunct to routine care in children with Attention-Deficit/Hyperactivity Disorder: Moderators of treatment response. Journal of Pediatric Psychology. 2010; 35(3): 317–326. pmid:19633060
- 31. Van den Hoofdakker BJ, Nauta MH, Dijck-Brouwer DAJ, van der Veen-Mulders L, Sytema S, Emmelkamp PMG, et al. Dopamine transporter gene moderates response to behavioral parent training in children with ADHD: A pilot study. Developmental Psychology. 2012; 48(2): 567–574. pmid:22182296
- 32. Van den Hoofdakker BJ, Hoekstra PJ, van der Veen-Mulders L, Sytema S, Emmelkamp PMG, Minderaa RB, et al. Paternal influences on treatment outcome of behavioral parent training in children with attention-deficit/hyperactivity disorder. European Child & Adolescent Psychiatry. 2014; 23(11): 1071–1079.
- 33. Jensen P, Arnold L, Swanson J, Vitiello B, Abikoff H, Greenhill L, et al. 3-Year follow-up of the NIMH MTA Study. Journal of the American Academy of Child & Adolescent Psychiatry. 2007; 46: 989–1002.
- 34. MTA Cooperative Group. A 14-month randomized clinical trial of treatment strategies for Attention-Deficit/Hyperactivity Disorder. Archives of General Psychiatry. 1999; 56(12): 1073. pmid:10591283
- 35. Molina BS, Hinshaw SP, Swanson JM, Arnold LE, Vitiello B, Jensen PS, et al. The MTA at 8 years: prospective follow-up of children treated for combined-type ADHD in a multisite study. Journal of the American Academy of Child & Adolescent Psychiatry. 2009; 48: 484–500.
- 36. MTA Cooperative Group. National institute of mental health multimodal treatment study of ADHD follow-up: 24-month outcomes of treatment strategies for attention-deficit/hyperactivity disorder. Pediatrics. 2004; 113: 754–761. pmid:15060224
- 37. Shaffer D, Fisher P, Lucas CP, Dulcan MK, Schwab-Stone ME. NIMH diagnostic interview schedule for children version IV (NIMH DISC-IV): Description, differences from previous versions, and reliability of some common diagnoses. Journal of the American Academy of Child & Adolescent Psychiatry. 2000; 39(1): 28–38.
- 38. Kort W, Schittekatte M, Bosmans M, Compaan E, Vermeir G, Verhaeghe P. Wechsler Intelligence Scale for Children-III: Handleiding. Amsterdam: Pearson; 2005.
- 39. Sattler JM. Assessment of Children: Cognitive Applications (4th edition). San Diego, CA.; 2001.
- 40. Taves DR. Minimization: a new method of assigning patients to treatment and control groups. Clinical Pharmacology & Therapeutics. 1974; 15: 443–453.
- 41. Oosterlaan J, Scheres A, Antrop I, Roeyers H, Sergeant JA. Handleiding bij de Vragenlijst voor Gedragsproblemen bij Kinderen VvGK. Lisse: Swets Test Publishers.; 2000.
- 42. Pelham WE, Gnagy EM, Greenslade KE, Milich R. Teacher ratings of DSM-III-R symptoms for the disruptive behavior disorders. Journal of the American Academy of Child & Adolescent Psychiatry. 1992; 31(2): 210–218.
- 43. DuPaul GJ. Parent and Teacher Ratings of ADHD Symptoms: Psychometric Properties in a Community-Based Sample. Journal of Clinical Child Psychology. 1991; 20: 245–253.
- 44. Gioa GA, Isquith PK, Guy SC, Kenworth L. Behaviour rating inventory of executive function. Odessa, FL: Psychological Assessment Resources. 2000.
- 45. Smidts DP, Huizinga M. Handleiding Executieve Functies Gedragsvragenlijst. Amsterdam: Hogrefe; 2009.
- 46. Gioia GA, Isquith PK, Kenworthy L, Barton RM. Profiles of Everyday Executive Function in Acquired and Developmental Disorders. Child Neuropsychology. 2002; 8: 121–137. pmid:12638065
- 47. Sitarenios G, Kovacs M. Use of the CDI. The Use of Psychological Testing for Treatment Planning and Outcomes Assessment. In. New Jersey: Lawrence Erlbaum Associates; 1999. 267–298.
- 48. Timbremont B, Braet C. Children's Depression Inventory, Nederlandstalige versie, Handleiding.: Swets & Zeitlinger; 2002.
- 49. Saylor CF, Finch AJJ, Spirito A, Bennett B. The Children's Depression Inventory: A Systematic Evaluation of Psychometric Properties. Vol. 52, No. 6, 955–967. Journal of Consulting and Clinical Psychology. 1984; 52: 955–967.
- 50. Birmaher B, Khetarpal S, Brent D, Cully M, Balach L, Kaufman J, et al. The screen for child anxiety related emotional disorders (SCARED): Scale construction and psychometric characteristics. Journal of the American Academy of Child & Adolescent Psychiatry. 1997; 36(4): 545–553.
- 51. Muris P, Bodden D, Hale W, Birmaher B, Mayer B. SCARED-NL: Handleiding bij de gereviseerde Nederlandse versie van de Screen for Child Anxiety Related Emotional Disorders. Amsterdam: Boom Test Uitgevers; 2007.
- 52. Fabiano GA, Pelham WE. Measuring iImpairment in children with Attention-Deficit Hyperactivity Disorder. The ADHD Report. 2002; 10(2): 6–10.
- 53. Fabiano GA, Pelham WEJ, Waschbusch DA, Gnagy EM, Lahey BB, Chronis AM, et al. A Practical Measure of Impairment: Psychometric Properties of the Impairment Rating Scale in Samples of Children With Attention Deficit Hyperactivity Disorder and Two School-Based Samples. Journal of Clinical Child & Adolescent Psychology. 2006; 35: 369–385.
- 54. Baraldi AN, Enders CK. An introduction to modern missing data analyses. Journal of School Psychology. 2010; 48: 5–37. pmid:20006986
- 55. Cohen J. Statistical Power Analysis for the Behavioral Sciences.: Routledge; 1988.
- 56. King MW, Resick PA. Data mining in psychological treatment research: A primer on classification and regression trees. Journal of Consulting and Clinical Psychology. 2014; 82(5): 895–905. pmid:24588404
- 57. Dusseldorp E, Van Mechelen I, Doove LL. quint: Qualitative Interaction Trees. R package version 1.0. 2013. Available: http://cran.r-project.org/package=quint.
- 58. Dusseldorp E, Doove L, van Mechelen I. Quint: An R package for the identification of subgroups of clients who differ in which treatment alternative is best for them. Behavior Research Methods. 2015.
- 59. Daley D, van der Oord S, Ferrin M, Danckaerts M, Doepfner M, Cortese S, et al. Behavioral interventions in attention-deficit/hyperactivity disorder: A meta-analysis of randomized controlled trials across multiple outcome domains. Journal of the American Academy of Child & Adolescent Psychiatry. 2014; 53(8): 835—847.e5.
- 60. Sonuga-Barke EJS, Brandeis D, Cortese S, Daley D, Ferrin M, Holtmann M, et al. Nonpharmacological interventions for ADHD: Systematic review and meta-analyses of randomized controlled trials of dietary and psychological treatments. The American Journal of Psychiatry. 2013; 170(3): 275–289. pmid:23360949
- 61. Van Brunt K, Matza LS, Classi PM, Johnston JA. Preferences related to attention-deficit/hyperactivity disorder and its treatment. Patient Preference and Adherence. 2011; 5: 33–43. pmid:21311700
- 62. Safren SA, Perlman CA, Sprich S, Otto MW. Mastering your adult ADHD: A cognitive-behavioral treatment program therapist guide.: Oxford university press.; 2005.
- 63. Antshel KM, Faraone SV, Gordon M. Cognitive behavioral treatment outcomes in adolescent ADHD. Journal of Attention Disorders. 2012; 18(6): 483–495. pmid:22628140
- 64. Airaksinen E, Larsson M, Forsell Y. Neuropsychological functions in anxiety disorders in population-based samples: Evidence of episodic memory dysfunction. Journal of Psychiatric Research. 2005; 39(2): 207–214. pmid:15589570
- 65. Eysenck MW, Derakshan N, Santos R, Calvo MG. Anxiety and cognitive performance: Attentional control theory. Emotion. 2007; 7(2): 336–353. pmid:17516812
- 66. Tucker DM, Derryberry D. Motivated attention: Anxiety and the frontal executive functions. Neuropsychiatry, Neuropsychology, and Behavioral Neurology. 1992; 5: 233–252.
- 67. Schatz DB, Rostain AL. ADHD with comorbid anxiety: A review of the current literature. Journal of Attention Disorders. 2006; 10(2): 141–149. pmid:17085624
- 68. Pocock SJ, Assmann SE, Enos LE, Kasten LE. Subgroup analysis, covariate adjustment and baseline comparisons in clinical trial reporting: Current practice and problems. Statistics in Medicine. 2002; 21: 2917–2930. pmid:12325108
- 69. Lee S, Lei MK, Brody GH. Confidence intervals for distinguishing ordinal and disordinal interactions in multiple regression. Psychological Methods. 2015; 20(2): 245–258. pmid:25844629
- 70. Dixon DO, Simon R. Bayesian subset analysis. Biometrics. 1991; 47: 871–881. pmid:1742443
- 71. Rothwell PM. Subgroup analysis in randomized controlled trials: Importance, indications, and interpretation. The Lancet. 2005; 365: 176–186.
- 72. Wang R, Lagakos SW, Ware JH, Hunter DJ, Drazen JM. Statistics in medicine: Reporting of subgroup analyses in clinical trials. New England Journal of Medicine. 2007; 357: 2189–2194. pmid:18032770