12 Aug 2014: The PLOS ONE Staff (2014) Correction: Effectiveness of Treatment Approaches for Children and Adolescents with Reading Disabilities: A Meta-Analysis of Randomized Controlled Trials. doi: info:doi/10.1371/journal.pone.0105843 View correction
Children and adolescents with reading disabilities experience a significant impairment in the acquisition of reading and spelling skills. Given the emotional and academic consequences for children with persistent reading disorders, evidence-based interventions are critically needed. The present meta-analysis extracts the results of all available randomized controlled trials. The aims were to determine the effectiveness of different treatment approaches and the impact of various factors on the efficacy of interventions. The literature search for published randomized-controlled trials comprised an electronic search in the databases ERIC, PsycINFO, PubMed, and Cochrane, and an examination of bibliographical references. To check for unpublished trials, we searched the websites clinicaltrials.com and ProQuest, and contacted experts in the field. Twenty-two randomized controlled trials with a total of 49 comparisons of experimental and control groups could be included. The comparisons evaluated five reading fluency trainings, three phonemic awareness instructions, three reading comprehension trainings, 29 phonics instructions, three auditory trainings, two medical treatments, and four interventions with coloured overlays or lenses. One trial evaluated the effectiveness of sunflower therapy and another investigated the effectiveness of motor exercises. The results revealed that phonics instruction is not only the most frequently investigated treatment approach, but also the only approach whose efficacy on reading and spelling performance in children and adolescents with reading disabilities is statistically confirmed. The mean effect sizes of the remaining treatment approaches did not reach statistical significance. The present meta-analysis demonstrates that severe reading and spelling difficulties can be ameliorated with appropriate treatment. In order to be better able to provide evidence-based interventions to children and adolescent with reading disabilities, research should intensify the application of blinded randomized controlled trials.
Citation: Galuschka K, Ise E, Krick K, Schulte-Körne G (2014) Effectiveness of Treatment Approaches for Children and Adolescents with Reading Disabilities: A Meta-Analysis of Randomized Controlled Trials. PLoS ONE 9(2): e89900. doi:10.1371/journal.pone.0089900
Editor: Karen Lidzba, University Children’s Hospital Tuebingen, Germany
Received: July 6, 2013; Accepted: January 29, 2014; Published: February 26, 2014
Copyright: © 2014 Galuschka et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This research was supported by grants from the Bundesverband für Legasthenie und Dyskalkulie e.V. (BVL) and Deutsche Gesellschaft für Kinder- und Jugendpsychiatrie, Psychosomatik und Psychotherapie (DGKJP). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Children, adolescents, and adults with reading disability (dyslexia) experience a significant impairment in the acquisition of reading accuracy, reading fluency, reading comprehension, and spelling skills, which cannot be accounted for by low IQ, visual acuity problems, neurological damage, or poor educational opportunities . Reading disability has genetic basis  and the underlying neurobiological and cognitive causes are largely debated. Impairments in auditory speech perception and processing, as well as visual attention and perception deficits are considered as the main causes of reading and spelling difficulties in dyslexia –. Reading and spelling deficits influence an individual’s performance in most academic domains . In addition, there is strong evidence of a link between reading disabilities and externalizing disorders, generalized anxiety, and school-related anxiety , .
The evidence-based development and the evaluation of interventions for children and adolescents with reading disabilities are, therefore, of particularly profound importance. A large number of interventions and therapies, derived from various treatment approaches, have been constructed and evaluated. Several systematic reviews have already summarized the findings of studies that evaluated the effectiveness of reading and spelling interventions. One of the most influential reviews of the research literature was conducted by the National Reading Panel (NRP) in the year 2000 . The review displays important results about the effectiveness of different types of reading instruction. Its main finding was that systematic instruction in learning letter sound relations and in blending sounds to form words is most effective for improving reading and spelling skills in disabled readers . Despite the importance of this finding, 13 years after its publication, the NRP review needs to be updated in order to integrate recent empirical findings.
However, most current systematic reviews are focused on the effectiveness of one specific treatment approach –. Other reviews address preventive methods for children at-risk for reading disability , . Since there is no widespread use of randomized-controlled trials (RCTs) in this research domain, current systematic reviews and meta-analyses often included not only RCTs, but also low-quality primary research (e.g., non-randomized research designs) –. To our best knowledge, no systematic review has been published to date that includes all available RCTs, without focusing on a specific treatment approach, and that integrates the results quantitatively with statistical methods.
The present meta-analysis has two advantages over previously published work. First, due to the inclusion of exclusively RCTs, the observed effect sizes can most likely be attributed to the intervention. Second, because all available RCTs are integrated, it is possible to compare the effectiveness of different treatment approaches.
The goal of this meta-analysis is twofold. The first aim is to determine the efficacy of different treatment approaches on reading and spelling performance of reading disabled children and adolescents. The second aim is to explore the impact of various factors on the efficacy of these treatment approaches.
An extensive literature search was conducted. We searched for intervention studies that were published until June 2013 in the databases ERIC, PsycINFO, PubMed, and Cochrane with the following search terms: “dyslexia” or “developmental reading disorder” or “developmental dyslexia” or “developmental reading disability” or “reading disorder” or “word blindness” or “spelling disorder” or “developmental spelling disorder” or “specific spelling disorder” combined with “intervention” or “treatment” or “therapy” or “therapeutics” or “training” or “remediation”.
We also examined bibliographical references of systematic reviews and primary studies. To check for unpublished RCTs, we searched the websites clinicaltrials.com and ProQuest. In addition, we contacted experts by sending an e-mail to each member of the mailing list of the Society for the Scientific Studies of Reading (SSSR).
Study Selection Criteria
To be considered for this review, studies must have met the following criteria: (a) the aim of the study was to examine the efficacy of an intervention or remediation programme for children and adolescents with reading disabilities; (b) the manuscript was written in English; (c) the study design included an untrained control group or a placebo training group; (d) group allocation was randomized, including parallel group randomization, group cluster randomization (quasi-randomized controlled trials were not selected); (e) study subjects were children, adolescents or adults (no studies with adults could be included) whose reading performance was below the 25th percentile or at least one standard deviation (SD), one year, or one grade below the expected level; (f) the study included subjects with intelligence in the normal range (IQ≥85, or described as having normal intelligence by the study author); (g) poor reading occured in mother tongue; (h) one or more reading or spelling tests were administered before and after treatment; and (i) pre- and post-test results of the reading or spelling tests were reported with sufficient detail to allow the calculation of an effect size or could be requested from the authors. Figure 1 summarizes the process of selecting studies for the meta-analysis.
Coding of the RCTs
Coding was done independently by the first author and an associate using a structured coding sheet. First, data necessary for effect size calculation (mostly means and standard deviations of pre- and post-tests) was extracted. Next, methodological characteristics, intervention characteristics, and sample characteristics were coded.
The methodological characteristics included: (a) the dependent variable (reading speed, reading comprehension, reading accuracy, pseudoword reading accuracy, pseudoword reading speed, nonword reading accuracy, nonword reading speed, or spelling); (b) the sample size; and (c) the administered reading test and spelling test. The intervention characteristics included: (a) treatment approach; (b) spelling/writing activities included (yes or no); (c) duration of the intervention in weeks; (d) total amount of intervention in hours; (e) setting (group, individual, or computer); and (f) conductor (professional or nonprofessional).
Treatment approaches were classified into distinct categories based on the description of the intervention in the report. The categories closely match the topic areas of the NRP review . The category phonemic awareness instruction includes interventions that foster the ability to recognize and manipulate phonemes in words. This implies tasks for recognizing phonemes in words, blending phonemes to words, segmenting a word into its phonemes, eliminating a phoneme from a word, or adding a phoneme to a word. All tasks are presented and performed orally. The category phonics instruction includes interventions that systematically teach letter-sound-correspondences and decoding strategies that involve blending or segmenting individual letters or phonemes or dividing a spoken or written word into syllables or onset and rimes. These interventions comprise reading and writing activities. The category reading fluency training includes interventions that contain repeated oral word reading practice or guided repeated word reading. These interventions aim to improve word recognition skills. The category reading comprehension training includes interventions that comprise tasks in which participants learn to extract textual information, summarize it, and relate it to existing knowledge. The category auditory training includes interventions in which subjects are confronted with non-linguistic auditory stimuli and are trained to identify and distinguish these stimuli. The category medical treatment includes interventions in which participants receive drugs to enhance their reading and spelling performance. The category coloured overlays includes interventions in which study subjects read with coloured filters or coloured overlays.
Finally, sample characteristics were coded. These included (a) age (mean and range) and (b) severity of reading impairment. The severity of reading impairment was identified by the inclusion criteria used in the trials and consists of three categories. The category severe reading disability includes studies in which participants’ reading performance was at least 2 SD below the expected value, below the 2.5th percentile, at least two years below grade level, or showed a discrepancy between chronological age and reading age of at least two years. The category moderate reading disability includes studies in which participants’ reading performance was at least 1 SD below the expected value, at least one year below grade level, below the 16th percentile, or showed a discrepancy between chronological age and reading age of at least one year. The category mild reading disability includes studies in which participants’ reading performance was below the 25th percentile.
Data Extraction and Effect Size Calculation
To evaluate the efficacy of an intervention, the effect size Hedges g was calculated by dividing the difference between the performance scores of the control group (CG) and the experimental group (EG) at post-test by their pooled standard deviation, and multiplying the result by a correction factor , .
Formula 1 - Hedges g
M = mean; EG = experimental group; CG = control group; n = number of study subjects; s = standard deviation; df = degrees of freedom.
If studies included more than one intervention group, but only one control group, every comparison of an intervention group with the control group was treated separately as an individual study. As a consequence, the control group was used to compute several effect sizes which are not independent from each other. An overweighting of the effect sizes was counteracted by dividing the sample size of the control group by the number of intervention groups. Similarly, if several control groups, but only one intervention group, were included, each comparison of a control group with the intervention group was treated as an individual study and the sample size of the intervention group was divided by the number of control groups.
To reduce the risk of under- or overestimating effect sizes, some effect sizes were corrected for pre-test differences. If the difference between the pre-test scores of the experimental and the control group displayed an effect size equal or greater than 0.20 (g≥0.20), the post-test score of the experimental group was corrected by adding or subtracting the difference between the pre-test scores. The effect size was then calculated on the basis of the corrected post-test score and the (uncorrected) pooled standard deviation. This was done because the formula described above does not take into consideration the pre-test differences, which leads to an over- or underestimation of the true magnitude of the effect if there are significant differences between the groups before the start of the intervention.
A maximum of two effect sizes were calculated for each comparison of an experimental group with a control group, one for reading performance and one for spelling performance. The following measures of reading performance were considered adequate for effect size calculation: reading accuracy, reading speed, reading comprehension, nonword reading speed, nonword reading accuracy, pseudoword reading speed or pseudoword reading accuracy. To determine spelling performance, tests measuring spelling accuracy were considered adequate.
Some studies used multiple reading and spelling tests to determine treatment efficacy, including standardized measures and non-standardized measures of learning transfer, as well as non-standardized measures whose tasks closely matched the training content. Effect sizes were calculated based on standardized measures, which are generally considered to be measures of learning transfer, if these were available. If standardized measures were not available, non-standardized measures of learning transfer were used for effect size calculation (n = 3 studies). Self-constructed measures that matched the training content were not used for effect size calculation, because these measures may not generalize to material not specifically taught. Thus, all effect sizes are based on measures of learning transfer. If a study reported results for several comparable tests (e.g., several standardized tests measuring different aspects of reading such as reading speed and comprehension), an average effect size was calculated from the effect sizes for individual tests, separately for reading and spelling performance.
Non-standardized dependent measures are suspected to overestimate the true magnitude of an effect , . Although all effect sizes are based on measures of learning transfer, it cannot be ruled out completely that the inclusion of studies without standardized measures introduced an artifact. For this reason, the main analyses were run with and without studies that used non-standardized measures. First, the analyses were conducted with all studies that met the inclusion criteria outlined above (i.e. studies with standardized or non-standardized measures; n = 22 studies; see Table 1). Second, the analyses were run with those studies that included standardized measures (n = 19 studies).
For studies that did not report means and standard deviations, effect sizes were calculated on the basis of other measures, for example t-test or F-test values. If a study did not report sufficient data, more information was requested from the corresponding author. If this request failed, co-authors were contacted.
The methodological quality of the included studies was assessed independently by the first author and an associate with the checklist for randomized controlled trials by the Scottish Intercollegiate Guidelines Network. To assess selection bias, it was determined if an adequate concealment method was used. Centralised allocation, computerised allocation systems, and the use of opaque envelopes were regarded as adequate methods of concealment. To assess performance/detection bias, it was determined if the study was blinded. Blinding of the participants and therapists is difficult to ensure in cognitive treatment trials. Therefore, it was only appraised if the assessment of the outcome measures was conducted by a blinded person. To assess reporting bias, it was determined if the data was adequately reported.
All analyses were performed using Biostat software “Comprehensive Meta Analysis Version 2.2.064” . Because of substantial differences between the treatment approaches that were evaluated in the included studies, there is no reason to assume that all studies share an identical true effect size. Consequently, a random effects model was used for the meta-analysis.
Of the randomized-controlled trials that were identified by the literature search, only 22 met all inclusion criteria and could be included in the meta-analysis. Interrater-agreement for article inclusion or exclusion exceeded κ = 0.786. All discrepancies were resolved by discussion. Coding reliabilities (percentage of interrater-agreement) for study characteristics and data extraction averaged 87%. Again, all discrepancies were disputed and solved.
Specifications regarding the methodological quality of the included trials were often incomplete. A sufficient description of the allocation concealment was missing in each of the 22 trials. Sixteen trials did not specify if the dependent variables were assessed by a blinded person –. Two trials ,  stated explicitly that the outcome measures were assessed by a person that was aware of the study subjects’ affiliation. Four studies – performed a blind assessment of treatment outcomes. It can therefore be concluded that most studies are at risk of having a bias. Data was considered as adequately reported in all of the included trials. One trial had to be excluded from the analysis due to lack of information regarding outcome data. Attempts to contact the authors failed.
Table S1 presents an overview of the trials that are included in the meta-analysis. Thirteen of the 22 trials included more than one intervention group, and two trials included more than one control group. Therefore, the meta-analysis was computed with a total of 49 comparisons of an experimental and a control group. These comparisons comprised 1138 participants in the experimental groups and 764 participants in the control groups. Effect size data for each subgroup within a study is presented separately for reading and spelling performance in Figure 2 and Figure 3.
Funnel plot displays treatment efficacy on reading performance for each comparison of an experimental group with a control group. ADD = Adding phonemes; CG = Control group; DI = Direct instruction; DS = Decoding skills; EXC = Exceptional; LPA = Sound-symbol correspondence and phonemic awareness; MA = Morphological awareness; OWLS = Oral and written language skills; PADS = Phonemic awareness and decoding skills; PAT = Phonological awareness training; PG = Placebo-control group; PHAB = Phonological analysis and blending; REG = Regular; SMS = Specific motor sequence; SP = Speech perception; SRT = Strategy reciprocal teaching; TCS = Text content and structure; TOD = Temporal order detection; WIST = Word identification strategy training; WAT = Word analogy training; WW = Write a word.
Funnel plot displays treatment efficacy on spelling performance for each comparison of an experimental group with a control group. ADD = Adding phonemes; CG = Control group; DI = Direct instruction; DS = Decoding skills; EXC = Exceptional; MA = Morphological awareness; OWLS = Oral and written language skills; PAT = Phonological awareness training; PG = Placebo-control group; PHAB = Phonological analysis and blending; REG = Regular; SMS = Specific motor sequence; WIST = Word identification strategy training; WAT = Word analogy training; WW = Write a word.
The comparisons were distributed across the treatment approaches as follows: five reading fluency trainings, three phonemic awareness instructions, three reading comprehension trainings, 29 phonics instructions, three auditory trainings, two medical treatments, and four coloured overlays or lenses. One trial evaluated the effectiveness of sunflower therapy and another investigated the effectiveness of specific motor sequences. These two interventions could not be allocated to a category because they pursue an entirely different treatment approach. Results of the meta-analysis are reported separately for reading and spelling performance.
All included studies reported the results of reading measures, which made it possible to estimate each intervention’s efficacy regarding reading performance. Phonics instruction was investigated most often. This approach is the only one whose effectiveness on reading performance was statistically confirmed. The mean effect size for phonics instruction was g’ = 0.322 (95% CI [0.177, 0.467]; I2 = 0). This suggests a small but statistically significant effect of phonics instructions on reading performance. The I2 statistic describes the proportion of observed dispersion that reflects real differences rather than differences that occur by chance. As can be seen in Table 1, the mean effect sizes of the remaining treatment approaches did not reach statistical significance. Subgroup analysis revealed no statistically significant difference between treatment approaches (p = .788).
In addition, subgroup analyses were conducted to explore the influence of other variables (intervention and sample characteristics) on reading improvement. Results are displayed in Table 2. Studies that did not include or did not specify a certain variable were excluded from the subgroup analysis in question. In addition, it was not possible to define subgroups of age or grade level because children’s age and grade level showed considerable overlap between studies. Therefore, it was not possible to perform subgroup analyses with these variables.
The analyses revealed that intervention studies with mild reading disabled children and adolescents report a slightly higher mean effect size (g’ = 0.449; 95% CI [0.239, 0.659]; I2 = 0%) compared with studies that included moderately disabled (g’ = 0.228; 95% CI [0.113, 0.342]; I2 = 31%) or severe reading disabled (g’ = 0.305; 95% CI [0.033, 0.576]; I2 = 0%) study subjects. However, this difference did not reach statistical significance (p = .188).
Studies were allocated into three distinct subgroups depending on the amount of intervention that was provided. No significant difference (p = .250) was found between the mean effect size of interventions that lasted up to 14 hours (g’ = 0.351; 95% CI [0.181, 0.520]; I2 = 0%), interventions that lasted between 15 hours and 34 hours (g’ = 0.113; 95% CI [−0.148, 0.374]; I2 = 0%), and interventions with more than 35 hours (g’ = 0.371; 95% CI [0.172, 0.570]; I2 = 0%).
To compare the effects of interventions with short- and long-term duration, the studies were divided into two subgroups: (a) up to 12 weeks; and (b) more than 12 weeks. The cut-off value of 12 weeks was chosen because it results in two subgroups of equal size making a statistical comparison between the two groups more appropriate. Interventions with a maximum duration of 12 weeks showed a small mean effect size of g’ = 0.261 (95% CI [0.155, 0.368]; I2 = 0%). Interventions that lasted more than 12 weeks tended to show higher effect sizes (g’ = 0.353; [0.151, 0.554]; I2 = 12%). Again, this difference did not reach statistical significance (p = .432).
To detect the impact of the setting on the success of an intervention three subgroups could be differentiated: (a) computer with teacher; (b) individual intervention; and (c) group intervention. The mean effect sizes of these subgroups did not differ significantly from each other (p = .403). The studies in the computer with teacher subgroup reached a mean effect size of g’ = 0.364 (95% CI [0.085, 0.643]; I2 = 0%), which was comparable to the mean effect size of group interventions (g’ = 0.379; 95% CI [0.211, 0.549]; I2 = 0%). Single subject interventions showed a small but significant mean effect size of g’ = 0.205 (95% CI [0.003, 0.407]; I2 = 57%).
Interventions that were conducted by the study author showed a high mean effect size (g’ = 0.806; 95% CI [0.397, 1.215]; I2 = 38%), whereas interventions that were conducted by teachers (g’ = 0.247; 95% CI [0.046, 0.449]; I2 = 0%) or special education therapists (g’ = 0.256; 95% CI [0.090, 0.422]; I2 = 0%) led to negligible mean effect sizes. Interventions that were conducted by students reached a small mean effect size (g’ = 0.400; [−0.109, 0.909]; I2 = 0%). Although a trend could be identified, there was no significant difference between these subgroups (p = .088).
In addition, subgroup analysis showed that the mean effect size of studies that did not include spelling/writing activities is moderate and significantly greater than zero (g’ = 0.331; 95% CI [0.195, 0.467]; I2 = 0%). Interventions that included spelling/writing exercises showed a small effect on reading improvement that did not reach statistical significance (g’ = 0.152; 95% CI [−0.157, 0.451]; I2 = 32%). This difference did not reach statistical significance (p = .286).
Ten trials (containing 18 comparisons) conducted spelling tests before and after treatment. It was, therefore, possible to calculate 18 effect sizes for spelling. Only in case of phonics instruction was it possible to compute a mean effect size. The other treatment approach categories included only one study that assessed spelling performance. Ten studies evaluated the effect of phonics instruction on spelling performance. These revealed a small but statistically significant mean effect size (g’ = 0.336; 95% CI [0.062, 0.610]; I2 = 22%).
Again, subgroup analyses were conducted to explore the involvement of other variables (intervention and sample characteristics) on the improvement of spelling performance. Because only few studies were available, some subgroups comprised less categories as in the case of reading performance (see Table 3).
Studies with participants considered as mild reading disabled (g’ = 0.415; 95% CI [0.089, 0.741]; I2 = 0%) showed a statistically significant mean effect size on spelling performance, whereas the effectiveness of studies with moderately disabled study subjects (g’ = 0.157; 95% CI [−0.027, 0.340]; I2 = 28%) could not be statistically confirmed. However, the analysis revealed no statistically significant difference between these two categories of severity (p = .176).
Significant differences (p = .010) were found between the mean effect sizes of interventions that lasted up to 14 hours (g’ = 0.432; 95% CI [0.114, 0.749]; I2 = 14%), interventions that lasted between 15 hours and 34 hours (g’ = 1.140; 95% CI [0.404, 1.875]; I2 = 0%), and interventions with more than 35 hours (g’ = 0.059; 95% CI [−0.181, 0.300]; I2 = 0%). In contrast, it was found that interventions that lasted more than 12 weeks have a higher mean effect size (g’ = 0.314; [−0.015, 0.643]; I2 = 0%) than interventions with a maximum duration of 12 weeks (g’ = 0.176; [0.011, 0.341]; I2 = 13%). However, this difference failed to reach statistical significance (p = .462).
Interventions that were conducted by teachers (g’ = 0.099; 95% CI [−0.412, 0.610]; I2 = 0%) or special education therapists (g’ = 0.148; 95% CI [−0.082, 0.378]; I2 = 23%) led to negligible mean effect sizes. Interventions that were conducted by students reached a large mean effect size (g’ = 0.945; 95% CI [0.417, 1.474]; I2 = 0%). This difference reached statistical significance (p = .021).
The mean effect sizes of studies that investigated individually administered interventions and studies that investigated group interventions did not differ significantly from each other (p = .476). Single subject interventions showed a mean effect size of g’ = 0.488, which was not statistically greater than zero (95% CI [−0.061, 1.038]; I2 = 48%). Group interventions showed a mean effect size of g’ = 0.266 (95% CI [0.000, 0.532]; I2 = 14%).
The mean effect size of studies that did not include spelling/writing activities (g’ = 0.337; 95% CI [−0.038, 0.713]; I2 = 14%) did not significant differ (p = .908) from the mean effect size of interventions that included spelling/writing exercises (g’ = 0.371; 95% CI [−0.067, 0.809]; I2 = 49%).
In the vast majority of studies (19 out of 22), the effect size calculation was based on standardized measures. Only three trials , ,  used non-standardized measures of learning transfer. These studies had evaluated phonics instructions, reading comprehension trainings, and a reading fluency training. Because the inclusion of studies with non-standardized measures might introduce an artifact (outlined above), the main analyses were rerun after these three studies were excluded.
Since only one study remained in the category ‘reading comprehension training’, it was not possible to calculate a mean effect size for this treatment approach. In the category ‘reading fluency training’ the exclusion of studies with non-standardized measures led to a minor change in the magnitude of the effect (Reading: g’ = 0.280; 95% CI [−0.072, 0.322]); n = 4). Interestingly, the mean effect sizes for phonics instruction are higher if trials using non-standardized measures are excluded from the analysis (Reading: g’ = 0.424; 95% CI [0.246, 0.601]; n = 25; Spelling: g’ = 0.376; 95% CI [0.065, 0.686]); n = 9). These findings demonstrate that the inclusion of studies with non-standardized measures in the present meta-analysis did not lead to an overestimation of the effect sizes and, therefore, does not confound the results.
A common problem of all disciplines in meta-analytic reviews is the phenomenon of publication bias . Publication bias occurs because statistically significant results are more likely to be published than non-significant results.
Only a small number of included studies assessed spelling performance. In addition, phonics instruction is the only treatment approach whose positive effect on reading performance is statistically confirmed. Therefore, publication bias was explored exemplarily only for those studies that evaluated phonics instruction and used reading performance as dependent variable. A funnel plot was used to explore the presence of publication bias. The shape of the funnel plot displayed asymmetry with a gap on the left of the graph. Using Duval and Tweedie’s trim and fill  the extent of publication bias was assessed and an unbiased effect size was estimated. This procedure trimmed 10 studies into the plot and led to an estimated unbiased effect size of g’ = 0.198 (95% CI [0.039, 0.357]) (see Figures 4 and 5, Table 4).
Funnel plot displays observed comparisons evaluating the efficacy of phonics instructions on reading performance.
Funnel plot displays observed and imputed comparisons evaluating the efficacy of phonics instructions on reading performance.
The first aim of this meta-analysis was to determine the effectiveness of different treatment approaches on reading and spelling performance of reading disabled children and adolescents. The results reveal that phonics instruction is the most intensively investigated treatment approach. In addition, it is the only approach whose effectiveness on reading and spelling performance in children and adolescents with reading disabilities is statistically confirmed. This finding is consistent with those reported in previous meta-analyses , . At the current state of knowledge, it is adequate to conclude that the systematic instruction of letter-sound-correspondences and decoding strategies, and the application of these skills in reading and writing activities, is the most effective method for improving literacy skills of children and adolescents with reading disabilities. The treatment approach phonics instruction has not only been evaluated in English-speaking countries, but also in studies conducted in Spain, Finland, and Italy. Despite the widespread use of this approach, it is not yet clear whether these interventions are equally effective across languages. This question could not be addressed in the present analysis and needs to be addressed by further research.
Phonics instruction combines elements of reading fluency training and phonemic awareness training. Reading fluency trainings emphasize repeated word or text reading practice. The results of the present meta-analysis suggest that reading fluency training alone is not an effective way to enhance the reading and spelling skills of children and adolescents with reading disabilities, as was reported in a previous meta-analysis .
Phonemic awareness trainings are widely recognised as being effective for the remediation of preschool children at risk for reading disabilities , . The present results demonstrate that when phonemic awareness interventions are provided to school-aged children and adolescents with reading difficulties, they do not have a significant effect on a child’s reading or spelling performance. This indicates that phonemic awareness and reading fluency trainings alone are not sufficient to achieve substantial improvements. However, the combination of these two treatment approaches, represented by phonics instruction, has the potential to increase the reading and spelling performance of children and adolescents with reading disabilities.
In terms of reading comprehension training, it was not possible to confirm a significant influence of this approach on literacy achievement. This result should be interpreted with caution because the present meta-analysis included only three comparisons that evaluated reading comprehension training. All three comparisons were conducted by the same author and they demonstrated negligible  to small  effect sizes. There is a clear need to complement these studies with further research.
The mean effect size of coloured lenses (Irlen lenses) did not reach statistical significance. Some studies compared the effect of coloured lenses to a placebo control group; other studies used an untrained control group instead. An interesting observation is that Irlen lenses showed small effect sizes if the experimental group was compared to an untreated control group . If the experimental group was compared to a placebo control group, effect sizes were negligible , . This finding confirms earlier systematic reviews that could not prove any positive effect of coloured lenses on literacy achievement, and suggests that results are mainly due to placebo effects , .
Studies that tried to enhance reading and spelling skills of children and adolescents with reading disabilities by medication with the nootropic piracetam showed only minor effects, and the mean effect size for reading performance did not reach statistical significance. With the possibility of side effects in mind  the risks of medication seem to outweigh its benefits.
Auditory trainings intend to foster reading and spelling by focussing on the underlying causes of the poor performance. At first glance, this approach seems convenient, but the results of the present meta-analysis demonstrate that auditory trainings do not significantly improve children’s reading and spelling skills. Based on the results of the present meta-analysis and those reported by other systematic reviews and non-randomized trials , , , it can be concluded that focussing directly on literacy skills is effective but the efficacy of interventions focussing on the underlying causes could not be confirmed to date.
The second aim of this meta-analysis was to explore the impact of various factors on the efficacy of interventions. The results of subgroup analyses do not allow clear conclusions about what makes an intervention successful. This may be caused by mutual confounding in the subgroup analyses, which means that each moderator could be confounded by any of the other moderators. This influences the observed association between moderator and outcomes and distorts the true magnitude of effects. As a consequence, the results of the performed subgroup analyses should be interpreted with caution. However, some findings are worth noting. First, subgroup analyses demonstrated that children and adolescents with mild reading disabilities show more improvement in literacy skills than more severely impaired participants. Second, interventions with higher amounts of treatment or longer durations of treatment seem to be more effective in improving literacy skills than therapies with small amounts of treatment or short-time interventions. Third, consistent with previous meta-analyses , , it was found that interventions that were conducted by the study author tend to show higher effect sizes than interventions that were implemented by other conductors. This suggests that solid and professional knowledge about reading disability in children and adolescents might enhance treatment efficacy. Meta-regression or hierarchical linear methods can be helpful to identify specific variables that influence the efficacy of an intervention. Due to the small number of included studies that distinguished or evaluated each variable, these statistical methods could not be applied in the present meta-analysis.
Unfortunately, it could not be assessed which intervention is particularly effective for a specific age or grade level. This was due to the occurrence that many of the included trials comprised study subjects of a wide age span. Ever since the meta-analyses of the NRP in the year 2000 , it has been apparent that interventions are not equally effective for different age groups or grade levels. Providing children of a wide age span with the same interventions is therefore not a recommended option for research settings and clinical practice.
The influence of publication bias was determined with funnel plots. Publication bias refers to the appearance that many studies remain unpublished because of negligible effect sizes or non-significant findings . This is presumably the case in this research domain. We controlled publication bias exemplarily for the treatment approach of phonics instructions, but it can be assumed that this phenomenon is present in the other treatment approaches as well. Duval and Tweedies trim and fill analysis estimated and valued the true, unbiased effect size as being small, but still statistically significant.
Consistent with prior research , , , , , this analysis demonstrated that severe reading and spelling difficulties can be ameliorated with appropriate treatment. The need for evidence-based interventions is obvious given the emotional and academic consequences for children with persistent reading disorders . To increase the informative value of studies, research in this domain should improve its methodological quality. Studies were often excluded from this analysis because of the absence of randomized allocation concealment. Randomization tries to secure that known and unknown determining factors are spread equally across groups. Research has shown that when meta-analyses include studies whose allocation concealment is inadequate, effects of interventions can be misjudged . Each study that was included in our analysis was randomized, but due to missing methodological specifications the quality of randomization procedures could not be determined. An equally important aspect is the assessment of the dependent variables by a blinded person. It has been demonstrated ,  that effects of interventions are exaggerated if the relevant outcome measures are not assessed in a blinded test situation. Therefore, effects can only be attributed to the conducted intervention if they are observed in a blinded randomized controlled trial with an adequate concealment technique. Unfortunately, most of the studies included in the present meta-analysis did not specify whether the dependent variable was assessed by a blinded person.
This meta-analysis comprises studies from various English-speaking and non-English-speaking countries like Finland, Italy, Spain, and Brazil. To conduct a meaningful meta-analysis with an adequate number of comparisons, these studies could not be analyzed separately for different languages or groups of languages. The transferability of research findings from English-speaking countries to languages with more consistent orthographies and less syllabic complexity and vice versa is largely debated –. It has been demonstrated that differences between languages affect children’s literacy acquisition ,  and, therefore, it cannot be generally assumed that symptom based treatment approaches are equally effective in each language.
The Anglo-American region far outweighs other countries in quantity and quality of the published work in this research domain. In order to be able to support children and adolescent with reading disabilities in different languages with evidence-based interventions, research in every country has to realign on high-quality standards. This refers in particular to the intensified application of blinded randomized controlled trials. Moreover, in order to solve the questions of the transferability of research findings across languages, cross-linguistic studies are required.
We would like to thank the authors of the studies included in this review for responding so willingly to repeated requests for information.
Analyzed the data: KG. Contributed reagents/materials/analysis tools: KG KK. Wrote the paper: KG EI. Conceived and designed the meta-analysis: KG EI KK GSK. Literature search and data extraction: KG. Contributed critical input: KG EI KK GSK.
- 1. Lyon GR, Shaywitz SE, Shaywitz BA (2003) A definition of dyslexia. Ann Dyslexia 53: 1–14. doi: 10.1007/s11881-003-0001-9
- 2. Scerri TS, Schulte-Körne G (2010) Genetics of developmental dyslexia. Eur Child Adolesc Psychiatry 19: 179–197. doi: 10.1007/s00787-009-0081-0
- 3. Schulte-Korne G, Bruder J (2010) Clinical neurophysiology of visual and auditory processing in dyslexia: a review. Clin Neurophysiol 121: 1794–1809. doi: 10.1016/j.clinph.2010.04.028
- 4. Goswami U, Fosker T, Huss M, Mead N, Szucs D (2011) Rise time and formant transition duration in the discrimination of speech sounds: the Ba-Wa distinction in developmental dyslexia. Dev Sci 14: 34–43. doi: 10.1111/j.1467-7687.2010.00955.x
- 5. Ziegler JC, Pech-Georgel C, Dufau S, Grainger J (2010) Rapid processing of letters, digits and symbols: what purely visual-attentional deficit in developmental dyslexia? Dev Sci 13: F8–F14. doi: 10.1111/j.1467-7687.2010.00983.x
- 6. Carroll JM, Maughan B, Goodman R, Meltzer H (2005) Literacy difficulties and psychiatric disorders: Evidence for comorbidity. J Child Psychol Psychiatry 46: 524–532. doi: 10.1111/j.1469-7610.2004.00366.x
- 7. Willcutt EG, Pennington BF (2000) Psychiatric comorbidity in children and adolescents with reading disability. J Child Psychol Psychiatry 41: 1039–1048. doi: 10.1111/1469-7610.00691
- 8. National Institute of Child Health and Human Development (2000) Report of the National Reading Panel. Teaching children to read: An evidence-based assessment of the scientific research literature on reading and its implications for reading instruction (NIH Publication No. 00–4769). Washington, DC: US Government Printing Office.
- 9. McArthur G, Eve PM, Jones K, Banales E, Kohnen S, et al. (2012) Phonics training for English-speaking poor readers. Cochrane Database Syst Rev 12: CD009115. doi: 10.1002/14651858.cd009115.pub2
- 10. Loo JHY, Bamiou DE, Campbell N, Luxon LM (2010) Computer-based auditory training (CBAT): benefits for children with language- and reading-related learning difficulties. Dev Med Child Neurol 52: 708–717. doi: 10.1111/j.1469-8749.2010.03654.x
- 11. Goodwin AP, Ahn S (2010) A meta-analysis of morphological interventions: Effects on literacy achievement of children with literacy difficulties. Ann Dyslexia 60: 183–208. doi: 10.1007/s11881-010-0041-x
- 12. Suggate SP (2010) Why what we teach depends on when: grade and reading intervention modality moderate effect size. Dev Psychol 46: 1556–1579. doi: 10.1037/a0020612
- 13. Elbaum B, Vaughn S, Hughes MT, Moody SW (2000) How Effective Are One-to-One Tutoring Programs in Reading for Elementary Students at Risk for Reading Failure? A Meta-Analysis of the Intervention Research. J Educ Psychol 92: 605–619. doi: 10.1037//0022-06126.96.36.1995
- 14. Scammacca N, Roberts G, Vaughn S, Edmonds M, Wexler J, et al.. (2007) Interventions for Adolescent Struggling Readers: A Meta-Analysis with Implications for Practice. Center on Instruction.
- 15. Swanson HL, Swanson HL (2001) Research on Interventions for Adolescents with Learning Disabilities: A Meta-Analysis of outcomes related to higher order processing. Elem School J 101: 331. doi: 10.1086/499671
- 16. Wanzek J, Vaughn S, Scammacca NK, Metz K, Murray CS, et al. (2013) Extensive Reading Interventions for Students With Reading Difficulties After Grade 3. Rev Educ Res 83: 163–195. doi: 10.3102/0034654313477212
- 17. Hedges LV (1981) Distribution theory for Glass’s estimator of effect size and related estimators. J Educ Behav Stat 6: 107–128. doi: 10.2307/1164588
- 18. Hedges LV (1982) Estimation of effect size from a series of independent experiments. Psychol Bull 92: 490–499. doi: 10.1037/0033-2909.92.2.490
- 19. Swanson HL (1999) Reading research for students with LD: a meta-analysis of intervention outcomes. J Learn Disabil 32: 504–532. doi: 10.1177/002221949903200605
- 20. Borenstein M, Hedges LV, Higgins JPT, Rothstein HR (2005) Comprehensive Meta-Analysis Version 2. Englewood, NJ: Biostat.
- 21. Bull L (2007) Sunflower therapy for children with specific learning difficulties (dyslexia): a randomised, controlled trial. Complement Ther Clin Pract 13: 15–24. doi: 10.1016/j.ctcp.2006.07.003
- 22. del Rosario Ortiz González M, Espinel AI, Rosquete RG (2002) Remedial interventions for children with reading disabilities: speech perception-an effective component in phonological training? J Learn Disabil 35: 334–342. doi: 10.1177/00222194020350040401
- 23. Heikkilä R, Aro M, Närhi V, Westerholm J, Ahonen T (2013) Does training in syllable recognition improve reading speed? A computer-based trial with poor readers from second and third grade. Sci Stud Read 17: 398–414. doi: 10.1080/10888438.2012.753452
- 24. Jiménez JE, Hernández-Valle I, Ramírez G, Ortiz Mdel R, Rodrigo M, et al. (2007) Computer speech-based remediation for reading disabilities: the size of spelling-to-sound unit in a transparent orthography. Span J Psychol 10: 52–67. doi: 10.1017/s1138741600006314
- 25. Kirk C, Gillon GT (2009) Integrated morphological awareness intervention as a tool for improving literacy. Lang Speech Hear Serv Sch 40: 341–351. doi: 10.1044/0161-1461(2008/08-0009)
- 26. Lovett MW, Borden SL, Warren-Chaplin PM, Lacerenza L, DeLuca T, et al. (1996) Text comprehension training for disabled readers: An evaluation of reciprocal teaching and text analysis training programs. Brain Lang 54: 447–480. doi: 10.1006/brln.1996.0085
- 27. Lovett MW, Lacerenza L, Borden SL, Frijters JC, Steinbach KA, et al. (2000) Components of effective remediation for developmental reading disabilities: Combining phonological and strategy-based instruction to improve outcomes. J Educ Psychol 92: 263–283. doi: 10.1037/0022-06188.8.131.523
- 28. Lovett MW, Ransby MJ, Hardwick N, Johns MS, Donaldson SA (1989) Can dyslexia be treated? Treatment-specific and generalized treatment effects in dyslexic children’s response to remediation. Brain Lang 37: 90–121. doi: 10.1016/0093-934x(89)90103-x
- 29. Lovett MW, Steinbach KA (1997) The effectiveness of remedial programs for reading disabled children of different ages: Does the benefit decrease for older children? Learn Disabil Q 20: 189–210. doi: 10.2307/1511308
- 30. Lovett MW, Warren-Chaplin PM, Ransby MJ, Borden SL (1990) Training the word recognition skills of reading disabled children: Treatment and transfer effects. J Educ Psychol 82: 769–780. doi: 10.1037/0022-06184.108.40.2069
- 31. Murphy CF, Schochat E (2011) Effect of nonlinguistic auditory training on phonological and reading skills. Folia Phoniatr Logop 63: 147–153. doi: 10.1159/000316327
- 32. O’Shaughnessy TE, Swanson HL (2000) A comparison of two reading interventions for children with reading disabilities. J Learn Disabil 33: 257–277. doi: 10.1177/002221940003300304
- 33. Robinson GL, Foreman PJ (1999) Scotopic sensitivity/Irlen syndrome and the use of coloured filters: a long-term placebo-controlled study of reading strategies using analysis of miscue. Percept mot skills 88: 35–52. doi: 10.2466/pms.19220.127.116.11
- 34. Ryder JF, Tunmer WE, Greaney KT (2008) Explicit instruction in phonemic awareness and phonemically based decoding skills as an intervention strategy for struggling readers in whole language classrooms. Read Writ 21: 349–369. doi: 10.1007/s11145-007-9080-z
- 35. Sanchez E, Rueda MI (1991) Segmental awareness and dyslexia: Is it possible to learn to segment well and yet continue to read and write poorly? Read Writ 3: 11–18. doi: 10.1007/bf00554561
- 36. Tressoldi PE, Lonciari I, Vio C (2000) Treatment of specific developmental reading disorders, derived from single- and dual-route models. J Learn Disabil 33: 278–285. doi: 10.1177/002221940003300305
- 37. Bhattacharya A, Ehri L (2004) Graphosyllabic analysis helps adolescent struggling readers read and spell words. J Learn Disabil 37: 331–348. doi: 10.1177/00222194040370040501
- 38. Törmänen MRK, Takala M (2009) Auditory processing in developmental dyslexia: An exploratory study of an auditory and visual matching training program with Swedish children with developmental dyslexia. Scand J Psychol 50: 277–285. doi: 10.1111/j.1467-9450.2009.00708.x
- 39. DiIanni M, Wilsher CR, Blank MS, Conners CK, Chase CH, et al. (1985) The effects of piracetam in children with dyslexia. J Clin Psychopharmacol 5: 272–278. doi: 10.1097/00004714-198510000-00004
- 40. Wilsher CR, Bennett D, Chase CH, Conners CK, DiIanni M, et al. (1987) Piracetam and dyslexia: effects on reading tests. J Clin Psychopharmacol 7: 230–237. doi: 10.1097/00004714-198708000-00004
- 41. Mitchell C, Mansfield D, Rautenbach S (2008) Coloured filters and reading accuracy, comprehension and rate: a placebo-controlled study. Percept mot skills 106: 517–532. doi: 10.2466/pms.106.2.517-532
- 42. McPhillips M, Hepper PG, Mulhern G (2000) Effects of replicating primary-reflex movements on specific reading difficulties in children: a randomised, double-blind, controlled trial. Lancet 12: 537–541. doi: 10.1016/s0140-6736(99)02179-0
- 43. Dickersin K, Min YI (1993) Publication bias: the problem that won’t go away. Ann N Y Acad Sci 703: 135–146. doi: 10.1111/j.1749-6632.1993.tb26343.x
- 44. Duval S, Tweedie R (2000) Trim and fill: A simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics 56: 455–463. doi: 10.1111/j.0006-341x.2000.00455.x
- 45. Ehri LC, Nunes SR, Stahl SA, Willows DM (2001) Systematic phonics instruction helps students learn to read: Evidence from the National Reading Panel’s meta-analysis. Rev Educ Res 71: 393–447. doi: 10.3102/00346543071003393
- 46. Ehri LC, Nunes SR, Willows DM, Schuster BV, Yaghoub-Zadeh Z, et al. (2001) Phonemic awareness instruction helps children learn to read: Evidence from the National Reading Panel’s meta-analysis. Read Res Q 36: 250–287. doi: 10.3102/00346543071003393
- 47. Bus AG, van Ijzendoorn MH (1999) Phonological awareness and early reading: A meta-analysis of experimental training studies. J Educ Psychol 91: 403–414. doi: 10.1037//0022-0618.104.22.1683
- 48. Döhnert M, Englert ED (2003) The Irlen syndrome-are there pathophysiologic correlates and scientific evidence for “reading with colors”? Z Kinder Jugendpsychiatr Psychother 31: 305–309.
- 49. Evans BJ, Drasdo N (1991) Tinted lenses and related therapies for learning disabilities: A review. Ophthalmic Physiol Opt 11: 206–217. doi: 10.1111/j.1475-1313.1991.tb00535.x
- 50. Wilsher CR, Taylor EA (1994) Piracetam in developmental reading disorders: A review. Eur Child Adolesc Psychiatry 3: 59–71. doi: 10.1007/bf01977668
- 51. Ise E, Engel RR, Schulte-Körne G (2012) Effective treatment of dyslexia: a meta-analysis of intervention studies. Kindh Entwickl 21: 122–136.
- 52. Berwanger D, von Suchodoletz W (2004) Trial of time processing training in children with reading and spelling disorders. Z Kinder Jugendpsychiatr Psychother 32: 77–84.
- 53. Thornton A, Lee P (2000) Publication bias in meta-analysis: its causes and consequences. J Clin Epidemiol 53: 207–216. doi: 10.1016/s0895-4356(99)00161-4
- 54. Pildal J, Hrobjartsson A, Jorgensen KJ, Hilden J, Altman DG, et al. (2007) Impact of allocation concealment on conclusions drawn from meta-analyses of randomized trials. Int J Epidemiol 36: 847–857. doi: 10.1093/ije/dym087
- 55. Schulz KF, Chalmers I, Hayes RJ, Altman DG (1995) Empirical evidence of bias. Dimensions of methodological quality associated with estimates of treatment effects in controlled trials. JAMA 273: 408–412. doi: 10.1001/jama.1995.03520290060030
- 56. Balk EM, Bonis PA, Moskowitz H, Schmid CH, Ioannidis JP, et al. (2002) Correlation of quality measures with estimates of treatment effect in meta-analyses of randomized controlled trials. JAMA 287: 2973–2982. doi: 10.1001/jama.287.22.2973
- 57. Seymour PH, Aro M, Erskine JM (2003) Foundation literacy acquisition in European orthographies. Br J Psychol 94: 143–174. doi: 10.1348/000712603321661859
- 58. Erickson K, Sachse S (2010) Reading acquisition, AAC and the transferability of English research to languages with more consistent or transparent orthographies. Augm Altern Commun 26: 177–190. doi: 10.3109/07434618.2010.505606
- 59. Ziegler JC, Bertrand D, Toth D, Csepe V, Reis A, et al. (2010) Orthographic depth and its impact on universal predictors of reading: a cross-language investigation. Psychol Sci 21: 551–559. doi: 10.1177/0956797610363406
- 60. Landerl K, Ramus F, Moll K, Lyytinen H, Leppanen PH, et al. (2013) Predictors of developmental dyslexia in European orthographies with varying complexity. J Child Psychol Psychiatry 54: 686–694. doi: 10.1111/jcpp.12029