Annualized relapse rates (ARR) in the placebo cohorts of phase-3 randomized controlled trials (RCT) of new treatments for relapsing remitting multiple sclerosis (RRMS) have decreased substantially during the last two decades. The causes of these changes are not clear. We consider a better understanding of this phenomenon essential for valuing the effects of new drugs and by designing new trials.
ARR, rate of relapse-free patients, trial start dates, baseline demographics, relapse definitions and the use of McDonald criteria were retrieved by literature research of the placebo cohorts from RRMS phase-3 trials. Predictors were estimated by univariate and multivariate regression analyses and random-effects meta-regression. In addition, regression models were calculated by the Sylvia Lawry Centre's (SLC), including individual case data from clinical trials performed until 2000. The most reliable meta-analytic results can be gained from pooled individual case data. In lack of this, random-effects meta-analyses are recommended.
Data from 12 published and one unpublished trial show a decrease of ARR from 1988 to 2012 (adjR2 = 0.807, p<0.0001). Regression models identified McDonald criteria followed by baseline mean age and the pre-study relapse rate as predictors of the ARR. The pooled individual case data (n = 505) confirmed a decrease of ARR over time. The pre-study relapse rate was the best predictor for on-study relapses. Lacking individual case data after implementation of the McDonald criteria excludes a direct comparison concerning McDonald criteria.
Pre-study relapse rate was the best predictor for on-study relapse rate but failed to explain the decrease of the ARR over time alone. Higher age at baseline and the implementation of McDonald criteria were associated as well with a lowered relapse rate in the random-effects meta-regression. These findings need further clarification based on individual case data.
Citation: Stellmann J-P, Neuhaus A, Herich L, Schippling S, Roeckel M, Daumer M, et al. (2012) Placebo Cohorts in Phase-3 MS Treatment Trials – Predictors for On-Trial Disease Activity 1990-2010 Based on a Meta-Analysis and Individual Case Data. PLoS ONE 7(11): e50347. https://doi.org/10.1371/journal.pone.0050347
Editor: Steven Jacobson, National Institutes of Health, United States of America
Received: May 4, 2012; Accepted: October 19, 2012; Published: November 29, 2012
Copyright: © 2012 Stellmann et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The work was funded by the NEU2 consortium, which is supported by the Federal Ministry of Education and Research (BMBF) (Förderkennzeichen 0315610–0315620). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: J-PS, Christoph Heesen, SS and RM have received payment for lectures including service on speakers bureaus as well as travel/accomodation/meeting expenses (unrelated to the activities listed) from GlaxoSmithKline, Merck, Novartis, Biogen Idec, Teva. MD, Scientific Director of the Sylvia Lawry Centre for Multiple Sclerosis Research e.V. (SLC) and one of the two Managing Directors of Trium Analysis Online GmbH, serves on the Editorial Board of MedNous and holds Patent 10 2007 044 705.3–35, German patent and trademark office 307 19 449.3/09. The Sylvia Lawry Centre received honoraria for interviews of MD with Propagate Pharma Limited and Deerfield Research LLC; is Medical Advisor of the German Multiple Sclerosis Society; received honoraria for consultancy, statistical analysis and use of actibeltH technology from the following entities: Bayer Schering, Biopartners, Biogen Idec, Biogenerix, Böhringer-Ingelheim, EISAI Limited, Heron Evidence Development Ltd, Hoffmann-La Roche, Johnson & Johnson Pharmaceutical Research & Development LLC, Sanofi-Aventis U.S. INC., Novartis Pharma GmbH, University of Oxford, Imperial College London, University of Southampton, Charite Berlin, University of Vienna, Greencoat Ltd, University Medical Center Hamburg-Eppendorf; serves on the advisory board for EPOSA study and received research grants from the following governmental entities: Federal Ministry of Education and Research Grant No 01GI0904, 01GI0920, Mayo Clinic Rochester, European Union (for SLC and Trium) Grant No 215820, European Union (for SLC) Grant No 223865, Federal Ministry of Economics and Technology Grant No KF0564001KF7, University of Oxford, Technical University of Munich, Hertie Foundation Grant No 1.01.1/07/015, Bavarian Research Foundation, National Multiple Sclerosis Society (NMSS), Porticus Foundation Grant No 900.50578, European Union Grant No LSHM-CT-2006–03759, University of Rochester, European Union Grant No LSHM-CT-2004–503485. AN is employee of the Sylvia Lawry Centre for Multiple Sclerosis Research. MR is employee of Teva Pharma. This does not alter the authors' adherence to all the PLOS ONE policies on sharing data and materials.
Relapses have been used widely as primary outcome in clinical phase-3 trials in relapsing-remitting Multiple Sclerosis (RRMS). Their future role in MS trial design is under discussion as annualized relapse rates (ARR) in the placebo cohorts of the 2010 published Fingolimod and Cladribine trials (ARR = 0.4 respectively 0.33) were substantially lower than in comparable trials of the 1990 s (e.g Interferon-ß-1a i.m. ARR = 0.9 and s.c. ARR = 1.28). – This observation leads to a major design problem for upcoming RRMS trials: with existing treatment options as comparator larger trials compensating for overall low event rates are necessary. Trials will become far more expensive and difficult to conduct. The reason, why placebo groups of recent trials show a less active disease course, is unclear.
A meta-analysis of demographic data and key points of study protocols as predictors for relapse rates during the last 20 years appears feasible.
In 2005, when reduction of relapse rates over time was not yet under discussion, Held et al.  used multiple regression models to identify predictors of the ARR. Data for this study were obtained from the Sylvia Lawry Centre for MS Research (SLC) individual case data base consisting of data from several placebo cohorts of phase-2 and 3 studies. The ARR of placebo patients in RCTs up to the year 1999 correlated well with the pre-study relapse rate and inversely with the disease duration at baseline. A possible change of the behaviour of placebo cohorts over time was not assessed by this approach. MRI variables at baseline did not show an additional predictive value on relapse and disability outcomes in RRMS populations. ,  Inusah et al. were the first to analyse the reduction of the ARR over time (1981 till 2008) in placebo groups of phase-2 and 3 trials by univariate and multivariate regression analysis in 2010. They found a predictive value of lower baseline mean age and of the definition of relapses either with a minimal duration of 24 or 48 hours for higher ARR on study.  Their analysis was limited in including only ARR as disease activity criterion in the multivariate regression. Furthermore, they analysed RRMS as well as secondary-progressive MS (SPMS) cohorts in a heterogenous mixture of phase-2 and 3 studies, whereas recent large phase-3 trials were not included.
To the best of our knowledge, the phase-3 studies published till 2012 have not been included in a meta-analysis yet. Furthermore, a random-effects meta-regression has never been performed in this dataset. Random-effects meta-regression is recommended and widely used to analyse the heterogeneity of studies in a meta-analysis. – Heterogeneity derives from different trial designs or different demographic characteristics of study populations. As the variance of the ARR may reflect heterogeneity of studies, the above mentioned meta-regression methodology seems the most appropriate statistical method to evaluate different variables as possible moderators of this heterogeneity .
Based on this concept, we aimed at covering the widest possible time-frame of high quality phase-3 trials in RRMS to asses 1.) The predictive value of baseline demographic factors and 2.) The impact of the McDonald criteria (2001, revised 2005) for clinical disease activity.,  This is especially relevant as the actual revision of McDonald criteria has just been published allowing a MS diagnosis at the time of first symptoms.  By use of the SLC database, we wanted to 3.) test possible predictors from the meta-analysis based on individual case data.
Our statistical analysis plan was designed to combine three possible ways of analysing predictors for relapse outcomes of randomized placebo controlled phase-3 clinical trials of relapsing-remitting MS: The results of (1) univariate and multivariate regression analyses and (2) a random-effects meta-analysis were compared. Additionally, we (3) evaluated cases in the open part of the SLC database. Exclusion of CIS, SPMS and primary-progressive MS (PPMS) patients seemed reasonable to avoid a bias towards MS courses with lower relapse activity. Phase-2 studies were not included in the analysis for the following reasons: relapses are not the usual primary outcome in MS phase-2 trials and there are no available data on relapse-free patients from most of these studies. Finally short durations between 6 and 12 months make estimates of the ARR less reliable. The ARR and the rate of relapse-free patients after 2 years (RRF) within the placebo cohorts were focussed as outcomes of interest. Based on the difficulties of relapse diagnosis and the on-going debate on the relevance of relapse numbers for prognosis the free-of-relapse criterion seems the more robust relapse outcome as it less influenced by number of relapses per patient and simply differentiates between missing or on-going relapse activity.
SLC has been collecting anonymised data from various data donors of natural history studies and randomized controlled clinical trials. The data it selves have been collected by the data donors with appropriate consent forms. Without explicit permission of the data donor even the data donor is anonymised.
Selection of studies
At first, a comprehensive literature research within the Pubmed database was performed (last access in March 2012) by one reviewer (JPS) with the following keywords: ‘randomized placebo controlled trial in relapsing remitting multiple sclerosis’. The results of the electronic search (n = 253) was screened by headers and abstracts if available. Only full-length original English journal publications were reviewed to identify studies that met the following criteria (n = 11): (1) placebo-controlled double-blinded phase-3 trials in MS with a follow up of at least 18 months, (2) relapse related outcomes, (3) exclusion of SPMS, PPMS and CIS patients and (4) published between 1980 and March 2012. Record selection, exclusions and inclusion of studies according to the PRISMA guidelines are presented in figure 1. JPS and CH. performed exclusions.
From of the publications following data was extracted: the name of the first author, year of publication and number of patients in the placebo cohort; the trial start date, the ARR and the RRF; the baseline characteristics of these cohorts including mean age, mean disease duration, rate of females, mean pre-study relapse rate and mean EDSS; definition of disease duration (time since first symptoms or time since diagnosis), relapse definition (24 or 48 hours of minimal symptom duration) and whether the McDonald criteria were applied. Throughout this paper the ARR and the RRF are labelled as “outcomes” while trial start dates, baseline values and definitions are mentioned as variables. For all means the standard deviations (SD) were also extracted if published. Available confidence intervals or standard errors were converted to SD. Missing values and definitions were obtained from data on file by contacting authors and sponsors of the publications and trials. To avoid data copying mistakes, the final dataset was reviewed by two collaborators (JPS and SR).
Standard errors (SE) for the rate of relapse free patients were calculated as proposed by Gelmann and Hill. Missing SD of ARR means were estimated under the assumption, that relapses follow a Poisson distribution. For all continuous variables a descriptive analysis with median, range, mean and SD was conducted. Scatter plots and regression analyses were used to determine if ARR and RRF did show a significant change over time defined by the trial start dates. Adjusted coefficients of determination (adjR2) between ARR/RRF and baseline values as well as between trial start date and baseline values were calculated. Besides these univariate models, multivariate regression models were applied as well. The included variables were selected by stepwise backward selection on base of the Akaike information criterion (AIC) that indicates the relative goodness of fit.,  All regression models were weighted for the number of patients.
Within random-effect meta-analyses the observed outcomes are understood as a random selection of the normally distributed outcomes. Different designs and populations of these studies lead to an additional heterogeneity or variability of the true outcome. This can be taken into account by adding a random effect allowing an estimation of the heterogeneity with tau2 as the between-study variance. Within mixed effect models, variables may be added and analysed in their ability in reducing the heterogeneity of the pure random effects model.,  The reduction of tau2 indicates the association between outcome and variables.
Within our analysis we calculated I2 (proportion of heterogeneity among true effects of total variability) and tau2 (between study variance) for the pure random effects models. A detection of outliers was implemented. For the mixed-effect models we included each variable separately and calculated tau2 and its relative change compared to the pure random effect model as a measure of association between variable and outcome. Finally, we modelled a mixed-effects analysis with the two variables which reduced the heterogeneity most effectively as recommended by Houwelingen et al..
Besides descriptive statistics of the pooled placebo population of the SLC database (open part), Poisson regression models were used to examine the correlation of possible predictors with the ARR. Logistic regressions were calculated for the models concerning the outcome relapse free after 2 years. Date of study entry was used equivalent to the trial start date. To ensure the blinding of the SLC database and to avoid conclusions about a single study, date of study entry was constructed as ordinal variable with each rank including at least two studies. Other variables were selected according to the meta-regression analyses. The final multivariate models based on variable selection by AIC.
All analyses were performed with the open-source software R including the Hmisc and the metafor package., ,  The results of the random-/ mixed effect meta-analyses were revised with STATA®.
We identified 11 published trials –, – that met our inclusion criteria. One further placebo cohort from an unpublished trial (ORIGIMS – A randomised, parallel group, placebo-controlled, double-blind phase-3 study of Bioferon® (interferon beta-1a) in the treatment of relapsing-remitting multiple sclerosis) could be added to the simple and meta-regression analyses from the SLC database, since the study sponsor had given its explicit consent to allow a separated re-analysis of the placebo cohort data. A dataset from a 13th study could be added based on 2011 conference proceedings.,  The available data is presented in table 1. For all trials we could determine the RRF and calculate the corresponding SE. The ARR with corresponding SD was available from all studies except three. 9 Studies used the time since diagnosis as definition for the baseline disease duration while 4 trials used the time since first symptoms. These were labelled as missing for the analysis. All trials conducted before 2001 (n = 6) included only clinically definite MS-cases while the 7 most recent trials implemented the McDonald criteria (6 studies with the 2001 criteria, 1 study with the revised criteria from 2005). This dataset was used for the simple regression and the random-effects meta-regression. The SLC dataset of individual case data consisted of a pool of 505 placebo patients. The two datasets overlapped concerning pivotal trials from the 1990ies, the ORIGIMS data was only included in the first dataset and excluded from the individual case data due to blinding reasons. Individual case data from more recent trials was not available due to restrictive data sharing policies of the involved companies. Therefore, findings from the meta-regression could not be confirmed in a comparable individual case dataset.
Univariate and multivariate regression
There was a significant reduction of the ARR over time (adjR2 = 0.807, p<0.0001, Figure 2) while, inversely, the RRF increased (adjR2 = 0.728, p = 0.0001, Figure 3) with a high association between both outcome measures (adjR2 = 0.815, p<0.0001). In the subgroup of studies not using McDonald criteria, i.e. before 2001, change over time was not significant. Univariate regression models showed a correlation of lower ARR and higher RRF with higher age, lower pre-study relapse rate and the use of the McDonald criteria. Definition of relapse duration was not significantly associated with the ARR but tended to show more relapses in the subgroup of studies using 48 hours as minimal relapse duration (data not shown). Further on, baseline mean age increased significantly over time while the pre-study relapse rate decreased significantly (Table 2 and Figure 4). By stepwise reverse exclusion of variables based on AIC we finally modelled a multivariate regression with baseline mean age and trial start dates as best predictors for the ARR (adjR2 = 0.821, p<0.0001). The RRF was best predicted by McDonald criteria and baseline mean age (adjR2 = 0.831, p<0.0001).
n = 13. Circle size corresponds to weighting by patient numbers of each cohort. R2 is the proportion of variability in the data that is accounted for by the weighted univariate regression model.
N = 13. Circle size corresponds to weighting by patient numbers of each cohort. R2 is the proportion of variability in the data that is accounted for by the weighted univariate regression model.
N = 13. Circle size corresponds to weighting by patient numbers of each cohort. R2 is the proportion of variability in the data that is accounted for by the weighted univariate regression model.
In a second step we performed the random-effect meta-analysis. Models without predictive variables showed that the variance of outcomes alone could not explain the between-study variability. For ARR we calculated as follows: I2 = 98.9% (CI 97.8–99.7%) of total variability due to heterogeneity (tau2 = 0.16, p<0.0001). The corresponding value for the RRF was I2 = 97.4% (CI 95.0–99.1%, tau2 = 0.03, p<0.0001). No outliers could be detected by the “leave one out” method. Using start dates, baseline variables as well as the categorical definitions as single variables in each mixed-effect model for ARR and RRF, we found trial start dates, baseline mean age, rate of females, pre-study relapse rate and the use of McDonald criteria associated with both outcomes. Other variables failed to reduce tau2 significantly showing that there is no association with the outcomes in our analyses. Baseline mean EDSS, the definition of relapses and definition of disease duration even increased tau2 in some models. Results of all models calculated and the association of variables with outcomes, measured by their ability in reducing tau2, are presented in table 3. Models with two variables did not show a significant higher association with relapse outcomes than the model with the best single predictor, which were trial start dates for both outcomes ARR and RRF.
In the third step of our analysis plan the regression models were evaluated based on 505 pooled placebo patients of the SLC database with a follow-up of at least 2 years. Due to blinding reasons data from patients entering the trials after 1999 could not be included. Descriptive statistics are presented in table 4 and did not show obvious deviation from other RRMS populations. The list of variables included in the multivariate models was reduced compared to the models applied to the cohorts from the literature for the following reasons: (1) definition of relapse was not available; (2) definition of disease duration was always the same (since diagnosis) and (3) McDonald criteria could not be analysed as all patients had clinically definite MS. Within the Poisson regression model the pre-study relapse rate in the year before entry (coefficient estimate = 0.12, p<0.001) could be identified as significant and best predictor of the ARR besides the date of entry in trial by year (coefficient estimate = −0.05, p<0.001). Analysing the RRF lead to the same results and showed an interestingly rapid decrease in the risk of relapses in studies after 1994 as indicated by the coefficients (Table 5). Baseline disease duration, age and gender had no further predictive value within multivariate models.
Our results of the yet largest meta-analysis of placebo cohorts in MS phase-3 trials confirm the notion of a reduction of relapse rates from 1990 to 2012 published studies. Individual data models from the SLC database between 1980 and 1999 also supported those findings. The most reliable meta-analytic results can be achieved by the analysis of pooled individual patient data as simulation studies indicated. Interestingly our re-evaluation of the SLC data including the moderator “entry to trial date” lead to a final multivariate regression model without the pre-study disease duration as independent predictor. This extended the analysis of Held et al. from 2005 whose analysis did not consider “entry to trial date” and whose final model included the pre-study relapse rate and the pre-study disease duration. In fact, the actual analysis demonstrates that the date of study entry has a higher predictive value for on-study relapse rates than the pre-study disease duration.
Mixed-effect models applied on the 12 placebo cohorts from the literature and the 13th study from the SLC database suggested the McDonald criteria, baseline mean age and pre-study relapse rate as well as the rate of females as predictors for the on-study relapse activity. These models are more robust than simple regressions as mixed-effects models take unavoidable differences between the included trials into account.,  In our study, the congruence between the analysis of both relapse outcomes was higher in the mixed-effect models than in simple regression models.
The mixed-effect models showed a clear association between less relapses and the inauguration of the McDonald criteria that could be shown for the first time in a meta-analytic approach. Unfortunately, we could not test this finding on an individual case data base. To ensure blinding of individual placebo cohorts of RCTs only studies before 2000 could be considered in this analysis. This led to a data pool lacking RRMS cohorts diagnosed with McDonald criteria and a discrepancy between the study populations used for the meta-regression and the individual case data analyses. In how far the observed effects are based mainly on the fact that the use of McDonald criteria might be understood as a categorical time variable differentiating between older and younger studies, cannot be clarified based on the available data. Start dates and the use of McDonald criteria showed nearly the same predictive value in classic and random-effect regression models. Models with both variables did not further increase the predictive value of the univariate models. All other variables failed to neutralize these high correlations. A differentiation between a continuous or stepwise decrease of the ARR cannot be performed with the available data and prohibits estimating the ARR for future trials.
The correlation between new diagnostic criteria and disease activity refers to two major problems. First, defining more and more patients with a chronic-inflammatory but benign condition as MS patients leads to a further reduction of relapse rates in the continuously expanding MS population. This phenomenon was described by Sormani et al. as Will Rogers phenomenon in an cohort of 309 CIS patients in 2008. The increasingly diagnosed so-called ‘radiological isolated syndrome’ (RIS) highlights this problem as real relapse activity is already known to be rare in this group. Within an US RIS cohort of 41 MS-symptom-free patients with a MRI suggestive of MS only 59% showed MRI activity and only 10 patients (25%) developed at least a clinically isolated syndrome (CIS) after a median of 5.4 years. Secondly, results from treatment studies and prognostic factors within natural cohorts from the pre-McDonald era cannot be transferred to contemporary MS-cohorts. Therefore it is more pressing to find new prognostic criteria for disability than more sensitive, sub-clinical diagnostic criteria. Expanding the MS population towards RIS with more sensitive diagnostic criteria disseminates the MS stigma with a vague benefit for patients and physicians being highly ambivalent to recommend treatments at this stage.
The pre-study relapse rate was less predictive than McDonald criteria in the mixed-effect models. This may at least partially be affected by recruitment strategies between older and newer studies: In consequence of accepting uncertain or unspecific symptoms in the disease history to fulfil a required number of relapses for study inclusion the true pre-study relapse rate may be overestimated in the newer studies. Probably a stricter definition of pre-study relapses e.g. steroid treatment, documented motor, visual or brainstem symptoms might be useful to have a more robust estimate for historical relapse frequencies.
Besides the relevance of pre-study relapses, McDonald criteria and trial start dates in our mixed-effect models give additional statistically reliable and clinically meaningful results: Young female patients have a higher short-term risk for further relapses.
The correlation of age and on-study relapses within the literature cohorts confirm the analysis of Inusah et al. but stays in contrast to the SLC data, where age did not increase the predictive value of multivariate models including pre-study relapses and year of study entry. This finding might be affected by the lower mean age of the SLC patients. As in the case of McDonald criteria, the restricted availability of individual case data denied a clarification of this discrepancy. Possible explanations might be higher sensitivity of McDonald criteria in older patients with lower relapse activity or previous treatment with approved drugs. In addition, the availability of approved therapies since 1993 may have lead to a selection towards older and less active patients for studies. This might be as well the reason for the nearly stepwise decrease of the relapse risk after 1994 observed within the logistic regression of the individual case data. Future cohorts with younger patients might help to clarify the relevance of age and disease duration for onstudy disease activity. The finding by Inusah et al., that different definition of relapses by minimal duration (24 or 48 hours) influences the ARR, could not be confirmed by our data. A small effect may be missed by our reduced data set, as only 4 studies used the 48 hours definition.  But, as our data even show a tendency towards more relapses in the subgroup with the more rigorous definition, we do not identify real implications for future trial design in this.
According to our findings future RRMS trials need a careful definition of inclusion criteria and novel trial designs have to be discussed as solution for an unpredictable ARR. Recently, Nicholas et al. published simulation studies that proposed adaptive designs to control sample sizes and power..
Meta-analyses are always restricted by the available studies. Only 12 phase-3 RCTs in RRMS with a follow-up of at least 18 months have been published in the last 20 years. Negative MS trials showing no effect of an investigational drug are difficult to retrieve. In fact, low event rates in placebo groups might be a major reason for not showing treatment effects. This was the case of the ORIGIMS trial, which we could include in our analysis. As our systematic literature research did not reveal one single negative trial, a substantial publication bias must be stated. Therefore our data set probably even overestimates the true ARR. The low number of studies restricted our mixed-effect models to a maximum of two moderators per model and inhibited a direct and statistically meaningful comparison of different moderators. Due to missing data the MRI criteria and prior treatments could not be included in our analyses but might have impacted on the disease activity and therefore relapse rates as well.
Up to now all analysed predictors failed to satisfactorily explain the lowering of relapse rates in phase-3 trials over the last decades. Inclusion criteria for future phase-3 trials in RRMS with relapse-based outcomes should be adjusted based on the results of our analysis showing that higher age and lower pre-study relapse rates are associated with a lower on-study ARR. The relevance of prior disease modifying treatments, the change of relapse rates over time and even more important the effect of new diagnostic criteria must be further analysed on an individual case data base. We emphasize the importance to continue gathering and sharing the data of placebo cohorts – and treatment data – as it has been started in the SLC in 2001.
We are grateful for the data verification by Susan Reh. We thank Volker Heeschen from Novartis, Norbert Zessack from Merck Serono and Martin Suhr from Biogen Idec for the kind support in filling the data gaps of the phase-3 studies. Further on we appreciate the explicit consent of TEVA Pharma and biopartners to reanalyse the individual case data of selected placebo cohorts.
Conceived and designed the experiments: J-PS LH AN CH MD SS RM. Analyzed the data: J-PS AN LH. Contributed reagents/materials/analysis tools: MR. Wrote the paper: J-PS LH AN CH MD SS MR.
- 1. Kappos L, Radue EW, O’Connor P, Polman C, Hohlfeld R, et al.. (2010) A placebo-controlled trial of oral fingolimod in relapsing multiple sclerosis. New England Journal of Medicine: 387–401.
- 2. Giovannoni G, Comi G, Cook S, Rammohan K, Rieckmann P, et al. (2010) A placebo-controlled trial of oral cladribine for relapsing multiple sclerosis. The New England journal of medicine 362: 416–426 .
- 3. Jacobs LD, Cookfair DL, Rudick R a, Herndon RM, Richert JR, et al. (1996) Intramuscular interferon beta-1a for disease progression in relapsing multiple sclerosis. The Multiple Sclerosis Collaborative Research Group (MSCRG). Annals of neurology 39: 285–294 .
- 4. PRISMS (Prevention of Relapses and Disability by Interferon beta-1a Subcutaneously in Multiple Sclerosis) Study Group (1998) Randomised double-blind placebo-controlled study of interferon beta-1a in relapsing/remitting multiple sclerosis. PRISMS (Prevention of Relapses and Disability by Interferon beta-1a Subcutaneously in Multiple Sclerosis) Study Group. Lancet 352: 1498–1504 .
- 5. Held U, Heigenhauser L, Shang C, Kappos L, Polman C (2005) Predictors of relapse rate in MS clinical trials. Neurology 65: 1769–1773 .
- 6. Barkhof F, Held U, Simon JH, Daumer M, Fazekas F, et al. (2005) Predicting gadolinium enhancement status in MS patients eligible for randomized clinical trials. Neurology 65: 1447–1454 .
- 7. Daumer M, Neuhaus a, Morrissey S, Hintzen R, Ebers GC (2009) MRI as an outcome in multiple sclerosis clinical trials. Neurology 72: 705–711 .
- 8. Inusah S, Sormani MP, Cofield SS, Aban IB, Musani SK, et al.. (2010) Assessing changes in relapse rates in multiple sclerosis. Multiple sclerosis (Houndmills, Basingstoke, England). doi:10.1177/1352458510379246.
- 9. Lambert PC, Sutton a J, Abrams KR, Jones DR (2002) A comparison of summary patient-level covariates in meta-regression with individual patient data meta-analysis. Journal of clinical epidemiology 55: 86–94.
- 10. Higgins J, Green S, editors (2011) Cochrane Handbook for Systematic Reviews of Interventions Version 5.1.0 [updated March 2011]. Available . The Cochrane Collaboration. p.
- 11. Thompson SG, Sharp SJ (1999) Explaining heterogeneity in meta-analysis: a comparison of methods. Statistics in medicine 18: 2693–2708.
- 12. McDonald WI, Compston a, Edan G, Goodkin D, Hartung HP, et al. (2001) Recommended diagnostic criteria for multiple sclerosis: guidelines from the International Panel on the diagnosis of multiple sclerosis. Annals of neurology 50: 121–127.
- 13. Polman CH, Reingold SC, Edan G, Filippi M, Hartung H-peter, et al. (2005) Diagnostic Criteria for Multiple Sclerosis/: 2005 Revisions to the “ McDonald Criteria .”. Annals of Neurology 11: 840–846 .
- 14. Polman CH, Reingold SC, Banwell B, Clanet M, Cohen JA, et al.. (2011) Diagnostic criteria for multiple sclerosis: 2010 revisions to the “McDonald criteria.” Annals of Neurology: n/a-n/a. doi:10.1002/ana.22366.
- 15. Moher D, Liberati A, Tetzlaff J, Altman DG (2009) Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLoS medicine 6: e1000097 .
- 16. Gelman A, Hill J (2009) Data Analysis Using Regression and Multilevel/Hierarchical Models. Cambridge: Cambridge University Press. p.
- 17. Fahrmeir L, Tutz G (2001) Multivariate statistical modelling based on generalized linear models. 2nd ed. New York: Springer. p.
- 18. Viechtbauer W (2010) Conducting meta-analyses in R with the metafor package. Journal of 36.
- 19. Houwelingen HCV, Arends LR, Stijnen T (2002) TUTORIAL IN BIOSTATISTICS Advanced methods in meta-analysis/: multivariate approach and meta-regression. Statistics in Medicine 624: 589–624 .
- 20. R Development Core Team (2010) R: A language and environment for statistical computing.
- 21. Harrell Jr FE (2010) Hmisc: Harrell Miscellaneous. R package version 3.8–3.
- 22. Polman CH, O Connor PW, Havrdova E, Hutchinson M, Kappos L, et al. (2006) A randomized, placebo-controlled trial of natalizumab for relapsing multiple sclerosis. New England Journal of Medicine 354: 899.
- 23. Achiron A, Gabbay U, Gilad R, Hassin-Baer S, Barak Y, et al. (1998) Intravenous immunoglobulin treatment in multiple sclerosis. Effect on relapses. Neurology 50: 398–402.
- 24. IFNB Multiple Sclerosis Study Group (1993) Interferon beta-lb is effective in relapsing-remitting multiple sclerosis. Neurology: 655–661.
- 25. Johnson KP, Brooks BR, Cohen JA, Ford CC, Goldstein J, et al. (1995) Copolymer 1 reduces relapse rate and improves disability in relapsing-remitting multiple sclerosis: results of a phase III multicenter, double-blind placebo-controlled trial. The Copolymer 1 Multiple Sclerosis Study Group. Neurology 45: 1268–1276.
- 26. Fazekas F, Deisenhammer F, Strasser-Fuchs S, Nahler G, Mamoli B (1997) Randomised placebo-controlled trial of monthly intravenous immunoglobulin therapy in relapsing-remitting multiple sclerosis. Austrian Immunoglobulin in Multiple Sclerosis Study Group. Lancet 349: 589–593.
- 27. Comi G, Jeffery D, Kappos L, Montalban X, Boyko A, et al. (2012) Placebo-controlled trial of oral laquinimod for multiple sclerosis. The New England journal of medicine 366: 1000–1009 .
- 28. O'Connor P, Wolinsky JS, Confavreux C, Comi G, Kappos L, et al. (2011) Randomized trial of oral teriflunomide for relapsing multiple sclerosis. The New England journal of medicine 365: 1293–1303 .
- 29. Gold R, Kappos L, Bar-Or A, Arnold D, Giovannoni G, et al.. (2011) Clinical efficacy of BG-12, an oral therapy, in relapsing-remitting multiple sclerosis: data from the phase 3 DEFINE trial. In: ECTRIMS. Amsterdam.
- 30. Selmaj K, Gold R, Kappos L, Dl A, Giovannoni G, et al.. (2011) P994 Safety and Tolerability of BG-12 in the Phase 3 DEFINE Trial in Patients with Relapsing-Remitting Multiple Sclerosis. Multiple Sclerosis: 994.
- 31. Sormani MP, Tintorè M, Rovaris M, Rovira A, Vidal X, et al. (2008) Will Rogers phenomenon in multiple sclerosis. Annals of neurology 64: 428–433 .
- 32. Okuda DT, Mowry EM, Beheshtian a, Waubant E, Baranzini SE, et al. (2008) Incidental MRI anomalies suggestive of multiple sclerosis: The radiologically isolated syndrome. Neurology 72: 800–805 .
- 33. Nicholas R, Straube S, Schmidli H, Schneider S, Friede T (2011) Trends in annualized relapse rates in relapsing-remitting multiple sclerosis and consequences for clinical trial design. Multiple sclerosis (Houndmills, Basingstoke, England). doi:10.1177/1352458511406309.
- 34. Giovannoni G, Comi G, Cook S, Rammohan K, Rieckmann P, et al. (2010) A placebo-controlled trial of oral cladribine for relapsing multiple sclerosis. The New England journal of medicine 362: 416–426 .