Figures
Abstract
The 32-item Motor Function Measure (MFM32) is an assessment of motor function, and its measurement properties were established in a broad neuromuscular disease population. This study sought to investigate the reliability, validity, and ability to detect change of MFM32 in individuals with Type 2 and non-ambulant Type 3 spinal muscular atrophy (SMA). Data were used from the Phase 2 study assessing the efficacy and safety of olesoxime. A total of 110 individuals with Type 2 or 3 SMA were included in the analyses. Test-retest reliability (intraclass-correlation coefficient in global impression-defined stable individuals), internal consistency (Cronbach’s alpha), convergent validity (Spearman rank order correlations with other measures), known-groups validity (analysis of covariance comparing Hammersmith Functional Motor Scale -defined groups), and ability to detect change (analysis of covariance comparing global impression-defined groups) were calculated. Strong evidence of test-retest reliability (intraclass-correlation coefficient = 0.93–0.95), internal consistency (Cronbach’s alpha = 0.89), convergent validity (Hammersmith Functional Motor Scale: rho = 0.87; forced vital capacity: rho = 0.61), known-groups validity (all p<0.0001), and ability to detect change (all p<0.001) were demonstrated. These results provide evidence of the MFM32’s measurement properties, supporting its use in longitudinal research in individuals with Type 2 and non-ambulant Type 3 SMA.
Citation: Trundell D, Le Scouiller S, Le Goff L, Gorni K, Vuillerot C (2020) Assessment of the validity and reliability of the 32-item Motor Function Measure in individuals with Type 2 or non-ambulant Type 3 spinal muscular atrophy. PLoS ONE 15(9): e0238786. https://doi.org/10.1371/journal.pone.0238786
Editor: Andrea Martinuzzi, IRCCS E. Medea, ITALY
Received: January 21, 2020; Accepted: August 21, 2020; Published: September 18, 2020
Copyright: © 2020 Trundell et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: As employees of Roche, DT, KG, and SS had direct access to the data at the time of the study. However, Roche is committed to supporting scientific research and have processes in place to ensure that patient level data are shared and used responsibly. Qualified researchers may request access to individual patient level data through the clinical study data request platform (www.vivli.org). Further details on Roche's criteria for eligible studies are available here (https://vivli.org/members/ourmembers/). For further details on Roche's Global Policy on the Sharing of Clinical Information and how to request access to related clinical study documents, see here (https://www.roche.com/research_and_development/who_we_are_how_we_work/clinical_trials/our_commitment_to_data_sharing).
Funding: This study is sponsored by F. Hoffmann-La Roche Ltd. The funder provided support in the form of salaries for authors [DT, KG and SS], but did not have any additional role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript. The specific roles of these authors are articulated in the ‘author contributions’ section. Writing and editorial assistance was provided by Megan Allen, PhD, of MediTech Media, UK (funded by F. Hoffmann-La Roche Ltd) in accordance with Good Publication Practice (GPP3) guidelines (http://www.ismpp.org/gpp3).”
Competing interests: The authors have read the journal's policy and have the following competing interests: DT is a shareholder and employee of Roche Products Ltd. SS is an employee of Roche Products Ltd. CV is a PI for Trophos and Roche clinical trials and has received consultancy fees from Roche, Biogen, and Avexis. KG is an employee and a shareholder of F. Hoffmann-La Roche Ltd. LG has accepted a congress invitation from Biogen. This does not alter the author’s adherence to PLOS ONE policies on sharing data and materials. The authors would like to declare the following with this research: Roche is developing risdiplam (Evrysdi™) for SMA, and the MFM was used as the primary endpoint.
Introduction
Spinal muscular atrophy (SMA) is a rare and severe progressive neuromuscular disease that causes muscle atrophy and disease-related complications which affect the whole body [1, 2]. SMA is caused by reduced levels of the survival of motor neuron (SMN) protein due to deletions and/or mutations of the SMN1 gene [3]. A second paralogue gene, SMN2, produces low levels of functional SMN protein that are not sufficient to fully compensate deficiency due to the lack of an SMN1 gene [4–6]. SMA manifests in various degrees of severity, all of which exhibit progressive muscle weakness and mobility impairment [7].
The clinical presentation of SMA ranges in disease severity and affects infants through adults. Rather than one single, well-delineated phenotype, SMA represents a spectrum of motor and functional disabilities, characterized by age of onset and highest motor milestone achieved. Patients are typically categorized into four main subtypes [8, 9]. SMA Types 2 and 3, the focus of this study, reflect a continuum of motor function impairment from individuals with poor fine motor function to individuals who are still ambulant, including overlap between stronger Type 2 and non-ambulant Type 3 individuals. Type 2 SMA usually manifests between 6 and 18 months of age with patients unable to walk without support [8, 10, 11]. Type 3 SMA is less severe, typically manifesting after 18 months of age, with patients achieving the ability to stand and walk without support until the disease progresses [10, 11]. The level of gross motor function development achieved varies across Type 2 and 3 SMA individuals, and is followed by progression, removing and limiting previously acquired function.
Valid and reliable measures of SMA-related outcomes allow for a better understanding of the natural history of all SMA types and of subgroups of disease trajectory [12–14]. Additionally, assessments which are sensitive to change and measure symptoms and outcomes that are meaningful to individuals with SMA are essential for assessing therapeutic strategies to treat SMA. Due to a heterogenous patient population, in terms of age and clinical phenotype, measures of treatment outcomes in SMA must be able to detect changes across the disease spectrum for which they are intended to be used [15].
Among them, the 32-item Motor Function Measure (MFM32) is an assessment of motor function for use in individuals with neuromuscular disease. Reliability and validity of the MFM32 have been demonstrated in a group of individuals with neuromuscular disorders [16]. The MFM32 has been assessed in an SMA population, with evidence of convergent validity (strongest associations with measures of strength and upper limb function) and known-groups validity (discriminating between sitters and non-sitters, and between ambulant and non-ambulant individuals) demonstrated [17]. Additionally, preliminary evidence of responsiveness was demonstrated in a small sample of SMA Type 2 (n = 12) and Type 3 (n = 19) individuals [18]. Furthermore, the MFM32 has been shown to be targeted to assessing individuals across the non-ambulant spectrum [19]. Although fit to the Rasch model was suboptimal in that study, this is due to the multidimensionality of the MFM32. This was highlighted by a more recent analysis in patients with neuromuscular disorders, which demonstrated good fit to a three-dimension confirmatory factor analysis model [20].
Despite these studies, there remains a gap in understanding the measurement properties of the MFM32 in an SMA population (e.g., reliability and ability to detect change). This is particularly important in the context of ongoing clinical research. For example, MFM32 score is the primary efficacy outcome assessment in SUNFISH (NCT02908685), a placebo-controlled, randomized study assessing the efficacy and safety of risdiplam in individuals with Type 2 and non-ambulant Type 3 SMA. Understanding the measurement properties of the MFM32 is critical for interpreting the results of this and future clinical trials that use the assessment scale.
The aim of this study is to assess the measurement properties of the MFM32 in a Type 2 and non-ambulant Type 3 SMA population. A population similar to SUNFISH was studied in the Phase 2, adaptive, parallel-group, double- blind, randomized, placebo-controlled, multicenter, multinational study designed to assess the efficacy and safety of olesoxime over 24 months in 3 to 25-year-old individuals with Type 2 and non-ambulatory Type 3 SMA (NCT01302600). Such datasets provide convenient samples for assessing measurement properties of the scales included. The purpose of this analysis was to retrospectively investigate the validity, reliability, and ability to detect change, of the MFM32 in children, adolescents and young adults with Type 2 and non-ambulant Type 3 SMA, using data from the Phase 2 study.
Material and methods
Ethics
For this study, all patients or their parent or guardian provided written informed consent before screening. The study was approved by local institutional review boards and ethics committees including Federal Institute for Drugs and Medical Devices (BfArM; Germany), Rejestracji Produktów Leczniczych Wyrobów Medycznych i Produktów Biobójczych (Poland), Division R&D Federal Agency for Medicines and Health Products (FAMHP; Belgium), Central Committee on Research Involving Human Subjects (CCMO; Netherlands), Medicines and Healthcare Products Regulatory Agency (United Kingdom), The French Health Product Safety Agency (AFSSAPS; France), and the Ethics Committee for Medical Experimentation at the Ospedale Pediatrico Bambino Gesù (Italy). Informed consent was obtained prior to any study-related procedure.
Analysis population
A total of 165 patients (randomized 2:1 to daily, oral 10 mg/kg olesoxime or placebo) were recruited in Belgium, France, Germany, Italy, the Netherlands, Poland, and the UK. The screening visit included a standard clinical examination, biological tests and SMA status assessments which were part of the routine care. The double-blind treatment period lasted 104 weeks [21]. Key inclusion criteria [21] were weakness and hypotonia consistent with a clinical diagnosis of Types 2 or 3 SMA, genetic confirmation of SMA, an MFM32 relative score (percentage of the maximum sum of both dimensions) ≥15% (D [domain] 1+D2 score), a Hammersmith Functional Motor Scale (HFMS) score at baseline between 3 and 38, age at onset of symptoms ≤3 years of age, and the ability to take the study treatment (tested at screening after informed consent). Key exclusion criteria [21] included diagnosis of a neurodegenerative or neuromuscular disease other than SMA and participation in any other investigational drug or therapy study within the previous 3 months. Further details of the Phase 2 study are reported by Bertini, et al. (2017) [21]. For this analysis, only patients with available MFM32 data (as appropriate for each analysis) were included. As participants less than 6 years old were administered an abbreviated version of the MFM 20 items (the MFM20), the analysis only included individuals aged 6 years and older. This resulted in an analysis sample of 110 patients.
Outcome assessments
The following outcome assessments were included in the analysis. Further details can be found in Bertini, et al. (2017) [21].
MFM
The MFM32 was used to assess motor function in patients. The 32 items of the MFM32 are distributed across three domains: D1 measures standing, transfers and ambulation, D2 measures proximal and axial function and D3 measures distal function. A 4-point Likert scale was used to rate the participant’s maximal ability without assistance: 0, does not initiate movement or starting position cannot be maintained; 1, partially completes the exercise; 2, completes the exercise with compensations, slowness or obvious clumsiness; 3, completes the exercise with a standard pattern. Scoring was conducted in line with the MFM 2nd edition user manual (2009). The raw sum score of the 32 items was converted to a 0–100 scale (by dividing by the total possible raw score of 96 and multiplying by 100). The developers created this scoring to aid interpretation of the score as a ‘percentage’ of normal function. The MFM32 was completed at baseline, and at Weeks 26, 52, 78 and 104. Physiotherapists were trained and certified in the administration of the MFM32.
HFMS
Motor function was also measured using the HFMS scale of 20 items. Each activity (item) was scored on a 3-point scoring system, with a score of 2 for unaided, 1 for assistance and 0 for inability [22]. Total scores, achieved by summing the scores for all the individual items, range from 0 to 40, with lower scores indicating more severe motor impairment (as defined in the user manual). The HFMS was completed at screening, and Weeks 13, 39, 65, and 91. Physiotherapists were trained and certified in the administration of the HFMS.
A major source of variation between the MFM32 and the HFMS is the presence of distal function items in the MFM32, permitting gradation of severity in weak Type 2 patients. Indeed, a floor effect has been noted on the HFMS [23]. Data from screening were used in the analysis.
Forced Vital Capacity (FVC)
Pulmonary function was assessed by measuring FVC as a percentage of that which was predicted for age and height. To adjust the FVC according to individual height, weight and gender, the FVC results were divided by the theoretical capacity, and calculated as a percentage. Theoretical capacity was calculated using the following approach (age: years; height: meters):
- For females aged <18 years: 1.4507 + (1.48 + 0.0127*Age)*Height [24].
- For males aged <18 years: 1.2782 + (1.3731 + 0.0164*Age)*Height [24].
- For females aged >18 years: 4.43*Height—0.026*Age—4.34 [25].
- For males aged >18 years: 5.76*Height—0.026*Age—4.34 [25].
Where height could not be measured (e.g., with scoliosis or contractures), ulna length was used to calculate a surrogate height measure [26]. FVC was assessed at baseline and Weeks 13, 26, 39, 52, 78, and 104.
Muscle atrophy is known to impact both motor function and respiratory function, making FVC a useful assessment for convergent validity analysis. Data from baseline were used in the analysis.
Global impression of change (GI-C)
The GI-C scale was used to assess change from baseline in the patient’s global health using a 7-point ordinal scale: 1-very much improved; 2-Much improved; 3-Minimally improved; 4-No change; 5-Minimally worse; 6-Much worse; 7-Very much worse [27]. A version was completed by the clinician (Clinician GI-C [CGI-C]) and another version by either the patient or a parent (Patient/Parent GI-C [PGI-C]) at each post-baseline visit.
Analyses
The analyses to assess the measurement properties of the MFM32 were conducted in patients ≥6 years old who were administered the MFM32. Data were pooled across treatment arms for all analyses. Some analyses used a subset of patients, as defined by the analysis (e.g., test-retest reliability) or by the availability of data (e.g., patients without follow-up data were not included in ability to detect change analyses). Item level missing data for study endpoints were handled according to the respective manuals (for example, missing item scores were set to 0 [unable to perform the task] prior to the calculation of the total score. SAS v9.2 was used for all analyses. For analyses that require a statistical comparison of groups (e.g., known-groups validity), the threshold for statistical significance was p<0.05. No adjustments for multiplicity were made. For analyses identifying relationships between variables (e.g., internal consistency), suggested values of acceptability were used to aid interpretation. Tests for normality were conducted (e.g., Shapriro–Wilk and Kolmogonov–Smirnov). All results were non-significant supporting the use of the parametric tests described below.
Sociodemographic descriptive statistics
Sociodemographic data were collected in order to characterize the patient sample. Baseline descriptive statistics (frequency and percentage for categorical variables; mean, standard deviation [SD], minimum and maximum for continuous variables) were calculated for gender, age (at informed consent), country, SMA type, and MFM32 score at baseline.
Reliability
Test-retest reliability.
Test-retest reliability assesses the stability of a score in patients who are not expected to have experienced a change in the construct of interest. Whilst test-retest reliability is often assessed over short durations, the approach does not give confidence in the reliability of a measure that is used over long-duration studies. Test-retest reliability of the MFM32 total score was assessed by calculating the intraclass correlation coefficient-model 2,1 (ICC2,1): a two-way random effects analysis of variance (patient by visit), in a subset of individuals classified as stable. Stable was defined using two different anchors (with separate analyses conducted for each group): 1) patients classified as “no change” on the CGI-C at Week 26; and 2) patients classified as “no change” on the PGI-C at Week 26. This methodology is commonly used, and is recommended by COnsensus‐based Standards for the selection of health Measurement Instruments (COSMIN) initiative [28, 29]. ICCs≥0.7 were considered acceptable [30].
Validity
Convergent validity.
Convergent validity is an assessment of the relationship between the target measure (in this case the MFM32) and related measures. Stronger correlations should be demonstrated with more related constructs. Convergent validity of MFM32 total score was assessed via Spearman rank order correlations with HFMS score (HFMS at screening, MFM32 at baseline) and with FVC (at baseline). To aid interpretation, the following thresholds were used [32, 33]: <0.2: Weak; ≥0.2 to <0.4: Modest; ≥0.4 to <0.6: Moderate; ≥0.6 to 0.8: Strong; ≥0.8: Very strong. Conceptually, the HFMS measures the most related construct to the MFM32 (i.e., both the measures assess motor function ability), and thus the strongest correlation was expected with this measure. Respiratory function is known to be impacted in this population, so the relationship was expected to be relatively strong between FVC and the MFM32; however, it was not expected to be as strong as the correlation between the MFM32 and the HFMS.
Known-groups validity.
The validity of a measure can be assessed by its ability to discriminate between two or more groups that are known to be different. For example, a measure of motor function should produce statistically significantly different scores for patients that are known to have greater function than those who are known to have lower levels of function. The level of function can be defined by a related measure. In this study, groups were defined by:
- HFMS score at screening: ≥ median vs < median
- ability to stand: able to stand vs unable to stand
- ability to sit with legs straight and back unsupported: able to sit vs unable to sit.
Ability to stand was assessed by Items 18 and 19 of the HFMS at screening. Unsupported standers, at screening, had an HFMS score of:
- 2 on Q18 (stand with support of one hand), AND
- 1 or 2 on Q19 (unsupported for count of 3, or unsupported for count >3).
Supported standers, at screening, had an HFMS score of:
- 1 (stand with minimal trunk support) or 2 on Q18, AND
- 0 on Q19 (stand <3 or unable).
Non-standers, at screening, had an HFMS score of:
- 0 on Q18 (requires knee/hip support or unable), AND
- 0 on Q19.
Supported and unsupported standers were grouped together due to small sample sizes.
Ability to sit was assessed by Item 2 of the HFMS at screening.
- Able to sit without support is defined by HFMS item 2 score = 2.
- Unable to sit without support is defined by HFMS Item 2 score <2.
Known-groups validity was assessed by comparing mean MFM32 total scores for each group via analysis of covariance (ANCOVA; controlling for age and gender) at baseline. A significant difference (p<0.05) between the groups provides evidence of known-groups validity.
Ability to detect change
Ability to detect change can be assessed by evaluating differences in the target outcome assessment’s change in scores between groups that are known to have had a different longitudinal outcome. It is an evaluation of the measure’s ability to detect differences, not an evaluation of the measure’s ability to detect a treatment effect (the cause of the groups’ difference is not what is being assessed but rather the responsiveness of the scale). Thus, even natural history datasets with untreated individuals could be used (i.e., the outcome of the Phase 2 study is largely irrelevant for this analysis). Ability of the MFM32 to detect change was evaluated comparing change from baseline to Week 104 scores in ‘responders’ vs ‘non-responders’ via ANCOVA (controlling for baseline MFM32 score, age and gender). Analyses were conducted using two different anchors to define responders and non-responders (separate analyses). For both CGI-C and PGI-C, responders were defined as those scored as no change, minimally improved, much improved or very much improved, and non-responders were those scored as minimally worse, much worse, or very much worse. A significant difference (p<0.05) between the groups provides evidence of ability to detect change.
Results
Patient demographics
Baseline patient demographics are listed in Table 1. Despite the intended upper age cap in the Phase 2 study of 25 years, one individual aged 27 years was enrolled in the study. As the purpose of our analysis was to assess the measurement properties in children, adolescents and young adults, this individual was included in the analysis, resulting in a mean age of 12 years (6–27 years). More than half of the sample were male (56%), and most had SMA Type 2 (62%). Baseline MFM32 scores ranged from 21.9–69.8, providing a broad range of functional ability.
Evaluation of the impact of missing MFM32 item scores at baseline indicated that majority of patients (97%) had no missing data. The three remaining patients (3%) missed 1, 9 and 11 items respectively. For the two patients with multiple missing item data, the items missing were the more challenging tasks including ambulation assessments. Imputing a score of 0 for these items was consistent with expectations based on other item scores. Similar levels of missing data were seen at Weeks 26 (3%) and 104 (6%). Thus, any impact of missing data and imputation is limited due to the high completion rate.
Test-retest and internal consistency reliability
All reliability analyses achieved the acceptable threshold of >0.7 for the respective tests. Specifically, an ICC of 0.93 was found in patients classified as “no change” on CGI-C (N = 92), and an ICC of 0.95 in patients classified as “no change” on PGI-C (N = 74), and Cronbach’s α = 0.89.
Convergent validity
As expected, the correlation between the MFM32 and the HFMS (Spearman’s ρ = 0.87, p<0.0001, N = 110) was greater than that between the MFM32 and the FVC (Spearman’s ρ = 0.61, p<0.0001, N = 104).
Known-groups validity
Table 2 reports the results of the known-groups validity analyses. Least square (LS) means followed expected patterns (i.e., larger MFM32 total scores for less severe patients) for HFMS score at screening (≥11 vs <11), ability to stand, and ability to sit (LS mean difference = 17.4, 16.5 and 11.9, respectively). All analyses demonstrated significant differences (p<0.0001) between groups.
Ability to detect change
As shown in Table 3, the MFM32 was able to detect a change in global condition as assessed by both clinician-rated and patient-/parent-rated change in global health. Significant differences were found between groups’ LS mean change scores (p<0.001), using both anchors, with MFM32 change scores following logical patterns (i.e., worsening in the “minimally worse” or worse groups, and small improvements in the “no change” or better groups). Of note, similar LS mean values were similar for both CGI-C- and PGI-C-defined groups, with a decline of approximately 5 points in the worsening groups, and an improvement just below 1 point in the stable/improving groups.
Discussion
This study provides evidence that the MFM32 total score is a valid, reliable and responsive assessment of motor function ability in individuals with Type 2 and non-ambulant Type 3 SMA. Specifically, supportive evidence of reliability was demonstrated by high ICCs (test-retest reliability) and a high Cronbach’s α (internal consistency). The magnitude and pattern of Spearman rank order correlation coefficients were consistent with expectations. While MFM32 scores at baseline exhibited strong correlations with both HFMS and FVC scores, correlation with HFMS was stronger than with FVC, providing evidence of convergent validity. Of note is the high correlation between the MFM32 and HFMS scores, consistent with the strong conceptual overlap between these scales. Furthermore, the MFM32 was able to discriminate between groups that were categorized by HFMS total or pre-specified item scores. Comparing baseline MFM scores between groups defined by HFMS scores (non-standers vs standers and sitters vs non-sitters) demonstrated a significant difference between groups, thus providing evidence of known-groups validity.
This study also provides evidence of the ability of the MFM32 to detect change. Despite the majority of patients being rated as “no change” on both the CGI-C and PGI-C, the MFM32 was still able to detect significant differences between those classified as stable or improving and those classified as worsening, as defined by these anchor scales.
Due to the retrospective nature of the study design, there are several limitations to consider when interpreting these findings. If prospectively designing a validation study, the selection and timing of assessments would differ to better fit the needs of psychometric analyses. For example, the HFMS was administered at screening rather than baseline, resulting in a variable gap, ranging from 1 to 4 weeks, between the rating of the HFMS and MFM32. Whilst this period of time is unlikely to reflect much difference in motor function ability, small changes cannot be discounted. This impacts the interpretation of both the convergent and known-groups validity results. For FVC, the height was estimated rather than measured, thus the FVC values should be considered estimates. The population includes a broad age range and, due to sample size limitations, no subgroup analyses were performed. Age effects are possible, given the inclusion of children, adolescents and adults. Additionally, these results do not provide evidence for children below 6 years or adults above 27 years. Finally, the PGI-C mixes patient and parent ratings. Differences between patients and parents have been reported on other scales in SMA [34]. Despite the limitations with the retrospective use of this data sample, the results still provide useful evidence of the MFM’s measurement properties.
This analysis complements the existing evidence, with findings consistent with those in a broader neuromuscular disease population, and prior analyses of convergent and known-groups validity in SMA [16, 17]. The MFM32 correlation with FVC in this study (ρ = 0.61) was a similar magnitude but lower than that identified by in a baseline analysis of the NatHis SMA study (ρ = 0.70). In this European, prospective, multicenter, longitudinal natural history study of Type 2 and 3 SMA, correlations with Myogrip and Myopinch (measures of upper arm strength) were similar to the correlation between MFM32 and HFMS identified in this analysis. These consistent findings provide additional confidence in use of the MFM32 to assess motor function in individuals with Types 2 and 3 SMA.
Future research gaps include establishing content validity and estimating meaningful within-patient change. Content validity is established by demonstrating that the items measure function that is meaningful to patients and their families. For example, in a study assessing the content validity of a related measure, the Expanded Hammersmith Functional Motor Scale, 55 individuals (30 parents and 25 patients) were able to relate all of the 33 items to activities of daily living [35]. Estimating meaningful within-patient change thresholds is critical for the interpretation of clinical outcome assessment data from interventional studies. A recent qualitative survey involving 822 respondents (including 436 patients and 370 parents), indicated that stabilization would be an important outcome of treatment for 96.5% of participants [36]. This finding is supported by qualitative research with patients and parents who reported that maintaining existing abilities is important, and that even small changes in function are meaningful [37]. In support of this qualitative evidence, in a suitable sample (e.g., in a dataset that includes both a suitable anchor measure, and a broad range of change in motor function), anchor- and distribution-based analyses should be conducted to estimate a meaningful within-patient change threshold to support interpretation of MFM32 data.
Conclusions
Overall, these analyses provide robust supportive evidence of the validity, reliability and ability to detect change of the MFM32 in patients with Type 2 and non-ambulant Type 3 SMA. These results support the use of the MFM32 total score in longitudinal studies involving individuals with Type 2 and non-ambulant Type 3 SMA.
Acknowledgments
The authors would like to thank all individuals, and their families, who participated in the olesoxime Phase 2 study, and the study investigators (Enrico Bertini, Eugenio Mercuri, Francesco Muntoni, Janbernd Kirschner, Anna Lusakowska, Giacomo Comi, Jean-Marie Cuisset, Bruno Scherrer, Ludo van der Pol, Carole André, Claudio Bruno, Brigitte Chabrol, Nicolas Deconinck, Brigitte Estournet, Stephanie Fontaine-Carbonnel, Nathalie Goemans, Alessandra Govoni, Michela Guglieri, Hanns Lochmuller, Michele Mayer, Wolfgang Müller-Felber, François Rivier, Helen Roper, Ulrike Schara, Mariacristina Scoto, Leonard van den Berg, Giuseppe Vita, and Maggie Walter). The authors would also like to thank Hannah Staunton, Roche, for her review of the draft manuscript.
References
- 1. Farrar MA, Kiernan MC. The genetics of spinal muscular atrophy: Progress and challenges. Neurotherapeutics. 2015;12(2):290–302. pmid:25413156
- 2. Mercuri E, Bertini E, Iannaccone ST. Childhood spinal muscular atrophy: controversies and challenges. Lancet Neurol. 2012;11(5):443–52. pmid:22516079
- 3. Lefebvre S, Burglen L, Reboullet S, Clermont O, Burlet P, Viollet L, et al. Identification and characterization of a spinal muscular atrophy-determining gene. Cell. 1995;80(1):155–65. pmid:7813012
- 4. Lorson CL, Hahnen E, Androphy EJ, Wirth B. A single nucleotide in the SMN gene regulates splicing and is responsible for spinal muscular atrophy. Proc Natl Acad Sci U S A. 1999;96(11):6307–11. pmid:10339583
- 5. Coovert DD, Le TT, McAndrew PE, Strasswimmer J, Crawford TO, Mendell JR, et al. The survival motor neuron protein in spinal muscular atrophy. Hum Mol Genet. 1997;6(8):1205–14. pmid:9259265
- 6. Burnett BG, Munoz E, Tandon A, Kwon DY, Sumner CJ, Fischbeck KH. Regulation of SMN protein stability. Mol Cell Biol. 2009;29(5):1107–15. pmid:19103745
- 7. D'Amico A, Mercuri E, Tiziano FD, Bertini E. Spinal muscular atrophy. Orphanet J Rare Dis. 2011;6:71. pmid:22047105
- 8. Munsat TL, Davies KE. International SMA consortium meeting. (26–28 June 1992, Bonn, Germany). Neuromuscul Disord 1992;2(5–6):423–8. pmid:1300191
- 9. Mercuri E, Finkel RS, Muntoni F, Wirth B, Montes J, Main M, et al. Diagnosis and management of spinal muscular atrophy: Part 1: Recommendations for diagnosis, rehabilitation, orthopedic and nutritional care. Neuromuscul Disord. 2018;28(2):103–15. pmid:29290580
- 10. Munsat T. Workshop Report: International SMA Collaboration. Neuromuscul Disord. 1991;1:81.
- 11. Zerres K, Rudnik-Schoneborn S, Forrest E, Lusakowska A, Borkowska J, Hausmanowa-Petrusewicz I. A collaborative study on the natural history of childhood and juvenile onset proximal spinal muscular atrophy (type II and III SMA): 569 patients. J Neurol Sci. 1997;146(1):67–72. pmid:9077498
- 12. Mercuri E, Finkel R, Montes J, Mazzone ES, Sormani MP, Main M, et al. Patterns of disease progression in type 2 and 3 SMA: Implications for clinical trials. Neuromuscul Disord. 2016;26:126–31. pmid:26776503
- 13. Pera MC, Coratti G, Mazzone ES, Montes J, Scoto M, De Sanctis R, et al. Revised upper limb module for spinal muscular atrophy: 12 month changes. Muscle Nerve. 2019;59(4):426–30. pmid:30677148
- 14. Mercuri E, Mayhew A, Muntoni F, Messina S, Straub V, Van Ommen GJ, et al. Towards harmonisation of outcome measures for DMD and SMA within TREAT-NMD; report of three expert workshops: TREAT-NMD/ENMC workshop on outcome measures, 12th—13th May 2007, Naarden, The Netherlands; TREAT-NMD workshop on outcome measures in experimental trials for DMD, 30th June—1st July 2007, Naarden, The Netherlands; conjoint Institute of Myology TREAT-NMD meeting on physical activity monitoring in neuromuscular disorders, 11th July 2007, Paris, France. Neuromuscul Disord. 2008;18(11):894–903. pmid:18818076
- 15. Swoboda KJ, Kissel JT, Crawford TO, Bromberg MB, Acsadi G, D'Anjou G, et al. Perspectives on clinical trials in spinal muscular atrophy. J Child Neurol. 2007;22(8):957–66. pmid:17761650
- 16. Berard C, Payan C, Hodgkinson I, Fermanian J, Group MFMCS. A motor function measure for neuromuscular diseases. Construction and validation study. Neuromuscul Disord 2005;15(7):463–70. pmid:16106528
- 17. Chabanon A, Seferian AM, Daron A, Pereon Y, Cances C, Vuillerot C, et al. Prospective and longitudinal natural history study of patients with Type 2 and 3 spinal muscular atrophy: Baseline data NatHis-SMA study. PLoS One. 2018;13(7):e0201004. pmid:30048507
- 18. Vuillerot C, Payan C, Iwaz J, Ecochard R, Berard C, Group MFMS. Responsiveness of the motor function measure in patients with spinal muscular atrophy. Arch Phys Med Rehabil. 2013;94(8):1555–61. pmid:23380348
- 19. Cano SJ, Mayhew A, Glanzman AM, Krosschell KJ, Swoboda KJ, Main M, et al. Rasch analysis of clinical outcome measures in spinal muscular atrophy. Muscle Nerve. 2014;49(3):422–30. pmid:23836324
- 20. Guillot T, Roche S, Rippert P, Hamroun D, Iwaz J, Ecochard R, et al. Is Going Beyond Rasch Analysis Necessary to Assess the Construct Validity of a Motor Function Scale? Arch Phys Med Rehabil. 2018;99(9):1776–82 e9. pmid:29625093
- 21. Bertini E, Dessaud E, Mercuri E, Muntoni F, Kirschner J, Reid C, et al. Safety and efficacy of olesoxime in patients with type 2 or non-ambulatory type 3 spinal muscular atrophy: a randomised, double-blind, placebo-controlled phase 2 trial. Lancet Neurol. 2017;16(7):513–22. pmid:28460889
- 22. Main M, Kairon H, Mercuri E, Muntoni F. The hammersmith functional motor scale for children with spinal muscular atrophy: A scale to test ability and monitor progress in children with limited ambulation. Eur J Paediatr Neurol. 2003;7:155–9. pmid:12865054
- 23. Mazzone E, Bianco F, Martinelli D, Glanzman AM, Messina S, De Sanctis R, et al. Assessing upper limb function in nonambulant SMA patients: development of a new module. Neuromuscul Disord. 2011;21(6):406–12. pmid:21421316
- 24. Quanjer PH, Borsboom GJ, Brunekreef B, Zach M, Forche G, Cotes JE, et al. Spirometric reference values for white European children and adolescents: Polgar revisited. Pediatr Pulmonol. 1995;19(2):135–42. pmid:7659469
- 25. Quanjer PH, Tammeling GJ, Cotes JE, Pedersen OF, Peslin R, Yernault JC. Lung volumes and forced ventilatory flows. Report Working Party Standardization of Lung Function Tests, European Community for Steel and Coal. Official Statement of the European Respiratory Society. Eur Respir J Suppl. 1993;16:5–40. pmid:8499054
- 26. Cheng JC, Leung SS, Chiu BS, Tse PW, Lee CW, Chan AK, et al. Can we predict body height from segmental bone length measurements? A study of 3,647 children. J Pediatr Orthop. 1998;18:387–93. pmid:9600569
- 27. Busner J, Targum SD. The Clinical Global Impressions Scale: Applying a Research Tool in Clinical Practice. Pyschiatry. 2007;4(7):28–37.
- 28. Matza LS, Thompson CL, Krasnow J, Brewster-Jordan J, Zyczynski T, Coyne KS. Test-retest reliability of four questionnaires for patients with overactive bladder: the overactive bladder questionnaire (OAB-q), patient perception of bladder condition (PPBC), urgency questionnaire (UQ), and the primary OAB symptom questionnaire (POSQ). Neurourol Urodyn. 2005;24(3):215–25. pmid:15747340
- 29. Mokkink LB, Prinsen CA, Patrick DL, Alonso J, Bouter LM, De Vet HC, et al. COSMIN methodology for systematic reviews of Patient-Reported Outcome Measures (PROMs); user manual. 2018. Available from: http://www.cosmin.nl. Accessed July 2020.
- 30.
Nunnally JC, Bernstein IH. The assessment of reliability. Psychometric Theory. 3rd ed. New York, NY: McGraw-Hill, Inc; 1994.
- 31. Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika. 1951;16:297–334.
- 32.
Bartz AE. Basic statistical concepts. 4th ed. Upper Saddle River, NJ: Prentice Hall; 1999.
- 33.
Swinscow DTV. Correlation and regression. In: Statistics at square one. 9th ed. BMJ.[Internet]. 1997;75–85. Available from: http://publish.uwo.ca/~gzou2/Stats_at_Square1.pdf. Accessed July 2020.
- 34. Iannaccone ST, Hynan LS, Morton A, Buchanan R, Limbers CA, Varni JW. The PedsQL™ in pediatric patients with spinal muscular atrophy: Feasibility, reliability, and validity of the pediatric quality of life inventory™ generic core scales and neuromuscular module. Neuromuscul Disord 2009;19(12):805–12. pmid:19846309
- 35. Pera MC, Coratti G, Forcina N, Mazzone ES, Scoto M, Montes J, et al. Content validity and clinical meaningfulness of the HFMSE in spinal muscular atrophy. BMC Neurol. 2017;17(1):39. pmid:28231823
- 36. Rouault F, Christie-Brown V, Broekgaarden R, Gusset N, Henderson D, Marczuk P, et al. Disease impact on general well-being and therapeutic expectations of European Type II and Type III spinal muscular atrophy patients. Neuromuscul Disord. 2017;27(5):428–38. pmid:28237437
- 37. McGraw S, Qian Y, Henne J, Jarecki J, Hobby K, Yeh WS. A qualitative study of perceptions of meaningful change in spinal muscular atrophy. BMC Neurol. 2017;17(1):68. pmid:28376816