Degenerative cervical myelopathy [DCM] is a disabling and increasingly prevalent condition. Variable reporting in interventional trials of study design and sample characteristics limits the interpretation of pooled outcomes. This is pertinent in DCM where baseline characteristics are known to influence outcome. The present study aims to assess the reporting of the study design and baseline characteristics in DCM as the premise for the development of a standardised reporting set.
A systematic review of MEDLINE and EMBASE databases, registered with PROSPERO (CRD42015025497) was conducted in accordance with PRISMA guidelines. Full text articles in English, with >50 patients (prospective) or >200 patients (retrospective), reporting outcomes of DCM were deemed to be eligible.
A total of 108 studies involving 23,876 patients, conducted world-wide, were identified. 33 (31%) specified a clear primary objective. Study populations often included radiculopathy (51, 47%) but excluded patients who had undergone previous surgery (42, 39%). Diagnositic criteria for myelopathy were often uncertain; MRI assessment was specified in only 67 (62%) of studies. Patient comorbidities were referenced by 37 (34%) studies. Symptom duration was reported by 46 (43%) studies. Multivariate analysis was used to control for baseline characteristics in 33 (31%) of studies.
Citation: Davies BM, McHugh M, Elgheriani A, Kolias AG, Tetreault L, Hutchinson PJA, et al. (2017) The reporting of study and population characteristics in degenerative cervical myelopathy: A systematic review. PLoS ONE 12(3): e0172564. https://doi.org/10.1371/journal.pone.0172564
Editor: Giovanni Grasso, Universita degli Studi di Palermo, ITALY
Received: September 5, 2016; Accepted: February 7, 2017; Published: March 1, 2017
Copyright: © 2017 Davies et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: Research in the senior author’s laboratory is supported by a core support grant from the Wellcome Trust and MRC to the Wellcome Trust-Medical Research Council Cambridge Stem Cell Institute. MRNK is supported by a NIHR Clinician Scientist Award (CS-2015-15-023). PJAH holds a NIHR research professorship and is supported by the NIHR Cambridge Biomedical Research Centre. MGF acknowledges support from the Halbert Chair in Neural Repair and Regeneration and the Dezwirek Foundation.
Competing interests: The authors have declared that no competing interests exist.
Chronic compression of the cervical spinal cord due to degenerative processes, including disc herniation, spondylosis, and ligament hypertrophy or ossification, has been collectively referred to as degenerative cervical myelopathy [DCM].  Disability ranges from mild pain to severe sensorimotor deficits including quadriplegia. DCM is estimated to be the most common spinal cord disorder, and is expected to have an increasing incidence with the aging population in the industrial world .
Presently, surgical decompression is the mainstay of treatment, although the type and timing of surgery remains controversial. Defining optimal treatment strategies has been challenging due to difficulties in research synthesis and the heterogeneous reporting of outcome variables . This is a recognized problem in many fields of healthcare and has led to the establishment of consensus-based, core outcome sets .
Effective pooled analysis and its accurate interpretation requires common outcome measures as well as an understanding of important study characteristics. This is particularly pertinent in DCM, where a number of baseline factors have been found to influence outcome [5,6]. Much like outcomes, these study components are often heterogeneous and not consistently reported. Pioneered by organizations such as the National Institute of Neurological Disorders [NIND], this has led to an extension of standardization from outcomes, to other study characteristics . The nomenclature for this is inconsistent and includes ‘common data elements’ or ‘minimum reporting sets’.
Various methods have been proposed for the development of a minimum reporting set. One method is to map existing reporting practice by performing a systematic review of the literature. This information is then used to inform a DELPHI consensus process, that includes relevant stakeholders such as clinicians, academics, allied care professionals, patients and care givers. Organisations such as COMET [Core outcome measures in effectiveness trials] have been setup to facilitate this process .
The benefit of collaborative study in DCM is recognized. For example, the systematic and standardized approach of the AOSpine network has provided unique prospective datasets for advancing our understanding of DCM [9–12]. These have the potential to accelerate the development of optimal treatment for DCM, especially if future studies are designed on common grounds, and supported by a minimum reporting set.
Our objectives therefore were to describe the reporting of baseline characteristics in studies of DCM in order to inform a subsequent consensus process. This study complements and extends existing work on ancillary outcome measures in DCM and is referred to as CODE-DCM [Core outcomes and data elements in degenerative cervical myelopathy] [13,14].
A systematic review was conducted in accordance with the PRISMA guidelines (S1 Table) and registered with the PROSPERO (CRD42015025497) prospective register of systematic reviews. MEDLINE [Ovid] and Embase [Ovid] databases were searched on the 12th August 2015 using the search strategy [“Cervical”] AND [“Myelopathy”] for articles focused on myelopathy secondary to chronic compression of the spinal cord. The search was conducted using the OVID Basic Search function. Related search terms were not included. Animal studies, case reports and letters/editorials were excluded.
Titles and abstracts were screened for relevance. Full text articles were subsequently screened for eligibility according to the following criteria:
- English, full text
- Prospective study with >50 patients or retrospective study with >200 patients
- Assessment of clinical outcomes in response to a treatment strategy (conservative or interventional)
- Articles published since 1st January 1995
Articles were screened by two authors [BMD, AE] and data were extracted independently by two authors [BMD, MM] using a piloted proforma (S2 and S3 Tables). Discrepancies were settled by discussion and mutual agreement. A retrospective review of prospectively collected data was considered a prospective study.
Descriptive statistics were used to report frequency and proportion of measured data elements. Statistical comparisons were made using the Chi-Squared test, with significance set at p = 0.05.
The search strategy returned 6894 articles. Following application of inclusion and exclusion criteria, 108 articles were considered [Fig 1]. There were 91 prospective studies and 17 randomised controlled trials [RCT]. Further details about the shortlisted studies are available in our previous publication 
Study design and patient selection
Of the 108 studies, 53 (49%) recorded whether ethical approval was obtained, including one study which cited that it was not required. Overall study objectives were outlined in 103 (95%) of studies; however, they were rarely specific. Thirty-three (31%) clearly specified a primary objective, including the timing of outcome assessment and 36 (33%) included secondary objectives. The investigation time period was specified in 93 (86%) studies and measured outcomes were defined in 96 (89%).
Clear inclusion and exclusion criteria were described in 98 (91%) and 75 (69%) studies, respectively. Patients who had previous cervical surgery were excluded by 42 (39%) studies. The diagnostic criteria for myelopathy were often unclear, with MRI assessment specified in only 67 (62%) studies. Neurophysiology for diagnosis was reported in two studies. Many studies included patients with myelopathy and radiculopathy; only 57 (53%) studies considered myelopathy only patients. The frequency of causative pathology (e.g. Disc herniation, OPLL) was specified in 87 (81%) of studies.
Most articles reported disease severity (97, 90%) using one or more functional assessment tools, including the Japanese Orthopaedic Association assessment [JOA] (50, 46%), Nurick score (25, 23%), modified JOA (20, 19%) or the Oswestry Neck Disability Index [NDI] scales (20, 19%).
Imaging, distinct to that required for assessment of radiological outcomes or diagnosis, was reported by 59 (55%) studies. Typically this was MRI (58, 54%).
Imaging was used to report the disease level (46, 43%), number of treated levels (72, 67%) or putative prognostic factors (28, 26%) such as cord signal change (22, 20%) or cord compression measures (18, 17%).
Patient age (107, 99%) and gender (105, 97%) were typically recorded in studies. Race was recorded by 4 (4%) studies. General health status was referenced by 37 (34%) studies, typically by reporting on study specific subcategories such as BMI (13, 12%), smoking status (23, 21%), diabetes (9, 8%) or atherosclerotic disease (3, 3%). Only 9 studies used a recognized grading system: ASA [American Society of Anesthesiologists] (4, 4%), CCI [Charlson Comorbidity Index] (3, 3%) or CIRS [Cumulative Illness Rating Scale] (1, 1%). Other, less frequently reported patient information included employment status, workers compensation, mental health and medication burden. Symptom duration was reported by 46 (43%) studies. Multivariate analysis was used to control for baseline characteristics when evaluating outcomes, in 33 (31%) of studies.
Operative and post-operative course
The technical details of the intervention were detailed in 74 (69%) studies, two of which reported the use of intraoperative electrophysiology. Follow-up timing was outlined by 74 (69%) studies. Mean follow-up was reported by 48 (44%) studies. Identification of the chosen time points for outcome comparison was often ambiguous. Of the prospective studies, only 41 (45%) reported follow-up rates, or the data from which it could be calculated. Many studies (19, 18%) used outcomes from ‘final follow up’ to assess their primary objectives.
Reporting differences between study designs
Reporting differences were noted when comparing prospective with retrospective studies, and RCTs with other clinical trials (Table 1). When compared with retrospective studies, prospective studies were more likely to define the timing of follow up (p<0.01) and at which interval endpoints would be compared (p = 0.04). When compared with all other clinical trials, RCTs were more likely to define inclusion and exclusion criteria (p = 0.01) and follow up timing (p = 0.05). However they were less likely to report symptom duration (p = 0.02) or the cause of myelopathy (p<0.01) in their series. Reporting consistency did not therefore improve in all domains with higher levels of study type.
Significant results (p<0.05) are denoted by*.
Summary of findings
Heterogeneity of reported study and sample characteristics exists in DCM clinical research, even amongst studies of a higher level of evidence. This included study design characteristics, such as the requirement for ethics, clearly defined objectives and inclusion/exclusion criteria and population characteristics, such as general health status, symptom duration and disease parameters (e.g. disease level and pathology subtype). The reporting of baseline severity whilst prevalent, was reported with a variety of grading systems. This is not an unexpected finding, given its prevalence in other fields  and in DCM outcome reporting . However it provides a challenge to effective pooled analysis  and interpretation of study results. As certain baseline characteristics of DCM patients are known to influence outcome, failure to report these entails a risk of reporting bias.
Importance of reporting baseline characteristics
Reporting of baseline characteristics is important for the understanding of sample groups. It is also fundamental for research synthesis, as these data elements indicate to what confidence pooled outcomes can be trusted, e.g. whether patient selection was appropriate and for appraisal of methodological quality [17,18].
A key aspect of this is preventing selection bias, defined by Cochrane as the “systematic differences between study groups.”  Baseline data is required to make assessments of outcome bias[19,20]. The variability identified in the present study, therefore poses a limitation in DCM research, in particular:
1) Incomplete recording of baseline characteristics.
A recent systematic review by Tetreault et al (2015) considered prognostic factors in DCM . The review identified excellent evidence suggesting that symptom duration and baseline severity are important predictors of outcomes; these factors were only reported by 46 (43%) and 97 (90%) of studies included in this review. Similarly, age may impact outcome after surgery and was reported by 107 (99%) of studies here. In addition other markers of general health status, such as diabetes, smoking and psychological factors may also influence outcomes. This highlights not only the importance of reporting such factors, but also perhaps a greater role for multivariate analysis, poorly used in the studies we reviewed.
The recording of general health status is not straightforward, as on an individual basis, diseases may be poorly represented for analysis whilst studies may focus on different co-morbidities, or use different grouping terms. Co-morbidity indexes are helpful tools to standardize this, but the bundled data may obscure the significance of key individual predictors. The significance of these indexes in DCM is not yet clearly defined.
2) Incomplete recording of symptoms.
The overlap of treatments for compressive cervical myelopathy and radiculopathy, in addition to their possible coexistence, has lead to their combined consideration in many studies. However, newer research indicates that the presence of radiculopathy in non-myelopathic patients with imaging evidence of cord compression is associated with higher risk of disease . The impact of this on outcomes has not yet been studied, but one would expect their disease profiles to differ , and as such, their commonly unspecified combination is an obstacle for DCM pooled analysis. The distinction of radiculopathy from myelopathy can be difficult. This is an area in which electrophysiology could have a significant role, yet it was only specified in two studies. Of note, one of these studies identified electrophysiological markers of myelopathy severity  corroborated elsewhere .
3) Incomplete recording of the pathology type.
DCM is a recently proposed umbrella term to encompass cervical myelopathy due to cervical stenosis of degenerative aetiology . Unification of the common clinical phenotype under a new index term will require future studies to better clarify the types of pathologies included. In addition, if the field conformed under such a term, it would lead to future, easy study identification. Whilst these ambitions are helpful, it is important not to overlook that each pathology is distinct, particularly when considered without myelopathy, and that their long-term disease profiles may differ.
Challenges for standardisation
The development of consensus derived reporting standards has helped homogenize reporting and obviate many of the aforementioned limitations [7,26]. We intend to apply these processes to the field of DCM, to define the core outcomes and common data elements in degenerative cervical myelopathy [CODE-DCM]. The results of this systematic review, alongside further planned work, will be used to inform a DELPHI process, made up of key stakeholders including patients, care givers, professionals and industry. This project has been registered with the COMET initiative .
The challenge for CODE-DCM stakeholders when interpreting the findings of this systematic review will be to delineate variables present by convention or chance, from those that will make a contribution. Gender for example is almost ubiquitously reported and not known to influence outcome, whereas symptom duration is a significant predictor of outcome and was reported by less than half of studies . The selection of pertinent components is key to ensure the resultant framework is concise and not an inflexible burden that could impede novel research .
A threat to succinct guidelines would include attempts to future proof them. As already mentioned, the ambiguity of co-morbidities is one example, but also the inclusion of promising new imaging techniques not captured in this systematic review, such as PET  and Diffusion Tensor Imaging . Future-proofing is extremely difficult and may risk overcomplicating reporting at this time. Instead, careful consideration of future research with subsequent updates may be more appropriate .
Further perspectives for DCM research
Some additional findings from this review are worth mentioning as they may represent knowledge gaps in the field of DCM. The limited use of electrophysiology and the significance of radiculomyelopathy compared with myelopathy on outcome improvements have already been mentioned. An additional area of interest is the common exclusion of patients with previous surgery. When assessing an intervention, it is understandable that potential confounders are excluded, but given this group of patients represent a significant proportion of our practice, a better understanding of their response to repeat surgery is a clinical need.
This series reports on the articles selected by its search strategy, which has inherent limitations addressed in our previous publication. 
An additional limitation distinct to the common data elements review compared to our previous core outcomes review was a greater discrepancy between authors during data extraction. This likely relates to the requirement for many elements to be interpreted from the text rather than simply copied. For example whether a study was prospective or retrospective was not always recorded, and therefore on some occasions had to be interpreted from the methodology. This risks some errors in the reporting of findings. However, the use of two authors to extract data, and the use of over 100 studies should prevent any such error impacting the overall findings. This observation would suggest a greater need for the use of reporting guidelines such as STROBE and CONSORT [19,20].
Heterogeneity in the reporting of study and sample characteristics exists, even when considering higher levels of evidence. These findings echo those of outcome reporting in DCM, and further exemplify the need for the establishment of a common reporting set .
S1 Table. PRISMA Checklist for Systematic Reviews.
The PRISM Checklist, including page references to the location of components in this article.
S2 Table. Shortlisted Articles.
Spreadsheet providing the initially shortlisted articles.
S3 Table. Included articles and Extracted Data.
Spreadsheet containing the extracted data (agreed by authors BMD and MM) for all included articles.
Research in the senior author’s laboratory is supported by a core support grant from the Wellcome Trust and MRC to the Wellcome Trust-Medical Research Council Cambridge Stem Cell Institute. MRNK is supported by a NIHR Clinician Scientist Award.
PJAH holds a NIHR research professorship and is supported by the NIHR Cambridge Biomedical Research Centre. MGF acknowledges support from the Halbert Chair in Neural Repair and Regeneration and the Dezwirek Foundation.
Disclaimer: This report is independent reseach arising from a Clinician Scientist Award, CS-2015-15-023, supported by the National Institute for Health Research. The views expressed in this publication are those of the authours and not necessarily those of the NHS, the National Institute for Health Research or the Department of Health.
- Conceptualization: BMD MRK.
- Formal analysis: BMD MRK.
- Funding acquisition: MRK.
- Investigation: BMD MM AE.
- Methodology: BMD AK MRK.
- Project administration: BMD.
- Supervision: MGF MRK.
- Visualization: BMD AK PJH MGF MRK LT.
- Writing – original draft: BMD.
- Writing – review & editing: BMD AK PJH MGF MRK LT.
- 1. Nouri A, Tetreault L, Singh A, Karadimas SK, Fehlings MG. Degenerative Cervical Myelopathy: Epidemiology, Genetics, and Pathogenesis. Spine. 2015;40: E675–93. pmid:25839387
- 2. Karadimas SK, Gatzounis G, Fehlings MG. Pathobiology of cervical spondylotic myelopathy. Eur Spine J. 2015;24 Suppl 2: 132–138.
- 3. Davies BM, McHugh M, Elgheriani A, Kolias AG, Tetreault LA, Hutchinson PJA, et al. Reported Outcome Measures in Degenerative Cervical Myelopathy: A Systematic Review. PLoS ONE. 2016;11: e0157263. pmid:27482710
- 4. Boers M, Kirwan JR, Wells G, Beaton D, Gossec L, d'Agostino M-A, et al. Developing core outcome measurement sets for clinical trials: OMERACT filter 2.0. J Clin Epidemiol. 2014;67: 745–753. pmid:24582946
- 5. Tetreault LA, Karpova A, Fehlings MG. Predictors of outcome in patients with degenerative cervical spondylotic myelopathy undergoing surgical treatment: results of a systematic review. Eur Spine J. Springer Berlin Heidelberg; 2013;24: 236–251.
- 6. Tetreault LA, Côté P, Kopjar B, Arnold P, Fehlings MG, AOSpine North America and International Clinical Trial Research Network. A clinical prediction model to assess surgical outcome in patients with cervical spondylotic myelopathy: internal and external validations using the prospective multicenter AOSpine North American and international datasets of 743 patients. Spine J. 2015;15: 388–397. pmid:25549860
- 7. Saver JL, Warach S, Janis S, Odenkirchen J, Becker K, Benavente O, et al. Standardizing the Structure of Stroke Clinical and Epidemiologic Research Data. Stroke. 2012;43: 967–973. pmid:22308239
- 8. Williamson P, Clarke M. The COMET (Core Outcome Measures in Effectiveness Trials) Initiative: Its Role in Improving Cochrane Reviews. Cochrane Database Syst Rev. 2012;: ED000041. pmid:22592744
- 9. Fehlings MG, Jha NK, Hewson SM, Massicotte EM, Kopjar B, Kalsi-Ryan S. Is surgery for cervical spondylotic myelopathy cost-effective? A cost-utility analysis based on data from the AOSpine North America prospective CSM study. J Neurosurg Spine. 2012;17: 89–93. pmid:22985375
- 10. Fehlings MG, Smith JS, Kopjar B, Arnold PM, Yoon ST, Vaccaro AR, et al. Perioperative and delayed complications associated with the surgical treatment of cervical spondylotic myelopathy based on 302 patients from the AOSpine North America Cervical Spondylotic Myelopathy Study. J Neurosurg Spine. 2012;16: 425–432. pmid:22324802
- 11. Fehlings MG, Wilson JR, Kopjar B, Yoon ST, Arnold PM, Massicotte EM, et al. Efficacy and safety of surgical decompression in patients with cervical spondylotic myelopathy: results of the AOSpine North America prospective multi-center study. J Bone Joint Surg Am. 2013;95: 1651–1658. pmid:24048552
- 12. Fehlings MG, Ibrahim A, Tetreault L, Albanese V, Alvarado M, Arnold P, et al. A global perspective on the outcomes of surgical decompression in patients with cervical spondylotic myelopathy: results from the prospective multicenter AOSpine international study on 479 patients. Spine. 2015;40: 1322–1328. pmid:26020847
- 13. Kalsi-Ryan S, Singh A, Massicotte EM, Arnold PM, Brodke DS, Norvell DC, et al. Ancillary outcome measures for assessment of individuals with cervical spondylotic myelopathy. Spine. 2013;38: S111–22. pmid:23963009
COMET Initative. CODE-DCM Project. Available: http://www.comet-initiative.org/studies/details/821?result=true. [Cited 5th Febuary 2017].
- 15. Chari A, Hocking K, Edlmann E, Turner C, Santarius T, Hutchinson PJ, et al. Core Outcomes and common Data Elements in Chronic Subdural Haematoma (CODE-CSDH): A systematic review of the literature focusing on baseline and peri-operative care data elements. J Neurotrauma. 2015.
- 16. Kirkham JJ, Gargon E, Clarke M, Williamson PR. Can a core outcome set improve the quality of systematic reviews?—a survey of the Co-ordinating Editors of Cochrane Review Groups. Trials. 2013;14: 21. pmid:23339751
Higgins JPT, Green S. Cochrane Handbook for Systematic Reviews of Interventions. Version 5.1.0. Avaliable at: http://handbook.cochrane.org/ [Cited 5th Febuary 2017].
- 18. Schünemann HJ. GRADE: from grading the evidence to developing recommendations. A description of the system and a proposal regarding the transferability of the results of clinical research to clinical practice. Z Evid Fortbild Qual Gesundhwes. 2009;103: 391–400. pmid:19839216
- 19. Vandenbroucke JP, Elm von E, Altman DG, Gøtzsche PC, Mulrow CD, Pocock SJ, et al. Strengthening the Reporting of Observational Studies in Epidemiology (STROBE): explanation and elaboration. PLoS Med. 2007;4: e297. pmid:17941715
- 20. Schulz KF, Altman DG, Moher D, CONSORT Group. CONSORT 2010 statement: updated guidelines for reporting parallel group randomised trials. PLoS medicine. 2010. p. e1000251. pmid:20352064
- 21. Tetreault LA, Karpova A, Fehlings MG. Predictors of outcome in patients with degenerative cervical spondylotic myelopathy undergoing surgical treatment: results of a systematic review. Eur Spine J. 2015;24 Suppl 2: 236–251.
- 22. Wilson JR, Barry S, Fischer DJ, Skelly AC, Arnold PM, Riew KD, et al. Frequency, timing, and predictors of neurological dysfunction in the nonmyelopathic patient with cervical spinal cord compression, canal stenosis, and/or ossification of the posterior longitudinal ligament. Spine. 2013;38: S37–54. pmid:23963005
- 23. Wen CY, Cui JL, Liu HS, Mak KC, Cheung WY, Luk KDK, et al. Is diffusion anisotropy a biomarker for disease severity and surgical prognosis of cervical spondylotic myelopathy? Radiology. 2014;270: 197–204. pmid:23942607
- 24. Ding Y, Hu Y, Ruan D-K, Chen B. Value of somatosensory evoked potentials in diagnosis, surgical monitoring and prognosis of cervical spondylotic myelopathy. Chin Med J. 2008;121: 1374–1378. pmid:18959112
- 25. Lyu RK, Tang LM, Chen CJ, Chen CM, Chang HS, Wu YR. The use of evoked potentials for clinical correlation and surgical outcome in cervical spondylotic myelopathy with intramedullary high signal intensity on MRI. Journal of Neurology, Neurosurgery & Psychiatry. 2004;75: 256–261.
- 26. Clarke M. Standardising outcomes for clinical trials and systematic reviews. Trials. 2007;8: 39. pmid:18039365
- 27. Tetreault L, Ibrahim A, Côté P, Singh A, Fehlings MG. A systematic review of clinical and surgical predictors of complications following surgery for degenerative cervical myelopathy. J Neurosurg Spine. 2016;24: 77–99. pmid:26407090
- 28. Floeth FW, Galldiks N, Eicker S, Stoffels G, Herdmann J, Steiger H-J, et al. Hypermetabolism in 18F-FDG PET predicts favorable outcome following decompressive surgery in patients with degenerative cervical myelopathy. J Nucl Med. 2013;54: 1577–1583. pmid:23918736
- 29. Martin AR, Aleksanderek I, Cohen-Adad J, Tarmohamed Z, Tetreault L, Smith N, et al. Translating state-of-the-art spinal cord MRI techniques to clinical use: A systematic review of clinical studies utilizing DTI, MT, MWF, MRS, and fMRI. NeuroImage: Clinical. 2016;10: 192–238.