Figures
Abstract
Background
International agreement supports physical functioning as a key domain to measure interventions effectiveness for low back pain. Patient reported outcome measures (PROMs) are commonly used in the lumbar spinal surgery population but physical functioning is multidimensional and necessitates evaluation also with physical measures.
Objective
1) To identify outcome measures (PROMs and physical) used to evaluate physical functioning in the lumbar spinal surgery population. 2) To assess measurement properties and describe the feasibility and interpretability of physical measures of physical functioning in this population.
Methods
This systematic review was conducted according to a registered and published protocol. Two stages of searching were conducted in MEDLINE, EMBASE, Health & Psychosocial Instruments, CINAHL, Web of Science, PEDro and ProQuest Dissertations & Theses. Stage one included studies to identify physical functioning outcome measures (PROMs and physical) in the lumbar spinal surgery population. Stage two (inception to 10 July 2023) included studies assessing measurement properties of stage one physical measures. Two independent reviewers determined study eligibility, extracted data and assessed risk of bias (RoB) according to COSMIN guidelines. Measurement properties were rated according to COSMIN criteria. Level of evidence was determined using a modified GRADE approach.
Results
Stage one included 1,101 reports using PROMs (n = 70 established in literature, n = 67 developed by study authors) and physical measures (n = 134). Stage two included 43 articles assessing measurement properties of 34 physical measures. Moderate-level evidence supported sufficient responsiveness of 1-minute stair climb and 50-foot walk tests, insufficient responsiveness of 5-minute walk and sufficient reliability of distance walked during the 6-minute walk. Very low/low-level evidence limits further understanding.
Conclusions
Many physical measures of physical functioning are used in lumbar spinal surgery populations. Few have investigations of measurement properties. Strongest evidence supports responsiveness of 1-minute stair climb and 50-foot walk tests and reliability of distance walked during the 6-minute walk. Further recommendations cannot be made because of very low/low-level evidence. Results highlight promise for a range of measures, but prospective, low RoB studies are required.
Citation: Kowalski KL, Mistry J, Beilin A, Goodman M, Lukacs MJ, Rushton A (2024) Physical functioning in the lumbar spinal surgery population: A systematic review and narrative synthesis of outcome measures and measurement properties of the physical measures. PLoS ONE 19(8): e0307004. https://doi.org/10.1371/journal.pone.0307004
Editor: Emad A. Aboelnasr, Cairo University Faculty of Physical Therapy, EGYPT
Received: March 19, 2024; Accepted: June 26, 2024; Published: August 29, 2024
Copyright: © 2024 Kowalski et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting information files. No identifying information is found within the manuscript or Supporting information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Musculoskeletal low back pain (LBP) persists as a leading global cause of disability from adolescence to old age [1]. It is the most prevalent condition requiring effective rehabilitation [2] with best-evidence guidelines recommending interventions focused on self-management, physical and psychological therapies [3, 4]. For appropriate clinical indications, surgical interventions are effective in reducing pain and enhancing physical functioning [3, 5]. Selecting appropriate outcome measures for the lumbar spinal surgery population is important as population-specific outcome measures are recommended for use when measuring treatment outcomes for specific clinical populations and when focusing on the individual, an important component of providing patient-centered care [6, 7].
International agreement supports physical functioning as the most important outcome domain to measure effectiveness of interventions for LBP [8]. Physical functioning is the impact of a condition on physical activities of daily living (e.g., walking ability, performance status, disability index) [9]. Use of patient reported outcome measures (PROMs) to evaluate physical functioning in LBP is common, despite low to very low quality evidence for their content validity [10]. The Oswestry Disability Index (ODI) is most commonly used/recommended in LBP and lumbar spinal surgery [11, 12]. However, previous systematic reviews have highlighted a breadth of PROMs are used to evaluate physical functioning in LBP and lumbar spinal surgery populations [12, 13]. As these systematic reviews were either conducted over 20 years ago [13] or with a limited search strategy [12], a contemporary and comprehensive search of the literature will enable wider considerations of PROMs to evaluate physical functioning in lumbar spinal surgery populations.
Physical functioning is a multidimensional construct and necessitates evaluation with physical outcome measures, including impairments (e.g., strength), performance on a standardized task (e.g., 6-minute walk) and activity in a natural environment (e.g., step count) [14, 15]. Physical measures (e.g., time to symptom onset during 6-minute walk) are the measurement unit of interest. Physical measures are also an important component of assessment in LBP, informing clinical reasoning to formulate a diagnosis, prognosis and intervention plan. Despite growing widespread use in lumbar spinal surgery [16] and recognized value in other musculoskeletal conditions [17–19], recommendations for physical measures of physical functioning in the lumbar spinal surgery population do not exist.
Selecting outcome measures with adequate measurement properties is key for accurate interpretation of information gained during a clinical assessment and for measuring effectiveness of interventions in research and clinical practice. The COnsensus-based Standards for the selection of health Measurement Instruments (COSMIN) initiative aims to facilitate selecting high-quality outcome measures through a systematic evaluation of validity, responsiveness, reliability and measurement error of PROMs and physical measures [20, 21]. Use of outcome measures with adequate measurement properties supports clinicians in their clinical reasoning for accuracy in assessment and diagnosis, monitoring patient progress, evaluating treatment outcomes and making informed decisions to optimize patient outcomes. When selecting outcome measures, COSMIN also recommends considering interpretability and feasibility, as these are important for clinical understanding of outcome measure scores and application within the local clinical or research context [20]. While use of physical measures in lumbar spinal surgery has risen exponentially [16], there is no systematic review evaluating measurement properties of physical measures of physical functioning in the lumbar spinal surgery population. While systematic reviews of PROM measurement properties exist [12, 22], there is also no contemporary comprehensive resource outlining all PROMs of physical functioning, beyond the ODI, in this population.
Objectives
- To identify outcome measures (patient reported and physical) used to evaluate physical functioning in the lumbar spinal surgery population.
- To assess the measurement properties and describe the interpretability and feasibility of physical measures of physical functioning in the lumbar spinal surgery population.
Methods
Design
Using a two-staged approach, this systematic review was conducted according to a registered (PROSPERO CRD42021293880) and published protocol [23]. Stage one identified PROMs (excluding ODI) and physical measures used to evaluate physical functioning in the lumbar spinal surgery population. Results informed the stage two search strategy to identify studies of measurement properties of the physical measures, guided by COSMIN methodology [20, 21]. Reporting aligns with the Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) statement [24]. Ethical approval was not required for this systematic review.
Eligibility criteria
Population.
Adults aged ≥18 years listed for or previous lumbar spinal surgery for musculoskeletal LBP and/or low back-related leg pain.
Intervention.
Lumbar spinal surgery at one or more levels, including thoracolumbar and lumbosacral. Surgery due to trauma, fracture, space occupying mass (e.g., tumor), inflammatory conditions, infection, osteoporosis, congenital scoliosis, cauda equina syndrome and extra-spinal causes of back and/or leg pain were excluded.
Outcome measures.
Stage one: Outcome measures evaluating physical functioning categorized as:
- PROMs: questionnaires, scales or subscales assessing ≥1 aspects of physical functioning. ODI was excluded as it is a well-established PROM of physical functioning in lumbar spinal surgery [11, 25].
- Impairment-based: structure or function of a specific body part or system [15] (e.g., strength)
- Performance-based: performance on a standardized task [14, 26] (e.g., 6-minute walk)
- Activity in a natural environment: remote monitoring of physical functioning in a natural environment [14, 27] (e.g., step count)
Stage two: Physical outcome measures (categorizations 2–4). Outcome measures not practical within Physical Therapy settings (clinical, hospital, community) were excluded (e.g., imaging).
Study design of included studies.
Stage one: All study designs and article types.
Stage two: Studies of measurement properties (validity, responsiveness, reliability, measurement error). Studies were excluded if data were not original (e.g., systematic review), normative only or insufficient (e.g., conference abstract).
For both stages, studies not in English were excluded.
Information sources
Searches were developed in MEDLINE (Ovid) and a librarian (MG) adapted for EMBASE (Ovid), Health and Psychosocial Instruments (Ovid), CINAHL (EBSCOhost), Web of Science Core Collection, Scopus, PEDro (stage one only), and ProQuest Dissertations and Theses. Electronic databases were searched from inception to December 15, 2021 for stage one and inception to July 10, 2023 for stage two. Reference lists of included studies in stage two were hand-searched independently by two authors (KK, JM) to identify additional potential articles.
Search strategy
Search strategies were developed in collaboration with a librarian (MG; S1 Appendix) and informed by National Institute for Health and Care Excellence (NICE) guidelines for LBP and sciatica over 16s [28]. Stage two also included physical outcome measures identified in stage one and the COSMIN sensitive search and exclusion filter [29]. An independent librarian peer-reviewed stage one search using the Peer Review of Electronic Search Strategies (PRESS) checklist.[30]
Selection process
Citations were imported into Covidence (Veritas Health Innovation, Australia) and duplicates removed. Title/abstract screening was performed independently in duplicate (KK, JM, AB). Full texts were obtained and reviewed independently in duplicate (KK, JM, AB) for articles meeting eligibility criteria or when eligibility was unclear. Disagreements at each stage were discussed, and a third author (AR) used if consensus was not achieved.
Data collection process and data items
Data were extracted independently in duplicate (KK, JM, AB) using standardized data extraction sheets. Data extraction included study characteristics, participant characteristics and outcome measures. Stage two was guided by the COSMIN scoring form, which also included measurement properties and information related to interpretability and feasibility, as recommended by COSMIN to aid selection of physical measures [20]. Differences in data extraction were resolved through discussion. One investigator [31] was contacted and responded to one email to clarify reporting during stage two, in accordance with the a priori strategy for contacting study authors [23].
Risk of bias (RoB) in individual studies
As planned [23], RoB was not assessed in stage one. For stage two, RoB was assessed independently in duplicate (KK, JM) using the COSMIN Checklist [32] and extended tool for measurement instruments [21]. For each study of a measurement property, RoB was rated as “Very good”, “Adequate”, “Doubtful”, or “Inadequate” and the overall rating was determined using the worst score counts principle [21, 32]. Studies using hypothesis testing approaches that did not define a hypothesis or no hypothesis could be derived were rated as “Inadequate” because of strong potential for selective reporting of analyses and outcomes [33, 34]. Disagreements were discussed, and if consensus not achieved, a third author (AR) was used.
Data synthesis
For stage one, PROMs and physical measures were categorized according to established frameworks of physical functioning. PROMs were categorized according to the IMMPACT/OMERACT (Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials, Outcome Measures in Rheumatoid Arthritis Clinical Trials) framework: general, site-specific, disease-specific, pain-related physical functioning/activities or activities of daily living [14].Physical measures were categorized according to level-two categories of the International Classification of Functioning, Disability and Health (ICF) [15].
For stage two, results of each measurement property study were rated as sufficient (+), insufficient (-) or indeterminant (?) criteria for good measurement properties (S2 Appendix) [20]. Studies using a criterion approach were considered criterion validity or responsiveness [32]. Studies using hypothesis testing approaches that did not define a hypothesis or no hypothesis could be derived were rated as indeterminant. Standards for assessing a priori hypotheses in hypothesis testing approaches have been removed from updated COSMIN guidelines with recommendations the systematic review team formulate hypotheses to evaluate results [20, 32]. However, as most studies did not define a hypothesis or no hypothesis could be derived, formulating post-hoc hypothesis for authors of included studies would have elevated RoB in this systematic review to an unacceptable level as lack of a priori hypotheses introduces threats to the internal validity of included studies and therefore this systematic review would have provided an inaccurate representation of the quality of the literature [35, 36].
High heterogeneity and RoB directed a qualitative synthesis, in accordance with the a priori protocol [23]. Summarized results were rated as sufficient or insufficient if at least 75% of individual studies were rated as sufficient or insufficient, indeterminant if at least 75% of individual studies were rated as indeterminant or inconsistent (±) if less than 75% of the individual studies agreed. Information related to interpretability and feasibility are described, in accordance with COSMIN recommendations [20].
Reporting biases
Assessment of reporting bias was conducted through evaluating consistency between published results and study protocols, if identified in stage two.
Overall quality of evidence
Quality of evidence was evaluated for each measurement property per physical measure, using GRADE (Grading of Recommendations Assessment, Development and Evaluation) modified for measurement properties [20]. Four factors contributed to determining quality of evidence (RoB, inconsistency, imprecision, indirectness). Two reviewers (KK, JM) independently determined quality of evidence and disagreements were resolved through discussion. If consensus was not achieved, a third author (AR) was used.
Results
The PRISMA flow diagram (Fig 1) shows both stages of searching, selection and reasons for exclusion (S3 Appendix). For stage one, complete agreement was achieved between reviewers. For stage two, there was strong agreement between reviewers for title/abstract screening (κ = 0.85) and full text review (κ = 0.94) [37]. Complete agreement on eligibility was achieved through discussion. Due to unclear reporting, the third reviewer (AR) was consulted once about one study [38] to agree which measurement property was investigated.
Stage one: Identify physical functioning outcome measures
Study characteristics.
Stage one included 1,101 reports, published across 47 countries over 40 years (1982–2022) with increasing annual publications (S4 Appendix). The age of lumbar spinal surgery populations rose from early 40s in the 1980s to early 60s in the 2010s. For studies that reported sex/gender, reporting was binary and there was about equal representation of male/men and female/women. Common surgical procedures included fusion, decompression and discectomy. Physical functioning was assessed using PROMs in n = 964 reports and physical measures comprising impairments (n = 92 reports), performance (n = 198 reports) and activity in a natural environment (n = 42 reports; S4 Appendix). Most reports included >1 measure of physical functioning.
Results of synthesis.
PROMs.
70 established PROMs were identified and authors from 49 articles developed 67 of their own PROMs in physical functioning categories [14] of (S5 Appendix):
- General (Established: n = 31, 44%; Developed: n = 41, 61%), including subcategories: physical activity, health status/quality of life, functional status, disability, patient-identified functional limitations
- Site-specific (Established: n = 26, 37%; Developed: n = 3, 4.5%), including subcategories: low back and/or leg pain, back/headache/facial pain
- Disease-specific (Established: n = 9, 13%; Developed: n = 1, 1.5%), including subcategories: spinal stenosis, scoliosis
- Pain-related (Established: n = 2, 3%; Developed: n = 2, 3%), including subcategory: pain-related disability not linked to a body region or condition/disease
- Activities of daily living (Established: n = 2, 3%; Developed: n = 20, 30%), including subcategories: self-care. activities of daily living
The most frequently used PROM to evaluate physical functioning was the Short Form Health Survey (e.g., SF-36, SF-12) physical component score and physical functioning domain (S4 Appendix). The Roland Morris Disability Questionnaire was the second most frequently used PROM with several versions and modifications (24-item, 23-item, for sciatica, substitute ‘leg pain’ for ‘back pain’, modifications not further specified). The importance of walking in the lumbar spinal surgery population is highlighted by 23 different stand-alone question/response option combinations to measure walking capacity.
Physical measures
134 physical measures were identified comprising impairments (n = 35, 26%), performance (n = 77, 58%) and activity in a natural environment (n = 22, 16%; S4 and S5 Appendices). Physical measures (specific measure of a physical assessment, e.g., lumbar flexion range of movement) were categorized into 17 physical outcome measures (broad outcome that is measured by physical assessment, e.g., range of movement) and mapped to 15 level-two categories in ICF components Body Function and Activities/Participation [15]:
ICF Body Function component
- Mobility of joint functions (physical outcome measure: range of movement)
- Muscle power functions (physical outcome measure: strength)
- Control of voluntary movement functions (physical outcome measure: motor control)
- Gait pattern functions (physical outcome measure: gait parameters)
- Exercise tolerance functions (physical outcome measure: aerobic capacity)
- Muscle endurance functions (physical outcome measure: muscle endurance)
- Involuntary movement reaction functions (physical outcome measure: balance)
ICF Activities/Participation component
- Changing basic body position (physical outcome measure: functional mobility)
- Maintaining body position (physical outcome measure: sustained positions)
- Lifting and carrying objects (physical outcome measure: lifting)
- Hand and arm use (physical outcome measure: reaching)
- Walking (physical outcome measures: walking, composite gait measure)
- Going up and down stairs (physical outcome measure: stairs)
- Moving around in different locations (physical outcome measure: walking)
- Other specified mobility (physical outcome measures: multi-activity performance-based measures, functional task performance, physical activity parameters)
The most frequently used physical outcome measure was range of movement for impairment-based, walking tests for performance-based and physical activity parameters for activity in a natural environment. Similar to PROMs, the importance of walking is highlighted by 21 different walking physical measures (e.g., 6-minute walk test) across performance-based and activity in a natural environment physical outcome measures.
Stage two: Assess measurement properties of physical measures
Study characteristics.
Stage two included 43 articles (Table 1) evaluating measurement properties of 34 physical measures of impairments (n = 8), performance (n = 18) and activity in a natural environment (n = 8). Studies were published across 13 countries over 24 years. The total number of participants was 4,619 and sample size ranged from 8–375 (median = 50). Mean age of participants was 57 (range: 27–91). Reporting of sex versus gender was variable (70% sex, 30% gender) and binary with slightly more men/males (54%). Surgeries included discectomy, decompression and fusion for lumbar disc herniation, spinal stenosis, degenerative disc disease and spondylolisthesis.
RoB in individual studies.
Overall RoB in individual studies was rated as inadequate (78%, n = 76), doubtful (18%, n = 17) and very good (4%, n = 4; Tables 2–4). Key issues included lack of a priori hypotheses for hypothesis testing approaches and reporting measurement properties for comparator instruments (S6 Appendix). Complete agreement in RoB assessment was achieved through discussion (κ = 0.81).
Measurement property results per physical measure.
Investigations included assessments of validity (n = 22 physical measures; Table 2), responsiveness (n = 20 physical measures; Table 3), reliability (n = 8 physical measures; Table 4) and measurement error (n = 6 physical measures; Table 4). The strongest evidence was moderate-level, supporting sufficient responsiveness of 1-minute stair climb and 50-foot walk tests, insufficient responsiveness of the 5-minute walk test, and sufficient reliability of distance walked during the 6-minute walk test. Very low to low-level evidence limits further understanding of measurement properties. Measurement properties for all physical measures are summarized in Table 5. S6 Appendix details individual studies of measurement properties per physical measure and overall quality of evidence.
Impairment-based physical outcome measures. Active range of movement: very low-level evidence supports indeterminant construct validity and responsiveness (construct approach) of computer assisted electronic inclinometer measures of lumbar, trunk and hip flexion and extension, and dual bubble inclinometer measures of lumbopelvic flexion and extension [40, 44, 45]. Very low-level evidence supports indeterminant responsiveness (construct approach) of the Schober test[40].
Handgrip strength: low-level evidence supports indeterminant construct validity of maximum handgrip strength using a handgrip dynamometer [41–43].
Gait parameters: low level evidence supports indeterminant construct validity and sufficient criterion validity of the Two-step test [39]. Very low-level evidence supports indeterminant responsiveness (construct approach) for asymmetry of double support and stride length using a wireless gait analysis system [31].
Performance-based physical outcome measures. 1-minute stair climb: moderate-level evidence supports sufficient responsiveness (construct approach) of the 1-minute stair climb test [53].
5 repetitions sit to stand: low-level evidence supports sufficient reliability of time to complete the 5 repetitions sit-to-stand test [60, 61]. Low-level evidence supports indeterminant construct validity and measurement error of time to complete the 5 repetition sit-to-stand test [54, 57, 60–62].
5-minute walk test: moderate-level evidence supports insufficient responsiveness (construct approach) of the 5-minute walk test [53].
6-minute walk test: for distance walked during 6-minute walk test measured using the 6WT app [38, 55, 56, 59, 70] (measurement instrument not reported) [66, 67] moderate-level evidence supports sufficient test-retest reliability [38, 55], low-level evidence supports sufficient responsiveness (criterion approach), inconsistent measurement error, and indeterminant construct validity and responsiveness (construct approach) [38, 55, 56, 66, 67, 70], and very low-level evidence supports patient reported responsiveness [59]. For distance to first symptoms during the 6-minute walk test measured using the 6WT app, very low-level evidence supports sufficient test-retest reliability and responsiveness (criterion approach) [38], and indeterminant construct validity, responsiveness (construct approach) and measurement error [38]. For time to first symptoms during the 6-minute walk test measured using the 6WT app, very low-level evidence supports sufficient test-retest reliability, insufficient responsiveness (criterion approach) [38], and indeterminant construct validity, responsiveness (construct approach) and measurement error [38].
10-meter walk test: very low-level evidence supports indeterminant construct validity and responsiveness (construct approach) of the 10-meter walk test [69].
50-foot walk test: moderate-level evidence supports sufficient responsiveness (construct approach) of the 50-foot walk test [53]
Modified Sorensen test: very low-level evidence supports indeterminant construct validity of the modified Sorensen test [47, 48].
Self-paced walking test: very low-level evidence supports indeterminant responsiveness (construct approach) of distance and time walked during the self-paced walking test [68].
Timed up and go: low-level evidence supports inconsistent responsiveness (construct approach) of time to complete the TUG using the 6WT app [56] or stopwatch [51] (not reported) [53]. Very low-level evidence supports insufficient responsiveness (criterion approach) and indeterminant construct validity using the 6WT app [56, 63, 65] or stopwatch [64] (not reported) [46, 50, 57].
Treadmill test: for maximum walking time and time to first symptoms during a treadmill test, very low-level evidence supports sufficient pre-operative test-retest reliability and indeterminant post-operative test-retest reliability [49]. Low to very low-level evidence supports indeterminant construct validity for maximum walking distance, maximum walking time, time to first symptoms and distance to first symptoms during a treadmill test [52, 58].
Trunk muscle endurance: very low-level evidence supports indeterminant responsiveness (construct approach) of repetitive arch ups and sit ups until exhaustion [40].
Activity in a natural environment physical outcome measures. Step counts: very low-level evidence supports indeterminant construct validity and responsiveness (construct approach) of steps per day using a Fitbit (Flex 2, Charge, Zip) [71, 74, 75] and Mi Band [78]. At the thigh, very low-level evidence supports inconsistent criterion validity of step detection (sufficient for ActivPAL3 and Jawbone UP Move, insufficient for Fitbit Flex) [73]. At the wrist, very low-level evidence supports insufficient criterion validity of step detection using a Fitbit Flex and Jawbone UP Move [73]. At the thigh and wrist, very low-level evidence supports indeterminant measurement error of step detection using Fitbit Flex, Jawbone UP Move and ActivPAL3 (thigh only) [73].
Gait Posture Index: very low-level evidence supports indeterminant responsiveness (construct approach) of the Gait Posture Index using personal electronic devices (e.g., Garmin) or Mi Band 2 [72, 76].
Distance walked per day: very low-level evidence supports indeterminant construct validity (using personal smart phone) [79] and responsiveness (construct approach, using Fitbit Zip) [75] of distance walked per day.
Gait cycles: very low-level evidence supports indeterminant construct validity of number of gait cycles per day, gait cycles per hour and gait intensities per day using the StepWatch3 [77].
Interpretability and feasibility.
Limited information reported interpretability and feasibility (S6 Appendix). Floor and ceiling effects were generally not reported. Distribution of scores suggest floor effects in some physical measures with small scores (e.g., range of movement) [44, 45], while some walking physical measures may have floor (symptoms at test start) [49] and ceiling (no symptoms at test end) [38, 49, 68] effects. Data missingness was variable (0–69%). Minimum important or detectable change scores were rarely investigated in studies [53, 66, 67], though some values were reported from the literature [51, 56, 57, 64, 65]. Administering physical measures required small (e.g., accelerometer) or readily available equipment (e.g., goniometer), or a patient’s personal electronic device (e.g., smartwatch), enabling relatively easy set up/administration within clinical environments. Most studies did not report equipment cost, but free smartphone applications exist [38, 55, 56, 59, 65]. Several digital technologies (personal electronic devices [38, 55, 56, 59, 61, 65, 72, 76], low-cost consumer-grade wearables [71–78]) may enhance feasibility of data collection because of minimal interference with daily activities. When reported, standardized instructions and ease of score calculation appear feasible in clinical environments.
Discussion
Using a rigorous two-staged approach, this systematic review is the first to identify outcome measures (PROMs, physical) used to evaluate physical functioning in the lumbar spinal surgery population and assess measurement properties of the physical measures. Stage one generated a comprehensive list of PROMs (Established n = 70, Developed n = 67) and physical measures (n = 134). However, only 34 physical measures had investigations of measurement properties. Moderate-level evidence supported sufficient responsiveness of the 1-minute stair climb and 50-foot walk tests, insufficient responsiveness of a 5-minute walk test and sufficient reliability of distance walked during the 6-minute walk test. Very low to low-level evidence limits further understanding of measurement properties for a wide range of physical measures.
Stage one
The global importance of physical functioning in lumbar spinal surgery is emphasized by the breadth of countries represented in stage one and increasing number of publications across 40 years. This aligns with physical functioning advocated as a critical domain within core outcome sets for the past 25 years [8, 80, 81], and measurement instruments recommended for use within core outcome sets including well-established PROMs (ODI, Roland Morris Disability Questionnaire) [8, 80, 81]. However, stage one identified an extensive number of PROMs across all five categories of physical functioning [14] and physical measures across 15 level-two ICF categories [15]. Use of a range of measures aligns with previous systematic reviews highlighting limited and inconsistent implementation of recommendations for standardizing outcome measures in LBP clinical trials [82], and substantiates physical functioning as a multidimensional construct not best measured with a single PROM. Support for PROMs and physical measures to evaluate physical functioning in other musculoskeletal disease is strong, including international recommendations within clinical trials [17, 19, 83]. In lumbar spinal surgery, there is emerging evidence demonstrating the value of physical measures (important to patients [59], responsive to change [38, 53, 56, 59, 66], predictive of outcomes [84]). However, recommendations for their use in LBP populations do not exist.
Establishing consensus on appropriate physical measures of physical functioning is required to enable comparisons of interventions and outcomes. Stage one results highlight an illustrative example. Walking was frequently evaluated with PROMs and physical measures, aligning with previous research emphasizing walking as an important component of rehabilitation in lumbar spinal surgery [85–88]. However, with 23 different stand-alone PROM questions/response options to measure walking capacity and 21 different walking physical measures, it is nearly impossible to compare across studies. Action is required to standardize measurement of physical functioning outcomes.
Stage two
Sufficient measurement properties are required for recommending measures to use in research and clinical practice [89, 90]. While 134 physical measures were identified in stage one, only 34 had investigations of measurement properties. Few recommendations can be made because of indeterminant measurement properties and mostly very low to low-level evidence. The strongest evidence supports performance-based measures, with moderate-level evidence for sufficient responsiveness of 1-minute stair climb and 50-foot walk tests, insufficient responsiveness of the 5-minute walk test and sufficient reliability of distance walked during the 6-minute walk test. Very low to low-level evidence limits further understanding of measurement properties. However, results illustrate promise for a range of physical measures, for example when hypotheses are appropriately defined for construct approaches to validity and responsiveness [53, 65] or no hypothesis is required to evaluate the measurement property [38, 39, 49,55, 56, 60, 61, 66, 67, 73]. according to COSMIN [20]. While emerging evidence supports the value of physical measures, at present few clear recommendations can be made. Prospective, low risk of bias studies are required.
A challenge in conducting this systematic review was inconsistent terminology and poor reporting in included studies. Inconsistent terminology led to lack of clarity in the measurement property under investigation. For example, three studies [38, 55, 65] reported an investigation of content validity, however study designs and statistical methodologies aligned with the COSMIN definition of hypothesis testing for construct validity. Poor reporting strongly contributed to indeterminant measurement properties, high RoB and very low to low-level evidence. It precluded COSMIN recommendations to derive hypotheses for study authors in the absence of a priori hypotheses in hypothesis testing approaches because of insufficient information about the expected direction and strength of associations. COSMIN guidelines were designed to enable systematic reviews but can also be used to inform terminology and reporting of studies on measurement properties. However, this is not common, as inconsistent terminology and poor reporting are challenges identified in other systematic reviews investigating measurement properties of physical measures [91, 92]. COSMIN has published reporting guidelines for studies on measurement properties [93], however lessons learned from Consolidated Standards of Reporting Trials (CONSORT) suggest at least 15 years may be required for widespread implementation [94]. PRISMA-COSMIN reporting guidelines for systematic reviews of measurement properties[95] will hopefully hasten the process to draw clear conclusions about measurement properties of physical measures.
This systematic review demonstrates expanding use of digital technologies in remote monitoring of physical functioning. Several digital technologies were used (personal devices [38, 55, 56, 59, 61, 65, 72, 76], low-cost consumer-grade [71–78] and research-grade wearables [31, 73]) to measure impairments, performance and activity in a natural environment, with unique combinations in some studies. For example, free smartphone applications enabled digital self-assessments of performance-based measures (e.g., 6-minute walk) in a natural environment, but with limited and very low to low-level evidence supporting measurement properties [38, 56, 59, 65]. However, one unique study of patient-reported responsiveness indicated that patients perceive a smartphone application that measured 6-minute walk performance was better at detecting changes in their symptoms compared to PROMs (very low-level evidence) [59]. In selecting appropriate digital technologies in lumbar spinal surgery, important factors to consider are wear position and gait aids, as both influence validity of step count detection (very low-level evidence) [73]. Digital technologies are promising solutions to enable personalized remote monitoring, aid clinical reasoning and inform tailored interventions [96], however their measurement properties are largely unknown, necessitating low RoB studies.
Strengths and limitations
This robust systematic review used a rigorous two-staged approach to identify physical functioning outcome measures, enabling a comprehensive search for measurement properties of physical measures of physical functioning. It is limited by heterogeneity in physical measures, indeterminant measurement properties and RoB across included studies. Poor reporting of studies was a key issue. Important findings may have been missed with the exclusion of non-English full texts (108 in stage one, 1 in stage two). Limitations prevent clear recommendations for a range of physical measures of physical functioning in the lumbar spinal surgery population.
Conclusions
While many physical measures are used to evaluate physical functioning in the lumbar spinal surgery population, few have investigations of measurement properties. Research to date is overall low quality, consisting of high RoB studies with inconsistent use of terminology and poor reporting. The strongest evidence supports performance-based measures, with moderate-level evidence for sufficient responsiveness of the 1-minute stair climb and 50-foot walk tests, insufficient responsiveness of the 5-minute walk test and sufficient reliability of distance walked during the 6-minute walk test. There is promise for physical measures of physical functioning to demonstrate sufficient measurement properties, but few clear recommendations can be made. Knowledge of measurement properties is essential in establishing consensus on appropriate physical measures of physical functioning to evaluate effectiveness of interventions for lumbar spinal surgery populations and inform clinical practice. Prospective low RoB studies are required owing to emerging evidence demonstrating the value of physical measures.
Supporting information
S2 Appendix. Rating criteria for measurement properties.
https://doi.org/10.1371/journal.pone.0307004.s002
(DOCX)
S3 Appendix. Articles excluded at full text stage (stage 2).
https://doi.org/10.1371/journal.pone.0307004.s003
(DOCX)
Acknowledgments
Study authors thank Meagan Stanley for peer-reviewing the stage one electronic search strategy and Keerthana Jayaprakash for administrative support in stage one.
Patient and public involvement
The spinal pain research Patient Partner Advisory Group (PPAG) in the School of Physical Therapy at Western University has shaped our program of research methodology focused on physical measures of physical functioning. Systematic review results have been discussed with the PPAG to inform development of future research projects.
References
- 1. GBD 2019 Diseases and Injuries Collaborators. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. The Lancet. 2020;396: 1204–1222. pmid:33069326
- 2. Cieza A, Causey K, Kamenov K, Hanson SW, Chatterji S, Vos T. Global estimates of the need for rehabilitation based on the Global Burden of Disease study 2019: a systematic analysis for the Global Burden of Disease Study 2019. The Lancet. 2020;396: 2006–2017. pmid:33275908
- 3. Foster NE, Anema JR, Cherkin D, Chou R, Cohen SP, Gross DP, et al. Prevention and treatment of low back pain: evidence, challenges, and promising directions. The Lancet. 2018;391: 2368–2383. pmid:29573872
- 4. Buchbinder R, Underwood M, Hartvigsen J, Maher CG. The Lancet Series call to action to reduce low value care for low back pain: an update. Pain. 2020;161: S57. pmid:33090740
- 5. Chou R, Baisden J, Carragee EJ, Resnick DK, Shaffer WO, Loeser JD. Surgery for low back pain: A review of the evidence for an American pain society clinical practice guideline. Spine (Phila Pa 1976). 2009;34: 1094–1109. pmid:19363455
- 6. Black N. Patient reported outcome measures could help transform healthcare. BMJ (Online). 2013;346. pmid:23358487
- 7. Churruca K, Pomare C, Ellis LA, Long JC, Henderson SB, Murphy LED, et al. Patient-reported outcome measures (PROMs): A review of generic and condition-specific measures and a discussion of trends and issues. Health Expectations. 2021. pmid:33949755
- 8. Chiarotto A, Deyo RA, Terwee CB, Boers M, Buchbinder R, Corbin TP, et al. Core outcome domains for clinical trials in non-specific low back pain. Eur Spine J. 2015;24: 1127–1142. pmid:25841358
- 9. Dodd S, Clarke M, Becker L, Mavergames C, Fish R, Williamson PR. A taxonomy has been developed for outcomes in medical research to help improve knowledge discovery. J Clin Epidemiol. 2018;96: 84–92. pmid:29288712
- 10. Chiarotto A, Ostelo RW, Boers M, Terwee CB. A systematic review highlights the need to investigate the content validity of patient-reported outcome measures for physical functioning in patients with low back pain. J Clin Epidemiol. 2018;95: 73–93. pmid:29154811
- 11. Chiarotto A, Boers M, Deyo RA, Buchbinder R, Corbin TP, Costa LOP, et al. Core outcome measurement instruments for clinical trials in nonspecific low back pain. Pain. 2018;159: 495. pmid:29194127
- 12. Stokes O, Cole A, Breakwell L, Lloyd A, Leonard C, Grevitt M. Do we have the right PROMs for measuring outcomes in lumbar spinal surgery? European Spinal Journal. 2017;26: 816–824. pmid:28070683
- 13. Grotle M, Brox JI, Vøllestad NK. Functional Status and Disability Questionnaires: What Do They Assess? Spine (Phila Pa 1976). 2005;30: 130–140.
- 14.
Taylor AM, Phillips K, Patel K V., Turk DC, Dworkin RH, Beaton D, et al. Assessment of physical function and participation in chronic pain clinical trials: IMMPACT/OMERACT recommendations. Pain. Lippincott Williams and Wilkins; 2016. pp. 1836–1850.
- 15.
World Health Organization. Towards a Common Language for Functioning, Disability and Health ICF. 2002.
- 16. Stienen M, Ho AL, Staartjes VE, Maldaner N, Veeravagu A, Desai A, et al. Objective measures of functional impairment for degenerative diseases of the lumbar spine: a systematic review of the literature. The Spine Journal. 2019;19: 1276–1293. pmid:30831316
- 17. Dobson F, Hinman RS, Roos EM, Abbott JH, Stratford P, Davis AM, et al. OARSI recommended performance-based tests to assess physical function in people diagnosed with hip or knee osteoarthritis. Osteoarthritis Cartilage. 2013;21. pmid:23680877
- 18. Burgi CR, Peters S, Ardern CL, Magill JR, Gomez CD, Sylvain J, et al. Which criteria are used to clear patients to return to sport after primary ACL reconstruction? A scoping review. Br J Sports Med. 2019;53: 1154–1161. pmid:30712009
- 19. Bateman M, Evans JP, Vuvan V, Jones V, Watts AC, Phadnis J, et al. Development of a core outcome set for lateral elbow tendinopathy (COS-LET) using best available evidence and an international consensus process. Br J Sports Med. 2022;56: 657–666. pmid:35135827
- 20. Prinsen CAC, Mokkink LB, Bouter LM, Alonso J, Patrick DL, de Vet HCW, et al. COSMIN guideline for systematic reviews of patient-reported outcome measures. Quality of Life Research. 2018;27: 1147–1157. pmid:29435801
- 21. Mokkink LB, Boers M, van der Vleuten CPM, Bouter LM, Alonso J, Patrick DL, et al. COSMIN Risk of Bias tool to assess the quality of studies on reliability or measurement error of outcome measurement instruments: a Delphi study. BMC Med Res Methodol. 2020;20: 1–13. pmid:33267819
- 22. Ziedas A, Abed V, Bench C, Rahman T, Makhni MC. Patient-Reported Outcomes Measurement Information System physical function instruments compare favorably to legacy patient-reported outcome measures in spine patients: a systematic review of the literature. Spine Journal. 2022. pmid:34699997
- 23. Kowalski KL, Lukacs MJ, Mistry J, Goodman M, Rushton AB. Physical functioning outcome measures in the lumbar spinal surgery population and measurement properties of the physical outcome measures: protocol for a systematic review. BMJ Open. BMJ Publishing Group; 2022. pmid:35667717
- 24. Page MJ, Mckenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ Open. 2021;372: n71. pmid:33782057
- 25. van Hooff ML, Jacobs WC, Willems PC, Wouters MW, de Kleuver M, Peul WC, et al. Evidence and practice in spine registries: A systematic review, and recommendations for future design of registries. Acta Orthop. 2015;86: 534–544. pmid:25909475
- 26. Dobson F, Hinman RS, Hall M, Terwee CB, Roos EM, Bennell KL. Measurement properties of performance-based measures to assess physical function in hip and knee osteoarthritis: A systematic review. Osteoarthritis Cartilage. 2012;20: 1548–1562. pmid:22944525
- 27. Block VJ, Bove R, Zhao C, Garcha P, Graves J, Romeo AR, et al. Association of Continuous Assessment of Step Count by Remote Monitoring With Disability Progression Among Adults With Multiple Sclerosis. JAMA Netw Open. 2019;2: e190570. pmid:30874777
- 28.
NICE guideline. Low back pain and sciatica in over 16s: assessment and management NICE guideline. 2016. www.nice.org.uk/guidance/ng59
- 29. Terwee CB, Jansma EP, Riphagen II, de Vet HCW. Development of a methodological PubMed search filter for finding studies on measurement properties of measurement instruments. Quality of Life Research. 2009;18: 1115–1123. pmid:19711195
- 30. McGowan J, Sampson M, Salzwedel DM, Cogo E, Foerster V, Lefebvre C. PRESS Peer Review of Electronic Search Strategies: 2015 Guideline Statement. J Clin Epidemiol. 2016;75: 40–46. pmid:27005575
- 31. Loske S, Nüesch C, Byrnes KS, Fiebig O, Schären S, Mündermann A, et al. Decompression surgery improves gait quality in patients with symptomatic lumbar spinal stenosis. Spine Journal. 2018;18: 2195–2204. pmid:29709554
- 32. Mokkink LB, de Vet HCW, Prinsen CAC, Patrick DL, Alonso J, Bouter LM, et al. COSMIN Risk of Bias checklist for systematic reviews of Patient-Reported Outcome Measures. Quality of Life Research. 2018;27: 1171–1179. pmid:29260445
- 33. Terwee CB, Bot SDM, de Boer MR, van der Windt DAWM, Knol DL, Dekker J, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60: 34–42. pmid:17161752
- 34. Shea BJ, Reeves BC, Wells G, Thuku1 M, Hamel C, Moran J, et al. AMSTAR 2: a critical appraisal tool for systematic reviews that include randomised or non-randomised studies of healthcare interventions, or both. BMJ. 2017;358. pmid:28935701
- 35. Glasziou P, Altman DG, Bossuyt P, Boutron I, Clarke M, Julious S, et al. Reducing waste from incomplete or unusable reports of biomedical research. The Lancet. 2014;383: 267–76. pmid:24411647
- 36. Ioannidis JPA, Greenland S, Hlatky MA, Khoury MJ, Macleod MR, Moher D, et al. Research: increasing value, reducing waste 2: Increasing value and reducing waste in research design, conduct, and analysis. Lancet. 2014;383: 166. pmid:24411645
- 37. McHugh ML. Lessons in biostatistics interrater reliability: the kappa statistic. Biochemica Medica. 2012;22: 276–282. Available: https://hrcak.srce.hr/89395
- 38. Zeitlberger AM, Sosnova M, Ziga M, Gautschi OP, Regli L, Bozinov O, et al. Distance to first symptoms measured by the 6-min walking test differentiates between treatment success and failure in patients with degenerative lumbar disorders. European Spine Journal. 2022;31: 596–603. pmid:35015137
- 39. Fujita N, Sakurai A, Miyamoto A, Michikawa T, Otaka Y, Suzuki S, et al. Stride length of elderly patients with lumbar spinal stenosis: Multi-center study using the Two-Step test. Journal of Orthopaedic Science. 2019;24: 787–792. pmid:30737067
- 40. Häkkinen A, Kautiainen H, Sintonen H, Ylinen J. Health related quality of life after lumbar disc surgery: A prospective study of 145 patients. Disabil Rehabil. 2005;27: 94–100. pmid:15823989
- 41. Inoue H, Watanabe H, Okami H, Shiraishi Y, Kimura A, Takeshita K. Handgrip strength correlates with walking in lumbar spinal stenosis. European Spine Journal. 2020;29: 2198–2204. pmid:32651633
- 42. Kwon JW, Lee BH, Sung S, Lee S bin, Park MS, Cho JH, et al. Hand grip strength as a surrogate marker for postoperative changes in spinopelvic alignment in patients with lumbar spinal stenosis. Scientific Reports 2020 10:1. 2020;10: 1–6. pmid:32778718
- 43. Kwon JW, Lee BH, Lee S bin, Sung S, Lee CU, Yang JH, et al. Hand grip strength can predict clinical outcomes and risk of falls after decompression and instrumented posterolateral fusion for lumbar spinal stenosis. Spine Journal. 2020;20: 1960–1967. pmid:32622937
- 44. Mannion AF, Dvorak J, Müntener M, Grob D. A prospective study of the interrelationship between subjective and objective measures of disability before and 2 months after lumbar decompression surgery for disc herniation. European Spine Journal. 2005;14: 454–465. pmid:15830214
- 45.
Pitino T. The impairment and disability relationship in patients who have undergone lumbar discectomy. Northeastern University. 2000.
- 46. Corniola M v., Stienen MN, Joswig H, Smoll NR, Schaller K, Hildebrandt G, et al. Correlation of pain, functional impairment, and health-related quality of life with radiological grading scales of lumbar degenerative disc disease. Acta Neurochir (Wien). 2016;158: 499–505. pmid:26783024
- 47. Dedering Å, Harms-Ringdahl K, Nèmeth G. Back extensor muscle fatigue in patients with lumbar disc herniation: Pre-operative and post-operative analysis of electromyography, endurance time and subjective factors. European Spine Journal. 2006;15: 559–569. pmid:15940475
- 48. Dedering Å. Lumbar muscle fatigue and subjective health measurements in patients with lumbar disc herniation 2 years after surgery. European Spine Journal. 2012;21: 646–654.
- 49. Deen HGJ, Zimmerman RS, Lyons MK, McPhee MC, Verheijde JL, Lemens SM. Test-retest reproducibility of the exercise treadmill examination in lumbar spinal stenosis. Mayo Clin Proc. 2000;75: 1002–1007. pmid:11040847
- 50. Gautschi OP, Smoll NR, Corniola M v., Joswig H, Chau I, Hildebrandt G, et al. Validity and reliability of a measurement of objective functional impairment in lumbar degenerative disc disease: The Timed Up and Go (TUG) test. Neurosurgery. 2016;79: 270–278. pmid:26702840
- 51. Gautschi OP, Joswig H, Corniola M v., Smoll NR, Schaller K, Hildebrandt G, et al. Pre- and postoperative correlation of patient-reported outcome measures with standardized Timed Up and Go (TUG) test results in lumbar degenerative disc disease. Acta Neurochir (Wien). 2016;158: 1875–1881. pmid:27488842
- 52. Herno A, Partanen K, Talaslahti T, Kaukanen E, Turunen V, Suomalainen O, et al. Long-term clinical and magnetic resonance imaging follow-up assessment of patients with lumbar spinal stenosis after laminectomy. Spine (Phila Pa 1976). 1999;24. pmid:10457572
- 53. Jakobsson M, Brisby H, Gutke A, Lundberg M, Smeets R. One-minute stair climbing, 50-foot walk, and timed up-and-go were responsive measures for patients with chronic low back pain undergoing lumbar fusion surgery. BMC Musculoskelet Disord. 2019;20: 137. pmid:30927913
- 54. Klukowska AM, Schröder ML, Stienen MN, Staartjes VE. Objective functional impairment in lumbar degenerative disease: concurrent validity of the baseline severity stratification for the five-repetition sit-to-stand test. J Neurosurg Spine. 2020;33: 4–11. pmid:32084632
- 55. Maldaner N, Sosnova M, Zeitlberger AM, Ziga M, Gautschi OP, Regli L, et al. Evaluation of the 6-minute walking test as a smartphone app-based self-measurement of objective functional impairment in patients with lumbar degenerative disc disease. J Neurosurg Spine. 2020;33: 779–788. pmid:32764182
- 56. Maldaner N, Sosnova M, Zeitlberger AM, Ziga M, Gautschi OP, Regli L, et al. Responsiveness of the self-measured 6-minute walking test and the Timed up and Go test in patients with degenerative lumbar disorders. J Neurosurg Spine. 2021;35: 52–59. pmid:33974372
- 57. Master H, Pennings JS, Coronado RA, Henry AL, O’Brien MT, Haug CM, et al. Physical Performance Tests Provide Distinct Information in Both Predicting and Assessing Patient-Reported Outcomes Following Lumbar Spine Surgery. Spine (Phila Pa 1976). 2020;45: E1556–E1563. pmid:32890302
- 58. Prasad BCM, Chandra VVR, Devi BV, Chivukula SS, Pundarikakshaiah K. Clinical, radiological, and functional evaluation of surgical treatment in degenerative lumbar canal stenosis. Neurol India. 2016;64: 683. pmid:27381113
- 59. Sosnova M, Zeitlberger AM, Ziga M, Gautschi OP, Regli L, Weyerbrock A, et al. Patients undergoing surgery for lumbar degenerative spinal disorders favor smartphone-based objective self-assessment over paper-based patient-reported outcome measures. The Spine Journal. 2021;21: 610–617. pmid:33346155
- 60. Staartjes VE, Schröder ML. The five-repetition sit-to-stand test: Evaluation of a simple and objective tool for the assessment of degenerative pathologies of the lumbar spine. J Neurosurg Spine. 2018;29: 380–387. pmid:29957147
- 61. Staartjes VE, Beusekamp F, Schröder ML. Can objective functional impairment in lumbar degenerative disease be reliably assessed at home using the five-repetition sit-to-stand test? A prospective study. European Spine Journal. 2019;28: 665–673. pmid:30680635
- 62. Staartjes VE, Klukowska AM, Schröder ML. Association of maximum back and leg pain severity with objective functional impairment as assessed by five-repetition sit-to-stand testing: analysis of two prospective studies. Neurosurg Rev. 2020;43: 1331–1338. pmid:31451936
- 63. Staartjes VE, Joswig H, Corniola M V., Schaller K, Gautschi OP, Stienen MN. Association of Medical Comorbidities With Objective Functional Impairment in Lumbar Degenerative Disc Disease. Global Spine J. 2022;12. pmid:33334183
- 64. Stienen MN, Smoll NR, Joswig H, Corniola M v., Schaller K, Hildebrandt G, et al. Validation of the baseline severity stratification of objective functional impairment in lumbar degenerative disc disease. J Neurosurg Spine. 2017;26. pmid:28291406
- 65. Stienen MN, Maldaner N, Sosnova M, Zeitlberger AM, Ziga M, Weyerbrock A, et al. External Validation of the Timed up and Go Test as Measure of Objective Functional Impairment in Patients with Lumbar Degenerative Disc Disease. Neurosurgery. 2021;88: E142–E149. pmid:33040156
- 66. Takenaka H, Kamiya M, Sugiura H, Nishihama K, Ito A, Suzuki J, et al. Responsiveness and Minimal Clinically Important Difference of the 6-minute Walk Distance in Patients Undergoing Lumbar Spinal Canal Stenosis Surgery. Clin Spine Surg. 2022;35: E345–E350. pmid:34039891
- 67. Takenaka H, Kamiya M, Sugiura H, Nishihama K, Suzuki J, Hanamura S. Minimal Clinically Important Difference of the 6-Minute Walk Distance in Patients Undergoing Lumbar Spinal Canal Stenosis Surgery: 12 Months Follow-Up. Spine (Phila Pa 1976). 2023;48. pmid:36728803
- 68. Tomkins-Lane C, Zheng PZ, Sun R, Smuck M. Variability among methods and timing of pain assessment tools for tracking improvement of lumbar stenosis patients after surgery. Spine Journal. 2020;20: 1826–1831. pmid:32534134
- 69. Wada T, Tanishima S, Kitsuda Y, Osaki M, Nagashima H, Noma H, et al. Walking speed is associated with postoperative pain catastrophizing in patients with lumbar spinal stenosis: a prospective observational study. BMC Musculoskelet Disord. 2022;23. pmid:36536351
- 70. Ziga M, Sosnova M, Zeitlberger AM, Regli L, Bozinov O, Weyerbrock A, et al. Objective outcome measures may demonstrate continued change in functional recovery in patients with ceiling effects of subjective patient-reported outcome measures after surgery for lumbar degenerative disorders. The Spine Journal. 2023. pmid:37182704
- 71. Bienstock DM, Shankar D, Kim J, Gao M, Srivastava K, Bronson WH, et al. Accelerometry Data Delineate Phases of Recovery and Supplement Patient-Reported Outcome Measures Following Lumbar Laminectomy. World Neurosurg. 2022;160: e608–e615. pmid:35104658
- 72. Ghent F, Mobbs RJ, Mobbs RR, Sy L, Betteridge C, Choy WJ. Assessment and Post-Intervention Recovery After Surgery for Lumbar Disk Herniation Based on Objective Gait Metrics from Wearable Devices Using the Gait Posture Index. World Neurosurg. 2020;142: e111–e116. pmid:32585383
- 73. Gilmore SJ, Davidson M, Hahne AJ, McClelland JA. The validity of using activity monitors to detect step count after lumbar fusion surgery. Disabil Rehabil. 2020;42: 863–868. pmid:30326754
- 74. Kim DH, Nam KH, Choi BK, Han IH, Jeon TJ, Park SY. The Usefulness of a Wearable Device in Daily Physical Activity Monitoring for the Hospitalized Patients Undergo-ing Lumbar Surgery. J Korean Neurosurg Soc. 2019;62: 561–566. pmid:31337197
- 75. Mobbs RJ, Phan K, Maharaj M, Rao PJ. Physical Activity Measured with Accelerometer and Self-Rated Disability in Lumbar Spine Surgery: A Prospective Study. Global Spine J. 2016;6: 459–464. pmid:27433430
- 76. Mobbs RJ, Mobbs RR, Choy WJ. Proposed objective scoring algorithm for assessment and intervention recovery following surgery for lumbar spinal stenosis based on relevant gait metrics from wearable devices: the Gait Posture index (GPi). Journal of Spine Surgery. 2019;5: 300–309. pmid:31663040
- 77. Schulte TL, Schubert T, Winter C, Brandes M, Hackenberg L, Wassmann H, et al. Step activity monitoring in lumbar stenosis patients undergoing decompressive surgery. European Spine Journal. 2010;19: 1855–1864. pmid:20186442
- 78. Stienen MN, Rezaii PG, Ho AL, Veeravagu A, Zygourakis CC, Tomkins-Lane C, et al. Objective activity tracking in spine surgery: a prospective feasibility study with a low-cost consumer grade wearable accelerometer. Sci Rep. 2020;10: 1–11. pmid:32188895
- 79. Voglis S, Ziga M, Zeitlberger AM, Sosnova M, Bozinov O, Regli L, et al. Smartphone-based real-life activity data for physical performance outcome in comparison to conventional subjective and objective outcome measures after degenerative lumbar spine surgery. Brain and Spine. 2022;2. pmid:36248147
- 80. Deyo RA, Battie M, Beurskens AJHM, Bombardier C, Croft P, Koes B, et al. Outcome measures for low back pain research: A proposal for standardized use. Spine (Phila Pa 1976). 1998;23. pmid:9779535
- 81. Clement RC, Welander A, Stowell C, Cha TD, Chen JL, Davies M, et al. A proposed set of metrics for standardized outcome reporting in the management of low back pain. Acta Orthop. 2015;86: 523–533. pmid:25828191
- 82. Froud R, Patel S, Rajendran D, Bright P, Bjørkli T, Buchbinder R, et al. A Systematic Review of Outcome Measures Use, Analytical Approaches, Reporting Methods, and Publication Volume by Year in Low Back Pain Trials Published between 1980 and 2012: Respice, adspice, et prospice. PLoS One. 2016;11: e0164573. pmid:27776141
- 83. Fitzgerald GK, Hinman RS, Zeni J, Risberg MA, Snyder-Mackler L, Bennell KL. OARSI Clinical Trials Recommendations: Design and conduct of clinical trials of rehabilitation interventions for osteoarthritis. Osteoarthritis Cartilage. 2015;23: 803–814. pmid:25952351
- 84. Master H, Pennings JS, Coronado RA, Bley J, Robinette PE, Haug CM, et al. How Many Steps Per Day During the Early Postoperative Period are Associated With Patient-Reported Outcomes of Disability, Pain, and Opioid Use After Lumbar Spine Surgery? Archives of Physical Medicine and Rehabilitation. Arch Phys Med Rehabil. 2021;102: 1873–1882. pmid:34175276
- 85. Gilmore SJ, McClelland JA, Davidson M. Physiotherapy management of patients undergoing lumbar spinal surgery: A survey of Australian physiotherapists. New Zealand Journal of Physiotherapy. 2016;44.
- 86. Rushton A, Wright C, Heap A, White L, Eveleigh G, Heneghan N. Survey of current physiotherapy practice for patients undergoing lumbar spinal fusion in the United Kingdom. Spine (Phila Pa 1976). 2014;39. pmid:25188595
- 87. Williamson E, White L, Rushton A. A survey of post-operative management for patients following first time lumbar discectomy. European Spine Journal. 2007;16. pmid:16944223
- 88. Gilmore SJ, Hahne AJ, Davidson M, Mcclelland JA. Predictors of substantial improvement in physical function six months after lumbar surgery: is early post-operative walking important? A prospective cohort study. BMC Musculoskelet Disord. 2019;20. pmid:31506099
- 89. Chiarotto A, Ostelo RW, Turk DC, Buchbinder R, Boers M. Core outcome sets for research and clinical practice. Braz J Phys Ther. 2017;21: 77–84. pmid:28460714
- 90. Boers M, Kirwan JR, Wells G, Beaton D, Gossec L, D’Agostino MA, et al. Developing Core Outcome Measurement Sets for Clinical Trials: OMERACT Filter 2.0. J Clin Epidemiol. 2014;67: 745–753. pmid:24582946
- 91. Middlebrook N, Rushton AB, Abichandani D, Kuithan P, Heneghan NR, Falla D. Measures of central sensitization and their measurement properties in musculoskeletal trauma: A systematic review. European Journal of Pain (United Kingdom). 2021;25: 71–87. pmid:33034137
- 92. Jakobsson M, Gutke A, Mokkink LB, Smeets R, Lundberg M. Level of Evidence for Reliability, Validity, and Responsiveness of Physical Capacity Tasks Designed to Assess Functioning in Patients With Low Back Pain: A Systematic Review Using the COSMIN Standards. Phys Ther. 2019;99: 457–477. pmid:30566577
- 93. Gagnier JJ, Lai J, Lidwine ·, Mokkink B, Terwee CB. COSMIN reporting guideline for studies on measurement properties of patient-reported outcome measures. Quality of Life Research. 2021;30: 2197–2218. pmid:33818733
- 94. Turner L, Shamseer L, Altman DG, Schulz KF, Moher D. Does use of the CONSORT Statement impact the completeness of reporting of randomised controlled trials published in medical journals? A Cochrane reviewa. Syst Rev. 2012;1: 1–7.
- 95. Elsman EBM, Butcher NJ, Mokkink LB, Terwee CB, Tricco A, Gagnier JJ, et al. Study protocol for developing, piloting and disseminating the PRISMA-COSMIN guideline: a new reporting guideline for systematic reviews of outcome measurement instruments. Syst Rev. 2022;11: 1–13.
- 96. Chehade MJ, Yadav L, Kopansky-Giles D, Merolli M, Palmer E, Jayatilaka A, et al. Innovations to improve access to musculoskeletal care. Best Pract Res Clin Rheumatol. 2020;34: 101559. pmid:32718885