Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Core outcome sets for spinal and associated limb, trunk, abdomen or pelvic pain: A systematic review

  • Tim Noblet ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Writing – original draft, Writing – review & editing

    timnoblet@hotmail.com

    Affiliations St George’s University Hospitals NHS Foundation Trust, London, United Kingdom, School of Physical Therapy, Western University, London, Ontario, Canada, Faculty of Medicine, Health and Human Sciences, Lifespan Health and Wellbeing Research Centre, Macquarie University, Australia

  • Carol Li,

    Roles Data curation, Methodology, Writing – review & editing

    Affiliation Faculty of Health Sciences, Western University, London, Ontario, Canada

  • Richard Newsham-West,

    Roles Data curation

    Affiliation Australian and New Zealand College of Anaesthetists (ANZCA), Melbourne, Australia

  • David Walton,

    Roles Conceptualization, Data curation, Methodology, Writing – review & editing

    Affiliation School of Physical Therapy, Western University, London, Ontario, Canada

  • Alison Rushton

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Writing – review & editing

    Affiliations St George’s University Hospitals NHS Foundation Trust, London, United Kingdom, School of Physical Therapy, Western University, London, Ontario, Canada

Abstract

Background

Spinal pain is a significant global health issue, affecting millions and ranking as one of the leading causes of disability worldwide. Despite the wide scope of research conducted on spinal and associated pain, the lack of standardised core outcome measures poses challenges for comparing and synthesising research data. Core Outcome Sets (COSs) are intended to harmonise assessment and facilitate comparison across studies. This review aimed to identify, map, and examine published core outcome sets (COSs) designed for the assessment of spinal pain—including cervical, thoracic, lumbar—and spinal-related limb, trunk, abdomen, or pelvic pain. It also sought to synthesise consistent outcome domains across these COSs, categorising them by anatomical region and measurement type, including patient-reported, physical, biological, psychological, social, and environmental measures.

Methods

This systematic review followed PRISMA guidelines and was registered with PROSPERO. A comprehensive literature search of 13 electronic databases and grey literature sources was conducted from 2000 to April 2025. Two independent reviewers assessed study eligibility and quality using predefined criteria. Data extraction was performed to identify core outcome domains, and a thematic analysis was conducted to categorise domains based on anatomical regions, patient-reported outcomes, performance measures, and biopsychosocial factors.

Results

Thirteen studies met inclusion criteria, addressing core outcome sets for cervical (n = 4), thoracolumbar (n = 1), and lumbar (n = 8) spinal regions. Patient-reported outcome measures were the most frequently recommended outcome type. The most commonly endorsed domains were physical function n = 9 (100%), pain intensity n = 8 (88.9%), participation in work or daily activities n = 7 (77.8%), and disability n = 6 (66.7%). However, few studies incorporated psychological, social, environmental, or physiological domains, highlighting critical gaps in the multidimensional assessment of spinal pain.

Conclusion

This systematic review identified key domains in current use and significant gaps in biopsychosocial and biological measurement. Findings will support researchers, clinicians, and policymakers in selecting appropriate outcomes for spinal pain research and practice. A Delphi study to develop an internationally agreed “Essential Universal Set” for spinal pain, inclusive of multidimensional biopsychosocial domains, is a sound next step.

Introduction

Spinal pain is a complex and multifaceted health issue, with diverse aetiologies, clinical presentations, and treatment approaches [1]. On a global scale, approximately 790 million people experience spinal pain along with associated pain in the limbs, trunk, abdomen, or pelvis at any given time [2]. Spinal pain has persistently ranked as the leading cause of disability spanning from adolescence to older adulthood, with the number of disability-affected years rising worldwide and has consequently been identified as the leading need for rehabilitation intervention [25]. The costly consequences of spinal pain, both in terms of health and economics, are exacerbated by low-value care, inefficient and fragmented care pathways, and the absence of a person-centred approach [68]. These costs are projected to escalate further in the coming decades [2,6,8].

The research investigating spinal pain displays remarkable diversity in its approaches, methodologies, and areas of focus [6]. Researchers and practitioners from various disciplines, including medicine, physical therapies, psychology, neuroscience and public health, have contributed to the extensive body of knowledge surrounding spinal pain [1,6,810]. This research encompasses a broad range of topics, such as the aetiology, epidemiology, diagnostic methods, treatment modalities, rehabilitation techniques, and psychosocial factors influencing pain experience and management [8,9,1113]. Moreover, studies explore different populations, including adults, adolescents, and older adults, as well as specific subgroups like athletes or those with specific spinal conditions [1,6]. Research methods vary, including clinical trials and observational studies, systematic reviews, qualitative interviews, and other designs [1,14,15]. This diversity in research can be helpful for providing a comprehensive understanding of spinal pain, enabling the development of tailored interventions and improved outcomes for people affected by this complex condition [1,2,6].

The diversity of research also poses challenges in comparing and synthesising data, thereby limiting the ability to draw robust conclusions and make evidence-based decisions [6,1618]. The lack of standardisation in outcomes particularly hampers meaningful comparison, data pooling, and the development of clinical guidelines. In an effort to harmonize outcomes across studies, several Core Outcome Sets (COSs) have been published intended for use in research on spinal pain. COSs consist of standardised measurements that capture the essential domains of interest in a specific health condition, such as spinal pain [1619]. These COSs enable consistent evaluation of interventions and facilitate comparison across studies of, for example, treatment effectiveness. By promoting research quality, reliability, and transparency, COSs stand to contribute significantly to advancements in the understanding and management of spinal pain and associated pain conditions [9,1620].

Several core outcome sets (COSs) have been developed for spinal pain and related musculoskeletal conditions over the past two decades. Notable examples include outcome sets for low back pain [21,22] whiplash-associated disorders [23] and spinal deformity surgery [24]. However, these efforts have generally focused on isolated anatomical regions or specific conditions, leading to fragmentation in measurement approaches. To date, no systematic review has comprehensively mapped and appraised COSs across the full spectrum of spinal pain and its common referred pain presentations,

Despite several relevant COSs being published, there has yet to be widespread consensus on the use of any one set. This poses a problem for both research and clinical practice in the field of spinal pain [9,1620]. To tackle these issues, this study aims to systematically collate, examine, and evaluate the existing COSs employed in assessing spinal and associated pain,

Aim

To identify what core outcome sets are currently used in the assessment of spinal pain and associated limb, trunk, abdomen, or pelvic pain, and to thematically analyze the domains to identify consistencies and gaps across them.

Objectives

  1. To identify, map and examine published core outcome sets intended for use in the assessment of: spinal (cervical, thoracic, lumbar) pain; spinal related lower limb pain; spinal related upper limb pain; and spinal related trunk, abdomen or pelvic pain.
  2. To synthesise the consistent domains across core outcome sets for spinal pain and associated limb, trunk, abdomen, or pelvic pain. For easier interpretation, domains are conceptualized and categorized according to anatomical region of relevance, mode of implementation (patient-reported, physical/performance-based, or clinician/observer-rated) and broad construct (e.g., biological, physiological, psychological, social, environmental).

Methods

This systematic review was conducted according to a pre-defined protocol with pre-specified eligibility criteria, in order to identify and analyse empirical research relating to the study’s objective [25]. This methodology was selected as a systematic review aims to critically appraise and synthesise results, providing an explicit overview and detailed map of the current evidence [2628]. To maximise the confidence of findings and minimise bias, the protocol was informed by the Cochrane handbook [25,2931] to facilitate the completion of low risk of bias systematic review by using recommendations for appropriate methodological decision-making informed by empirical evidence [25]. For clarity and transparency, this review is reported in accordance with the PRISMA statement (Preferred Reporting Items for Systematic Reviews and Meta-analyses) developed by a panel of international experts to ensure the quality of reporting [31,32].

Protocol and registration

The systematic review is registered with PROSPERO (CRD 42023473529), a register of anticipated health and social care systematic reviews held by the University of York Centre for Reviews and Dissemination. Registration ensures transparency, reducing bias caused by potential selective reporting and was essential to avoid duplication [29,33].

Eligibility criteria

An adapted PICOS framework ((P) patient population or the disease being addressed, (I) interventions or exposure, (C) comparator/control group, (O) outcome measure, (S) study design)) was used to inform eligibility criteria [25,34]. PICOS enables specific elements of clinical evidence to be identified in a systematic review [25,34,35]. The eligibility criteria are detailed in Fig 1.

Information sources

The literature search employed sensitive topic-based strategies designed for each of the sources identified in Fig 2. The specific databases searched were selected following discussion with subject matter experts, methodologists, and advice from specialist librarians at St George’s University of London (UK) and Western Univerity (Ontario, Canada). The information resources accessed were chosen to enable a holistic literature search [25,29,36]. Grey literature was explored using ProQuest Dissertations and Theses, along with trial registers such as clinicaltrials.gov and the International Clinical Trials Registry Platform of the World Health Organization. In addition, experts in the area were consulted to detect any further studies [25,3638].

Search strategy

Pre-defined search terms and combinations, with database specific standardised vocabulary were employed to ensure all relevant studies were retrieved [25,3638]. This was essential to ensure that no studies were missed due to different words being used to describe the same concepts [25,3638]. The search of databases encompassed the period over the 25-year window from 01.01.2000 to 01.04.2025 to ensure all contemporary COSs were captured. Fig 3 demonstrates the MEDLINE (Ovid) search strategy that was adapted to meet search terms of other included databases. Where a protocol or pilot study was identified, the definitive study was sought, or the authors were contacted to determine whether further published or unpublished research has been undertaken. The reference lists of the included studies were searched to ensure no studies were missed [36,38].

Data management

A web-based software platform (COVIDENCE, Veritas Health Innovation, Melbourne Australia) was used for managing various stages of systematic and other literature reviews. This platform facilitated the importation of all citations, assisted in eliminating duplicates arising from the search strategy, and supported the determination of eligibility throughout the screening and review phases [39].

Selection process

Two investigators searched the information sources (TN + CL) and independently assessed studies for inclusion by grading each eligibility criterion. The use of two investigators reduced the possibility of missing relevant literature or excluding relevant research where challenging judgements were necessary for selection or rejection of a study [33]. In the event of a selection disagreement a third reviewer (AR/DW, methodological experts) was consulted [33,37]. Both reviewers independently evaluated studies by title and abstract for potential eligibility. Following discussion between reviewers, if a study could not be explicitly excluded based on its title and abstract, its full text was reviewed [29,31]. All potentially relevant studies proceeded forward to full text review after which the two reviewers made independent judgements as to whether an individual study met the eligibility criteria. Any conflict was resolved with a 3rd reviewer. The numbers of studies included and excluded at the different stages was recorded using a PRISMA flow diagram [25,33,36].

Data collection process

Data extraction and charting was performed by the primary reviewer (TN) and checked and agreed by the secondary reviewer (CL). Data collection used pre-determined data collection sheets specific to the review objectives which were piloted, refined and agreed by the researchers prior to use [33,36]. Any differences were resolved at a consensus meeting of all researchers [37], and the third reviewers (AR/DW) checked for consistency and clarity.

Data items and outcomes

The following data items and outcomes were extracted from all included studies: authors, date of publication, study aim(s)/objective(s) and design, anatomical region of interest, definition of the condition the COS was used for, recommended outcome measures included within the COS, including verbatim domains or constructs (or specific tools) endorsed and recommendations for the frequency of use for patient assessment [25,27,36].

Quality (including internal validity) assessment of individual studies

As of this review there were no widely-accepted risk of bias (RoB) tools published for critical appraisal of Delphi or consensus research, which was anticipated to be the most commonly used method for development of the COSs. Accordingly, for purposes of this review we adapted the Conducting and Reporting of Delphi Studies (CREDES [40]) best practice framework to evaluate the quality (including internal validity) of the included studies regarding conduct and reporting [40].

While other frameworks exist for evaluating COS development and reporting—namely COS-STAD (Core Outcome Set–Standards for Development) [41] and COS-STAR (Core Outcome Set–Standards for Reporting) [42]—these were not used directly in this review. COS-STAD provides methodological guidance for COS design, and COS-STAR outlines standards for transparent reporting. Despite their distinct focuses, both frameworks share key domains such as clear rationale, stakeholder engagement, consensus methodology, and scope definition. The CREDES checklist was chosen over COS-STAD and COS-STAR because it integrates domains relevant to both frameworks and is specifically tailored to Delphi and consensus methodology [4042]. This made it the most suitable tool for appraising both the design and reporting quality of the included studies, capturing aspects of internal validity, transparency, and methodological consistency across the consensus-based COS literature.

CREDES was developed by methodological experts to enable quality and consistency in the conduct, reporting and appraisal of Delphi studies, using a 16-item checklist [40] specifically tailored to Delphi and consensus methodology [4042]. A modified version of the ‘Buchbinder appraisal scale’ (BAS) was used to evaluate each included study against the 16-items within the CREDES [4345]. The BAS awards one point for fully meeting a criterion, half a point for partially meeting a criterion, or zero points for not meeting a criterion or being unable to score due to a deficit in the information reported [4345]. Therefore, a quality score between 0–16 was reported of each of the included studies [40]. The BAS was originally developed for the critical appraisal of treatment based classification systems across 7 criteria with an overall Intraclass Correlation Coefficient (ICC) of 0.82 for inter-rater reliability [43,44]. The scale has now been adopted successfully across a range of studies where quality assessment tools are yet to be developed and validated [43,44].

Synthesis of results

A three-step process was used to synthesise the data.

Stage 1: Identification and mapping of COSs

An explanation of each included study’s characteristics and COS outcomes/recommendations were tabulated.

Stage 2: Synthesis of outcome measures across regions of the spine

For each anatomical region, data across the studies were synthesised to identify the assessment meaures recommended in the following categories: patient reported measures (PROMs), physical/ performance-based measures, biological measures, clinician-assessed psychological measures, social measures, environmental/ ecological measures [2628]. A narrative synthesis was used to synthesise the measures within spinal pain and spinal related limb, chest, abdominal, and pelvic pain ‘regions’ [27,29,37,46].

Stage 3: identification of recurrent outcome domains across COSs

The grouping of data across studies and spinal region into core domains, was completed by using the tabulated data from stages 1 and 2. The constructs endorsed within each core outcome measure set were grouped through thematic content analysis [47]. This process started by reviewing and re-reviewing the verbatim constructs included within outcome measures within each COS, and where specific PROMs were endorsed, returning to the literature as needed to define the construct(s) being measured by each. Domains were interpreted verbatim per the COS authors, while one-off or customized COS-specific questions or questionnaires were interrogated for the domain(s) with which they most aligned based either on the original development publications of those tools or, where the question or tool lacked adequate theoretical grounding in its initial development, interpreted against contemporary domains of health and wellness with the International Classification of Functioning, Disability, and Health (ICF) as the primary framework. Each domain, question, or PROM were therefore aligned with one or more dominant domains for thematic analysis.

Following extraction and interpretion of the most accurate construct from all measures included in the COSs, the thematic grouping moved from specific to broad, wherein groups of like constructs were clustered together to form ‘Sub-Domains’ that continue to be clustered into conceptually-distinct ‘Domains’ with consensus reached by all authors. A domain was considered to be in “common use” if it was reported in more than two thirds (66.6%) of the included core outcome sets. This threshold was selected based on its use in previous quality studies assessing the frequency of domains within patient-reported outcome measures (PROMs) [48].

Certainty of evidence

Assessment of the overall certainty or confidence in the evidence was not conducted in this systematic review. Established frameworks such as GRADE (Grading of Recommendations Assessment, Development and Evaluation) [49,50] and GRADE-CERQual (Confidence in the Evidence from Reviews of Qualitative Research) [51] were not appropriate for the included studies, as they are primarily designed for assessing certainty in intervention studies and qualitative evidence, respectively. The studies included in this review were primarily Delphi and consensus-based, which lack a suitable, validated framework for assessing certainty of evidence [4951]. As such, no formal assessment of certainty was applied.

Results

Study selection

The search returned n = 8,046 potentially relevant studies. Following screening for duplicates, n = 5,790 citations remained. One additional study was identified from the Internet searches, and one from hand-searching the citation lists of the remaining articles, both of which were subsequently screened. Reviewing by title and abstract excluded n = 5,751 studies that clearly did not meet the eligibility criteria. No relevant unpublished studies were found, and no further studies were identified from reviews of the national research register or via experts in the field. The full texts of the remaining n = 39 studies were examined in detail and evaluated against the eligibility criteria.

Two studies—White et al (2004) [52] and Rebbeck et al (2007) [53]—were ‘whiplash associated disorder’ (WAD)- and ‘mechanical neck pain’-specific adaptations of a COS developed by Deyo et al (1998) [21] for low back pain. While the Deyo publication (1998) pre-dated publication cut-off of 2000, it was retained in the final selection of sources as the two subsequent adaptations were otherwise uninterpretable without the context of the original. All three were individually rated for quality and included separately in stages 1 and 2 of the data synthesis. In stage 3, they were considered a single COS (with region-specific adaptations) for data extraction and synthesis, as their constructs were consistent. Similarly, Verburg et al (2019) [24] and Verburg et al (2021) [54] were two studies from one programme of research. Both met eligibility criteria and were retained. Again, they were rated and synthesised separately in stages 1 and 2, but treated as a single COS in stage 3 as both retained the same outcome domains although the later study introduced outcome-based quality indicators for patient-reported outcome measures (PROMs). For the same reasons Chen et al (2019) [55] and Sterling et al (2023) [23] were also considered a single study in stage 3, as Sterling et al (2023) [23] provided specific measurement instruments for the COS constructs initially defined by Chen et al (2019) [55].

Thirteen studies were included in the systematic review. 100% inter-rater agreement was achieved following open discussion at each stage. Third reviewer mediation was not required. Fig 4 presents the number of studies at each stage of the selection process.

Stage 1: Identification and mapping of core outcome measure sets

Study characteristics

Studies were published between 1998–2023. Study designs included Delphi technique (n = 5) [22,24,5557], other consensus methods (n = 6) [21,23,54,5860], and adaptations and evaluation of existing COS (produced through consensus methods) to produce region-specific COS (n = 2) [52,53].

Four studies described COSs for the cervical spine region [23,52,53,55]: three focused on whiplash-associated disorders [23,53,55] and one on mechanical neck pain [52]. Eight studies described COSs for the lumbar spine region exclusively [21,22,24,54,56,5860]. Of these, four addressed ‘nonspecific’ or ‘mechanical’ low back pain [21,24,54,58], two were intended for all low back pain excluding cases with sinister pathology, inflammatory disease, infection, or fracture [22,56], one focused exclusively on chronic low back pain [59], and one was inclusive of both acute and chronic low back pain [60]. One study investigated the thoracolumbar region (including both thoracic and lumbar regions) [57], specifically targeting adolescent and young adult patients with spinal deformity.

Six studies developed COSs specifically for research purposes [2123,55,59,60], seven for clinical practice [24,5254,56,57], and one for both research and clinical settings [58]. COSs developed for clinical practice were generally shorter, including fewer domains or measures, whereas those designed for research settings tended to be more comprehensive, incorporating a broader array of domains or measures.

In terms of scope, two studies aimed to define core outcome domains only [55,58], one focused on defining appropriate patient-reported outcome measures (PROMs) [24], and ten defined both outcome domains and the PROMs to assess them [2123,5254,56,57,59,60].

Only three studies explicitly defined the frequency with which the COS should be used [24,56,60]. One study recommended specific application strategies tailored to individual studies [59], while the remaining nine did not specify frequency of use [2123,5255,57,58].

Quality assessment

Quality assessment data can be found in Supplementary file 1. Scores ranged from 0.5 to 15 out of a maximum of 16 points. The most commonly met quality domains across studies included clear justification for using the Delphi or consensus technique, well-defined consensus procedures, and adequate interpretation of results. High-scoring studies consistently addressed these criteria. In contrast, the least commonly met domains were prevention of bias, external validation, and detailed reporting of expert panel and participant recruitment.

Results of individual studies

The data and quality assessment outcome from individual studies (COSs) are presented in Table 1.

Results of syntheses

Stage 2: Synthesis of outcome measures.

Table 2 presents a synthesis of outcome measures across spinal regions, structured according to the outcome domains reported within each included study. Each outcome is categorised under the domain as defined by the original authors of the respective core outcome set.

thumbnail
Table 2. Synthesis of outcome measures across regions of the spine.

https://doi.org/10.1371/journal.pone.0338184.t002

Cervical spine COSs

Three studies [23,52,53] with a quality rating of 0.5 to 14/16 reported outcome meaures in COSs for the cervical spine. All three COSs endorsed standardized PROMs. None included physical/performance-based, biological, non-patient-reported psychological, social, or environmental/ecological measures.

Pain was the most commonly included domain, appearing in all three COSs [23,52,53]. The Visual Analogue Scale (VAS) was included for use in all three. Two [52,53] (quality 0.5/16) included a custom symptom scale, while one [23] (quality 14/16) included the Numeric Pain Rating Scale (NPRS).

Disability was included in two COSs [52,53] (both quality 0.5/16) using tailored questions about daily activity limitation and time lost from work or school due to symptoms.

Percieved Recovery (global percieved change) was included in one COS [23] (quality 14/16) using a 7- or 11-point Global Rating of Change (GROC) scale.

Physical function was included in one COS [23] (quality 14/16) using both the Neck Disability Index (NDI) and the Whiplash Disability Questionnaire (WDQ).

Psychological functioning was included in one COS [23] (quality 14/16), which recommended the use of validated instruments including the Pictorial Fear of Activity Scale-Cervical (PFActS.-C.), the Pain Self-Efficacy Questionnaire, and the Pain Catastrophising Scale.

Satisfaction with care was included in one COS [52] (quality 0.5/16) using a single-item measure regarding the participant’s overall impression of their medical care.

Social function was included in all three COSs [23,52,53]. Two C.O.S. [52,53] (both quality 0.5/16) included a ‘PROM question’ that asked how pain interfered with work and home responsibilities. One C.O.S. [23] (quality 14/16), included two ‘PROM questions’, asking about employment status (e.g., modified duties, disability pension) and the percentage of usual work hours performed.

Wellbeing assessment was included in two COSs [52,53] (quality 0.5/16) using a single global item asking participants how they would feel if their current symptoms persisted for the rest of their lives.

Thoracic spine COSs

One study [57] with a quality rating of 12.5/16 reported a COS for the thoracic spine comprised entirely of standardized PROMs. The Scoliosis Research Society–22 revised questionnaire (SRS-22r) was the primary PROM included, capturing multiple outcome domains relevant to thoracic spine conditions. These included pain, physical function, recreation and leisure, self-image, and overall outcome of surgery. In addition, satisfaction with the cosmetic result was assessed post-operatively using the same instrument. The COS also included the EuroQoL-5D (EQ-5D) to assess overall quality of life. An extended version of this COS specific to surgical outcomes was also presented that included clinician-assessed outcomes such as re-operation rates, post-operative complications, and changes in spinal deformity. No other outcome types—such as physical or performance-based measures, non-patient-reported psychological or social measures, or environmental/ecological measures—were included,

Lumbar spine COSs

Nine studies (exclusively lumbar n=8, thoracolumbar n=1) [21,22,24,54,5660] with a quality rating range of 2 to 15/16 reported COSs for the lumbar spine. All nine COSs included PROMs to assess multiple outcome domains. One COS [61] (quality 5/16) included physical/performance-based measures and two COSs [60,61] (both with quality rating 5/16) included biological measures (blood tests, MRIs and radiographs). None included non-patient-reported psychological or social outcomes, or environmental and/or ecological measures.

Pain was the most frequently reported outcome domain, included in seven COSs (quality 2.5-15/16). The Numeric Pain Rating Scale (NPRS) was included in four COSs [22,24,54,56] with quality scores ranging from 2.5 to 15/16. One C.O.S. [21] (quality 2/16) included the Visual Analogue Scale (VAS), one COS [60](quality 5/16) included the Numeric Pain Scale (NPS), and two COSs [21,60] (quality 2-5/16) included custom symptom-specific questions. One thorocolumbar scoliosis-specific COS [57] (quality 12.5/16) included the SRS-22r which includes a pain subscale as part of a multidimensional PROM.

Disability was included in five COSs [21,24,54,56,60] (quality 2 to 13/16). These included: The Oswestry Disability Index (ODI) in two [24,56] (quality 7.5-13/16), the Quebec Back Pain Disability Scale (QBPDS) in two [24,54] (quality 2.5-13/16), and the Roland-Morris Disability Questionnaire (RMDQ) in one COS [60] (quality 5/16). One C.O.S. [21] (quality 2/16) included single-item disability questions, asking about days spent in bed, missed work, or restricted normal activity due to back or leg pain.

Physical function assessment was included in six COSs [21,22,24,54,57,61] (quality 2-15/16), using both validated PROMs and COS-specific structured questions. The O.D.I. 2.1a was included in three COSs [21,22,61] (quality 2-15/16). The Patient-Specific Functional Scale (PSFS) was included in two COS [24,54] (quality 2.5-13/16), while the RMDQ was included in three [21,22,61] (quality 2-15/16). One COS [57] (quality 12.5/16) included the SRS-22r to assess physical functioning as part of a multidimensional PROM. One COS [21] (quality 2/16) also included a single question on pain interference with work and home responsibilities.

Quality of life was included in four COS [22,56,57,61] (quality 5-15/16). The E.Q.-5D. was included in three COSs [22,56,57] (quality7.5-15/16), while the EuroQol visual analogue scale (E.Q.-VAS) was included for use by one [56] (quality 7.5/16). The S.F.-12 was recommended by one COS [22] (quality 15/16). Two C.O.S.s [22,61] (quality 5-15/16) included the P.R.O.M.I.S.-GH-10.

Wellbeing was included in two COSs [21,60] (quality 2-5/16). One C.O.S. [21] (quality 2/16) included the EuroQol, the S.F.-12, and included a question asking how patients would feel if their current symptoms persisted for life. One C.O.S. [60] (quality 5/16) included the S.F.-36.

Patient-reported Psychological Function were included in two COSs. Both the Pain Catastrophising Scale and Fear-Avoidance Beliefs Questionnaire (FABQ) were included in one COS [60] (quality 5/16). Two C.O.S.s [60,61] (quality- both studies 5/16) included measures to evaluate depression: one [61] included the Beck Depsression Inventory (BDI-II) and Center for Epidemiological Studies – Depression scale (CES-D), while the other [60] included the Patient Health Questionnaire-9 (PHQ-9).

Satisfaction with care was included in four COSs [21,24,54,60] (quality 2.5-13/16). Two C.O.S.s [21,60] (quality 2-5/16) included a single-item satisfaction question. Two others [24,54] (quality 2.5-13/16) included the Global Perceived Effect scale.

Work status was recommended for inclusion in two COSs [56,60] (quality 5-7.5/16). One [56] (quality 7.5/16) included a general work status question, while one [60] (quality 5/16) included more detailed questions about return-to-work outcomes and days missed due to back or leg pain.

Pre and post-operative outcomes were included in one COS [57] (quality 12.5/16) recommending the SRS-22r to assess the overall outcome of surgery, satisfaction with cosmetic results, recreation and leisure, and self-image.

Physical and performance-based measures were included in one COS [61] (quality 5/16), being clinician-assessed hip medial rotation, lower limb muscle strength, deep tendon reflexes, and straight leg raise (SLR) for patients with radicular symptoms.

Biological measures were included in two COSs [61]. Both included metrics extracted from diagnostic testing including MRI. One [60] (quality 5/16) also included use of blood tests and radiographs where indicated.

Reporting of adverse events was included in one [61] (quality 5/16).

No included studies reported non-patient-reported psychological, social, or environmental/ecological measures.

Stage 3: identification of recurrent outcome domains.

A total of n=82 domains, questions, or PROMs were extracted from across all included COSs detailed in Table 1. Thematic content analysis categorised the 82 recommendations into 22 conceptual domains (Table 3). Of those, four domains were endorsed by >6 (66.66%) of the 9 COSs meeting the threshold for ‘common use’ status: patient rated physical function (100%), pain symptoms/Pain intensity (88.9%), participation - work or school (77.8%), and participation - activities of daily living(66.7%).

thumbnail
Table 3. Presence of each of the 22 thematically-classified Domains endorsed by each of the COSs.

https://doi.org/10.1371/journal.pone.0338184.t003

Reporting biases

For all eligible studies included in this review, complete data were available from the published reports. No additional data were required, and no contact with study authors was necessary to obtain missing information. As such, the risk of reporting bias related to unavailable or selectively reported data was considered minimal.

Discussion

This systematic review comprehensively identified and synthesised core outcome sets for spinal pain and associated pain in the limbs, trunk, abdomen, or pelvis. While previous literature has explored specific outcome measures for specific spinal regions or clinical contexts, this review uniquely brings together COSs across multiple anatomical regions and domains of assessment, offering a pan-spinal overview.

This review identified 13 eligible manuscripts describing COSs (and derivatives thereof) intended for use in the cervical, thoracic, or lumbar spinal regions. Across these, PROMs were the most frequently endorsed outcome type. Core domains, most notably pain intensity, physical function, and participation in work and daily activities, were each recommended in at least two-thirds of the identified COSs suggesting that these are ‘essential’ domains as endorsed by experts and, in some cases, representatives of the patient population.

These findings are closely aligned with those reported by Sabet et al. (2025), whose systematic review of 40 COSs for musculoskeletal conditions similarly highlighted pain and function as the most commonly endorsed domains [62]. This alignment reflects longstanding clinical and research priorities and is consistent with recommendations from established initiatives such as OMERACT and the COMET Initiative, which advocate for the standardisation of core domains in health outcomes research [18,19,63]. The concordance observed reinforces the central importance of these domains in musculoskeletal and spinal research and underscores the necessity of their inclusion in any future COS developed for spinal pain. The consistent prioritisation of domains such as pain intensity and physical function across COSs likely reflects the long-standing biomedical orientation of spinal pain research and the preference for outcomes that are easily measured, clinically interpretable, and responsive to intervention. Conversely, the limited inclusion of psychological, social, and biological domains may result from practical constraints, disciplinary silos in COS development, and concerns regarding feasibility in clinical settings. These patterns suggest that while core domains remain stable, broader biopsychosocial factors, essential to precision and person-centred care, are underrepresented, limiting the comprehensiveness of existing COSs.

Variation in the validated outcome tools used across the included COSs was evident, with several studies employing the same instrument to measure different domains. This was particularly apparent between the domains of physical function and disability, where tools were often applied beyond the scope for which they were originally developed or psychometrically validated. Such inconsistencies raise concerns regarding construct validity and comparability across studies.

The prominence of domains such as pain and physical function across both musculoskeletal and spinal COSs suggests a shared, cross-disciplinary recognition of their value. These domains not only reflect key patient priorities but also provide meaningful indicators of clinical progress and treatment efficacy. However, the review also identified significant gaps. Despite an increasing appreciation of spinal pain as a multifactorial and biopsychosocial condition, few COSs extended beyond traditional domains. Domains such as sleep disturbance, psychological wellbeing, social and environmental determinants, and physical, biological or physiological markers were infrequently represented. These omissions are important, as they suggest that many existing COSs may fall short of supporting the aims of precision rehabilitation or personalised medicine in spinal pain research and practice—a concern amplified by accumulating evidence that factors such as sleep disturbance, psychological wellbeing, and environmental influences significantly impact pain persistence, variability in treatment response, and long-term functional outcomes [9,6467].

A growing body of evidence underscores the need to expand the scope of outcome measurement in spinal pain research to encompass currently underrepresented dimensions. Increasingly, studies have demonstrated that factors such as sleep quality, neuroimmune interactions, and psychological constructs—including resilience, catastrophising, and self-efficacy—influence the onset, progression, and treatment response of spinal pain [9,6472]. Likewise, social and environmental determinants—such as levels of social support, histories of trauma, loneliness, access to healthcare services, and broader ecological exposures—are recognised as pivotal in shaping the pain experience and recovery trajectory, yet these remain largely absent from existing core outcome sets [6,12,20,7376].

Beyond psychosocial and contextual domains, the integration of biological and physical metrics into COS development represents a sound direction for future development of holistic outcome sets. Biomarkers, neuroimaging findings, genetic and epigenetic indicators, and physical performance measures offer the potential to better characterise patient subgroups and tailor therapeutic approaches more precisely. Recent advances in omics technologies and wearable sensor data further enhance this potential by allowing real-time, individualised monitoring of health status and functional capacity [7781]. Owing perhaps to many of the COSs in this review pre-dating the wide and growing accessibility of advanced performance and biological sensors, their absence appears to signal a need for planned regular updates to core sets. In this context, the results of our review can expedite such updates. It appears widely accepted that the four common domains reported herein should remain part of such COSs, enabling development groups to dedicate energies on appraising new technologies for their usefulness in updates sets.

To establish such comprehensive COSs, interdisciplinary collaboration is essential. Researchers from across clinical, social, psychological, and biological sciences must work collaboratively to ensure that COSs are sufficiently multidimensional to support the complexity of spinal pain assessment. Crucially, this includes not only healthcare professionals and academic researchers, but also patients and members of the public [82,83]. Patient and public involvement (PPI) is critical to ensure that the outcomes valued by those living with spinal pain are meaningfully represented [84]. Without this, outcome sets risk being overly researcher-centric, potentially overlooking aspects of health and well-being that patients consider important [83].

To enhance adoption and relevance across settings, COS implementation must be accompanied by a staged, evidence-informed process. First, robust psychometric appraisal of candidate instruments is needed to ensure reliability and validity across spinal populations. Second, feasibility testing in routine clinical environments (across primary, secondary, and tertiary care) is required to determine acceptability and integration potential. Finally, pilot implementation studies should be conducted to evaluate the COS’s impact on clinical decision-making, patient experience, and research data quality. Without this staged approach, there is a risk that even well-constructed COSs may fail to achieve meaningful uptake or influence practice.

The findings of this review demonstrate that COSs developed for clinical practice were generally shorter and included fewer domains—primarily focusing on pain, function, and disability—to reduce burden during consultations. In contrast, COSs designed for research settings tended to incorporate a wider array of domains, including psychosocial, work-related, and quality-of-life measures, reflecting the need for more comprehensive data collection. This clear distinction in scope and content highlights the importance of differentiating COSs by intended use to ensure both feasibility in clinical settings and rigour in research contexts [85]. While overlap between the two is desirable, their purposes differ. Research COSs must enable standardisation and comparison across trials, support meta-analyses, and permit the detection of treatment effects. Clinical COSs, in contrast, must be feasible, efficient, and sensitive to the needs of individual patients. To date, few COSs have clearly articulated whether their intended use is in research, clinical care, or both [85]. This is an area requiring further clarification in future work.

Strengths and limitations

A key strength of this systematic review lies in its methodological rigour and comprehensive scope, incorporating a wide range of core outcome sets (COSs) across multiple spinal regions and related pain presentations. The use of a clearly defined protocol, adherence to PRISMA guidelines, and robust data extraction and synthesis procedures contribute to the overall reliability and transparency of the review. However, an important limitation concerns the quality appraisal of the included studies. Many of the identified studies employed consensus-based methodologies, such as Delphi techniques, for which no universally accepted risk-of-bias tool currently exists. To address this challenge, we used the CREDES framework in combination with a modified version of the Buchbinder Appraisal Scale to evaluate methodological quality [40,4345]. Nonetheless, a number of studies scored poorly on quality assessment, not necessarily due to poor conduct, but because they were undertaken prior to the development of formalised COS reporting and development standards, such as the core outcomes sets standards for design (COS-STAD) and reporting (COS-STAR) [41,42]. As a result, several studies lacked comprehensive reporting on key aspects of methodology, including the rationale for consensus methods, recruitment of expert panels, and stakeholder engagement. This limits the ability to fully appraise internal validity and may have introduced heterogeneity in study quality that could affect the interpretation of findings. Another methodological limitation of this review is the absence of a formal assessment of confidence in the findings. Frameworks such as GRADE and GRADE-CERQual, although robust in intervention and qualitative research settings respectively, were not applicable to the consensus-based studies included in this review [49,51]. This limits the ability to make strong recommendations about which domains or outcome measures should be universally adopted in future COS development and underscores the pressing need for a validated approach to assessing the certainty of evidence in COS development studies.

Conclusion

This systematic review offers the first comprehensive synthesis of COSs currently used to assess spinal pain. It identifies consistent emphasis on core domains such as pain intensity, physical function, and participation in daily and work-related activities, reinforcing their relevance to both clinical practice and research. However, it also highlights fragmentation in outcome measurement and a lack of integration of broader biopsychosocial and biological domains. The existence of several COSs for a single body region (i.e., neck, low back) also poses challenges to the reason such sets exist, intended as a set of core outcomes that are consistently collected across all relevant research studies. With no one COS for these regions being widely implemented, the intention of having a COS has yet to be realized in spinal pain. The common domains across sets may be a sound point of departure for the next iteration of spinal COSs towards widespread adoption.

The findings provide critical insights for researchers, policymakers, and healthcare leaders in selecting outcome measures that support rigorous, clinically safe, and economically sound research and evaluation. Importantly, this review underscores the need for a consensus-based, internationally recognised COS that reflects the multidimensional nature of spinal pain. Developing such a COS will improve study comparability, reduce research waste, enable robust meta-analyses, and guide personalised, high-quality care.

References

  1. 1. Hartvigsen J, Hancock MJ, Kongsted A, Louw Q, Ferreira ML, Genevay S, et al. What low back pain is and why we need to pay attention. Lancet. 2018;391(10137):2356–67. pmid:29573870
  2. 2. Cieza A, Causey K, Kamenov K, Hanson SW, Chatterji S, Vos T. Global estimates of the need for rehabilitation based on the global burden of disease study 2019: a systematic analysis for the global burden of disease study 2019. Lancet. 2021;396(10267):2006–17. pmid:33275908
  3. 3. Vos T, Lim SS, Abbafati C, Abbas KM, Abbasi M, Abbasifard M. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: a systematic analysis for the global burden of disease study 2019. The Lancet. 2020;396(10258):1204–22.
  4. 4. Hoy D, Brooks P, Blyth F, Buchbinder R. The Epidemiology of low back pain. Best Pract Res Clin Rheumatol. 2010;24(6):769–81. pmid:21665125
  5. 5. Hoy D, March L, Brooks P, Blyth F, Woolf A, Bain C, et al. The global burden of low back pain: estimates from the global burden of disease 2010 study. Ann Rheum Dis. 2014;73(6):968–74. pmid:24665116
  6. 6. Buchbinder R, van Tulder M, Öberg B, Costa LM, Woolf A, Schoene M, et al. Low back pain: a call for action. Lancet. 2018;391(10137):2384–8. pmid:29573871
  7. 7. Maher C, Underwood M, Buchbinder R. Non-specific low back pain. Lancet. 2017;389(10070):736–47. pmid:27745712
  8. 8. Foster NE, Anema JR, Cherkin D, Chou R, Cohen SP, Gross DP, et al. Prevention and treatment of low back pain: evidence, challenges, and promising directions. Lancet. 2018;391(10137):2368–83. pmid:29573872
  9. 9. Noblet T, Marriott J, Hensman-Crook A, O’Shea S, Friel S, Rushton A. Independent prescribing by advanced physiotherapists for patients with low back pain in primary care: A feasibility trial with an embedded qualitative component. PLoS One. 2020;15(3):e0229792. pmid:32182243
  10. 10. Noblet T, Jadhakhan F, Bennett A, McCrum C, O’Shea S, Hensman-Crook A, et al. A multi-site prospective, observational study of physiotherapist independent prescribing activity across musculoskeletal clinics in the United Kingdom. Musculoskelet Sci Pract. 2022;61:102616. pmid:35785573
  11. 11. Mistry J, Falla D, Noblet T, Heneghan NR, Rushton A. Clinical indicators to identify neuropathic pain in low back related leg pain: a modified Delphi study. BMC Musculoskelet Disord. 2020;21(1):601. pmid:32900367
  12. 12. Mistry J, Heneghan NR, Noblet T, Falla D, Rushton A. Diagnostic utility of patient history, clinical examination and screening tool data to identify neuropathic pain in low back related leg pain: a systematic review and narrative synthesis. BMC Musculoskelet Disord. 2020;21(1):532. pmid:32778086
  13. 13. Achttien RJ, Powell A, Zoulas K, Staal JB, Rushton A. Prognostic factors for outcome following lumbar spine fusion surgery: a systematic review and narrative synthesis. Eur Spine J. 2022;31(3):623–68. pmid:34705106
  14. 14. Beyera GK, O’Brien J, Campbell S. Health-care utilisation for low back pain: a systematic review and meta-analysis of population-based observational studies. Rheumatol Int. 2019;39(10):1663–79. pmid:31463608
  15. 15. Rushton A, Jadhakhan F, Masson A, Athey V, Staal JB, Verra ML, et al. Patient journey following lumbar spinal fusion surgery (FuJourn): A multicentre exploration of the immediate post-operative period using qualitative patient diaries. PLoS One. 2020;15(12):e0241931. pmid:33259481
  16. 16. Gargon E, Gorst SL, Harman NL, Smith V, Matvienko-Sikar K, Williamson PR. Choosing important health outcomes for comparative effectiveness. 2018.
  17. 17. Hughes KL, Clarke M, Williamson PR. A systematic review finds Core Outcome Set uptake varies widely across different areas of health. J Clin Epidemiol. 2021;129:114–23. pmid:32987162
  18. 18. Williamson PR, Altman DG, Bagley H, Barnes KL, Blazeby JM, Brookes ST, et al. The COMET handbook: version 1.0. Trials. 2017;18:1–50.
  19. 19. Prinsen CAC, Vohra S, Rose MR, Boers M, Tugwell P, Clarke M, et al. How to select outcome measurement instruments for outcomes included in a “Core Outcome Set” - a practical guideline. Trials. 2016;17(1):449. pmid:27618914
  20. 20. Noblet TD, Marriott JF, Rushton AB. Independent prescribing by advanced physiotherapists for patients with low back pain in primary care: protocol for a feasibility trial with an embedded qualitative component. BMJ Open. 2019;9(4):e027745. pmid:31048447
  21. 21. Deyo RA, Battie M, Beurskens AJ, Bombardier C, Croft P, Koes B, et al. Outcome measures for low back pain research. A proposal for standardized use. Spine (Phila Pa 1976). 1998;23(18):2003–13. pmid:9779535
  22. 22. Chiarotto A, Boers M, Deyo RA, Buchbinder R, Corbin TP, Costa LOP, et al. Core outcome measurement instruments for clinical trials in nonspecific low back pain. Pain. 2018;159(3):481–95. pmid:29194127
  23. 23. Sterling M, Andersen T, Carroll L, Connelly L, Côté P, Curatolo M, et al. Recommendations for a core outcome measurement set for clinical trials in whiplash associated disorders. Pain. 2023;164(10):2265–72. pmid:37171189
  24. 24. Verburg AC, van Dulmen SA, Kiers H, Nijhuis-van der Sanden MWG, van der Wees PJ. Development of a standard set of outcome measures for non-specific low back pain in Dutch primary care physiotherapy practices: a Delphi study. Eur Spine J. 2019;28(7):1550–64. pmid:31004195
  25. 25. Higgins JG. Cochrane Handbook for Systematic Reviews of Interventions. The Cochrane Collaboration. 2011.
  26. 26. Munn Z, Peters MDJ, Stern C, Tufanaru C, McArthur A, Aromataris E. Systematic review or scoping review? Guidance for authors when choosing between a systematic or scoping review approach. BMC Med Res Methodol. 2018;18(1):143. pmid:30453902
  27. 27. Peters MDJ, Marnie C, Tricco AC, Pollock D, Munn Z, Alexander L, et al. Updated methodological guidance for the conduct of scoping reviews. JBI Evid Synth. 2020;18(10):2119–26. pmid:33038124
  28. 28. Lockwood C, Dos Santos KB, Pap R. Practical Guidance for Knowledge Synthesis: Scoping Review Methods. Asian Nurs Res (Korean Soc Nurs Sci). 2019;13(5):287–94. pmid:31756513
  29. 29. Akers JA-IR, Baba-Akbari SA. CRD’s Guidance for Undertaking Reviews in Health Care.: York: Centre for Reviews and Dissemination (CRD). 2009.
  30. 30. Shamseer L, Moher D, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015: elaboration and explanation. BMJ. 2015;350:g7647. pmid:25555855
  31. 31. Moher D, Shamseer L, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst Rev. 2015;4(1):1. pmid:25554246
  32. 32. Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ. 2021;372:n71. pmid:33782057
  33. 33. Liberati A, Altman DG, Tetzlaff J, Mulrow C, Gøtzsche PC, Ioannidis JPA, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. Ann Intern Med. 2009;151(4):W65-94. pmid:19622512
  34. 34. Methley AM, Campbell S, Chew-Graham C, McNally R, Cheraghi-Sohi S. PICO, PICOS and SPIDER: a comparison study of specificity and sensitivity in three search tools for qualitative systematic reviews. BMC Health Serv Res. 2014;14:579. pmid:25413154
  35. 35. Robinson KA, Saldanha IJ, McKoy NA. Development of a framework to identify research gaps from systematic reviews. J Clin Epidemiol. 2011;64(12):1325–30. pmid:21937195
  36. 36. Hicks C. Research Methods for Clinical Therapists: Applied Project Design and Analysis. 4th ed ed. London, UK: Churchill Livingstone. 1999.
  37. 37. Furlan AD, Pennick V, Bombardier C, van Tulder M, Editorial Board, Cochrane Back Review Group. 2009 updated method guidelines for systematic reviews in the Cochrane Back Review Group. Spine (Phila Pa 1976). 2009;34(18):1929–41. pmid:19680101
  38. 38. Bowling A. Research Methods in Health: Investigating Health & Health Sciences. 2nd ed. Maidenhead, UK: Open University Press. 2001.
  39. 39. Hicks CM. Research methods for clinical therapists: applied project design and analysis. Elsevier Health Sciences. 2009.
  40. 40. Jünger S, Payne SA, Brine J, Radbruch L, Brearley SG. Guidance on Conducting and REporting DElphi Studies (CREDES) in palliative care: Recommendations based on a methodological systematic review. Palliat Med. 2017;31(8):684–706. pmid:28190381
  41. 41. Kirkham JJ, Davis K, Altman DG, Blazeby JM, Clarke M, Tunis S, et al. Core Outcome Set-STAndards for Development: The COS-STAD recommendations. PLoS Med. 2017;14(11):e1002447. pmid:29145404
  42. 42. Kirkham JJ, Gorst S, Altman DG, Blazeby JM, Clarke M, Devane D, et al. Core Outcome Set-STAndards for Reporting: The COS-STAR Statement. PLoS Med. 2016;13(10):e1002148. pmid:27755541
  43. 43. Maissan F, Pool J, de Raaij E, Wittink H, Ostelo R. Treatment based classification systems for patients with non-specific neck pain. A systematic review. Musculoskelet Sci Pract. 2020;47:102133. pmid:32148328
  44. 44. Lam KN, Rushton A, Thoomes E, Thoomes-de Graaf M, Heneghan NR, Falla D. Neck pain with radiculopathy: A systematic review of classification systems. Musculoskelet Sci Pract. 2021;54:102389. pmid:33992884
  45. 45. Buchbinder R, Goel V, Bombardier C, Hogg-Johnson S. Classification systems of soft tissue disorders of the neck and upper limb: do they satisfy methodological guidelines?. J Clin Epidemiol. 1996;49(2):141–9. pmid:8606315
  46. 46. Popay J, Roberts H, Sowden A, Petticrew M, Arai L, Rodgers M. Guidance on the Conduct of Narrative Synthesis in Systematic Reviews. ESRC Methods Programme. 2006.
  47. 47. Braun V, Clarke V. Using thematic analysis in psychology. Qualitative Research in Psychology. 2006;3(2):77–101.
  48. 48. Sabet TS, Anderson DB, Stubbs PW, Buchbinder R, Terwee CB, Chiarotto A, et al. Identifying common core outcome domains from core outcome sets of musculoskeletal conditions: protocol for a systematic review. Syst Rev. 2022;11(1):248. pmid:36403060
  49. 49. Balshem H, Helfand M, Schünemann HJ, Oxman AD, Kunz R, Brozek J, et al. GRADE guidelines: 3. Rating the quality of evidence. J Clin Epidemiol. 2011;64(4):401–6. pmid:21208779
  50. 50. Guyatt GH, Oxman AD, Vist G, Kunz R, Brozek J, Alonso-Coello P, et al. GRADE guidelines: 4. Rating the quality of evidence--study limitations (risk of bias). J Clin Epidemiol. 2011;64(4):407–15. pmid:21247734
  51. 51. Lewin S, Glenton C, Munthe-Kaas H, Carlsen B, Colvin CJ, Gülmezoglu M, et al. Using qualitative evidence in decision making for health and social interventions: an approach to assess confidence in findings from qualitative evidence syntheses (GRADE-CERQual). PLoS Med. 2015;12(10):e1001895. pmid:26506244
  52. 52. White P, Lewith G, Prescott P. The core outcomes for neck pain: validation of a new outcome measure. Spine (Phila Pa 1976). 2004;29(17):1923–30. pmid:15534418
  53. 53. Rebbeck TJ, Refshauge KM, Maher CG, Stewart M. Evaluation of the core outcome measure in whiplash. Spine (Phila Pa 1976). 2007;32(6):696–702. pmid:17413477
  54. 54. Verburg AC, van Dulmen SA, Kiers H, Nijhuis-van der Sanden MWG, van der Wees PJ. Patient-Reported Outcome-Based Quality Indicators in Dutch Primary Care Physical Therapy for Patients With Nonspecific Low Back Pain: A Cohort Study. Phys Ther. 2021;101(8):pzab118. pmid:33929546
  55. 55. Chen K, Andersen T, Carroll L, Connelly L, Côté P, Curatolo M, et al. Recommendations For Core Outcome Domain Set For Whiplash-Associated Disorders (CATWAD). Clin J Pain. 2019;35(9):727–36. pmid:31188173
  56. 56. Clement RC, Welander A, Stowell C, Cha TD, Chen JL, Davies M, et al. A proposed set of metrics for standardized outcome reporting in the management of low back pain. Acta Orthop. 2015;86(5):523–33. pmid:25828191
  57. 57. de Kleuver M, Faraj SSA, Holewijn RM, Germscheid NM, Adobor RD, Andersen M, et al. Defining a core outcome set for adolescent and young adult patients with a spinal deformity. Acta Orthop. 2017;88(6):612–8. pmid:28914116
  58. 58. Cieza A, Stucki G, Weigl M, Disler P, Jackel W, van Der Linden S. ICF core sets for low back pain. J Rehabil Med. 2004;36(s44):69–74.
  59. 59. Deyo RA, Dworkin SF, Amtmann D, Andersson G, Borenstein D, Carragee E, et al. Report of the NIH Task Force on research standards for chronic low back pain. Phys Ther. 2015;95(2):e1–18. pmid:25639530
  60. 60. Pincus T, Santos R, Breen A, Burton AK, Underwood M, Multinational Musculoskeletal Inception Cohort Study Collaboration. A review and proposal for a core set of factors for prospective cohorts in low back pain: a consensus statement. Arthritis Rheum. 2008;59(1):14–24. pmid:18163411
  61. 61. Deyo RA, Dworkin SF, Amtmann D, Andersson G, Borenstein D, Carragee E, et al. Report of the National Institutes of Health task force on research standards for chronic low back pain. J Manipulative Physiol Ther. 2014;37(7):449–67. pmid:25127996
  62. 62. Sabet TS, Anderson DB, Stubbs PW, Buchbinder R, Terwee CB, Chiarotto A, et al. Pain and physical function are common core domains across 40 core outcome sets of musculoskeletal conditions: a systematic review. J Clin Epidemiol. 2025;180:111687. pmid:39864671
  63. 63. Beaton DE, Maxwell LJ, Shea BJ, Wells GA, Boers M, Grosskleg S, et al. Instrument Selection Using the OMERACT Filter 2.1: The OMERACT Methodology. J Rheumatol. 2019;46(8):1028–35. pmid:30709952
  64. 64. Finan PH, Goodin BR, Smith MT. The association of sleep and pain: an update and a path forward. J Pain. 2013;14(12):1539–52. pmid:24290442
  65. 65. Batley S, Aartun E, Boyle E, Hartvigsen J, Stern PJ, Hestbæk L. The association between psychological and social factors and spinal pain in adolescents. Eur J Pediatr. 2019;178(3):275–86. pmid:30465273
  66. 66. Hruschak V Msw Rsw, Cochran G. Psychosocial and environmental factors in the prognosis of individuals with chronic pain and comorbid mental health. Soc Work Health Care. 2017;56(7):573–87. pmid:28594600
  67. 67. Tran ST, Koven ML, Castro AS, Goya Arce AB, Carter JS. Sociodemographic and Environmental Factors are Associated with Adolescents’ Pain and Longitudinal Health Outcomes. J Pain. 2020;21(1–2):170–81. pmid:31255798
  68. 68. Leccese A, Severo M, Ventriglio A, Petrocchi S, Limone P, Petito A. Psychological interventions in patients with physical pain: A focus on catastrophizing and resilience—a systematic review. MDPI. 2025.
  69. 69. Ahmed SA, Shantharam G, Eltorai AEM, Hartnett DA, Goodman A, Daniels AH. The effect of psychosocial measures of resilience and self-efficacy in patients with neck and lower back pain. Spine J. 2019;19(2):232–7. pmid:29906617
  70. 70. Gonzalez-Alvarez ME, Sanchez-Romero EA, Turroni S, Fernandez-Carnero J, Villafañe JH. Correlation between the Altered Gut Microbiome and Lifestyle Interventions in Chronic Widespread Pain Patients: A Systematic Review. Medicina (Kaunas). 2023;59(2):256. pmid:36837458
  71. 71. Tonelli Enrico V, Vo N, Methe B, Morris A, Sowa G. An unexpected connection: A narrative review of the associations between Gut Microbiome and Musculoskeletal Pain. Eur Spine J. 2022;31(12):3603–15. pmid:36308543
  72. 72. Ustianowska K, Ustianowski Ł, Machaj F, Gorący A, Rosik J, Szostak B, et al. The Role of the Human Microbiome in the Pathogenesis of Pain. Int J Mol Sci. 2022;23(21):13267. pmid:36362056
  73. 73. Stanhope J, Breed MF, Weinstein P. Exposure to greenspaces could reduce the high global burden of pain. Environ Res. 2020;187:109641. pmid:32447087
  74. 74. Mistry J, Heneghan NR, Noblet T, Falla D, Rushton A. Diagnostic utility of patient history, clinical examination and screening tool data to identify neuropathic pain in low back-related leg pain: protocol for a systematic review. BMJ Open. 2019;9(11):e033187. pmid:31767596
  75. 75. Karran EL, Fryer CE, Middleton JW, Moseley GL. Exploring the Social Determinants of Health Outcomes for Adults with Low Back Pain or Spinal Cord Injury and Persistent Pain: A Mixed Methods Study. J Pain. 2022;23(9):1461–79. pmid:35429673
  76. 76. Karran EL, Grant AR, Moseley GL. Low back pain and the social determinants of health: a systematic review and narrative synthesis. Pain. 2020;161(11):2476–93. pmid:32910100
  77. 77. Firdous A, Gopalakrishnan V, Vo N, Sowa G. Challenges and opportunities for omics-based precision medicine in chronic low back pain. Eur Spine J. 2025;34(7):2522–36. pmid:36565345
  78. 78. Bjorland S, Moen A, Schistad E, Gjerstad J, Røe C. Genes associated with persistent lumbar radicular pain; a systematic review. BMC Musculoskelet Disord. 2016;17(1):500. pmid:27964712
  79. 79. Matheve T, Janssens L, Goossens N, Danneels L, Willems T, Van Oosterwijck J, et al. The Relationship Between Pain-Related Psychological Factors and Maximal Physical Performance in Low Back Pain: A Systematic Review and Meta-Analysis. J Pain. 2022;23(12):2036–51. pmid:36057387
  80. 80. Hodges PW, van den Hoorn W. A vision for the future of wearable sensors in spine care and its challenges: narrative review. J Spine Surg. 2022;8(1):103–16. pmid:35441093
  81. 81. Mistry J, Walton DM, Noblet T, Bowling B, Heneghan NR, Rushton AB. Diagnostic utility of diagnostic investigations to identify neuropathic pain in low back-related leg pain: protocol for a systematic review. BMJ Open. 2024;14(1):e078392. pmid:38286684
  82. 82. Brett J, Staniszewska S, Mockford C, Herron-Marx S, Hughes J, Tysall C, et al. A systematic review of the impact of patient and public involvement on service users, researchers and communities. Patient. 2014;7(4):387–95. pmid:25034612
  83. 83. Baines RL, Regan de Bere S. Optimizing patient and public involvement (PPI): Identifying its “essential” and “desirable” principles using a systematic review and modified Delphi methodology. Health Expect. 2018;21(1):327–35. pmid:28929554
  84. 84. Staniszewska S, Brett J, Simera I, Seers K, Mockford C, Goodlad S. GRIPP2 reporting checklists: tools to improve reporting of patient and public involvement in research. BMJ. 2017;358.
  85. 85. Kearney A, Williamson PR, Dodd S. A review of core outcome sets (COS) developed for different settings finds there is a subset of outcomes relevant for both research and routine care. J Clinical Epidemiol. 2024:111440.