Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Comparative analysis of patient-reported outcomes in joint arthroplasty surgeries

  • Ville Äärimaa ,

    Contributed equally to this work with: Ville Äärimaa, Karita Kohtala, Inari Laaksonen

    Roles Conceptualization, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing

    Affiliations Department of Orthopedics and Traumatology, Turku University Hospital, Turku, Finland, University of Turku, Turku, Finland

  • Karita Kohtala ,

    Contributed equally to this work with: Ville Äärimaa, Karita Kohtala, Inari Laaksonen

    Roles Methodology, Writing – review & editing

    kshkoh@utu.fi

    Affiliation University of Turku, Turku, Finland

  • Keijo Mäkelä ,

    Roles Writing – review & editing

    ‡ KM, MK, AA and AR also contributed equally to this work.

    Affiliations Department of Orthopedics and Traumatology, Turku University Hospital, Turku, Finland, University of Turku, Turku, Finland

  • Mikko Karvonen ,

    Roles Writing – review & editing

    ‡ KM, MK, AA and AR also contributed equally to this work.

    Affiliations Department of Orthopedics and Traumatology, Turku University Hospital, Turku, Finland, University of Turku, Turku, Finland

  • Anssi Arimaa ,

    Roles Writing – review & editing

    ‡ KM, MK, AA and AR also contributed equally to this work.

    Affiliations Department of Orthopedics and Traumatology, Turku University Hospital, Turku, Finland, University of Turku, Turku, Finland

  • Anssi Ryösä ,

    Roles Writing – review & editing

    ‡ KM, MK, AA and AR also contributed equally to this work.

    Affiliations Department of Orthopedics and Traumatology, Turku University Hospital, Turku, Finland, University of Turku, Turku, Finland

  • Joel Kostensalo,

    Roles Data curation, Formal analysis, Visualization

    Affiliation Natural Resources Institute Finland, Joensuu, Finland

  • Fanny Kaivonen,

    Roles Conceptualization, Data curation, Investigation, Methodology

    Affiliation Department of Orthopedics and Traumatology, Turku University Hospital, Turku, Finland

  • Inari Laaksonen

    Contributed equally to this work with: Ville Äärimaa, Karita Kohtala, Inari Laaksonen

    Roles Conceptualization, Methodology, Project administration, Supervision, Writing – original draft, Writing – review & editing

    Affiliations Department of Orthopedics and Traumatology, Turku University Hospital, Turku, Finland, University of Turku, Turku, Finland

Abstract

Background

This study aims to report and analyze disease-specific patient-reported outcome measure (PROM) effect size (ES) variations, in patients undergoing major arthroplasty surgery.

Material and methods

All institution-based data of primary knee, hip, or shoulder arthroplasty patients at Turku University hospital (Finland) between January 2020 –December 2022 were collected, and treatment outcome assessed as a PROM difference between baseline and one-year follow-up. PROM ES were calculated for each patient and patient group separately, and patients with ES >0.5, were considered responders. Factors contributing to patient outcome and differences between patient groups were investigated using linear models and non-parametric methods.

Results

2580 patients were operated (complete follow-up data on 1828 patients). 1110 (61%) of the patients were female, and mean age was 69 years (SD 10). The mean ES across all patient groups was 2.64 (SD 1.29) and the biggest ES was observed in shoulder patients and the smallest in knee patients. Smaller ES was statistically significantly associated with higher preoperative PROM, higher ASA class, and old age. The percentage of responders was highest for shoulder patients (97.7%), followed by hip patients (96.8%), and lowest for knee patients (92.5%).

Conclusion

The observed ES for joint arthroplasty surgeries is high. However, there are significant disparities among patients with primary knee, hip, and shoulder joint arthroplasty surgery. These variations are mainly due to differences in preoperative PROM score and may be attributed to differences in patient selection. We recommend that prior to shared decision-making, preoperative scores are thoroughly reviewed with the patient, along with other patient specific factors that may influence the end result of the treatment.

Introduction

Over 500 million people worldwide suffer from osteoarthritis with varying and progressing symptoms of pain and locomotive impairment [1]. With the ageing population, rising demand for treatment, and the prevalence of obesity, there is a noticeable increase in the number of knee, hip, and shoulder joint arthroplasty surgeries performed [2].

Patient-reported outcome measures (PROMs) have emerged as important tools for evaluating patient health, monitoring changes, and assessing treatment outcomes, particularly in degenerative musculoskeletal diseases [3]. A clinically relevant change in a validated disease-specific PROM offers the most precise and responsive indication of a treatment effect detectable by patients themselves [4]. Despite the increasing number of studies reporting PROM outcomes in specific arthroplasty procedures, there remains limited understanding of how these outcomes compare across different anatomical regions of arthroplasty surgery. Moreover, the comparative patient-reported treatment effects and factors affecting it are obscure. Ultimately, it is in the interest of every patient and physician to comprehend the expected treatment outcome before shared decision making on elective operative treatment [3, 5].

In this study, our goal was to examine and report the outcomes and effect sizes (ES) of disease-specific PROMs in primary knee, hip, and shoulder arthroplasty surgery for osteoarthritis within a comprehensive institutional PROM registry. Our specific objective was to examine differences, variation, and reasons for variation in PROM ES in these patient groups.

Materials and methods

An institution-based anatomically divided systematic PROM registry for all patients undergoing joint arthroplasty surgery was established in 2019 at Turku University Hospital (Finland) with a catchment area of ca. 500,000 people. The registry was comprised of anatomic subregistries with disease-specific PROM outcome instruments as presented in Table 1. The scores were gathered preoperatively at baseline and at one year postoperatively using an electronic application (Omavointi, BCB Medical, Turku, Finland). For this study, all PROM data were scaled to range from 0 to 100 and harmonized in a way that a positive change in the score represented an improvement in the perceived quality of life. Patient history data were gathered regarding age, sex, American Society of Anesthesiology (ASA) score, and body mass index (BMI). Surgery related data on the main diagnosis code (according to the International Classification of Diseases (ICD) 10 classification), and the main operation (according to the Classification of Surgical Procedures Version 1.14 by the Nordic Medico-Statistical Committee (NOMESCO)) were also gathered (see S1 Table). The data were accessed 2.4.2024.

thumbnail
Table 1. Patient groups and disease-specific patient-reported outcome measures (PROM).

https://doi.org/10.1371/journal.pone.0314818.t001

Patients with end stage osteoarthritis and primary joint arthroplasty surgery for knee, hip, or shoulder were included using operative coding and International Classification of Diseases (ICD-10) codes. All patient data between January 1st, 2020 and December 31st, 2022 were collected, and the treatment outcome assessed as a difference between baseline and one-year follow-up PROM values. To compare results between groups we adopted a unified measure of outcome which we refer hereafter as ES. This measure can be understood as a standardized outcome measure. The effect size corresponding to observation i is given here by where PROM12 months,i is the PROM response 12 months after the operation for the patient i, PROMpreop,i is the preoperative PROM-score of the patient, and SD(PROMpreop) is the standard deviation of all the PROM-scores in the registry. The patients with ESi>0.5 are referred to as responders, i.e., individuals for whom the surgical treatment has led to an improvement over the distribution-based estimate of minimally clinically important difference (MCID) [9]. Statistical differences in the fractions of positive responders between anatomical groups were tested using Fisher’s exact test. Statistical significance of the differences in ES and unnormalized change in PROM between anatomical groups were tested using Kruskal’s test.

The differences between groups were analyzed using a linear model, in which ASA class, anatomical group, age, BMI, and preoperative PROM were covariates. Based on graphical checks, the effect of age was modeled as constant up to 65 and linearly for older patients. Six BMI outliers suspected of being erroneous (BMI = 53–2700, with the distribution being continuous up to 47) were removed from the linear model fit. An alternative model with sex included as covariate was explored, but the association of sex with effect size was not statistically significant and based on the Akaike Information Criterion (AIC) a model without it was preferred. As a sensitivity analysis we also fitted an identical model with the unnormalized change in PROM as the response variable. Factors contributing to a patient being a responder (ES>0.5) were tested using an analogous generalized linear model with a binary response variable with a logit link function. The regression coefficients for the latter two models are given in the Supplementary material (S2 and S3 Tables).

All statistical analyses were carried out using the statistical software R [10].

Permission for this study was obtained from the Internal Review Board of Turku University Hospital (TurkuCRC). The study has been performed in accordance with the ethical standards of 1964 Declaration of Helsinki. Our research permit number is T283/2020 and institutional approval was granted 18.2.2021. Ethical approval was not required in our institute as this study is registry based retrospective analysis and the study had no effect on patients’ treatment. All patient data was deidentified during analyses.

Results

The demographic variables are presented in Table 2. In total, 2580 patients were operated (complete follow-up data on 1828 patients). 1110 (61%) of the patients were female, and mean age was 69 years (SD 10). The ES results are presented in Figs 1 and 2 and Tables 3 and 4. The mean ES across all patient groups was 2.64 (SD 1.29) and the biggest mean ES was observed in shoulder patients (4.17, 95% C.I. [3.85, 4.49]), followed by hip patients (2.76, 95% C.I. [2.67, 2.84]), and the smallest in knee patients (2.42, 95% C.I. [2.34, 2.50]). The ES variation was highest among shoulder patients (SD 1.52), followed by knee patients (SD 1.27), and smallest for hip patients (SD 1.15). The ES was smaller for patients with higher preoperative PROM, with average ES being -0.051 (95% C.I. [-0.050, -0.052], p<0.0001) smaller per each additional preoperative PROM percentage point. Patients with higher ASA classes and BMI had on average lower ES, as did older patients. Patients’ sex did not statistically significantly correlate with the ES. Hip and knee patients had lower ES than shoulder patients with the same preoperative PROM percentage scores.

thumbnail
Fig 1. PROM measurements for all patients included in the study before the operation (baseline) and at the 1-year follow-up mark.

https://doi.org/10.1371/journal.pone.0314818.g001

thumbnail
Fig 2. Surgical outcomes measured as effect sizes for the patient groups included in the study.

Left plot for ASA = 2.

https://doi.org/10.1371/journal.pone.0314818.g002

thumbnail
Table 3. PROM scores (SD), mean changes and effect sizes (ES), their 95% confidence intervals and number of responders (%).

MCID from this data set: 6.8 (WOOS), 8.2 (OHS), and 7.4 (OKS).

https://doi.org/10.1371/journal.pone.0314818.t003

thumbnail
Table 4. Regression coefficients for a linear model with effect size (ES) as the response variable.

https://doi.org/10.1371/journal.pone.0314818.t004

The ES = 0.5 limits for responders based on this dataset were 6.8 (WOOS), 8.2 (OHS), and 7.4 (OKS). The percentage of responders was highest for shoulder patients (97.7%), followed by hip patients (96.8%), and lowest for knee patients (92.5%). There was a statistically significant difference between the number of responders for hip and knee surgeries (p<0.0001). However, when controlling for the preoperative PROM (%), the in between differences were not statistically significant (see S2 Table).

The differences in mean ES and unnormalized change in PROM were both highly statistically significant (p<0.0001) between the anatomical groups. However, all ended up with high average one-year PROMs: 83 (WOOS), 88 (OHS), and 82 (OKS).

Discussion

The primary finding of this study was a statistically significant difference in treatment ES among patients undergoing primary joint arthroplasty surgery for osteoarthritis in knee, hip, or shoulder. Accordingly, the number of responders and non-responders between patient groups was also significantly different between knee and hip patients. The lack of statistical significance when comparing to shoulder patients relates to the much smaller sample size, with 84 responders and only two non-responders. The mean ES were high and well above the estimated clinically significant levels in all joint arthroplasty groups.

Hip and knee arthroplasties are renowned for their high patient satisfaction [3], and although shoulder arthroplasties are less frequent, previous studies have reported similarly high satisfaction [11, 12]. A widely accepted threshold for a large ES in clinical medicine is 0.8 [13]. Within our study, we observed that the average ES among shoulder and hip patients was notably higher when compared to knee patients. Furthermore, the ES negatively correlated with ASA class, age, and especially with the preoperative PROM-scores. Interestingly, the one-year follow-up scores were essentially similar in all groups and the differences in ES were mostly due to variation in the baseline score. Accordingly, there seems to be a mean attainable PROM end-result limit (around 85%) for joint arthroplasty surgery. In order for a patient to subjectively benefit from the operation the baseline score has to be appropriately low, and our results allude to looser indications of knee arthroplasty surgery in terms of patient symptoms. It is noteworthy that all operations were performed due to end stage i.e. bone to bone osteoarthritis, and the radiographic findings do not always correlate with the clinical presentation [1416].

Osteoarthritis represents a significant global disease burden [17]. Therefore, transparent reporting of health care outcomes may be regarded as a responsibility of all care providers [18]. PROMs can also be viewed as quality indicators for healthcare, given their ability to capture also patient-perceived adverse events and care-related complications. Nonetheless, one must bear in mind that changes in subjective health after a surgical intervention comprise of both specific and non-specific effects [19] and sometimes the latter might be of greater significance. Therefore, the ES in our study do not represent the differences in the effect of surgical intervention per se, but rather differences in the overall perceptions of our treated patients. Psychological factors are reported to play an important role in treatment outcome [19, 20].

We employed three disease-specific PROMs, selected by our specialists based on international joint arthroplasty collaboration [21, 22]. However, despite a shared denominator of primary osteoarthritis the PROMs are likely not fully comparable due to different psychometric properties [23]. The concepts of responder and MCID are also theoretical, in addition to being context and patient group dependent [24]. Despite the best efforts of, e.g., International Consortium of Health Outcome Measures (ICHOM) and consensus-based standards for the selection of health measurement instruments (COSMIN) there is a lack of uniform inclusive commitment [4, 25]. Furthermore, PROM validation should proposedly be an iterative cross-cultural process instead of a static designation that a PROM is valid [26]. Only 2 of our PROM questionnaires have been properly cross culturally validated in Finnish thus far, and the third (WOOS) is used as an institutionally translated version, yet without published data on psychometrics.

We acknowledge that in addition to potential criticism related to PROM selection, there are other weaknesses in our study. First, despite the analyses being controlled for ASA class and age [27], we could not control for all potential confounders, such as detailed radiographic findings, operative technique, and comorbidities. Second, PROMs are known to suffer from floor and ceiling effects [28]. Although, a marked ceiling effect was detected in the hip patients in our study (25%), this mostly had the effect of suppressing the high ES to some degree. Third, we lacked a fully comparable generic PROM or a uniform global score for all patients. Fourth, large divergence in same PROM MCID estimates between different patient populations and also statistical methods has been reported [29], and our responder analysis was based on distribution only.

As a strength, this was a systematic and comprehensive tertiary hospital-based registry overview with a large number of consecutive patients. We included fully completed data only, and given the number of patients, it is likely that the detected differences represent genuine variations between patient groups rather than individual fluctuations. One-year follow-up is also likely sufficient in capturing the biggest treatment effect [19].

After analyzing the results, we would like to emphasize the following points. Careful patient selection is essential for all patients undergoing artificial joint replacement, particularly those receiving knee arthroplasty. Our results suggest that knee arthroplasty outcomes are comparatively weaker than those in other major joint arthroplasties, potentially due to more lenient criteria for surgery in some cases. Preoperative PROM scores should be included in the decision making process. It is recommended to review both the patient’s initial condition and the expected outcome prior to deciding on surgery, allowing any unrealistic expectations to be addressed. Additionally, patients should be informed that factors such as age, comorbidities and excess weight significantly impact the expected outcomes, regardless of the technical success of the procedure.

Conclusion

The observed PROM ES for major joint arthroplasty surgeries is high. However, there are significant disparities among patients with primary knee, hip, and shoulder joint arthroplasty surgery. Based on our observation the shoulder patients experience the largest treatment effect, and the knee arthroplasty patients the lowest. This variation underscores the importance of patient selection criteria, emphasizing also the need for thorough consideration of preoperative PROMs in the shared decision making regarding joint arthroplasty surgery. The expected results should be openly communicated to all stakeholders and further research targeted in improving PROM implementation, and treatment outcomes throughout orthopedics.

Supporting information

S1 Table. Most common diagnoses and operations in patient groups.

Note that both numbers refer to the entire registry and thus the operation percentages do not refer only to the diagnosis on the same row of the table.

https://doi.org/10.1371/journal.pone.0314818.s003

(DOCX)

S2 Table. Regression coefficients for a generalized linear model with responder-status as a binary response variable.

https://doi.org/10.1371/journal.pone.0314818.s004

(DOCX)

S3 Table. Regression coefficients for a linear model with change in PROM (%) as the response variable.

https://doi.org/10.1371/journal.pone.0314818.s005

(DOCX)

Acknowledgments

The authors want to give thanks to Maria Suuripää (BCB Medical, Turku, Finland) for helping us with the data search.

References

  1. 1. Steinmetz JD, Culbreth GT, Haile LM, Rafferty Q, Lo J, Fukutaki KG, et al. Global, regional, and national burden of osteoarthritis, 1990–2020 and projections to 2050: a systematic analysis for the Global Burden of Disease Study 2021. Lancet Rheumatol. 2023;5(9).
  2. 2. Sloan M, Premkumar A, Sheth NP. Projected volume of primary total joint arthroplasty in the u.s., 2014 to 2030. Journal of Bone and Joint Surgery—American Volume. 2018;100(17). pmid:30180053
  3. 3. Wilson I, Bohm E, Lübbeke A, Lyman S, Overgaard S, Rolfson O, et al. Orthopaedic registries with patient-reported outcome measures. EFORT Open Rev. 2019;4(6). pmid:31210973
  4. 4. Harris K, Dawson J, Gibbons E, Lim C, Beard D, Fitzpatrick R, et al. Systematic review of measurement properties of patient-reported outcome measures used in patients undergoing hip and knee arthroplasty. Patient Relat Outcome Meas. 2016;Volume 7. pmid:27524925
  5. 5. Dunbar MJ, Richardson G, Robertsson O. I can’t get no satisfaction after my total knee replacement: rhymes and reasons. Vols. 95-B, The bone & joint journal. 2013. pmid:24187375
  6. 6. Nyring MRK, Olsen BS, Amundsen A, Rasmussen JV. Minimal Clinically Important Differences (MCID) for the Western Ontario Osteoarthritis of the Shoulder Index (WOOS) and the Oxford Shoulder Score (OSS). Patient Relat Outcome Meas. 2021;Volume 12. pmid:34588833
  7. 7. Yeo MGH, Goh GS, Chen JY, Lo NN, Yeo SJ, Liow MHL. Are Oxford Hip Score and Western Ontario and McMaster Universities Osteoarthritis Index Useful Predictors of Clinical Meaningful Improvement and Satisfaction After Total Hip Arthroplasty? Journal of Arthroplasty. 2020;35(9). pmid:32416955
  8. 8. Maredupaka S, Meshram P, Chatte M, Kim WH, Kim TK. Minimal clinically important difference of commonly used patient-reported outcome measures in total knee arthroplasty: review of terminologies, methods and proposed values. Vol. 32, Knee Surgery and Related Research. 2020.
  9. 9. Jaeschke R, Singer J, Guyatt GH. Measurement of health status. Ascertaining the minimal clinically important difference. Control Clin Trials. 1989;10(4).
  10. 10. R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. 2023 [cited 2023 Apr 2]; Available from: https://www.R-project.org.
  11. 11. Rasmussen J, Jakobsen J, Olsen BS, Brorson S. Translation and validation of the Western Ontario Osteoarthritis of the Shoulder (WOOS) index &ndash; the Danish version. Patient Relat Outcome Meas. 2013;
  12. 12. Rasmussen J V., Hole R, Metlie T, Brorson S, Äärimaa V, Demir Y, et al. Anatomical total shoulder arthroplasty used for glenohumeral osteoarthritis has higher survival rates than hemiarthroplasty: a Nordic registry-based study. Osteoarthritis Cartilage. 2018;26(5).
  13. 13. Cohen J. A power primer. Psychol Bull. 1992;112(1).
  14. 14. Abdelaziz H, Balde OM, Citak M, Gehrke T, Magan A, Haasper C. Kellgren–Lawrence scoring system underestimates cartilage damage when indicating TKA: preoperative radiograph versus intraoperative photograph. Arch Orthop Trauma Surg. 2019;139(9).
  15. 15. Bedson J, Croft PR. The discordance between clinical and radiographic knee osteoarthritis: A systematic search and summary of the literature. BMC Musculoskelet Disord. 2008;9.
  16. 16. Chang CB, Han I, Kim SJ, Seong SC, Kim TK. Association between radiological findings and symptoms at the patellofemoral joint in advanced knee osteoarthritis. Journal of Bone and Joint Surgery—Series B. 2007;89(10). pmid:17957071
  17. 17. Vos T, Abajobir AA, Abbafati C, Abbas KM, Abate KH, Abd-Allah F, et al. Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990–2016: A systematic analysis for the Global Burden of Disease Study 2016. The Lancet. 2017;390(10100). pmid:28919117
  18. 18. World Health Organization (WHO). WHO web page [Internet]. [cited 2023 May 25]. Available from: https://www.who.int.
  19. 19. Galea VP, Rojanasopondist P, Ingelsrud LH, Troelsen A, Rubash HE, Bragdon C, et al. Longitudinal changes in patient-reported outcome measures following total hip arthroplasty and predictors of deterioration during follow-up: A seven-year prospective international multicentre study. Bone and Joint Journal. 2019;101 B(7). pmid:31256661
  20. 20. Berliner JL, Brodke DJ, Chan V, SooHoo NF, Bozic KJ. Can Preoperative Patient-reported Outcome Measures Be Used to Predict Meaningful Improvement in Function After TKA? Clin Orthop Relat Res. 2017;475(1). pmid:26956248
  21. 21. Moeini S, V. Rasmussen J, Wirenfelt Klausen T, Brorson S. Rasch analysis of the Western Ontario Osteoarthritis of the Shoulder index &ndash; the Danish version. Patient Relat Outcome Meas. 2016;Volume 7.
  22. 22. Rolfson O, Eresian Chenok K, Bohm E, Lübbeke A, Denissen G, Dunn J, et al. Patient-reported outcome measures in arthroplasty registries: Report of the Patient-Reported Outcome Measures Working Group of the International Society of Arthroplasty Registries: Part I. Overview and rationale for patient-reported outcome measures. Acta Orthop. 2016;87.
  23. 23. Gagnier JJ, Johnston BC. Poor quality patient reported outcome measures bias effect estimates in orthopaedic randomized studies. J Clin Epidemiol. 2019;116. pmid:31374331
  24. 24. Angst F, Aeschlimann A, Angst J. The minimal clinically important difference raised the significance of outcome effects above the statistical level, with methodological implications for future studies. J Clin Epidemiol. 2017;82. pmid:27986610
  25. 25. International Consortium for Health Outcomes Measurement (ICHOM). ICHOM Web Site [Internet]. [cited 2023 May 7]. Available from: https://www.ichom.org/.
  26. 26. Rolfson O, Bohm E, Franklin P, Lyman S, Denissen G, Dawson J, et al. Patient-reported outcome measures in arthroplasty registries: Report of the Patient-Reported Outcome Measures Working Group of the International Society of Arthroplasty RegistriesPart II. Recommendations for selection, administration, and analysis. Acta Orthop. 2016;87.
  27. 27. Ingelsrud LH, Wilkinson JM, Overgaard S, Rolfson O, Hallstrom B, Navarro RA, et al. How do Patient-reported Outcome Scores in International Hip and Knee Arthroplasty Registries Compare? Clin Orthop Relat Res. 2022 Oct;480(10):1884–96. pmid:35901444
  28. 28. Wamper KE, Sierevelt IN, Poolman RW, Bhandari M, Haverkamp D. The Harris hip score: Do ceiling effects limit its usefulness in orthopedics? A systematic review. Acta Orthop. 2010;81(6).
  29. 29. Frahm Olsen M, Bjerre E, Hansen MD, Tendal B, Hilden J, Hróbjartsson A. Minimum clinically important differences in chronic pain vary considerably by baseline pain and methodological factors: systematic review of empirical studies. Vol. 101, Journal of Clinical Epidemiology. 2018.