Skip to main content
Advertisement
  • Loading metrics

Association between surgeon training grade and the risk of revision following unicompartmental knee replacement: An analysis of National Joint Registry data

  • Timothy J. Fowler ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    t.j.fowler@bristol.ac.uk

    Affiliation Musculoskeletal Research Unit, Translational Health Sciences, Bristol Medical School, Southmead Hospital, Bristol, United Kingdom

  • Nicholas R. Howells,

    Roles Writing – review & editing

    Affiliation Musculoskeletal Research Unit, Translational Health Sciences, Bristol Medical School, Southmead Hospital, Bristol, United Kingdom

  • Ashley W. Blom,

    Roles Conceptualization, Funding acquisition, Methodology, Resources, Supervision, Visualization, Writing – review & editing

    Affiliation Faculty of Health, The University of Sheffield, Sheffield, United Kingdom

  • Adrian Sayers ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Supervision, Visualization, Writing – review & editing

    ‡ These authors are joint senior authors on this work.

    Affiliation Musculoskeletal Research Unit, Translational Health Sciences, Bristol Medical School, Southmead Hospital, Bristol, United Kingdom

  • Michael R. Whitehouse

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Visualization, Writing – review & editing

    ‡ These authors are joint senior authors on this work.

    Affiliations Musculoskeletal Research Unit, Translational Health Sciences, Bristol Medical School, Southmead Hospital, Bristol, United Kingdom, National Institute for Health Research Bristol Biomedical Research Centre, University Hospitals Bristol NHS Foundation Trust, University of Bristol, Bristol, United Kingdom

Abstract

Background

Unicompartmental knee replacements (UKRs) are performed by surgeons at various stages in training with varying levels of supervision, but we do not know if this is a safe practice with comparable outcomes to consultant-performed UKR. The aim of this study was to use registry data for England and Wales to investigate the association between surgeon grade (consultant, or trainee), the senior supervision of trainees (supervised by a scrubbed consultant, or not), and the risk of revision surgery following UKR.

Methods and findings

We conducted an observational study using prospectively collected data from the National Joint Registry for England and Wales (NJR). We included adult patients who underwent primary UKR for osteoarthritis (n = 106,206), recorded in the NJR between 2003 and 2019. Exposures were the grade of the operating surgeon (consultant, or trainee) and whether or not trainees were directly supervised by a consultant during the procedure (referred to as “supervised by a scrubbed consultant”). The primary outcome was all-cause revision surgery. The secondary outcome was the number of procedures revised for the following specific indications: aseptic loosening/lysis, infection, progression of osteoarthritis, unexplained pain, and instability. Flexible parametric survival models were adjusted for patient, operation, and healthcare setting factors.

We included 106,206 UKRs in 91,626 patients, of which 4,382 (4.1%) procedures were performed by a trainee. The unadjusted cumulative probability of failure at 15 years was 17.13% (95% CI [16.44, 17.85]) for consultants, 16.42% (95% CI [14.09, 19.08]) for trainees overall, 15.98% (95% CI [13.36, 19.07]) for trainees supervised by a scrubbed consultant, and 17.32% (95% CI [13.24, 22.50]) for trainees not supervised by a scrubbed consultant. There was no association between surgeon grade and all-cause revision in either crude or adjusted models (adjusted HR = 1.01, 95% CI [0.90, 1.13]; p = 0.88). Trainees achieved comparable all-cause survival to consultants, regardless of the level of scrubbed consultant supervision (supervised: adjusted HR = 0.99, 95% CI [0.87, 1.14]; p = 0.94; unsupervised: adjusted HR = 1.03, 95% CI [0.87, 1.22]; p = 0.74).

Limitations of this study relate to its observational design and include: the potential for nonrandom allocation of cases by consultants to trainees; residual confounding; and the use of the binary variable “surgeon grade,” which does not capture variations in the level of experience between trainees.

Conclusions

This nationwide study of UKRs with over 16 years’ follow up demonstrates that trainees within the current training system in England and Wales achieve comparable all-cause implant survival to consultants. These findings support the current methods by which surgeons in England and Wales are trained to perform UKR.

Author summary

Why was this study done?

  • Unicompartmental knee replacement (UKR) is an alternative to total knee replacement (TKR) in patients with symptomatic osteoarthritis. The National Institute for Health and Care Excellence (NICE) recommends that patients with isolated medial compartment OA should be offered a choice of UKR or TKR.
  • Proposed advantages of UKR over TKR include superior functional outcomes, reduced length of stay, fewer medical complications, greater cost-effectiveness, and lower mortality. However, UKR revision rates are considerably higher than TKR revision rates.
  • The British Association for Surgery of the Knee (BASK) and European Knee Society (EKS) have recommended that knee surgeons should have exposure to and training in UKR.
  • UKRs are performed by surgeons at different stages in training with varying levels of supervision. However, we do not know if UKRs performed by trainees last as long as those performed by fully trained consultant surgeons.

What did the researchers do and find?

  • We analysed data from the National Joint Registry for England and Wales (NJR), which is the largest joint replacement registry in the world. We included over 100,000 primary UKRs performed between 2003 and 2019.
  • We were interested in whether the procedure was performed by a fully trained consultant surgeon, or a trainee. We were also interested in whether or not trainees were directly supervised by a consultant during the operation. The primary outcome was all-cause revision surgery. We used a specialist statistical method called “flexible parametric survival modelling” to analyse the data.
  • We found no association between surgeon grade and all-cause revision. Trainees achieved comparable outcomes to consultants, regardless of the level of consultant supervision.

What do these findings mean?

  • These data suggest that within the current training system in the England and Wales, UKRs performed by trainee surgeons last as long as those performed by fully trained consultant surgeons.
  • The findings of this study are reassuring and support the current methods by which surgeons are trained to perform UKR in England and Wales.
  • Limitations of this study relate to its observational design. A notable limitation is that we used a binary exposure (consultant, or trainee), which does not capture variations in the level of experience between trainees.

Introduction

Unicompartmental knee replacement (UKR) is an alternative to total knee replacement (TKR) in patients with symptomatic osteoarthritis (OA) isolated to a single compartment [1]. The National Institute for Health and Care Excellence (NICE) recommends that patients in England and Wales with isolated medial compartment OA should be offered a choice of UKR or TKR [2]. The British Association for Surgery of the Knee (BASK) and European Knee Society (EKS) have recommended that knee surgeons should have exposure to and training in UKR [3]. Proposed advantages of UKR over TKR include superior functional outcomes, reduced length of stay, fewer medical complications, greater cost-effectiveness, and lower mortality [4,5]. However, UKR revision rates are considerably higher than primary TKR revision rates [6,7]. Previous studies have suggested that UKRs performed by low-volume surgeons are associated with an increased risk of revision compared to UKRs performed by experienced higher volume surgeons [8,9]. This raises the question of whether or not it is safe for these procedures to be performed by trainees.

The survival of a joint replacement, defined as the absence of revision surgery over time, is the principal metric used for comparing the longevity of implant components and is a commonly used measure of surgical performance. Our current understanding of the survival of UKRs in the context of surgical training is based on a small number of observational studies, which are discussed in our recent systematic review on this subject [10]. Bottomley and colleagues conducted an observational study of 1,084 UKRs, of which 673 (62.1%) were performed by trainees. They reported no significant difference in implant survival between the groups, with 9-year cumulative survival estimates of 93.9% and 93.0% for consultants and trainees, respectively [11]. A New Zealand Joint Registry (NZJR) study of 8,854 UKRs, of which 304 (3.4%) were performed by trainees, reported no difference in the revision rates of supervised senior trainees compared to attending surgeons [12]. This study did not report survival estimates and the overall number of trainee cases in the cohort was insufficient to facilitate meaningful comparison between the supervised and unsupervised trainee groups. The survival of UKRs according to surgeon grade and supervision remains poorly understood. It is not clear if current training practices are safe, or whether trainees achieve comparable outcomes to consultants.

The aim of this research was to use National Joint Registry (NJR) data from England and Wales to investigate the association between surgeon grade, the supervision of trainees, and the risk of revision following UKR.

Methods

Patients and data sources

We performed an observational study using prospectively collected data recorded in the NJR. The initial NJR data set was 1,502,564 linked knee procedures performed between 1 April 2003 and 31 December 2019. We included primary UKRs in adult patients (aged ≥18 years) performed for an indication of OA only. Patellofemoral joint replacements were excluded. Cases were included if the operating surgeon grade was recorded as any of the following: Foundation Year 1 (F1) to Specialty Trainee Year 2 (ST2); ST3-ST8; fellow; or consultant. The process of mapping grade classifications to account for variations in terminology used in different versions of the NJR Minimum Data Set (MDS) form is outlined in S1 Appendix.

Analysis plan

The study protocol was designed prior to commencing the study, including defining the study population, exposures, outcomes of interest, and statistical methods. The main analyses were planned prior to commencing the study and these are documented throughout this methods section. However, data-driven changes to the analysis took place and these are summarised in detail in S2 Appendix, which justifies the model selection and construction.

Data processing

The base data set used in the current study is based on the same cut of NJR data that is used in the 17th Annual Report [13]. NJR data are annually linked to other healthcare system data sets, including Civil Registration Authority data, using unique patient identifiers. This linkage, which was carried out by the NJR prior to us obtaining the data set, is approved by the Health Research Authority under Section 251 of the NHS act 2006 [13]. The steps taken in data processing and are summarised in the study flow diagram in Fig 1 and illustrated in greater detail in S1 Fig. All exclusions are consistent with the exclusion criteria of this study and the stage at which these occurred is clearly documented.

Exposures

The primary exposure (exposure A) was surgeon grade. This is a binary variable, which was categorised according to the grade of the operating surgeon: (1) consultant; or (2) trainee. Procedures performed by surgeons of the following grades were categorised under the variable “trainee”: F1-ST2; ST3-ST8; and fellow. Consultants have completed their formal training in orthopaedic surgery and been appointed to a senior position in which they can practice independently and supervise trainees.

F1-ST2 represents the first 4 years of postgraduate training after graduating from medical school (F1, F2, ST1, and ST2). ST2 doctors who have completed the Membership of the Royal College of Surgeons (MRCS) examination are eligible to apply to Specialty Training. Specialty Training in Trauma and Orthopaedic Surgery is typically a six-year programme (ST3-8). ST3-ST8 trainees are referred to as “specialty trainees,” or “registrars.” Progression through training levels is dependent on the successful completion of training requirements and competencies. Trainees who have completed ST6 are eligible to sit the examination for Fellowship of the Royal College of Surgeons (FRCS), which is mandatory for Certification of Completion of Training (CCT). Trainees subsequently progress to post-CCT fellowship training prior to applying for a consultant position. The term “consultant” is synonymous with “attending”, and the term “registrar” is synonymous with “resident” in many healthcare settings including the United States of America. A schematic summary of the stages of surgical training in the United Kingdom is included in S3 Appendix [14].

The secondary exposure was whether or not trainees were directly supervised by a consultant during the procedure (exposure B). We refer to direct consultant supervision as “supervised by a scrubbed consultant” throughout this paper. Therefore, trainee cases were subcategorised as follows: (1) trainee supervised by a scrubbed consultant; or (2) trainee not supervised by a scrubbed consultant. Cases were categorised as “supervised by a scrubbed consultant” if the first assistant was recorded as a consultant.

Given the variability in the level of experience between individual trainees, we performed a sensitivity analysis by recategorising cases according to the specific training grade of the operating surgeon (exposure C: consultant; F1-ST2; ST3-ST8; or fellow). Cases were further subcategorised according to the level of scrubbed consultant supervision.

Outcomes of interest

The primary outcome was all-cause revision, which was defined as any procedure to add, remove, or modify one or more components of an implant construct for any reason [13]. The secondary outcome measure was the number of procedures revised for the following specific indications, which are listed as the 5 most common indications for knee replacement revision by the NJR: aseptic loosening/lysis, infection, progression of OA, unexplained pain, and instability [13].

Statistical analysis

Frequencies and percentages were used to describe categorical variables. The mean, standard deviation (SD), and interquartile range (IQR) were used to describe continuous variables. Unrevised cases were either administratively censored on 31 December 2019, or the date of death, depending on which was earliest. Unadjusted estimates of net implant failure were calculated using the Kaplan–Meier (KM) method.

We performed a comprehensive exploratory analysis using Cox regression. A combination of graphical plots, Schoenfeld residuals, and likelihood ratio testing (comparing proportional and non-proportional hazards models) were used to assess the proportional hazards (PH) assumption at each level of adjustment and to assess the time-dependent effects of each confounding variable [15]. Adjusted analyses did not satisfy the PH assumption, which was due to the time-dependent effects of multiple confounding variables included in the models (age, sex, IMD, approach, fixation, bearing mobility, year of operation, and funding source). Surgeon grade (exposures A) did not demonstrate a time-dependent effect.

To account for non-proportionality, we used flexible parametric survival modelling (FPM) [15,16], which has been used in previous NJR analyses [1720]. This method uses restricted cubic spline functions to model the baseline hazard and account for the time-dependent effects of specified variables. Graphical assessment, AIC, BIC, and likelihood ratio testing were used to optimise the fit and complexity of the final model [15]. This process of model selection, construction, and justification is described in greater detail in S2 Appendix.

Analyses were adjusted for categorical confounding variables in the following manner. Model 1 was unadjusted. Model 2 was adjusted for patient-level factors (age, sex, American Society of Anaesthesiologists (ASA) grade, and index of multiple deprivation (IMD) decile). Model 3 was further adjusted for operation-level factors (anaesthetic, approach, fixation, and bearing mobility). Model 4 was further adjusted for healthcare setting factors (funding source and year of operation). In each case, the baseline category was the most frequently occurring (as detailed in S4 Appendix).

Body mass index (BMI) is missing in a large proportion of NJR records. It has been reported that approximately 40% of patients did not have a BMI recorded in the NJR in 2009, compared to approximately 18% in 2016 [21]. Due to the significant proportion of records with missing values, BMI was not included as a confounding variable in the analyses. This is consistent with the approach used in previous NJR studies and this decision was made prior to initiating the study based on the known pattern of missing data [22,23].

We performed separate analyses for all-cause revision and the 5 specific indications for revision (aseptic loosening/lysis, infection, progression of OA, unexplained pain, and instability), which were examined as separate survival endpoints. Separate FPM analyses were performed for each exposure and analyses were incrementally adjusted for confounding variables.

In response to the peer review process, we conducted an additional sensitivity analysis to explore the lack of independence between observations in patients who underwent bilateral procedures (on the same day, or on different days). We examined the primary outcome measure using a fully adjusted (Model 4) multilevel mixed effects parametric survival model to introduce a frailty term and account for time-dependent effects [24]. All analyses were performed using Stata (Version SE 15.1; StataCorp LP, USA). This study is reported as per the Reporting of studies Conducted using Observational Routinely collected health Data (RECORD) Statement (S1 RECORD Checklist) [25].

Patient and public involvement

Patient representatives sit on the committee structure of the NJR. The research priorities of the NJR are identified by this committee and approved by the patient representatives. Patients were not involved in setting the research question or the outcome measures nor were they involved in the design, implementation, or interpretation of the results of this study. We are unable to disseminate the results of this study directly to study participants due to the anonymous nature of the data. We plan to disseminate our findings through the NJR communications team to relevant individuals who determine the provision of joint replacement and to the general population through local and national press.

Ethics statement

The NJR supports public health surveillance and wider clinical decision-making and holds data that are anonymous to the researchers who use it. NHS Health Research Authority guidance dictates that the secondary use of such data for research does not require approval by a research ethics committee. Therefore, separate research ethics committee approval was not required for this study. Patients are consented for inclusion in the NJR according to standard practice, with permission under the Health Service (Control of Patient Information) Regulations, otherwise referred to as Section 251 support [26].

Results

Descriptive analysis

We included 106,206 UKR procedures in 91,626 patients, of which 4,382 (4.1%) were performed by trainees. Trainees were supervised by a scrubbed consultant in 66.1% (n = 2,898) of trainee-performed cases (Table 1 and Fig 1).

thumbnail
Table 1. Descriptive statistics for patient, operation, and healthcare setting factors for included UKRs.

https://doi.org/10.1371/journal.pmed.1004445.t001

The mean age of patients operated on by trainees was 1.7 years older than patients operated on by consultants (65.5 versus 63.8 years). Trainees operated on a lower proportion of ASA I patients (15.7% versus 21.3%) and a higher proportion of ASA ≥III patients (13.4% versus 8.4%). A higher proportion of trainee procedures utilised uncemented implants (23.6% versus 19.9%) and a mobile bearing (72.0% versus 60.9%) (Table 1).

The maximum duration of follow up was 16.8 years. Mean follow up was 6.5 years (SD 4.3; IQR 2.6 to 10.1 years) for trainee UKRs and 5.6 years (SD 4.00; IQR 2.2 to 8.6 years) for consultant UKRs. A total of 6,920 UKRs were revised at a mean of 4.3 years (SD 3.5; IQR 1.5 to 6.5 years).

Missing data

Details of missing data are documented in S2 Fig. Fewer than 5% of cases (n = 5,120) had missing data. Complete-case analysis was used in all analyses and records with missing data in any confounding variable field used in subsequent statistical models were excluded from the relevant model. This is based on the assumption that the pattern of missingness of NJR data is independent of the primary exposure and the outcome. Considering the large data sets, the small proportion of incomplete cases and the assumed pattern of missingness, any potential improvement in efficiency from using multiple imputation compared to complete-case analysis is likely to be negligible [22,27].

All-cause revision

The unadjusted cumulative probability of failure at 15 years was 17.13% (95% CI [16.44, 17.85]) for consultants, 16.42% (95% CI [14.09, 19.08]) for trainees overall, 15.98% (95% CI [13.36, 19.07]) for trainees supervised by a scrubbed consultant, and 17.32% (95% CI [13.24, 22.50]) for trainees not supervised by a scrubbed consultant. Failure estimates (one minus survival) for all intervals of follow up are presented in Table 2, and graphically displayed as a one minus KM plot in Fig 2.

thumbnail
Fig 2.

Kaplan–Meier plot (one minus survival) demonstrating the cumulative probability of UKR failure (i.e., all-cause revision) according to surgeon grade (exposure A).

https://doi.org/10.1371/journal.pmed.1004445.g002

thumbnail
Table 2. The unadjusted cumulative probability of all-cause failure of UKRs according to surgeon grade (exposure A) and supervision (exposure B).

https://doi.org/10.1371/journal.pmed.1004445.t002

Unadjusted FPM analysis comparing UKRs performed by consultants and trainees (exposure A), indicated that surgeon grade was not associated with the risk of all-cause revision (Model 1: HR = 1.05, 95% CI [0.94, 1.17]; p = 0.40). This finding, which is documented in Table 3, persisted despite incremental adjustment for patient, operation, and healthcare setting factors (Model 4: HR = 1.01, 95% CI [0.90, 1.13]; p = 0.88). Further analysis was performed according to the level of senior supervision (exposure B). Neither crude nor adjusted models demonstrated an association between the level of supervision of trainees and the risk of all-cause revision (Table 3).

thumbnail
Table 3. Results of flexible parametric models (FPMs) according to surgeon grade (exposure A) and supervision (exposure B).

https://doi.org/10.1371/journal.pmed.1004445.t003

Sensitivity analysis was performed following further subcategorisation of cases according to specific training grade (exposure C) and supervision. There was no evidence of an association between any specific training grade (F1-ST2, ST3-ST8, or fellow) and an increased risk of all-cause revision, regardless of the level of supervision (Table 4). It should be noted that very few UKRs were performed by surgeons in the most junior category (F1-ST2).

thumbnail
Table 4. Sensitivity analysis: Results of flexible parametric models (FPMs) for all-cause revision according to the specific training grade (exposure C) and supervision.

https://doi.org/10.1371/journal.pmed.1004445.t004

An additional sensitivity analysis was performed to explore the lack of independence between observations in patients who underwent bilateral procedures. The results were very similar, and we found no evidence of an association between surgeon grade (Model 4: HR = 1.01, 95% CI [0.90, 1.13]; p = 0.89) and the risk of all-cause revision.

Indication for revision

The 3 most common indications for revision in this cohort were progression of OA (n = 2,271), aseptic loosening/lysis (n = 1,972), and unexplained pain (n = 1,308). Crude and adjusted analyses demonstrated no evidence of an association between surgeon grade (exposure A) and an increased risk of revision for any indication, including aseptic loosening/lysis, infection, progression of OA, unexplained pain, or instability (Table 3).

Further analysis was performed according to the level of trainee supervision (exposure B). We found no evidence of an increased risk of revision for any indication when trainees were supervised by a scrubbed consultant. However, both crude and adjusted analyses (Models 1–3) demonstrated that procedures performed by trainees without scrubbed consultant supervision were associated with an increased risk of revision for unexplained pain, compared to procedures performed by consultants (Model 1: HR = 1.62, 95% CI [1.15, 2.28]; p = 0.01). This was not observed in the fully adjusted model (Model 4: HR = 1.34, 95% CI [0.95, 1.89]; p = 0.09) (Table 3).

Discussion

This analysis of 106,206 primary UKRs with over 16 years’ follow up represents the largest study to date of UKR outcomes in the context of surgical training. We have demonstrated that when comparing UKRs performed by consultants and trainees, there was no evidence of an association between surgeon grade and the risk of all-cause revision. Trainees achieved comparable outcomes to consultants regardless of the level of scrubbed supervision. There was no evidence that UKRs performed by trainees who were supervised by a scrubbed consultant were associated with an increased risk of revision for any specific indication (including aseptic loosening/lysis, infection, progression of OA, unexplained pain, and instability) compared to consultant-performed UKRs. We found evidence that UKRs performed by trainees who were not supervised by a scrubbed consultant were more likely to be revised for unexplained pain compared to consultant-performed UKRs. However, this was not observed in the fully adjusted model. Revision for unexplained pain following UKR has previously been attributed to low-volume surgeons, but not unsupervised trainees [8]. The most common indication for revision was progression of OA. The NJR defines revision as any procedure to add, remove, or modify one or more components of an implant construct for any reason [13]. Revision for progression of OA in the context of previous UKR implies progression of arthritis in previously unreplaced compartments of the knee. This includes procedures such as revising the UKR to a TKR, or the addition of another UKR (medial, lateral, or patellofemoral) to a previously unreplaced compartment. It does not necessarily imply failure of the individual implant components but is recorded by the NJR as a failure of the construct. We found no evidence of an association between surgeon grade, or the level of supervision and the risk of revision for progression of OA, which suggests that trainers are selecting appropriate cases for their trainees.

We included over 100,000 UKRs, which makes this significantly larger than any previous study of the association between surgeon grade and UKR outcomes [10,28]. Despite limiting our study period to predate the anomalous period of elective orthopaedic practice during the COVID-19 pandemic, our findings are current and represent UKRs with over 16 years of follow up. The data were recorded in a mandatory, nationwide prospective register, which improves the external validity and generalisability of our findings by reducing sampling bias. We employed FPM to model the time-dependent effects of confounding variables and account for non-proportionality. Furthermore, our incremental approach to confounding adjustment increases transparency by demonstrating the relative contribution of patient, operation, and healthcare setting factors to the adjusted results.

Despite these strengths, our study has limitations. This is an observational study and there is likely to be a nonrandom allocation of cases by consultants to trainees. We have attempted to account for this by adjusting for a comprehensive range of confounding variables. However, we acknowledge that there may be residual confounding and confirm that, to our knowledge, there are no further steps to take to adjust for factors that might have influenced the allocation of cases. While this may make it difficult to understand what the true training effect is, our results suggest that the current process of allocating UKR cases to trainees in England and Wales is safe and effective. Implant survival is an important objective metric of success. However, we did not consider other measures that may be relevant when evaluating the success of a joint replacement, such as patient-reported outcome measures, or postoperative complications other than failure, as they are not currently reported by the NJR. OA was the only indication, which along with adjustment for confounding variables, accounts for measurable variations in case complexity between the groups. However, our findings remain susceptible to residual confounding. For example, we did not adjust for BMI which, consistent with other NJR studies, was missing in a high proportion of records [21]. We performed multiple testing for various reasons for revision which may account for the association between unsupervised trainees and revision for unexplained pain that attenuated with adjustment. The distinction between medial and lateral UKRs is not routinely reported by the NJR. The NJR data collection process did not distinguish between medial and lateral UKRs until the introduction of MDS version 7 in 2018 and this information was not available within the data set [13].

The binary variable “surgeon grade” does not capture variations in the level of experience between individual trainees. We have attempted to address this through sensitivity analysis, by categorising cases according to the specific training grade of the surgeon; however, this categorical variable has similar limitations. Furthermore, supervision is recorded by the NJR as a binary variable according to the grade of the first assistant, which does not capture the spectrum of supervision that is necessary in the training process [29]. Thus, these categorical variables do not account for procedures that may have been part-performed by a trainee, or in which a trainee was supervised by an unscrubbed consultant.

A recent systematic review identified a small number of observational studies relating to this subject [10]. In their NZJR study, Storey and colleagues found no significant difference in the revision rate of UKRs performed by supervised senior trainees (n = 276) compared to attending surgeons (n = 8,550). They also reported that supervised senior trainees achieved comparable functional outcomes (Oxford Knee Score) to attending surgeons at 6 months. With only 14 cases in each group, the authors acknowledge that they had insufficient data for any meaningful analysis of the outcomes of UKRs performed by supervised junior trainees and unsupervised senior trainees. Furthermore, the indication for revision was not reported, and the description of the statistical methodology employed is limited [12]. Of note, a similarly low proportion of UKRs are recorded as performed by trainees in the NZJR (3.3%) and NJR (4.1%).

Bottomley and colleagues conducted a single-centre observational study of 1,084 Oxford medial UKRs (Zimmer Biomet, Swindon, UK). Trainees performed 673 UKRs (62.1%) and were supervised by a scrubbed consultant in 48% of cases. They reported no difference in implant survival between the groups, with 9-year cumulative survival estimates of 93.9% (95% CI [90.2, 97.6]) and 93.0% (95% CI [90.3, 95.7]) for consultants and trainees, respectively. In a subgroup analysis, they showed that trainees who had performed fewer than 10 UKRs had a failure rate of 5.1% compared to a failure rate of 4.7% in those who had undertaken more than 10 UKRs; a difference that was not statistically significant [11].

In comparison to the existing literature, the current study is significantly larger, has methodological advantages, longer follow up, and provides novel insight into the importance of scrubbed consultant supervision. Our findings are generally concordant with published data from another national joint registry [12], which suggests that our findings might be generalisable to other countries.

Our findings suggest that current training practices for UKR in England and Wales are safe, when defined by equivalence of survival outcomes. However, only a small proportion of UKRs in these countries are performed by trainees and it should be noted that very few UKRs were performed by surgeons of the most junior specific training grade (F1-ST2). It is likely that UKRs are typically performed by more experienced, senior trainees. However, we were unable to quantify this in the current study, due to the broad categories used by the NJR to record the grade of the operating surgeon.

It is presumed that trainers select appropriate cases for their trainees and permit trainees to operate without scrubbed supervision only when they have reached a subjective threshold of expertise commensurate with safe independent surgical practice. Our study suggests that in this context, trainees achieve comparable all-cause UKR survival to consultant surgeons. In terms of revision for unexplained pain, trainees might achieve their best outcomes when supervised by a scrubbed consultant. However, this association was not observed in the fully adjusted analysis. We propose that trainees should ideally be supervised by a scrubbed consultant when performing UKR, particularly during the early stages of training. When experienced senior trainees operate without scrubbed supervision, careful case selection is required, and scrubbed consultant supervision should be readily available.

The findings of this study are reassuring and support the current methods by which surgeons are trained to perform UKR in England and Wales. This is of particular importance in the context of current NICE guidelines, which recommend that patients with isolated medial compartment OA should be offered a choice of UKR or TKR [2]. This requires future generations of surgeons to be trained in both procedures, or for there to be easily accessible referral networks in place to allow surgeons that do not perform UKR to refer appropriate patients on to surgeons that do.

Conclusion

This nationwide study of UKRs with over 16 years’ follow up demonstrates that trainees in England and Wales achieve comparable all-cause implant survival to consultants. Our findings support the current methods by which surgeons in England and Wales are trained to perform UKR.

Supporting information

S1 RECORD Checklist. Reporting of studies Conducted using Observational Routinely collected health Data (RECORD) Checklist.

https://doi.org/10.1371/journal.pmed.1004445.s001

(DOCX)

S1 Fig. Detailed study flow diagram showing sequential exclusions.

https://doi.org/10.1371/journal.pmed.1004445.s002

(TIF)

S2 Fig. Detailed study flow diagram showing exclusion of missing data.

https://doi.org/10.1371/journal.pmed.1004445.s003

(TIF)

S1 Appendix. Process of accounting for changes in NJR operating surgeon grade categories.

https://doi.org/10.1371/journal.pmed.1004445.s004

(DOCX)

S2 Appendix. Model selection, construction and justification.

https://doi.org/10.1371/journal.pmed.1004445.s005

(DOCX)

S3 Appendix. Schematic summary of surgical training in the UK.

https://doi.org/10.1371/journal.pmed.1004445.s006

(DOCX)

S4 Appendix. Model specification summarising the exposures and confounding variables used in the analyses.

https://doi.org/10.1371/journal.pmed.1004445.s007

(DOCX)

Acknowledgments

We thank the patients and staff of all the hospitals who have contributed data to the National Joint Registry. We are grateful to the Healthcare Quality Improvement Partnership (HQIP), the National Joint Registry Steering Committee (NJRSC), and staff at the NJR Centre for facilitating this work.

The views expressed in this publication are those of the authors and do not necessarily reflect those of the NHS, the NIHR, the UK Department of Health and Social Care, the NJRSC, or the HQIP.

References

  1. 1. Price AJ, Alvand A, Troelsen A, Katz JN, Hooper G, Gray A, et al. Knee replacement. Lancet. 2018;392(10158):1672–82. pmid:30496082
  2. 2. NICE: Joint replacement (primary): hip, knee and shoulder [NG157] 2023 [cited 24/06/2024]. Available from: https://www.nice.org.uk/guidance/ng157/chapter/Recommendations#procedures-for-primary-elective-knee-replacement.
  3. 3. Price A, Thienpont E, Catani F, Abram S, Troelsen A, Bask, et al. Consensus statement on unicompartmental knee replacement: A collaboration between BASK and EKS. Knee. 2023;41:391–6. pmid:37088518
  4. 4. Wilson HA, Middleton R, Abram SGF, Smith S, Alvand A, Jackson WF, et al. Patient relevant outcomes of unicompartmental versus total knee replacement: systematic review and meta-analysis. BMJ. 2019;364:l352. pmid:30792179
  5. 5. Beard DJ, Davies LJ, Cook JA, MacLennan G, Price A, Kent S, et al. The clinical and cost-effectiveness of total versus partial knee replacement in patients with medial compartment osteoarthritis (TOPKAT): 5-year outcomes of a randomised controlled trial. Lancet. 2019;394(10200):746–56. pmid:31326135
  6. 6. Evans JT, Walker RW, Evans JP, Blom AW, Sayers A, Whitehouse MR. How long does a knee replacement last? A systematic review and meta-analysis of case series and national registry reports with more than 15 years of follow-up. Lancet. 2019;393(10172):655–63. pmid:30782341
  7. 7. Hunt LP, Blom AW, Matharu GS, Kunutsor SK, Beswick AD, Wilkinson JM, et al. Patients Receiving a Primary Unicompartmental Knee Replacement Have a Higher Risk of Revision but a Lower Risk of Mortality Than Predicted Had They Received a Total Knee Replacement: Data From the National Joint Registry for England, Wales, Northern Ireland, and the Isle of Man. J Arthroplasty. 2021;36(2):471–7 e6. pmid:33011013
  8. 8. Baker P, Jameson S, Critchley R, Reed M, Gregg P, Deehan D. Center and surgeon volume influence the revision rate following unicondylar knee replacement: an analysis of 23,400 medial cemented unicondylar knee replacements. J Bone Joint Surg Am. 2013;95(8):702–9. pmid:23595068
  9. 9. Liddle AD, Pandit H, Judge A, Murray DW. Effect of Surgical Caseload on Revision Rate Following Total and Unicompartmental Knee Replacement. J Bone Joint Surg Am. 2016;98(1):1–8. pmid:26738897
  10. 10. Fowler TJ, Aquilina AL, Blom AW, Sayers A, Whitehouse MR. Association between surgeon grade and implant survival following hip and knee replacement: a systematic review and meta-analysis. BMJ Open. 2021;11(11):e047882. pmid:34758989
  11. 11. Bottomley N, Jones LD, Rout R, Alvand A, Rombach I, Evans T, et al. A survival analysis of 1084 knees of the Oxford unicompartmental knee arthroplasty: a comparison between consultant and trainee surgeons. Bone Joint J. 2016;98-B(10 Supple B):22–7. pmid:27694512
  12. 12. Storey R, Frampton C, Kieser D, Ailabouni R, Hooper G. Does Orthopaedic Training Compromise the Outcome in Knee Joint Arthroplasty? J Surg Educ. 2018;75(5):1292–8. pmid:29574018
  13. 13. National Joint Registry: 17th Annual Report 2020 [cited 24/06/2024]. Available from: https://reports.njrcentre.org.uk/Portals/10/PDFdownloads/NJR%2017th%20Annual%20Report%202020.pdf.
  14. 14. Fitzgerald JE, Giddings CE, Khera G, Marron CD. Improving the future of surgical training and education: consensus recommendations from the Association of Surgeons in Training. Int J Surg. 2012;10(8):389–92. pmid:22449833
  15. 15. Royston P, Lambert PC. Flexible parametric survival analysis using stata: beyond the Cox model. College Station, Tex.: Stata; 2011.
  16. 16. Lambert PC, Royston P. Further development of flexible parametric models for survival analysis. Stata J. 2009;9(2):265–90.
  17. 17. Smith AJ, Dieppe P, Vernon K, Porter M, Blom AW, National Joint Registry. Failure rates of stemmed metal-on-metal hip replacements: analysis of data from the National Joint Registry of England and Wales. Lancet. 2012;379(9822):1199–204.
  18. 18. Blom AW, Hunt LP, Matharu GS, Reed M, Whitehouse MR. The effect of surgical approach in total knee replacement on outcomes. An analysis of 875,166 elective operations from the National Joint Registry for England, Wales, Northern Ireland and the Isle of Man. Knee. 2021;31:144–57. pmid:34175672
  19. 19. Blom AW, Hunt LP, Matharu GS, Reed MR, Whitehouse MR. The effect of surgical approach in total hip replacement on outcomes: an analysis of 723,904 elective operations from the National Joint Registry for England, Wales, Northern Ireland and the Isle of Man. BMC Med. 2020;18(1):242. pmid:32758226
  20. 20. Hunt LP, Whitehouse MR, Beswick A, Porter ML, Howard P, Blom AW. Implications of Introducing New Technology: Comparative Survivorship Modeling of Metal-on-Metal Hip Replacements and Contemporary Alternatives in the National Joint Registry. J Bone Joint Surg Am. 2018;100(3):189–96. pmid:29406339
  21. 21. Sayers A, Whitehouse MR, Judge A, MacGregor AJ, Blom AW, Ben-Shlomo Y. Analysis of change in patient-reported outcome measures with floor and ceiling effects using the multilevel Tobit model: a simulation study and an example from a National Joint Register using body mass index and the Oxford Hip Score. BMJ Open. 2020;10(8):e033646. pmid:32859657
  22. 22. Sayers A, Steele F, Whitehouse MR, Price A, Ben-Shlomo Y, Blom AW. Association between surgical volume and failure of primary total hip replacement in England and Wales: findings from a prospective national joint replacement register. BMJ Open. 2020;10:e033045. pmid:32928843
  23. 23. Fowler TJ, Aquilina AL, Reed MR, Blom AW, Sayers A, Whitehouse MR. The association between surgeon grade and risk of revision following total hip arthroplasty: an analysis of National Joint Registry data. Bone Joint J. 2022;104-B(3):341–51. pmid:35227094
  24. 24. Crowther MJ, Look MP, Riley RD. Multilevel mixed effects parametric survival models using adaptive Gauss-Hermite quadrature with application to recurrent events and individual participant data meta-analysis. Stat Med. 2014;33(22):3844–58. pmid:24789760
  25. 25. Benchimol EI, Smeeth L, Guttmann A, Harron K, Moher D, Petersen I, et al. The Reporting of studies Conducted using Observational Routinely-collected health Data (RECORD) Statement. PLoS Med. 2015;12(10):e1001885. pmid:26440803
  26. 26. National Joint Registry: Privacy Notice & GDPR 2024 [cited 24/06/2024]. Available from: https://www.njrcentre.org.uk/privacy-notice-gdpr/.
  27. 27. White IR, Carlin JB. Bias and efficiency of multiple imputation compared with complete-case analysis for missing covariate values. Stat Med. 2010;29:2920–31. pmid:20842622
  28. 28. Madanipour S, Singh P, Karia M, Bhamra JS, Abdul-Jabar HB. Trainee performed total knee arthroplasty is safe and effective: A systematic review and meta-analysis comparing outcomes between trainees and consultants. Knee. 2021;30:291–304. pmid:33984748
  29. 29. Intercollegiate Surgical Curriculum Programme: Trauma and Orthopaedic Surgery Curriculum 2021 [cited 24/06/2024]. Available from: https://www.iscp.ac.uk/iscp/curriculum/trauma-orthopaedic-surgery-curriculum/1-introduction/.