Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Application of artificial intelligence in X-ray imaging analysis for knee arthroplasty: A systematic review

  • Zhihong Zhang ,

    Contributed equally to this work with: Zhihong Zhang, Xu Hui

    Roles Data curation, Investigation, Methodology, Project administration, Validation, Writing – original draft, Writing – review & editing

    Affiliations Department of The First Clinical Medical College of Gansu, University of Chinese Medicine, Lanzhou, Gansu, China, Department of Evidence-Based Medicine Centre, School of Basic Medical Science, Lanzhou University, Lanzhou, Gansu, China

  • Xu Hui ,

    Contributed equally to this work with: Zhihong Zhang, Xu Hui

    Roles Data curation, Investigation, Methodology, Software, Supervision, Validation, Writing – original draft, Writing – review & editing

    Affiliations Department of Evidence-Based Medicine Centre, School of Basic Medical Science, Lanzhou University, Lanzhou, Gansu, China, Department of Centre for Evidence-Based Social Science/Center for Health Technology Assessment, School of Public Health, Lanzhou University, Lanzhou, Gansu, China, Department of Gansu Key Laboratory of Evidence-Based Medicine, Lanzhou University, Lanzhou, Gansu, China

  • Huimin Tao,

    Roles Data curation, Methodology, Resources, Validation, Visualization, Writing – review & editing

    Affiliation Department of The First Clinical Medical College of Gansu, University of Chinese Medicine, Lanzhou, Gansu, China

  • Zhenjiang Fu,

    Roles Conceptualization, Formal analysis, Resources, Visualization, Writing – review & editing

    Affiliation Department of The First Clinical Medical College of Gansu, University of Chinese Medicine, Lanzhou, Gansu, China

  • Zaili Cai,

    Roles Conceptualization, Formal analysis, Resources, Validation, Visualization, Writing – review & editing

    Affiliation Department of Radiology, Renhuai People’s Hospital, Zuiyi, Guizhou, China

  • Sheng Zhou ,

    Roles Funding acquisition, Methodology, Validation, Writing – review & editing

    Lzzs@sina.com (SZ); yangkh-ebm@lzu.edu.cn (KY)

    Affiliations Department of The First Clinical Medical College of Gansu, University of Chinese Medicine, Lanzhou, Gansu, China, Department of Radiology, Gansu Provincial Hospital, Lanzhou, Gansu, China

  • Kehu Yang

    Roles Conceptualization, Funding acquisition, Investigation, Supervision, Validation, Writing – review & editing

    Lzzs@sina.com (SZ); yangkh-ebm@lzu.edu.cn (KY)

    Affiliations Department of Evidence-Based Medicine Centre, School of Basic Medical Science, Lanzhou University, Lanzhou, Gansu, China, Department of Centre for Evidence-Based Social Science/Center for Health Technology Assessment, School of Public Health, Lanzhou University, Lanzhou, Gansu, China, Department of Gansu Key Laboratory of Evidence-Based Medicine, Lanzhou University, Lanzhou, Gansu, China

Abstract

Background

Artificial intelligence (AI) is a promising and powerful technology with increasing use in orthopedics. The global morbidity of knee arthroplasty is expanding. This study investigated the use of AI algorithms to review radiographs of knee arthroplasty.

Methods

The Ovid-Embase, Web of Science, Cochrane Library, PubMed, China National Knowledge Infrastructure (CNKI), WeiPu (VIP), WanFang, and China Biology Medicine (CBM) databases were systematically screened from inception to March 2024 (PROSPERO study protocol registration: CRD42024507549). The quality assessment of the diagnostic accuracy studies tool assessed the risk of bias.

Results

A total of 21 studies were included in the analysis. Of these, 10 studies identified and classified implant brands, 6 measured implant size and component alignment, 3 detected implant loosening, and 2 diagnosed prosthetic joint infections (PJI). For classifying and identifying implant brands, 5 studies demonstrated near-perfect prediction with an area under the curve (AUC) ranging from 0.98 to 1.0, and 10 achieved accuracy (ACC) between 96–100%. Regarding implant measurement, one study showed an AUC of 0.62, and two others exhibited over 80% ACC in determining component sizes. Moreover, Artificial intelligence showed good to excellent reliability across all angles in three separate studies (Intraclass Correlation Coefficient > 0.78). In predicting PJI, one study achieved an AUC of 0.91 with a corresponding ACC of 90.5%, while another reported a positive predictive value ranging from 75% to 85%. For detecting implant loosening, the AUC was found to be at least as high as 0.976 with ACC ranging from 85.8% to 97.5%.

Conclusions

These studies show that AI is promising in recognizing implants in knee arthroplasty. Future research should follow a rigorous approach to AI development, with comprehensive and transparent reporting of methods and the creation of open-source software programs and commercial tools that can provide clinicians with objective clinical decisions.

Introduction

Knee arthroplasty surgeries were initially widely performed in the 1,970s and 1,980s and are now considered an effective and cost-efficient treatment for end-stage knee arthritis [1]. It is projected that total knee arthroplasty (TKA) alone will exceed 1.26 million annual cases in the United States [2]. As the population of arthroplasty patients continues to grow, healthcare systems will face an increasing burden [3]. Therefore, understanding the causes and risk factors of revision knee arthroplasty to enhance surgery longevity has become increasingly important. Consequently, conducting postoperative imaging examinations to evaluate prosthesis characteristics and detect abnormalities is crucial. Various imaging techniques like X-ray, MRI, arthrography, scintigraphy, and fluorodeoxyglucose-positron emission tomography (FDG-PET) have been extensively used [4,5]. Among these methods, X-ray examination is considered primary due to advantages such as shorter duration of examination, cost-effectiveness, and enhanced diagnostic accuracy [6,7].

Due to recent developments, AI diagnostic capabilities have gained significant attention as an efficient and standardized tool in medicine and orthopedics [8]. Machine learning (ML) and deep learning (DL) models, known as convolutional neural networks (CNNs), are subsets of AI modeled after the human brain to identify rules and patterns in images [810]. AI algorithms have been reported to automatically diagnose stroke, retinopathy, and cancer histology with the same accuracy as experts in related fields [1114]. Due to its clear operational criteria and internal logic involved in knee prosthesis selection, AI techniques can effectively model this process. In recent years, there has been a significant increase in AI research for analyzing various aspects of knee arthroplasty, including implant identification, implant failure, precise measurement of dimensions, hospitalization duration, potential complications, cost-effectiveness analysis, functional recovery outcomes, and optimal surgical techniques [1520]. However, many of these algorithms were either underpowered or included misleading parameters. Importantly, medical imaging plays a crucial role in providing clinicians with convenient access to patient information in real healthcare settings. By utilizing labeled radiographs from medical experts, AI has emerged as a powerful and effective method for musculoskeletal imaging [21].

Consequently, we conducted a comprehensive systematic review of all relevant studies. This study focused on the application of X-ray imaging-based AI in prosthesis-related studies of knee arthroplasty and explored the diagnostic performance and analysis of AI algorithms and models for knee prostheses.

Materials and methods

This study was conducted following the guidelines in the Preferred Reporting Items for Systematic Reviews and Meta-Analyses of Diagnostic Test Accuracy (PRISMA-DTA) statement [22], and was prospectively registered on the International Prospective Register of Systematic Reviews (PROSPERO, CRD42024507549) [23].

Search strategy

From their inception until July 2023, a comprehensive search was conducted across the electronic databases of CNKI, VIP, Wan Fang, CBM, PubMed, Cochrane Library, Ovid-Embase, and Web of Science, with subsequent updates conducted until March 2024. The keywords used in the search strategy included (“knee arthroplasty” OR “knee prosthesis” OR “knee replacement”) (“prosthesis” OR “implants”) and (“artificial intelligence” OR “deep learning” OR “machine intelligence” OR “machine learning” OR “computational intelligence” OR “AI” OR “algorithms” OR “random forest” OR “support vector machine” OR “decision trees” OR “artificial neural network” OR “radiomics”). Furthermore, we manually screened the reference lists of the included studies to identify any publications that may be included in our systematic search. No limitations were imposed on the studies’ source or the publications’ language. The detailed search strategy is provided in S1 Table.

Study selection

Studies were imported into EndNote 20 and Rayyan (https://www.rayyan.ai/) for screening. Title and abstract screening, and full-text screening were performed independently by two reviewers, and discrepancies were resolved through discussion. Studies included met these criteria: 1) Study types: cohort studies and cross-sectional studies; 2) Participants: with or without knee prosthesis; 3) Index tests: diagnosis of knee arthroplasty based on quantitative imaging data (X-rays) using AI algorithms; 4) Reference standards: patients with a diagnosis confirmed by medical guidelines, clinical surgical records; 5) Outcomes: ultimately, we identified the currently established use four cases for AI in knee arthroplasty implant analysis. We excluded inaccessible studies, conference abstracts, reviews, editorials, consensus, guidelines, meta-analyses, duplicated studies, or review articles.

Data extraction

Two reviewers independently performed data extraction (ZZH and THM), from August to September 2023, and disagreements were resolved through discussion with a third reviewer(HX). When data were insufficient or missing from the articles, we attempted to contact the study authors for additional information. The following variables were included: 1) study characteristics: first author, publication year, country, prediction type; 2) patient clinical characteristics: number of samples, demographics; 3) model characteristics: index tests, reference standards, data sources, radiographs samples, implant designs, the proportion of data sets, external verification, AI device, AI method, AI techniques; 4) diagnostic performance measures such as the area under the curve (AUC), accuracy (ACC), sensitivity (SEN), specificity (SPE), positive predictive value (PPV), negative predictive value (NPV), or intraclass correlation coefficient (ICC).

Bias assessment and certainty of evidence

The Quality Assessment of Diagnostic Accuracy Studies-AI (QUADAS-AI) tool is an expanded instrument that incorporates the characteristics of AI research based on the QUADAS-2 [24] and QUADAS-C tools [25], which supplement the bias risk assessment in four relevant domains: case selection, diagnostic tests, reference standards, research processes, and timing. However, it should be noted that the current version of the QUADAS-AI tool is still in draft form and has not yet reached maturity regarding signal problems. Therefore, the QUADAS-2 tool was chosen as the preferred method for assessing bias risk. The tool includes four domains for risk of bias assessment and three for applicability concerns. Each section is classified as having a high, low, or unclear risk of bias. Two reviewers independently conducted a risk of bias assessment using the QUADAS-2 tool [24], and a third reviewer consulted to achieve consensus on each item. Furthermore, Review Manager 5.3 software aided in visually portraying the risk of bias assessment, providing a concise and intuitive representation of the results [26].

Data analyses

Descriptive statistics were employed to succinctly summarize the noteworthy findings and outcomes of the selected studies, as well as to delineate prevailing trends in AI technology, clinical applications, and their associated discoveries. Summary data were presented employing elementary means, frequencies, and proportions. The performance of AI models in the assessed studies was summarized utilizing diverse metrics encompassing AUC, ACC, SEN, SPE, PPV, NPV, and ICC.

Result

Search results

A study-flow diagram shows the process for study identification, inclusion, and exclusion (Fig 1). A comprehensive search yielded 7083 studies, which were subsequently screened for duplicates resulting in 4937 unique studies. After evaluating titles and abstracts, we excluded 4884 studies that did not meet the predefined inclusion/exclusion criteria. Fifty-three full-text articles underwent eligibility assessment, leading to the exclusion of 32 studies. Ultimately, a total of 21 studies met the criteria and were included in this systematic review.

thumbnail
Fig 1. PRISMA flow chart of the literature retrieval.

https://doi.org/10.1371/journal.pone.0321104.g001

Characteristics of thṭe included studies

Of the 21 studies evaluating range from tens to over ten thousand radiographs were included, with 10 studies explored AI application in classification of manufacturers and brands [2736], and 6 studies focused on implant measurements, including implant size and component position, alignment [3742], 3 studies were concerned with the detection of implant loosening [4345], and 2 studies related to the prediction and diagnosis of prosthetic joint infections (PJI) [46,47]. All studies were published during 2020–2023. All studies were retrospective studies. There were 6 studies involving various AI models [30,31,33,35,43,47], and others used a single AI model. There were 13 image-related studies used CNN models [2732,34], 4 used ML models [33,37,38,45], 3 used AI tools [36,40,42], and 1 used algorithms [46]. Also 7 studies used widely learned CNN models from Imagenet [27,28,31,4345,47]. 2 studies utilized ResNet [27,28], 4 studies employed Inception V3 [29,32,34,44]. 4 studies analyzed various AI models and selected those with the highest accuracy, namely DenseNet [31,43], Efficient Net & U-Net [30], MobileNetV2 [35], and the VGG-16 transfer learning Model [33]. Additionally, focusing on the model’s architecture methods like dropout regularization [45], batch normalization [2830,32,34,45], transfer learning [27,28,31,33,41,43,45], pretraining [27,43], and access to a large number of high-quality image datasets [27,28,31,4345,47] in this study. Thirteen studies data originated from a single center [27,30,32,3639,4144,47,48], while 8 studies drew from multiple centers [28,29,31,3335,40,46]. Seven studies were based on US populations [27,29,32,34,37,43,46]; 3 studies originated from China [41,44,47], India [31,33,35], and the UK [28,30,38], respectively; 2 studies from Germany [39,42]; and one from France [36], Australia [40], and Korea [45], respectively. The number of implants brand each algorithm could classify varied between the studies, with certain studies failing to disclose the number of prostheses [39,4247], while the highest reported count was 14 prosthesis manufacturers [32]. We also observed that the most commonly used X-ray imaging positions among the included studies were anteroposterior (AP) and lateral (LAT) views (86%). The detailed features of the studies are summarized in Table 1.

thumbnail
Table 1. Characteristics of the included studies.

https://doi.org/10.1371/journal.pone.0321104.t001

Identification and classification of implants

These included 10 studies [2736] (Table 2). The first study by Yi et al [27] in 2020 successfully demonstrated CNNs’ ability to accurately identify the presence of TKA, distinguishing between TKA and UKA, and two different primary TKA models on knee radiographs. All achieved an AUC of 1 and a SEN of 100%. In 2021, Belete et al [28] proposed a comparable CNN that effectively distinguished between TKA implants and no prosthesis, achieving an impressive ACC of 100% and an AUC score of 100%. Additionally, they developed a deep CNN for automatic classification of 7 types of knee arthroplasties on radiographs, demonstrating a remarkable ACC rate of 100%. Patel et al [30] conducted a study where they developed classification networks to identify 12 designs of hip and knee implants from radiographs. The final network achieved an impressive accuracy rate of 98.9%, outperforming all five specialists (median ACC rate of 76.1% and best ACC rate of 85.6%). Another study by Klemt et al [32] demonstrated that their AI model successfully distinguished between 24 THA and 14 TKA designs and became the first to identify primary and revision TKA designs automatically. Furthermore, this study had the largest sample size among all types of TKA implants examined. Their CNN model exhibited excellent discrimination in identifying eight primary TKA designs with an AUC value of 0.97 and six revision TKA designs with an AUC value of 0.96. Sharma et al [31] utilized a DL network trained on 1078 radiographs to accurately distinguish among six knee arthroplasty implants, with an internally validated AUC value of 0.98. Moreover, two studies [35,36] achieved similar precision in identifying automated knee implants, with ACCs reaching 96.7% and 99.9%, respectively. Through internal validation on plain radiographs, Tiwari et al [35] reported the development of an algorithm capable of identifying seven distinct knee arthroplasty models with an ACC rate of 95%. The study further highlighted that ML outperformed two human experts who achieved only an average ACC of 78%. The internally validated study by Karnuta et al [29] in 2021 effectively discriminated nine different knee arthroplasty implant models using 682 radiographs, yielding near-perfect AUC values ranging from UKAs to TKAs to distal femoral replacements (DFRs). Karnuta et al [36] conducted external validation in 2023 to enhance clinical applicability and ensure generalizability, significantly expanding the dataset to include 4724 radiographs classified based on a previous study [29]. The externally validated model exhibited outstanding performance with an AUC of 0.989 while maintaining an impressive average processing speed of 0.02 seconds per image for implant identification. Karnuta’s study [36] also identified nine knee prosthesis system designs from four manufacturers, representing contemporary and commonly used models.

thumbnail
Table 2. Performance of AI algorithms iin identifying implant in knee arthroplasty.

https://doi.org/10.1371/journal.pone.0321104.t002

Implant measurement

These included 6 AI studies to assess knee prosthesis angle alignment, position, and size (Table 3).

thumbnail
Table 3. Performance of AI algorithms in measuring implant in knee arthroplasty.

https://doi.org/10.1371/journal.pone.0321104.t003

Implant measurement-alignment

Among 4 studies employed AI to evaluate component alignment accuracy in TKA [37,39,40,42]. Farooq et al [37] used ML to predict clinical knee improvement based on sagittal implant location. ML models showed high satisfaction and normal knee function within -2° to +2° tibial slope and 0° to +7° femoral component flexion. Conversely, worse outcomes were predicted with femoral component extension, excessive flexion (>+10°), or significant alteration (>5°) of the native tibial slope. Erne et al [39] developed an innovative AI algorithm for determining lower limb alignment angles with a detection rate ranging from 92.4% to 98.9%. The excellent reliability and agreement between human vs human comparisons as well as AI vs mean human comparisons (ICC range: 0.8–1.0) highlight the outstanding performance of AI algorithms in terms of assessment speed (<60 seconds) and ACC for evaluating lower extremity alignment. Two studies [40,42] demonstrated the effectiveness and reliability of AI-powered applications (IB Lab LAMA) for automated measurement of lower limb alignment, which outperformed traditional orthopedic measurements by providing results in just 20 seconds per radiograph analysis using AI software alone. Schwarz et al.’ study [40] revealed a strong correlation between manual reads and AI software measurements for hip-knee-ankle (HKA), femoral component angle (FCA), and tibial component angle (TCA) (ICC>0.97). Additionally, the repeatability (96%) and reliability (78%) of long-leg radiographs with TKA assessed by the AI software further supported its efficacy. Pagano’s study [42], complemented by studies from Schwarz et al. [42], assessed more parameters using AI-powered software, including axial alignments, femoral and tibial angles, as well as other vital measurements such as joint-line convergence angle (JLCA), HKA, and Mikulicz line

Implant measurement-dimensions

With 2 studies evaluated the implant dimensions using AI [38,41]. Yue et al [41] employed an ECOC-based CNN model that incorporated patient gender and BMI with X-ray images to achieve prediction ACC of 88.23% and 86.27% for femoral and tibial components, respectively. Burge et al [38] evaluated the performance utility of an ML-based 2D-3D pipeline, which accurately predicted distal femur and proximal tibia sizes from X-ray images. The authors demonstrated that their model achieved an average root mean square error (RMSE) percentage of 77.9% and 80.5% for femoral and tibial component sizes while achieving accuracy rates of 71.8% for both maximum up/down hanging measurements, representing improvements by approximately 99.5% and 99.9%, respectively, within the ±1 size metric.

PJI of knee arthroplasty

Two studies predicted PJI using X-rays based on AI algorithms [46,47] (Table 4). One study [46] developed diagnostic coding algorithms using the International Classification of Diseases, 9th Revision (ICD-9), International Classification of Diseases, 10th Revision (ICD-10), and current procedural terminology (CPT) codes to identify PJI for knee arthroplasty in the US Veterans Health Administration. The study achieved PPVs of 75% (ICD-9) and 85% (ICD-10) for confirmed PJI events, which could be helpful for future pharmacoepidemiologic studies. Another study [47] developed DL models for classification networks using 1062 dual-channel X-ray images, which can assist clinicians in identifying PJI and aseptic failure with high SEN and SPE. It achieved an AUC of 0.931, a SEN of 0.905, and a SPE of 0.889 for identifying PJI in knee prostheses. Moreover, the high-risk regions predicted by the PJI deep learning model were closely correlated with intraoperative clinical and pathological findings

thumbnail
Table 4. Performance of AI Algorithms in predicting joint infection in knee arthroplasty.

https://doi.org/10.1371/journal.pone.0321104.t004

Implant loosening

Included 3 studies have monitored the implant loosening in TKA [4345] (Tabel 5). Shah et al. [45] evaluated a model for detecting loosening using 217 fixed TKA and 137 loosened TKA X-rays, employing various CNN models. The transfer learning approach utilized image modification techniques without altering the layers. The ACC achieved 70.8% when using original images with untrained models, which improved to 73.3% when segmentation and cropping tools were applied and exceeded 80% when pre-trained models with large datasets were used. When only images were considered, the ACC remained below 90%; however, incorporating clinical information increased the ACC beyond 90%. Lau et al. [44] conducted a study on TKA loosening analysis involving a dataset of 440 knee radiograph images utilizing an image-based ML model based on Xception architecture without fine-tuning. They achieved an overall ACC of 96.3%, SEN of 96.1%, SPE of 90.9%, and the AUC of 93.5%. Assessment by senior orthopedic specialists yielded an ACC range between 89.09% to 94.54%. Kim et al [45] employed a transfer learning model using VGG19 as a pre-trained CNN model to assess the accuracy in detecting TKA loosening among 100 patients with loosening and another 100 patients without it; two different degrees of freezing ranges were explored for transfer learning modeling techniques. The solely fine-tuned image models demonstrated impressive results with an ACC of 97.5%, perfect SEN of 100%, and SPE of 97.5%.

Quality assessment

The quality assessment of each included study is illustrated in Fig 2-3. In the patient selection domain, nine studies had a high or unclear risk [27,28,35,36,38,4345,47]. For the index test, 12 studies had an unclear risk [29,32,33,3540,43,45,46]. No bias was observed in the reference standard domain, while flow and timing biases were high or unclear in 4 studies [33,36,40,46]. Regarding applicability concerns, there were high or unclear applicability issues in the patient selection domain for three studies [30,42,43] and in the index test domain for three studies [38,45,46], and low applicability issues across all studies in the reference standard domain. Most included studies showed a low risk of bias and applicability concerns.

thumbnail
Figs 2. Risk of bias and applicability concerns summary.

https://doi.org/10.1371/journal.pone.0321104.g002

thumbnail
Figs 3. Risk of bias and applicability concerns graph.

https://doi.org/10.1371/journal.pone.0321104.g003

thumbnail
Table 5. Performance of AI algorithms in implant loosening in knee arthroplasty.

https://doi.org/10.1371/journal.pone.0321104.t005

Discussion

This systematic review aimed to comprehensively assess the research using AI algorithms in various aspects of X-ray-based knee arthroplasty. AI-based models offered a promising solution for identifying knee implant systems from X-radiographs, particularly in scenarios without clinical information. Leveraging object-detection algorithms, these AI algorithms could efficiently analyze images, aiding orthopedic surgeons in resolving complexities and prosthetic challenges associated with knee implants. Thus, they reduced human error and improved treatment workflows [48].

Nonetheless, due to differences in architecture and structure among the AI algorithms used in each study, it is still challenging to objectively and quantitatively compare their diagnostic performance for different types of knee arthroplasty. Therefore, developing and implementing standards for AI research in medicine, such as the machine learning-specific extension of the TRIPOD statement (TRIPOD-ML), is crucial. Despite being recently developed or modified, AI algorithms are expected to have higher accuracy. However, due to the inherent algorithmic differences, no significant variations in accuracy were observed. This suggests that while the AI model itself plays a role, the quantity and quality of the dataset currently hold greater importance [49]. Limited training data samples in some studies employ standard data augmentation methods [2731,33,34,39,45] and multicenter data sources [28,29,31,3335,40,46] to improve this situation. The “black box” phenomenon in AI diagnostics is also concerning [50], making models’ internal workings inexplicable. Visualization techniques like class activation mapping [2732,34,36,44,47,51] are commonly used to highlight essential image areas and provide a visual summary of CNN’s decision-making process. This enhances transparency and ensures immediate understanding for humans. Additionally, the QUADAS-2 tool assessment indicates that most studies in our study have low risks of bias and applicability concerns. However, certain domains like patient selection and the index test exhibit variability, with some studies showing high or unclear dangers and problems. These findings highlight the overall reliability of the included studies while emphasizing areas that may require cautious interpretation due to potential biases or applicability issues.

There are also several potential limitations. Firstly, the methodological heterogeneity of the included studies and variations in the performance reports constrained our survey to a systematic review only. Secondly, the current AI models were trained on a specific population of pre-selected cases, given the retrospective nature of this study. Further validation is imperative to ensure the accuracy and generalizability of these AI models. Thirdly, since all included studies involved predictive AI models, it should be noted that the tools for assessing these models are still under development, particularly with the ongoing extension of the Prediction Model Risk of Bias Assessment Tool for Artificial Intelligence (PROBAST-AI) guidelines. Consequently, the quality evaluation could not be further conducted.

Conclusion

In conclusion, this study highlights the promising role of AI in analyzing routine X-rays to identify and categorize knee implants, measure dimensions and alignment of knee prostheses, detect PJI, and assess implant loosening. Further research will benefit from including more extensive and diverse datasets, establishing multiple center population databases, and training the algorithms on broader implant designs. Additionally, an essential task for the future of AI is the development of open-source software programs and commercial tools that can provide clinicians with more objective clinical decisions.

References

  1. 1. Carr AJ, Robertsson O, Graves S, Price AJ, Arden NK, Judge A, et al. Knee replacement. Lancet. 2012;379(9823):1331–40. pmid:22398175
  2. 2. Sloan M, Premkumar A, Sheth NP. Projected Volume of Primary Total Joint Arthroplasty in the U.S., 2014 to 2030. J Bone Joint Surg Am. 2018;100(17):1455–60. pmid:30180053
  3. 3. Geary MB, Macknet DM, Ransone MP, Odum SD, Springer BD. Why Do Revision Total Knee Arthroplasties Fail? A Single-Center Review of 1632 Revision Total Knees Comparing Historic and Modern Cohorts. J Arthroplasty. 2020;35(10):2938–43. pmid:32561262
  4. 4. Signore A, Glaudemans AWJM. Reply to comment by J.P. Suarez Fernandez on “Consensus document for the diagnosis of prosthetic joint infections: a joint paper by the EANM, EBJIS, and ESR (with ESCMID endorsement)”. Eur J Nucl Med Mol Imaging. 2020;47(11):2503–4. pmid:32533242
  5. 5. French TH, Russell N, Pillai A. The diagnostic accuracy of radionuclide arthrography for prosthetic loosening in hip and knee arthroplasty. Biomed Res Int. 2013;2013:693436. pmid:23936835
  6. 6. Barnsley L, Barnsley L. Detection of aseptic loosening in total knee replacements: a systematic review and meta-analysis. Skeletal Radiol. 2019;48(10):1565–72. pmid:30976835
  7. 7. Temmerman OPP, Raijmakers PGHM, David EFL, Pijpers R, Molenaar MA, Hoekstra OS, et al. A comparison of radiographic and scintigraphic techniques to assess aseptic loosening of the acetabular component in a total hip replacement. J Bone Joint Surg Am. 2004;86(11):2456–63. pmid:15523018
  8. 8. Jamshidi A, Pelletier J-P, Martel-Pelletier J. Machine-learning-based patient-specific prediction models for knee osteoarthritis. Nat Rev Rheumatol. 2019;15(1):49–60. pmid:30523334
  9. 9. Rodríguez-Merchán EC. The current role of the virtual elements of artificial intelligence in total knee arthroplasty. EFORT Open Rev. 2022;7(7):491–7. pmid:35900206
  10. 10. Shen D, Wu G, Suk H-I. Deep Learning in Medical Image Analysis. Annu Rev Biomed Eng. 2017;19:221–48. pmid:28301734
  11. 11. Gayathri S, Gopi VP, Palanisamy P. Diabetic retinopathy classification based on multipath CNN and machine learning classifiers. Phys Eng Sci Med. 2021;44(3):639–53. pmid:34033015
  12. 12. Jeong S, Son D-S, Cho M, Lee N, Song W, Shin S, et al. Evaluation of Combined Cancer Markers With Lactate Dehydrogenase and Application of Machine Learning Algorithms for Differentiating Benign Disease From Malignant Ovarian Cancer. Cancer Control. 2021;28:10732748211033401. pmid:34923833
  13. 13. Mainali S, Darsie ME, Smetana KS. Machine Learning in Action: Stroke Diagnosis and Outcome Prediction. Front Neurol. 2021;12:734345. pmid:34938254
  14. 14. Du J, Yang J, Yang Q, Zhang X, Yuan L, Fu B. Comparison of machine learning models to predict the risk of breast cancer-related lymphedema among breast cancer survivors: a cross-sectional study in China. Front Oncol. 2024;14:1334082. pmid:38410115
  15. 15. Shah AK, Lavu MS, Hecht CJ 2nd, Burkhart RJ, Kamath AF. Understanding the use of artificial intelligence for implant analysis in total joint arthroplasty: a systematic review. Arthroplasty. 2023;5(1):54. pmid:37919812
  16. 16. Lopez CD, Gazgalis A, Boddapati V, Shah RP, Cooper HJ, Geller JA. Artificial Learning and Machine Learning Decision Guidance Applications in Total Hip and Knee Arthroplasty: A Systematic Review. Arthroplast Today. 2021;11:103–12. pmid:34522738
  17. 17. Kim M-S, Kim J-J, Kang K-H, Lee J-H, In Y. Detection of Prosthetic Loosening in Hip and Knee Arthroplasty Using Machine Learning: A Systematic Review and Meta-Analysis. Medicina (Kaunas). 2023;59(4):782. pmid:37109740
  18. 18. Chong YY, Chan PK, Chan VWK, Cheung A, Luk MH, Cheung MH, et al. Application of machine learning in the prevention of periprosthetic joint infection following total knee arthroplasty: a systematic review. Arthroplasty. 2023; 5(1). https://doi:10.1186/s42836-023-00195-2 pmid:531563164.
  19. 19. Salman LA, Khatkar H, Al-Ani A, Alzobi OZ, Abudalou A, Hatnouly AT, et al. Reliability of artificial intelligence in predicting total knee arthroplasty component sizes: a systematic review. Eur J Orthop Surg Traumatol. 2024;34(2):747–56. pmid:38010443
  20. 20. Hinterwimmer F, Lazic I, Suren C, Hirschmann MT, Pohlig F, Rueckert D, et al. Machine learning in knee arthroplasty: specific data are key-a systematic review. Knee Surg Sports Traumatol Arthrosc. 2022;30(2):376–88. pmid:35006281
  21. 21. Hirschmann A, Cyriac J, Stieltjes B, Kober T, Richiardi J, Omoumi P. Artificial Intelligence in Musculoskeletal Imaging: Review of Current Literature, Challenges, and Trends. Semin Musculoskelet Radiol. 2019;23(3):304–11. pmid:31163504
  22. 22. McInnes MDF, Moher D, Thombs BD, McGrath TA, Bossuyt PM, and the PRISMA-DTA Group, et al. Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies: The PRISMA-DTA Statement. JAMA. 2018;319(4):388–96. pmid:29362800
  23. 23. Ge L, Tian J-H, Li Y-N, Pan J-X, Li G, Wei D, et al. Association between prospective registration and overall reporting and methodological quality of systematic reviews: a meta-epidemiological study. J Clin Epidemiol. 2018;93:45–55. pmid:29111471
  24. 24. Whiting PF, Rutjes AWS, Westwood ME, Mallett S, Deeks JJ, Reitsma JB, et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529–36. pmid:22007046
  25. 25. Yang B, Mallett S, Takwoingi Y, Davenport CF, Hyde CJ, Whiting PF, et al. QUADAS-C: A Tool for Assessing Risk of Bias in Comparative Diagnostic Accuracy Studies. Ann Intern Med. 2021;174(11):1592–9. pmid:34698503
  26. 26. Tian J, Zhang J, Ge L, Yang K, Song F. The methodological and reporting quality of systematic reviews from China and the USA are similar. J Clin Epidemiol. 2017;85:50–8. pmid:28063911
  27. 27. Yi PH, Wei J, Kim TK, Sair HI, Hui FK, Hager GD, et al. Automated detection & classification of knee arthroplasty using deep learning. Knee. 2020;27(2):535–42. pmid:31883760
  28. 28. Belete SC, Batta V, Kunz H. Automated classification of total knee replacement prosthesis on plain film radiograph using a deep convolutional neural network. Informatics in Medicine Unlocked. 2021; 25. https://doi:10.1016/j.imu.2021.100669 pmid:531563254.
  29. 29. Karnuta JM, Luu BC, Roth AL, Haeberle HS, Chen AF, Iorio R, et al. Artificial Intelligence to Identify Arthroplasty Implants From Radiographs of the Knee. Journal of Arthroplasty. 2021; 36(3):935-40. https://doi:10.1016/j.arth.2020.10.021 pmid:531563207.
  30. 30. Patel R, Thong EHE, Batta V, Bharath AA, Francis D, Howard J. Automated Identification of Orthopedic Implants on Radiographs Using Deep Learning. Radiol Artif Intell. 2021;3(4):e200183. pmid:34350407
  31. 31. Sharma S, Batta V, Chidambaranathan M, Mathialagan P, Mani G, Kiruthika M, et al. Knee Implant Identification by Fine-Tuning Deep Learning Models. Indian Journal of Orthopaedics. 2021; 55(5):1295-305. https://doi:10.1007/s43465-021-00529-9 pmid:531565198
  32. 32. Klemt C, Uzosike AC, Cohen-Levy WB, Harvey MJ, Subih MA, Kwon YM. The Ability of Deep Learning Models to Identify Total Hip and Knee Arthroplasty Implant Design From Plain Radiographs. The Journal of the American Academy of Orthopaedic Surgeons. 2022; 30(9):409-15. https://doi:10.5435/JAAOS-D-21-00771 pmid:531562954
  33. 33. Tiwari A, Yadav AK, Bagaria V. Application of deep learning algorithm in automated identification of knee arthroplasty implants from plain radiographs using transfer learning models: Are algorithms better than humans?. Journal of Orthopaedics. 2022;32: 139-45. https://doi:10.1016/j.jor.2022.05.013 pmid:531563161.
  34. 34. Karnuta JM, Shaikh HJF, Murphy MP, Brown NM, Pearle AD, Nawabi DH, et al. Artificial Intelligence for Automated Implant Identification in Knee Arthroplasty: A Multicenter External Validation Study Exceeding 3.5 Million Plain Radiographs. Journal of Arthroplasty. 2023. https://doi:10.1016/j.arth.2023.03.039 pmid:531563200.
  35. 35. Ghose S, Datta S, Batta V, Malathy C, M G. Artificial Intelligence based identification of Total Knee Arthroplasty Implants. 2020 3rd International Conference on Intelligent Sustainable Systems (ICISS). 2020:302–7.
  36. 36. Bonnin M, Müller-Fouarge F, Estienne T, Bekadar S, Pouchy C, Selmi T. Artificial Intelligence Radiographic Analysis Tool for Total Knee Arthroplasty. Journal of Arthroplasty. 2023;38(7):S199-S207.e2. https://doi:10.1016/j.arth.2023.02.053 pmid:531563205.
  37. 37. Farooq H, Deckard ER, Arnold NR, Meneghini RM. Machine Learning Algorithms Identify Optimal Sagittal Component Position in Total Knee Arthroplasty. Journal of Arthroplasty. 2021;36(7):S242-S9. https://doi:10.1016/j.arth.2021.02.063 pmid:531565299.
  38. 38. Burge TA, Jones GG, Jordan CM, Jeffers JRT, Myant CW. A computational tool for automatic selection of total knee replacement implant size using X-ray images. Front Bioeng Biotechnol. 2022;10:971096. pmid:36246387
  39. 39. Erne F, Grover P, Dreischarf M, Reumann MK, Saul D, Histing T, et al. Automated Artificial Intelligence-Based Assessment of Lower Limb Alignment Validated on Weight-Bearing Pre- and Postoperative Full-Leg Radiographs. Diagnostics. 2022;12(11). https://doi:10.3390/diagnostics12112679 pmid:531563253.
  40. 40. Schwarz GM, Simon S, Mitterer JA, Frank BJH, Aichmair A, Dominkus M, et al. Artificial intelligence enables reliable and standardized measurements of implant alignment in long leg radiographs with total knee arthroplasties. Knee surgery, sports traumatology, arthroscopy: official journal of the ESSKA. 2022;30(8):2538-47. https://doi:10.1007/s00167-022-07037-9 pmid:531563199.
  41. 41. Yue Y, Gao Q, Zhao M, Li D, Tian H. Prediction of Knee Prosthesis Using Patient Gender and BMI With Non-marked X-Ray by Deep Learning. Front Surg. 2022;9: 798761. https://doi:10.3389/fsurg.2022.798761 pmid:531566012.
  42. 42. Pagano S, Müller K, Götz J, Reinhard J, Schindler M, Grifka J, et al. The Role and Efficiency of an AI-Powered Software in the Evaluation of Lower Limb Radiographs before and after Total Knee Arthroplasty. J Clin Med. 2023;12(17):5498. pmid:37685563
  43. 43. Shah RF, Bini SA, Martinez AM, Pedoia V, Vail TP. Incremental inputs improve the automated detection of implant loosening using machine-learning algorithms. Bone Joint J. 2020;102-B(6_Supple_A):101–6. pmid:32475275
  44. 44. Lau LCM, Chui ECS, Man GCW, Xin Y, Ho KKW, Mak KKK, et al. A novel image-based machine learning model with superior accuracy and predictability for knee arthroplasty loosening detection and clinical decision making. Journal of Orthopaedic Translation. 2022;36: 177-83. https://doi:10.1016/j.jot.2022.07.004 pmid:531565686.
  45. 45. Kim MS, Cho RK, Yang SC, Hur JH, In Y. Machine Learning for Detecting Total Knee Arthroplasty Implant Loosening on Plain Radiographs. Bioengineering-Basel. 2023;10(6). https://doi:10.3390/bioengineering10060632 pmid:531565304.
  46. 46. Weinstein EJ, Stephens-Shields A, Loabile B, Yuh T, Silibovsky R, Nelson CL, et al. Development and validation of case-finding algorithms to identify prosthetic joint infections after total knee arthroplasty in Veterans Health Administration data. Pharmacoepidemiol Drug Saf. 2021;30(9):1184–91. pmid:34170057
  47. 47. Wu L, Yang F, Wu Y, Cui J, Shi H, Bin S. A deep learning framework for diagnosing periprosthetic joint infections using X-ray images: a discovery and validation study. The Journal of arthroplasty. 2022. https://doi:10.1016/j.arth.2022.08.037 pmid:531564056.
  48. 48. Do HM, Spear LG, Nikpanah M, Mirmomen SM, Machado LB, Toscano AP, et al. Augmented Radiologist Workflow Improves Report Value and Saves Time: A Potential Model for Implementation of Artificial Intelligence. Acad Radiol. 2020;27(1):96–105. pmid:31818390
  49. 49. Langlotz CP, Allen B, Erickson BJ, Kalpathy-Cramer J, Bigelow K, Cook TS, et al. A Roadmap for Foundational Research on Artificial Intelligence in Medical Imaging: From the 2018 NIH/RSNA/ACR/The Academy Workshop. Radiology. 2019;291(3):781–91. pmid:30990384
  50. 50. Prevedello LM, Erdal BS, Ryu JL, Little KJ, Demirer M, Qian S, et al. Automated Critical Test Findings Identification and Online Notification System Using Artificial Intelligence in Imaging. Radiology. 2017;285(3):923–31. pmid:28678669
  51. 51. Zhou B, Khosla A, Lapedriza A, Oliva A, Torralba A. Learning Deep Features for Discriminative Localization. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016.