Figures
Abstract
Objective
To externally validate and compare the prognostic performance of four diabetic foot ulcer classifications—Meggitt-Wagner, University of Texas, SINBAD, and Saint Elian— for predicting major amputation in patients with diabetes treated at a public hospital in Peru.
Materials and methods
We conducted a retrospective cohort study at María Auxiliadora Hospital, one of the few referral centers in Lima with a specialized Diabetic Foot Unit. The study period was January 2015 to December 2019. Eligible patients had a lower-limb ulcer, complete clinical data recorded within 48 hours of the index healthcare encounter (hospital admission or outpatient evaluation), and at least six months of follow-up. Patients with venous ulcers or pressure ulcers related to immobilization were excluded. The primary outcome was major amputation, defined as any procedure above the ankle. Prognostic performance of the four systems was assessed using the area under the receiver operating characteristic curve (AUROC), sensitivity, specificity, predictive values, and the Youden index
Results
A total of 342 patients were included. All had type 2 diabetes mellitus, median age was 62 years (IQR 54–69), and 229 (67.0%) were male. The six-month cumulative incidence of major amputation was 11%. The AUROC values were: Saint Elian, 0.90; University of Texas (3D stage), 0.81; Meggitt-Wagner, 0.81; and SINBAD, 0.74. The most discriminative thresholds were SE ≥ 18, TU stage 3D, MW ≥ 3, and SINBAD ≥5. Patients with SE > 20 had a 32-fold higher risk of amputation compared with those with SE < 16.
Conclusion
The Saint Elian and University of Texas systems showed the best prognostic accuracy, while Meggitt-Wagner and SINBAD performed moderately. These findings provide clinicians with clear cut-off points to identify high-risk patients and support early interventions in referral settings. Their effective use depends on multidisciplinary teams trained in diabetic foot management, and selecting the classification system most suited to hospital resources and patient profiles may help reduce amputation rates.
Citation: Gallardo-Alburqueque LA, Quispe-Landeo Y, Chanamé-Marín A, Uribe-Cavero LJ, Yovera-Aldana M (2026) External validation of the Meggitt-Wagner, Texas University, SINBAD, and Saint Elian classifications for predicting major amputation in patients with diabetes at a public hospital in Peru. PLoS One 21(1): e0327601. https://doi.org/10.1371/journal.pone.0327601
Editor: Xiaoen Wei, Shuguang Hospital Affiliated to Shanghai University of Traditional Chinese Medicine, CHINA
Received: June 26, 2025; Accepted: December 12, 2025; Published: January 21, 2026
Copyright: © 2026 Gallardo-Alburqueque et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting information files.
Funding: This research received no specific funding. If accepted for publication, the article processing charge (APC) will be funded by Universidad Científica del Sur, Peru. The funder will have no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: SE, Saint Elian; MW, Meggitt-Wagner; TU, Texas University score; AUROC, Area under the receiver operating characteristic curve; MINSA, Peruvian Ministry of Health; SIS, Seguro Integral de Salud; IWGDF, International Working Group on the Diabetic Foot
Introduction
Diabetic foot is one of the most frequent and severe complications of diabetes mellitus. A lower limb amputation due to diabetic foot occurs every 20 seconds worldwide, making it the leading cause of non-traumatic limb loss [1]. Patients with diabetic foot ulcers have a 2.5-fold increased risk of death within five years compared to those with diabetes but without ulcers [2]. The direct costs associated with diabetic foot account for approximately one-third of total diabetes-related expenditures and are comparable to the investments made in oncological diseases [3] Given its high burden, the study of its prevalence and prognosis is essential to improve clinical outcomes and optimize healthcare resource allocation [4].
In clinical practice, prediction rules based on signs, symptoms, medical history, and laboratory findings are widely used to assess disease prognosis, guide therapeutic decisions, and enhance communication between healthcare providers and patients [5]. In the case of diabetic foot, such tools are primarily applied to predict the likelihood of major amputation or the success of limb-salvage interventions [6]. Currently, more than 25 classification systems have been developed, each incorporating a different number of criteria [7]. Their performance varies depending on the available technology, the characteristics of the reference population, and the presence of multidisciplinary care teams [8].
In the Peruvian context, diabetic foot affects approximately 5.9% of patients attending healthcare facilities [9]. It is also accounts one out of every five hospital admissions among patients with diabetes in Peruvian hospitals [10]. Limitations in healthcare personnel training, infrastructure, and funding hinder the establishment of multidisciplinary teams dedicated to diabetic foot care. [11] Furthermore, only 11.5% to 15% of these patients achieve adequate metabolic control —defined as optimal levels of blood glucose, blood pressure, and lipid profile —which further contributes to suboptimal clinical outcomes [12,13].
Most clinical scoring systems for diabetic foot were developed and validated in high-income countries, and their predictive performance may not translate directly to Latin American settings. Differences in patient profiles, healthcare pathways, and resource availability can influence prognostic accuracy, underscoring the need for external validation. [14] To address this gap, we compared the discriminative capacity of four widely used classifications—Meggitt-Wagner (MW), University of Texas score (TU), SINBAD, and Saint Elian (SE)—to predict major amputation at six months among patients managed at the Diabetic Foot Unit of a national referral hospital in Peru.
Methods
Study design
We performed an external validation study using a retrospective cohort design, analyzing prospectively recorded clinical data from the Diabetic Foot Unit at María Auxiliadora Hospital (Lima, Peru) between January 2015 and December 2019.
Study setting and clinical management
María Auxiliadora Hospital is a public referral center located in southern Metropolitan Lima, serving 13 districts (~2.5 million inhabitants). It primarily attends an insured population covered by Peru’s state-subsidized program for low-income and very low-income groups (Seguro Integral de Salud, SIS). Patients with diabetes and lower-limb ulcers, infection, or ischemia are assessed by the Diabetic Foot Unit either from outpatient services or the emergency department. The Unit applies a standardized diagnostic-therapeutic pathway: initial debridement with tissue sampling and targeted antibiotics, an intensified wound-care schedule during the acute phase (daily, Monday to Saturday), step-down to three sessions per week upon stabilization, vascular assessment with arterial Doppler waveforms, and inpatient or ambulatory follow-up until complete epithelialization. After epithelialization, patients continue monthly follow-up on an ongoing basis to assess for recurrence.
Population, sample and sampling
Eligible patients were identified from the prospectively maintained Diabetic Foot database of the Endocrinology Service, which records all patients managed by the Diabetic Foot Unit. We included patients with a lower-limb ulcer and complete baseline clinical information documented within 48 hours of the index healthcare event (hospital admission or outpatient evaluation), with at least six months of follow-up. Diabetic foot was defined as a lower-extremity ulcer in a patient with documented diabetes mellitus (medical history and laboratory records). We excluded venous ulcers (typical location and signs of chronic venous insufficiency, without neuropathic/ischemic features) and pressure ulcers related to immobilization (heel location with bedridden status). Patients with neuropathic, ischemic, or mixed diabetic ulcers were included. A non-probabilistic, convenience sampling approach was used, based on consecutive cases in the database during the study period.
Variables
Diabetic foot classifications.
We evaluated four widely used systems—Meggitt-Wagner (MW), University of Texas (TU), SINBAD, and Saint Elian (SE). MW and TU were included due to their historical and global use as reference frameworks; SINBAD is recommended by the IWGDF for international comparability; and SE, developed in Latin America, integrates multiple clinical domains relevant to resource-limited environments. Each system was applied as originally defined
- Meggitt-Wagner Classification (MW): This system is primarily based on ulcer depth, with partial consideration of ischemia. It includes six grades: grade 0, intact skin; grade 1, superficial ulcer; grade 2, ulcer reaching tendon or joint capsule; grade 3, deep ulcer with osteomyelitis or abscess; grade 4, localized gangrene (e.g., toes or forefoot); and grade 5, extensive gangrene involving the entire foot [15].
- Texas University Classification (TU): This system incorporates three dimensions: ulcer depth, infection, and ischemia. Depth is graded from 0 to 3: (0) pre- or post-ulcerative lesion with intact skin, (1) superficial wound not involving tendon, capsule or bone, (2) wound penetrating to tendon or capsule, and (3) wound penetrating to bone or joint. The presence of infection and/or ischemia is categorized as follows: (A) neither present, (B) infection, (C) ischemia, (D) both infection and ischemia. These combinations yield 16 possible grades with distinct prognostic implications for limb salvage and amputation [16].
- SINBAD Classification: Developed by Ince et al. in 2008 as a modification of the (S[AD])SAD system, it is an acronym that evaluates six parameters: Site (forefoot vs. midfoot/hindfoot), Ischemia (yes/no), Neuropathy (yes/no), Bacterial infection (yes/no), Area (<1 cm² or ≥1 cm²), and Depth (superficial or involving muscle/bone). This simplified score is intended for broad applicability in clinical and resource-limited settings [17].
- Saint Elian Classification (SE): This system is composed of ten parameters organized into three domains [18]. A cumulative score is generated, classifying severity as mild (<10 points), moderate (11–20 points), or severe (21–30 points).
- Anatomical factors: location (phalanges, metatarsal, or tarsal), topography (dorsal/plantar, lateral/medial, or involving two or more areas), and number of affected regions (one, two, or the entire foot).
- Aggravating factors: ischemia (none, mild, moderate, or severe), infection (none, mild, moderate, or severe), edema (none, perilesional, limited to the affected leg, or bilateral), and neuropathy (none, diminished sensation, complete loss of protective sensation, or presence of Charcot arthropathy).
- Contributing factors: depth (superficial, below dermis, or full-thickness), ulcer area (<10 cm², 10–40 cm², or >40 cm²), and healing phase (epithelialization, granulation, or inflammatory phase).
Outcome definition and ascertainment.
The primary outcome was major amputation, defined as any amputation above the ankle (infracondylar/supracondylar levels). Only amputations ipsilateral to the index ulcer were considered. Minor amputations (toe or transmetatarsal level) were classified within the “no major amputation” group. Outcomes were ascertained from hospital surgical records with active clinical follow-up; for missed visits, patients or relatives were contacted by telephone. Because this was a retrospective validation based on routinely collected data, predictors and outcomes were recorded in clinical practice prior to this analysis, and outcome assessors were not influenced by the classification scores. Given the absence of a national electronic medical record system during the study period, amputations performed at other institutions may have been missed; this is acknowledged as a study limitation.
Other variables.
Covariates included sociodemographic, anthropometric/clinical, medical history, ulcer characteristics (type, size, depth, location), hospitalization status (outpatient vs. inpatient), length of hospital stay, ischemia (see below), and laboratory data (HbA1c, hemoglobin, LDL cholesterol, albumin). All patients in the cohort had type 2 diabetes mellitus; no cases of type 1 diabetes were recorded.
- Sociodemographic variables: Age in years (<60; ≥ 60), sex (male/female), and educational attainment (primary or less; secondary; higher education).
- Anthropometric and clinical data: Body mass index (BMI) in kg/m² (<25; 25–29; ≥ 30), diabetes duration in years (<10; ≥ 10), and type of diabetes treatment (diet only; oral antidiabetic drugs; insulin).
- Medical history: Presence or absence of hypertension, coronary artery disease, stroke, and history of prior major amputation.
- Ulcer characteristics: Type (new vs. recurrent), size (measured 48 hours after admission using the ellipse formula, categorized as <10 cm²; 10–39 cm²; ≥ 40 cm²), and depth (based on the IWGDF PEDIS classification). Ulcer location was classified as forefoot (yes/no).
- Ischemia: Ischemia was assessed by arterial Doppler waveform analysis (monophasic, biphasic, or absent flow in the anterior tibial, posterior tibial, or popliteal arteries). Although toe–brachial index (TBI) and ankle–brachial index (ABI) provide quantitative measures, they were often not feasible due to extensive lesions, edema, or pain, and their interpretation can be unreliable in patients with medial arterial calcification (Mönckeberg sclerosis). In such contexts, qualitative Doppler waveform assessment is endorsed in the literature as a valid alternative, since monophasic or absent waveforms correlate with significant ischemia and adverse limb prognosis. [19] While operator-dependent, this method remains practical and widely applicable in diabetic-foot care when pressure indices are limited. [20]
- Laboratory data: Glycated hemoglobin (HbA1c < 7%; ≥ 7%), hemoglobin (<8 g/dL; ≥ 8 g/dL), LDL cholesterol (<100 mg/dL; ≥ 100 mg/dL), and serum albumin (<2.5 g/dL; ≥ 2.5 g/dL).
Procedures
Permission to access the diabetic foot database was obtained from the Endocrinology Service of María Auxiliadora Hospital. The dataset was provided in a fully anonymized format, with all personally identifiable information removed to ensure patient confidentiality
Statistical analysis
The authors accessed and processed the dataset between December 1, 2022 and Augut 31, 2023. All analyses were conducted using R software, version 2025.05.0 + 496. A priori, statistical power was calculated to ensure that the available sample size was adequate for the planned comparisons, using a two-sided alpha level of 5%. The detailed results of the power analysis are presented in S1 Table. Missing data were assessed for all variables. No imputation was performed; analyses were conducted using complete-case data. All variables required to compute the four classification systems and the primary outcome had complete data (100% availability).
Descriptive statistics were reported using absolute and relative frequencies for categorical variables, stratified by the presence or absence of major amputation. Differences between groups were assessed using the chi-square test for nominal variables and the Wilcoxon rank-sum test for ordinal or non-normally distributed variables. Additionally, the median and interquartile range (IQR) of length of hospital stay were calculated.
The six-month cumulative incidence of major amputation was determined according to MW, TU, SINBAD, and SE. Relative risks (RR) with 95% CIs were calculated using the lowest theoretical-risk category as reference. To ensure statistical stability, categories with sparse or no events were collapsed with adjacent groups. For the Meggitt-Wagner classification, grades 1–3 were integrated because no amputations occurred in grades 1–2 and only three amputations were observed in grade 3. For the Texas University classification, most amputations were concentrated in stage 3D, while other stages had very few or zero events; therefore, categories were collapsed into “3D” versus “other stages.” This merging approach was used to ensure sufficient sample sizes for statistical analyses and is consistent with prior validation studies evaluating diabetic foot classification systems [21].
Discrimination was assessed via AUROC with pairwise comparisons (DeLong test for correlated ROC curves) and additional sensitivity analyses were performed stratifying patients by their care setting (outpatient versus inpatient).. To facilitate interpretation of prognostic accuracy, we prespecified thresholds for the area under the ROC curve. An AUROC of 0.50 indicates chance-level discrimination; 0.51–0.59, very poor discrimination; 0.60–0.69, poor discrimination; 0.70–0.79, moderate discrimination; 0.80–0.89, good discrimination; and ≥0.90, excellent discrimination [22]. In addition to the primary outcome (major amputation), we conducted a complementary analysis in which the discriminatory performance of all classification systems was evaluated for the composite outcome “any amputation” (major or minor). This secondary analysis used the same ROC framework, with AUCs and 95% binomial exact confidence intervals estimated identically to the primary analysis.
Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were estimated for each classification and the category or score with the highest discriminative value was identified using the Youden index. A two-sided p-value < 0.05 was considered statistically significant.
Reporting guidelines
This study was reported in accordance with the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) guidelines for cohort studies [23] and the Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD) statement for external validation studies [24]. The completed checklists are available in Supplementary S1 Table (STROBE) and Supplementary S2 Table (TRIPOD).
Ethical aspects
This study was approved by the Endocrinology Department of María Auxiliadora Hospital and the Institutional Ethics and Research Committee of the Universidad Científica del Sur (Approval Certificate No. 386-CIEI-CIENTÍFICA-2021). The ethical approval for the project was renewed on August 10, 2023, extending its validity for six months, until February 10, 2024. The analyzed database contained no personally identifiable information, as all data were encrypted. Only physicians from the hospital’s endocrinology service had access to the identification codes linking patient data.
Results
A total of 357 patients were recorded in the diabetic foot database. Fifteen were excluded because they had incomplete diabetic foot classification data (n = 12) or an unknown outcome (n = 3), resulting in 342 patients included in the analysis. Among these, 39 (11.4%) underwent a major amputation, while the remaining 303 did not; this latter group comprised 76 patients with a minor amputation (22.2%) and 227 with no amputation (66.4%). Overall, any amputation (major or minor) occurred in 115 patients (33.6%) (Fig 1).
Of 357 patients, 15 were excluded, leaving 342 for analysis, including 39 major amputations and 303 without major amputation (76 minor amputations and 227 without amputation).
General characteristics
The average age of the population was 60 years, with two out of three being men. Hypertension was present in 38%, 3.9% had a prior stroke, and 7% had a previous major amputation. Diabetes duration was ≥ 10 years in 62%, 41% received insulin treatment before infection, 80% had HbA1c ≥ 7%, 12% had albumin < 2.5 g/dL, and 75% had hemoglobin < 8 g/dL. Regarding foot characteristics, 77% had an ulcer extension greater than 40 cm², 97% presented a new ulcer, 59% had ischemia, and 46% had severe infection (Table 1). The proportion of missing data for each variable is shown in Supplementary S3 Table.
Differences between amputees and non-amputees
Patients with major amputation had a longer duration of illness and higher prevalence of comorbidities such as hypertension and prior stroke. Age, education level, and body mass index were similar between groups.
Regarding diabetic foot features, major amputation was associated with greater ulcer depth, severe infection, ischemia, and larger ulcer extension, but not with ulcer location or recurrence.
Laboratory variables showed an association between albumin < 2.5 g/dL and major amputation, whereas HbA1c, LDL cholesterol, and hemoglobin showed no significant association (Table 1).
When stratified by classification categories, most amputations were concentrated in the most advanced stages. For the Meggitt–Wagner system, 92% of amputations occurred in grades 4–5, where the amputation rates were 23.8% and 100%, respectively. In the Texas University classification, 87% of amputations were clustered in stage 3D, with an amputation rate of 30.9%. For SINBAD, 85% of amputations occurred in scores ≥5, with rates of 21.7% in score 5 and 20.0% in score 6. Finally, for the Saint Elian system, almost half of patients in the 21–25 category (47.6%) required major amputation, compared with only 2.7% in the 11–15 group and 16.5% in the 16–20 group (Supplementary S4 Table).
Risk of major amputation
MW grades 4 or 5 increased the risk of major amputation 14.7 times (RR 15.7; 95% CI 4.9 to 50.2; p < 0.001) compared to the combined categories 1–3. TU stage 3D increased the risk 13.3 times (RR 14.3; 95% CI 5.7 to 35.7; p < 0.001) compared to the rest of the combined stages. When analyzing the three individual components of TU, only ischemia had sufficient cases to calculate risk (RR 4.59; 95% CI 1.84 to 11.48). A SINBAD score of 5 increased the risk 6.88 times (RR 7.88; 95% CI 3.11 to 19.9) compared to scores less than 4. Finally, an SE score greater than 20 increased the risk 31.2 times (95% CI 10.0 to 103.6) compared to scores less than 16 (Table 2).
Prognostic capacity
Seven ROC curves corresponding to four classification systems—SE, MW, TU (3D stage, depth, ischemia, infection), and SINBAD—were analyzed (Fig 2).
Areas under the ROC curve (AUC) with 95% confidence intervals: Saint Elian, 0.900 (0.858–0.942); University of Texas (UT) 3D stage, 0.811 (0.752–0.869); Meggitt–Wagner, 0.805 (0.770–0.839); UT depth, 0.802 (0.774–0.829); SINBAD, 0.747 (0.693–0.802); UT ischemia, 0.655 (0.595–0.715); UT infection, 0.650 (0.624–0.676). The diagonal line represents chance discrimination.
The SE scale demonstrated the best accuracy compared to the others, with an AUROC of 0.900. Followed by TU 3D stage (AUROC = 0.811) and MW (AUROC = 0.805), both showing similar performance. Lastly, SINBAD had the lowest discriminative ability, with an AUROC of 0.747. Regarding the individual components of UT, depth showed an accuracy similar to MW (AUROC = 0.802). Meanwhile, the ischemia and infection components exhibited lower performances, with AUROCs of 0.655 and 0.650, respectively, being the lowest among all. When stratifying by admission status, prognostic performance was consistently lower among inpatients compared with the overall cohort. For inpatients (n = 109), the Saint Elian classification decreased to moderate discrimination (AUROC 0.74, 95% CI 0.65–0.82), while Meggitt–Wagner and SINBAD dropped to poor or very poor discrimination (AUROC 0.60, 95% CI 0.50–0.69, and AUROC 0.58, 95% CI 0.48–0.67, respectively). The University of Texas system also showed diminished accuracy in this subgroup (AUROC 0.66 for the 3D stage, 95% CI 0.56–0.75). By contrast, in outpatients no major amputations were observed during follow-up, preventing calculation of discrimination metrics in this group (Table 3).
Pairwise comparisons confirmed these rank orders. The Saint Elian (SE) scale discriminated significantly better than Texas University stage 3D (ΔAUC = 0.09, 95% CI 0.03–0.14; p = 0.001), Meggitt–Wagner (ΔAUC = 0.10, 0.06–0.14; p < 0.001), and SINBAD (ΔAUC = 0.16, 0.10–0.20; p < 0.001). TU 3D outperformed SINBAD (ΔAUC = 0.07, 0.02–0.10; p = 0.003) and did not differ from Meggitt–Wagner (ΔAUC ≈ 0.01; p > 0.80). Meggitt–Wagner showed only a borderline advantage over SINBAD (ΔAUC = 0.06, 0.00–0.11; p = 0.053). For the UT components, ‘depth’ performed similarly to Meggitt–Wagner (p ≈ 0.80) and showed a borderline advantage over SINBAD (p ≈ 0.05), whereas the ‘ischemia’ and ‘infection’ components performed significantly worse than SE, TU 3D, and Meggitt–Wagner (ΔAUC range −0.09 to −0.25; all p < 0.001) (Table 4).
In supplementary analyses, the statistical power of the AUROC comparisons was high. Specifically, the power exceeded 90% for the contrasts between Saint Elian versus Meggitt-Wagner and SINBAD, and reached 78% for Saint Elian versus Texas University (Supplementary S5 Table). This indicates that the study had sufficient precision to detect meaningful differences in discriminative performance between classification systems.
The performance of the classifications was informed thanks to the Youden index of each one. A Wagner score greater than or equal to 3 had an index of 52.7%, a Texas 3D stadium had an index of 62.1%, a SINBAD score greater than or equal to 5 had an index of 45.6%, and a Saint Elian score greater than or equal to 18 had an index of 69.4. In line with Youden’s index (MW ≥ 3: 60.4%; TU 3D: 62.1%; SINBAD ≥ 5: 45.6%; SE ≥ 18: 69.4%), the best operating points also showed favorable sensitivity–specificity trade-offs. SE ≥ 18 achieved 87.2% sensitivity and 82.2% specificity (PPV 38.6%, NPV 98.0%; LR + 4.89, LR– 0.15). TU 3D showed 87.2% sensitivity and 74.9% specificity (PPV 30.9%, NPV 97.8%; LR + 3.47, LR– 0.17). MW ≥ 3 provided 100% sensitivity with 60.4% specificity (PPV 24.5%, NPV 100.0%; LR + 2.52, LR– 0.00), favoring rule-out. SINBAD ≥ 5 yielded 87.2% sensitivity and 58.4% specificity (PPV 21.3%, NPV 97.3%; LR + 2.09, LR– 0.21). Collectively, these thresholds indicate that SE ≥ 18 and TU 3D offer the strongest discrimination for identifying high-risk patients, while MW ≥ 3 maximizes sensitivity for screening (Table 5). In addition to these key thresholds, Supplementary S6 Table provides the complete evaluation of all possible cut-off points for each classification system.
As a complementary analysis, we also evaluated the performance of all seven classification systems for the broader outcome of “any amputation” (including both major and minor procedures). Meggitt–Wagner and the UT depth component showed the highest discrimination in the full cohort, whereas performance decreased substantially among hospitalized patients. In non-hospitalized patients, high-performing systems retained strong accuracy (AUC > 0.80). Full detailed results for this complementary sensitivity analysis are provided in Supplementary S7 Table.
Discussion
Main findings
In this external validation study, the Saint Elian classification showed excellent discriminative ability (AUROC 0.90), outperforming all other systems. Both the University of Texas stage 3D (AUROC 0.811) and the Meggitt–Wagner system (AUROC 0.805) achieved a good level of prognostic accuracy, while the SINBAD score (AUROC 0.747) demonstrated only moderate performance. Within the UT system, depth performed similarly to Wagner (AUROC 0.802, good), but the isolated ischemia (AUROC 0.655) and infection (AUROC 0.650) components had poor discriminatory power. These findings indicate that multidimensional systems such as SE and UT provide more robust prognostic information than single-dimension assessments, highlighting their potential value in clinical decision-making for diabetic foot care.
Comparison with previous studies
Recent evidence highlights the variability and methodological limitations in the validation of diabetic foot ulcer classifications. A systematic review and meta-analysis conducted in Latin America and the Caribbean found that the Wagner system at grade 3 threshold achieved the highest sensitivity, while the Saint Elian classification showed the highest specificity, although the certainty of the evidence was rated very low due to heterogeneity and methodological weaknesses across studies [25]. Complementing these findings, a global systematic review identified 28 different systems applied in 149 studies, noting that most classifications—including Wagner, University of Texas, WIfI, PEDIS, and SINBAD—were validated in only a few studies each, with low or very low certainty of evidence, and no consensus on a universally applicable tool [26]. These results underscore the fragmented nature of the evidence base and the need for more rigorous, standardized validation efforts across diverse healthcare settings.
Saint Elian classification
At the Latin American level, our results are consistent with an Argentine study that followed patients with new-onset diabetic foot ulcers for five months. This study reported an AUC of 0.893 for major amputation using the Saint Elian classification with a cut-off point greater than 18 [27]. Similarly, a Chinese study found that a Saint Elian score greater than 17 reduced the probability of cure by 24%, although it did not report statistics for major amputation alone due to a small sample size [28]. The Saint Elian system, composed of ten variables, is theoretically better suited to predict major amputation; however, the inclusion of multiple variables increases interobserver variability and may lead to inconsistent results. This highlights the importance of training and standardized validation procedures.
Texas University classification
Regarding the UT classification, a Philippine study reported an AUC of 0.785 for grade and 0.575 for stage [29]. In contrast, in our analysis, the University of Texas classification showed the second-best performance (AUC = 0.81) for major amputations. This difference may be explained by our dichotomous analysis, focused specifically on the presence of stage 3D, due to its high prevalence in our hospital population. The Texas system does not yield a single score but instead assigns patients into one of 16 categories based on four grades and four stages. In practice, the grade assesses lesion depth, while the stage identifies infection or ischemia [16]. An expanded version, the WIfI classification, was introduced nine years ago and also incorporates infection and ischemia categorization into a single outcome [30]. However, we were unable to evaluate this system due to limitations in vascular assessment.
Wagner classification
The Wagner classification demonstrated intermediate prognostic performance. It primarily assesses ulcer depth and includes only a partial assessment of peripheral arterial disease. In cases with necrosis, it cannot distinguish whether the cause is infection or ischemia. As one of the earliest and most widely used classification systems, Wagner’s utility in research is limited by its lack of detail regarding infection severity or ischemic status [31]. Nonetheless, in our study population, which included many patients with deep lesions and necrosis, this system had the third-best performance.
SINBAD classification
The SINBAD classification, composed of six dichotomized variables, had the lowest performance (AUC = 0.74). While this system is endorsed by the International Working Group on the Diabetic Foot (IWGDF) for communication among healthcare providers and outcome comparisons across institutions. Its binary structure and fixed cut-offs may limit its discriminative power in high-risk populations such as hospitalized patients [32].
Theoretical and practical implications
The Wagner system laid the foundation for prognostic assessment in diabetic foot, and subsequent tools—University of Texas, SINBAD, and Saint Elian—incorporated additional dimensions such as depth, infection, and ischemia to improve accuracy. In our setting, Saint Elian likely performed better because its multidimensional structure captures anatomical distribution, aggravating factors (ischemia, infection, edema), and contributing factors (depth, ulcer area, healing phase), providing a more comprehensive representation of disease severity. Despite its simplicity, Wagner remains clinically useful because ulcer depth strongly correlates with necrosis and infection—key drivers of limb loss. [33].
Binary assessments of ischemia and infection, as used in components of the University of Texas classification, performed poorly because they oversimplify conditions that exist on a spectrum and fail to capture intermediate stages that influence prognosis in this sample [16]. The very high relative risks observed in advanced categories are explained by the disproportionate clustering of amputations among these patients, which magnifies effect estimates [34]. The lower discrimination seen in hospitalized patients compared with the overall cohort is consistent with spectrum bias: inpatients typically present with more severe and homogeneous disease, reducing the ability of scoring systems to separate high- from low-risk cases [35].
At the hospital level, adopting Saint Elian or the University of Texas system as primary screening tools could help clinicians promptly identify patients at highest risk for major amputation, enabling intensified surveillance, targeted vascular evaluation, and timely multidisciplinary interventions. Wagner at grade ≥3 may function as a high-sensitivity rule-out threshold at first contact, while SINBAD at score ≥5 can facilitate communication and auditing across services. Other scores such as WIfI (Wound, Ischemia, and foot Infection) could not be evaluated in this cohort because they require systolic pressure measurements (ankle–brachial index, toe–brachial index, or transcutaneous oxygen), which were not systematically available; this highlights the importance of aligning tool selection with feasible diagnostic practices [36]. Finally, training and standardized application are essential to limit interobserver variability and ensure reproducible, clinically meaningful use of any classification system.
Public health importance
The classifications can have different uses in research, auditing, clinical monitoring and prognosis. The estimates found in this study may change depending on the population evaluated, the resources of the health system and the existence of a diabetic foot management program.
At the primary level, the Peruvian diabetic foot guidelines consider only the Wagner classification for reference purposes [37]. The International Working Group on the Diabetic Foot (IWGDF 2023) guidelines recommend the use of SINBAD because it is simple and easy to use, and could be an alternative to improve the communication and statistics [38].
Our data suggest that in referral hospitals with a heavy burden of advanced disease, Saint Elian and University of Texas systems,may offer superior prognostic value, complementing—rather than replacing—tools used in primary care.
Strengths and limitations
This study has several limitations that should be acknowledged. First, ischemia was assessed using qualitative Doppler waveform analysis rather than toe–brachial or ankle–brachial indices, which were often not feasible due to extensive lesions, edema, or pain. Although this approach is supported in diabetic foot care when pressure measurements are unreliable, it may introduce operator-dependent variability. Second, because outcome ascertainment relied primarily on surgical records from the index hospital, amputations performed at other institutions may not have been captured. This incomplete external capture would most likely lead to an underestimation of the true incidence of major amputation (bias toward underascertainment). However, the likelihood of missed amputations is expected to be low in this setting. María Auxiliadora Hospital is the only high-complexity public referral center for diabetic foot care in the catchment area, serving predominantly low-income populations covered by the national subsidized insurance system. Given the limited accessibility to private surgical services and the absence of nearby public facilities with comparable capability for major limb salvage or amputation, it is unlikely that a substantial number of patients would have undergone amputations elsewhere. Therefore, although the direction of this bias would lean toward underestimation, its magnitude is likely small. Third, the study was conducted in a single high-volume referral center, which may limit generalizability to primary care settings or hospitals with different patient profiles, resource constraints, or referral patterns. Fourth, although the dataset contained complete information for all variables required to compute the four classification systems and the primary outcome, several clinical and laboratory covariates had varying degrees of missingness, which restricted the ability to explore additional adjusted or multivariable analyses. Finally, interobserver variability in applying multidimensional classification systems—particularly Saint Elian—was not formally measured and may have introduced non-differential misclassification, potentially biasing estimates toward the null..
Strengths include a consecutive real-world cohort with six-month follow-up, application of four widely used systems, prespecified handling of sparse categories to stabilize estimates, and comprehensive operating characteristics (sensitivity, specificity, predictive values, likelihood ratios, and Youden indices) across the full range of scores and cut-offs (see S3 Table).
Conclusion
In this external-validation study from a Peruvian referral hospital, the Saint Elian and University of Texas classifications demonstrated good prognostic accuracy for predicting major amputation at six months, whereas the Meggitt–Wagner and SINBAD systems showed moderate performance. For practical use, patients with a Saint Elian score of 18 or higher or a University of Texas stage 3D should be considered at high risk and referred to specialized multidisciplinary care. Similarly, a Wagner grade of 3 or more or a SINBAD score of 5 or greater should prompt close monitoring and intensive management. Incorporating these thresholds into local care pathways, while adapting the choice of classification system to available resources and patient profiles, may strengthen risk stratification and improve limb-salvage outcomes in comparable health systems.
Supporting information
S1 Table. STROBE checklist for cohort studies.
https://doi.org/10.1371/journal.pone.0327601.s001
(DOCX)
S2 Table. TRIPOD checklist for external validation studies.
https://doi.org/10.1371/journal.pone.0327601.s002
(DOCX)
S4 Table. Incidence of major amputation across detailed categories of diabetic foot classification systems.
https://doi.org/10.1371/journal.pone.0327601.s004
(DOCX)
S5 Table. Statistical power of pairwise comparisons between ROC curves of diabetic foot classifications.
https://doi.org/10.1371/journal.pone.0327601.s005
(DOCX)
S6 Table. Prognostic discriminative capacity across the full range of categories and cut-off points of diabetic foot classifications for major amputation.
https://doi.org/10.1371/journal.pone.0327601.s006
(DOCX)
S7 Table. AUROC (95% CI) of each classification system for predicting any amputation, overall and stratified by hospitalization status.
https://doi.org/10.1371/journal.pone.0327601.s007
(DOCX)
S1 Dataset. Anonymized dataset used for the external validation of the Meggitt–Wagner, University of Texas, SINBAD, and Saint Elian classifications to predict major amputation in patients with diabetes treated at a public hospital in Peru.
https://doi.org/10.1371/journal.pone.0327601.s008
(XLSX)
References
- 1. Armstrong DG, Tan T-W, Boulton AJM, Bus SA. Diabetic Foot Ulcers: A Review. JAMA. 2023;330(1):62–75. pmid:37395769
- 2. Walsh JW, Hoffstad OJ, Sullivan MO, Margolis DJ. Association of diabetic foot ulcer and death in a population-based cohort from the United Kingdom. Diabet Med. 2016;33(11):1493–8. pmid:26666583
- 3. Armstrong DG, Swerdlow MA, Armstrong AA, Conte MS, Padula WV, Bus SA. Five year mortality and direct costs of care for people with diabetic foot complications are comparable to cancer. J Foot Ankle Res. 2020;13(1):16. pmid:32209136
- 4. McDermott K, Fang M, Boulton AJM, Selvin E, Hicks CW. Etiology, Epidemiology, and Disparities in the Burden of Diabetic Foot Ulcers. Diabetes Care. 2023;46(1):209–21. pmid:36548709
- 5. van Smeden M, Reitsma JB, Riley RD, Collins GS, Moons KG. Clinical prediction models: diagnosis versus prognosis. J Clin Epidemiol. 2021;132:142–5. pmid:33775387
- 6. Armstrong DG, Peters EJ. Classification of wounds of the diabetic foot. Curr Diab Rep. 2001;1(3):233–8. pmid:12643204
- 7. Game F. Classification of diabetic foot ulcers. Diabetes Metab Res Rev. 2016;32 Suppl 1:186–94. pmid:26455509
- 8. Monteiro-Soares M, Boyko EJ, Jeffcoate W, Mills JL, Russell D, Morbach S, et al. Diabetic foot ulcer classifications: A critical review. Diabetes Metab Res Rev. 2020;36 Suppl 1:e3272. pmid:32176449
- 9. Ramos W, López T, Revilla L, More L, Huamaní M, Pozo M. Results of the epidemiological surveillance of diabetes mellitus in hospitals in Peru, 2012. Rev Peru Med Exp Salud Publica. 2014;31(1):9–15. pmid:24718521
- 10. Yovera-Aldana M, Sáenz-Bustamante S, Quispe-Landeo Y, Agüero-Zamora R, Salcedo J, Sarria C, et al. Nationwide prevalence and clinical characteristics of inpatient diabetic foot complications: A Peruvian multicenter study. Prim Care Diabetes. 2021;15(3):480–7. pmid:33664012
- 11. Campillo-Vilorio N, Aragón-Sánchez J. Latin American initiatives to improve foot care in people with diabetes: the effort to reduce diabetes-related lower extremity amputations. Diabetes Res Clin Pract. 2025;226:112385. pmid:40714067
- 12. Azañedo D, Bendezú-Quispe G, Lazo-Porras M, Cárdenas-Montero D, Thomas NJ, Ceballos RM. Quality of metabolic control in outpatients with type 2 diabetes seen in a private clinic. Acta Méd Peru. 2017;34:106–13.
- 13. Huayanay-Espinoza IE, Guerra-Castañon F, Lazo-Porras M, Castaneda-Guarderas A, Thomas NJ, Garcia-Guarniz A-L, et al. Metabolic control in patients with type 2 diabetes mellitus in a public hospital in Peru: a cross-sectional study in a low-middle income country. PeerJ. 2016;4:e2577. pmid:27761351
- 14. Beulens JWJ, Yauw JS, Elders PJM, Feenstra T, Herings R, Slieker RC, et al. Prognostic models for predicting the risk of foot ulcer or amputation in people with type 2 diabetes: a systematic review and external validation study. Diabetologia. 2021;64(7):1550–62. pmid:33904946
- 15. Wagner FW Jr. The dysvascular foot: a system for diagnosis and treatment. Foot Ankle. 1981;2(2):64–122. pmid:7319435
- 16. Armstrong DG, Lavery LA, Harkless LB. Validation of a diabetic wound classification system. The contribution of depth, infection, and ischemia to risk of amputation. Diabetes Care. 1998;21(5):855–9. pmid:9589255
- 17. Ince P, Abbas ZG, Lutale JK, Basit A, Ali SM, Chohan F, et al. Use of the SINBAD classification system and score in comparing outcome of foot ulcer management on three continents. Diabetes Care. 2008;31(5):964–7. pmid:18299441
- 18. Martínez-De Jesús FR. A checklist system to score healing progress of diabetic foot ulcers. Int J Low Extrem Wounds. 2010;9(2):74–83. pmid:20483806
- 19. Vriens B, D’Abate F, Ozdemir BA, Fenner C, Maynard W, Budge J, et al. Clinical examination and non-invasive screening tests in the diagnosis of peripheral artery disease in people with diabetes-related foot ulceration. Diabet Med. 2018;35(7):895–902. pmid:29633431
- 20. Criqui MH, Matsushita K, Aboyans V, Hess CN, Hicks CW, Kwan TW, et al. Lower Extremity Peripheral Artery Disease: Contemporary Epidemiology, Management Gaps, and Future Directions: A Scientific Statement From the American Heart Association. Circulation. 2021;144(9):e171–91. pmid:34315230
- 21. Chang Y-C, Huang Y-Y, Hung S-Y, Yeh J-T, Lin C-W, Chen I-W, et al. Are current wound classifications valid for predicting prognosis in people treated for limb-threatening diabetic foot ulcers?. Int Wound J. 2024;21(1):e14338. pmid:37555265
- 22. Schober P, Mascha EJ, Vetter TR. Statistics From A (Agreement) to Z (z Score): A Guide to Interpreting Common Measures of Association, Agreement, Diagnostic Accuracy, Effect Size, Heterogeneity, and Reliability in Medical Research. Anesth Analg. 2021;133(6):1633–41. pmid:34633993
- 23. von Elm E, Altman DG, Egger M, Pocock SJ, Gøtzsche PC, Vandenbroucke JP, et al. Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement: guidelines for reporting observational studies. BMJ. 2007;335(7624):806–8. pmid:17947786
- 24. Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. 2015;350:g7594. pmid:25569120
- 25. Tacca-Quinteros PM, Tantaleán-Gutiérrez LV, Yovera-Aldana M. Capacidad pronóstica de los sistemas de clasificación de pie diabético para amputación de miembro inferior en América Latina y el Caribe: una revisión sistemática con metaanálisis. Acta Med Peru. 2024;41(3):167–82.
- 26. Monteiro-Soares M, Hamilton EJ, Russell DA, Srisawasdi G, Boyko EJ, Mills JL, et al. Classification of foot ulcers in people with diabetes: A systematic review. Diabetes Metab Res Rev. 2024;40: e3645.
- 27. Carro GV, Saurral R, Carlucci E, Gette F, Llanos M de LÁ, Amato PS. A Comparison Between Diabetic Foot Classifications WIfI, Saint Elian, and Texas: Description of Wounds and Clinical Outcomes. Int J Low Extrem Wounds. 2022;21(2):120–30. pmid:32594809
- 28. Huang Y, Xie T, Cao Y, Wu M, Yu L, Lu S, et al. Comparison of two classification systems in predicting the outcome of diabetic foot ulcers: the Wagner grade and the Saint Elian Wound score systems. Wound Repair Regen. 2015;23(3):379–85. pmid:25817047
- 29. Vera-Cruz PN, Palmes PP, Tonogan L, Troncillo AH. Comparison of WIFi, University of Texas and Wagner Classification Systems as Major Amputation Predictors for Admitted Diabetic Foot Patients: A Prospective Cohort Study. Malays Orthop J. 2020;14(3):114–23. pmid:33403071
- 30. Mills JL Sr, Conte MS, Armstrong DG, Pomposelli FB, Schanzer A, Sidawy AN, et al. The Society for Vascular Surgery Lower Extremity Threatened Limb Classification System: risk stratification based on wound, ischemia, and foot infection (WIfI). J Vasc Surg. 2014;59(1):220-34.e1-2. pmid:24126108
- 31. Jalilian M, Shiri S. The reliability of the Wagner Scale for evaluation the diabetic wounds: A literature review. Diabetes Metab Syndr. 2022;16(1):102369. pmid:34929620
- 32. Brocklehurst JD. The Validity and Reliability of the SINBAD Classification System for Diabetic Foot Ulcers. Adv Skin Wound Care. 2023;36(11):1–5. pmid:37861669
- 33. Forsythe RO, Ozdemir BA, Chemla ES, Jones KG, Hinchliffe RJ. Interobserver Reliability of Three Validated Scoring Systems in the Assessment of Diabetic Foot Ulcers. Int J Low Extrem Wounds. 2016;15(3):213–9. pmid:27358037
- 34. Reinecke H, Unrath M, Freisinger E, Bunzemeier H, Meyborg M, Lüders F, et al. Peripheral arterial disease and critical limb ischaemia: still poor outcomes and lack of guideline adherence. Eur Heart J. 2015;36(15):932–8. pmid:25650396
- 35. Willis BH. Spectrum bias--why clinicians need to be cautious when applying diagnostic test studies. Fam Pract. 2008;25(5):390–6. pmid:18765409
- 36. Gunawan IMK, Budiarta IB, Mahadewa TGB, Suryawisesa IBM, Wibawa IGABK, Ariyanta KD. Validity of Wagner, SINBAB, PEDIS, and WIFI Scoring Systems to Assess Risk of Amputation in Patients with Diabetic Foot Ulcers. JBN. 2024;8(2):52.
- 37.
Ministerio de Salud del Perú. Guía técnica: Guía de práctica clínica para el diagnóstico, tratamiento y control del pie diabético. In: Dirección General de Intervenciones Estratégicas en Salud Pública [Internet]. 2017 [cited 16 Sep 2025] pp. 1–29. Available: https://bvs.minsa.gob.pe/local/MINSA/3971.pdf
- 38. Schaper NC, van Netten JJ, Apelqvist J, Bus SA, Fitridge R, Game F, et al. Practical guidelines on the prevention and management of diabetes-related foot disease (IWGDF 2023 update). Diabetes Metab Res Rev. 2024;40(3):e3657. pmid:37243927