Correction
12 Oct 2021: Goncharov M, Mejia OAV, Perez de Souza Arthur C, Orlandi BMM, Sousa A, et al. (2021) Correction: Mortality risk prediction in high-risk patients undergoing coronary artery bypass grafting: Are traditional risk scores accurate?. PLOS ONE 16(10): e0258706. https://doi.org/10.1371/journal.pone.0258706 View correction
Figures
Abstract
Background
The performance of traditional scores is significantly limited to predict mortality in high-risk cardiac surgery. The aim of this study was to compare the performance of STS, ESII and HiriSCORE models in predicting mortality in high-risk patients undergoing CABG.
Methods
Cross-sectional analysis in the international prospective database of high-risk patients: HiriSCORE project. We evaluated 248 patients with STS or ESII (5–10%) undergoing CABG in 8 hospitals in Brazil and China. The main outcome was mortality, defined as all deaths occurred during the hospitalization in which the operation was performed, even after 30 days. Five variables were selected as predictors of mortality in this cohort of patients. The model’s performance was evaluated through the calibration-in-the-large and the receiver operating curve (ROC) tests.
Results
The mean age was 69.90±9.45, with 52.02% being female, 25% of the patients were on New York Heart Association (NYHA) class IV and 49.6% had Canadian Cardiovascular Society (CCS) class 4 angina, and 85.5% had urgency or emergency status. The mortality observed in the sample was 13.31%. The HiriSCORE model showed better calibration (15.0%) compared to ESII (6.6%) and the STS model (2.0%). In the ROC curve, the HiriSCORE model showed better accuracy (ROC = 0.74) than the traditional models STS (ROC = 0.67) and ESII (ROC = 0.50).
Citation: Goncharov M, Mejia OAV, Perez de Souza Arthur C, Orlandi BMM, Sousa A, Praça Oliveira MA, et al. (2021) Mortality risk prediction in high-risk patients undergoing coronary artery bypass grafting: Are traditional risk scores accurate? PLoS ONE 16(8): e0255662. https://doi.org/10.1371/journal.pone.0255662
Editor: Salil Deo, Case Western Reserve University School of Medicine, UNITED STATES
Received: April 13, 2021; Accepted: July 21, 2021; Published: August 3, 2021
Copyright: © 2021 Goncharov et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: There was no funding for this project.
Competing interests: The authors have no conflict of interest to declare in relation to this work.
Introduction
Over time, cardiovascular surgery results have progressively improved. One of the reasons for the improvement is associated with the risk stratification brought by risk scores and perioperative optimization [1]. In a continuous search for excellence, the application of ever more accurate score models is essential, especially in the evaluation of indications for new cost-effective procedures [2]. In addition, it is necessary to fully inform each patient about the risks associated with this procedure, which should be adjusted to the results of the hospital [3].
In this scenario, several models have been built and validated, aiming to reach more accurate predictions for specific populations. Among these models, STS [4] and ESII [5] reverberate the most and, at the same time, are supported by international guidelines. Both are recommended for patients undergoing most cardiovascular procedures. STS’ greatest advantage over ESII is probably the sample size, which is updated periodically. At the same time, one of the biggest criticisms of this voluntary registry may be related to estimated values unreachable for other populations [6]. It is known that the results of mandatory registries, which include all operated patients, can have high deviation [7]. In underdeveloped or developing countries, the evaluation provided by these tools can be impaired, due to the measurement of only part of the treatment, not the health system [8].
Even with the improvement of registry systems and refinement in formulation techniques of those tools, predictions are still impaired, especially in the high-risk subgroup [9, 10]. This may be related to the small number of high-risk patients included in the registries that originated the traditional models. In this aspect, traditional models would be important for a first categorization (approximation), but not for defining exactly what happens to patients at higher risk of death after cardiac surgery, as supported by the evidence. Therefore, this new model would be a second step and would come to a more accurate decision-making, through the recalibration and remodeling of variables for the high-risk population.
In this scenario, we evaluated the performance of STS, ESII and the HiriSCORE model derived from high-risk patients undergoing CABG (https://clinicaltrials.gov/ct2/show/NCT02560285).
Methods
Ethics and consent form
This study is a sub-analysis belonging to the project entitled "High-risk Patients in Cardiac Surgery Procedures: HiriSCORE”, registered online under number 13795, submitted and approved by the HCFMUSP Ethics Commission for Analysis of Research Projects (CAPPesq) as SDC: 4256/15/083.
Sample
The cross-sectional analysis is part of the HiriSCORE Project and coordinated by the Cardiac Surgery department of "Instituto do Coração do Hospital das Clínicas da Faculdade de Medicina da Universidade de São Paulo" (InCor-HCFMUSP).
All cases were consecutively operated from April 2016 to August 2019. Data came from 8 hospitals in Brazil (7) and China (1):
- Instituto do Coração, HCFMUSP, São Paulo, SP, Brazil.
- Hospital de Base de São José do Rio Preto, SP, Brazil.
- Instituto Nacional de Cardiologia do Rio de Janeiro, RJ, Brazil.
- Fuwai Hospital, Beijing, China.
- Hospital Beneficência Portuguesa, SP, Brazil.
- Hospital Santa Casa de Marilia, SP, Brazil.
- Hospital Samaritano Paulista, SP, Brazil.
- Instituto de Cardiologia do Distrito Federal, DF, Brazil.
The total sample consisted of 19,786 patients who underwent CABG, 11,692 of whom underwent isolated CABG. For this analysis, we have selected 248 patients considered at high risk (Fig 1). The final cohort included 248 patients (222 patients from Brazil and 26 patients from Fuwai Hospital in China) who underwent CABG with a mortality risk of 5 to 10% predicted by ESII (S1 Table).
Inclusion and exclusion criteria
Collecting, defining and organizing data
This analysis was made in the HiriSCORE Project database (https://clinicaltrials.gov/ct2/show/NCT02560285). It is a prospective multicenter and international registry. Data were collected by people trained for this purpose in each participating center and incorporated online on the REDCap platform (https://redcap.hc.fm.usp.br) through three interfaces available online: preoperative, intraoperative, and post-operative. Truthfulness and completeness of data were supervised by the registry’s executive committee. Variable definitions were adopted to respond to ESII (http://euroscore.org/calc.html) and STS (http://riskcalc.sts.org/stswebriskcalc/#/calculate) calculations. The outcome analyzed was mortality, defined as all deaths that occurred during the hospitalization in which the operation was performed, even after 30 days.
Statistical analysis
The analysis was performed using the statistical software STATA version 13.1 (StataCorp, Texas, USA) for MacOS. To assess the distribution of the data, the Shapiro-Wilk test and homogeneity between groups were performed. Continuous variables were expressed as mean ± standard deviation and categorical variables as absolute and relative frequencies. The association between mortality and the predictors was verified by the Chi-square and Fisher’s exact tests for the contingency tables. For the prediction of in-hospital mortality, stepwise multivariate logistic regression analysis was verified. The elaboration of the HiriSCORE was performed using stepwise multivariate logistic regression, with the pre- and intra-operative predictors, in which the risk value (OR) may vary according to the sum of variables that represent risk. All variables with inclusion criteria in the score were described in the results and expressed in a table with a coefficient value, 95% CI and p-value. The performance of the ESII and STS models was measured by comparing the mortality observed in the current study with the mortality predicted by the models in the established risk groups. Therefore, to assess the ability of ESII and STS to identify the risk of individuals included in the current study, the calibration-in-the-large curve and the discrimination by area under the ROC curve were performed. P-values of <0.05 were considered significant and plausible variables were identified as predictors of mortality after cardiac procedures. For internal validation of the HiriSCORE model, the method of discrimination by area under the ROC curve and calibration-in-the-large was selected.
Results
Overall, Table 1 includes data of 248 patients who underwent CABG surgery.
The average age was 69.9±9.45y (95% CI 68–70). Most patients were female (52%). There was a high prevalence of prior myocardial infarction (74%), hypertension (85%) and urgent or emergency surgery (85%). Eighty-one percent of patients had three-vessel coronary artery disease. Almost half (49%) of the patients had unstable angina (CCS 4). The observed mortality reached 13%.
In the association analysis, all variables with a focus on preoperative variables were studied. The main variables of interest are shown in Table 2.
In the association analysis in the cohort selected to prepare the HiriSCORE model, the following variables were related to deaths: diabetes (p = 0.023); insulin-dependent diabetes (p = 0.024); lesion of the left main coronary artery > 50% (p = 0.014); moderate heart valve disease (p = 0.019); body mass index > 30 kg/m2 (p = 0.02) and creatinine clearance <30 mL/min (p = 0.002) (Table 2). These variables were selected for stepwise multivariate regression analysis to create the HiriSCORE model.
Elaboration of HiriSCORE model
After association analysis with subsequent multivariate logistic regression using data of 248 patients with predicted risk by ESII 5–10% undergoing CABG, 5 significant variables were determined: body mass index, creatinine clearance, left main coronary artery stenosis, moderate heart valve disease and glucose (Table 3).
The HiriSCORE risk calculator was developed based on the variables described above. The logistical formula for calculating the mortality risk is as follows:
where RDM is the mortality risk, Exp is the exponential function, β0 is the constant with value -3.623276, βi is the tabulated coefficient of the independent variable xi.
Performance validation of ESII, STS and HiriSCORE models
Calibration of ESII, STS and HiriSCORE models.
Table 4 shows that both traditional risk models underestimate mortality in high-risk patients, when the HiriSCORE model showed good performance in all 5 subgroups.
The expected mean mortality in the established groups (quintiles) by the HiriSCORE model was 15.0%, by the STS was 2.0% and by the ESII was 6.6% (p <0.05). The expected mortality by ESII was > 3 times that of STS and < 2 times that of HiriSCORE (Fig 2).
The calibration-in-the-large showed that the expected mortality by STS (observed minus predicted [O-P] = 11.4) and ESII (O−P = 6.6) underestimated the mortality observed in high-risk patients undergoing CABG. However, the HiriSCORE model showed good calibration (O−P = −0.2).
Discrimination for ESII, STS and HiriSCORE models
As for discrimination, HiriSCORE model showed a satisfactory result of an area under the ROC curve (AUC) of 0.74 (95% CI 0.64–0.82). Analyzing traditional risk scores, we found that STS was better than ESII, obtaining a limit value of 0.67 (95% CI 0.62–0.72) against 0.50 (95% CI 0.58–0.69) of the AUC. It is important to emphasize that the literature considers good clinical models when the AUC is > 0.7 (Fig 3).
The HiriSCORE model showed a good AUC (0.74) while the STS and EuroSCORE II were poor in predicting mortality in high-risk patients undergoing CABG. Therefore, traditional scores were ineffective in discriminating mortality in high-risk patients.
Discussion
To date, there are no studies that assess the prediction of mortality risk in specific high-risk patients undergoing CABG. To estimate risk is to quantify complications that may occur after surgical procedures, allowing a better team planning, as well as decision-making and behavior regarding the procedures.
The two models with the most widespread use for cardiovascular surgery are STS and ESII. In addition, dissonant results appear throughout literature when applied in different scenarios [11–13]. However, evidence shows that, in patients considered to be at high risk for cardiac surgery, traditional scores lose accuracy in predicting mortality [14, 15].
This is probably because risk models originate from general populations of cardiac surgery patients, where most of them present low and medium risk and few are determined to be at high risk, especially in CABG surgeries [16]. For this scenario, procedure-specific models can be a solution for more accurate risk estimation for high-risk groups [17, 18]. Although the predictive variables for mortality after cardiac surgery are always the same, the most important is the weight of the coefficient given to each variable in relation to the specific outcome and group of patients. This is related to the degree of calibration of a model during the validation test. Therefore, calibration becomes the central phase for the validation of a risk prediction model, as it assesses how close the variable is to the outcome for such a scenario [18]. Thus, even with only a few variables, an instrument can be potential to predict mortality [19] and improve planning.
Perhaps the best idea for this scenario was launched by Ranucci in 2016 [20], adopting a two-stage approach for high-risk patients. Here, the high-risk population was underestimated by EuroSCORE II, therefore, patients with mortality risk above 25% were re-evaluated with more than 4 variables.
In this study, 5 predictors were defined. These predictors are partially involved in both, ESII and STS models, but with different coefficients. It shows the importance of reviewing the indication and re-stratifying patients with diabetes, overweight, renal dysfunction, significant left main coronary artery stenosis and moderate heart valve disease in patients referred for CABG.
Regarding the existing evidence in traditional models, Nashef et al [21] conducted a multicentric study that aimed to validate ESII using the STS database, and found similar observed and expected mortality, along with 0.77 in the model’s discriminative capability, reaching what the authors concluded to be an accurate result. Sergeant P. et al. [22] analyzed CABG patients using ESII, and the model reached a discriminative performance of 0.83, offering a good performance, but overestimating the risk in low-risk patients and underestimating it in high-risk patients. Yamaoka et al. [23] compared ESII to STS in the same sample and concluded that, regarding risk calibration, ESII was better, while STS overestimated risks. In this study, in the highest risk quartile, ESII showed excellent calibration and discrimination performance. In our study, calibration of the ESII was better than the STS, but discrimination was worse. One explanation can be the specific type of procedure evaluated.
Schrutka et al. reported the discriminatory power of scoring systems in patients treated with extracorporeal membrane oxygenation (ECMO) following cardiac surgery: both STS and ESII models showed low discriminatory accuracy in high-risk cohort [24]. The first Brazilian validation of ESII is one study that ratifies our findings [25], where ESII was not well calibrated and underestimated the risk in high- and low-risk patients.
In 2020, Hu et al. performed a risk model for CABG patients based on the Chinese Cardiac Surgery Registry (CCSR) [26]. Compared with ESII and SinoSCORE, the CCSR model had better discrimination and calibration. As we can see, it includes 80% of the HiriSCORE variables.
Following this trend, we have compared the performance of HiriSCORE model to predict mortality in high-risk patients undergoing CABG. In this paper, we evaluated the performance of ESII and STS and compared them with the HiriSCORE model. Here, the predicted mortality by the STS was 2% and by the ESII was 6.6%, for the overall 13.3% observed mortality. The reclassification of patients leads to a change in medical concept about the best treatment strategy for the patient, considering alternatives such as percutaneous intervention or medical treatment. The better stratified patients, the greater the impact on medical practice.
Although for CABG the use of STS is still more recommended than ESII [27], our results disagree with its use in high-risk patients. Over time, the severity of patients referred for CABG increased [27], so it is essential to build and validate more accurate prediction models for planning, in addition to an open and transparent discussion with patients and family members.
In addition, Shahian et al. in 2018 reported that a limitation of the score is that STS database does not have data on high-risk patients [28]. On the other hand, the ESII database presented problems because up to 43% of patients presented biased information related to mortality [29].
Therefore, the presence of body mass index > 30 kg/m2, glucose > 150 mg/dL, creatinine clearance <30 mL/min/m, left main coronary artery stenosis and moderate heart valve disease become the simplest and best way to predict mortality in high-risk patients undergoing CABG.
Study limitations
First, as an international multicenter observational record, HiriSCORE project database had to be organized to avoid selection or definition bias. Therefore, the consolidated data had to be evaluated in relation to the consistency, accuracy, and completeness of the information, as well as monitoring the inclusion of patients and following up on the results. Second, ESII was used as an inclusion criterion for high-risk patients in the study. This may have influenced the results. However, the influence would be more in favor than against this model, and that is not what happened, which can further reinforce the evidence found. Third, it would be the 30-day mortality data of the discharged patients, since although the list was delivered by the participating hospitals, we were unable to check with the death verification system at the reference sites. Even understanding that complications in high-risk patients usually happen immediately after surgery.
Conclusion
The HiriSCORE model for high-risk patients undergoing CABG was better than STS and ESII. We encourage external validation of this model to be used by heart teams as an aid in making better strategy decisions in patients considered to be at high risk for CABG.
Supporting information
S1 Table. Characteristics of the final sample of patients from Brazil and China submitted to isolated CABG with ESII between 5–10%.
https://doi.org/10.1371/journal.pone.0255662.s001
(DOCX)
Acknowledgments
We thank all the centers participating in this project for their commitment to the quality and safety of the patient.
References
- 1. McNeely C, Markwell S, Vassileva C. Trends in patient characteristics and outcomes of coronary artery bypass grafting in the 2000 to 2012 Medicare population. Ann Thorac Surg. 2016; 102:132–38. pmid:26941075
- 2. Mejía OA, Lisboa LA, Puig LB, Moreira LF, Dallan LA, Jatene FB. On-pump or off-pump? Impact of risk scores in coronary artery bypass surgery. Rev Bras Cir Cardiovasc. 2012;27(4):503–11. pmid:23515722
- 3. Englum BR, Saha-Chaudhuri P, Shahian DMet al. The impact of high-risk cases on hospitals’ risk-adjusted coronary artery bypass grafting mortality rankings. Ann Thorac Surg. 2015; 99:856–62. pmid:25583462
- 4. Shahian DM, O'Brien SM, Filardo G et al. The Society of Thoracic Surgeons 2008 cardiac surgery risk models: part 1—coronary artery bypass grafting surgery. Ann Thorac Surg. 2009; 88: S2–S22.
- 5. Nashef SA, Roques F, Sharples LD, Nilsson J, Smith C, Goldstone ARet al. EuroSCORE II. Eur J Cardiothorac Surg. 2012;41(4):734–44. pmid:22378855
- 6. Gutacker N, Bloor K, Cookson R et al. Comparing hospital performance within and across countries: an illustrative study of coronary artery bypass graft surgery in England and Spain. Eur J Public Health. 2015;25 Suppl 1:28–34.
- 7. Palma-Ruiz M, García de Dueñas L, Rodríguez-González A, Sarría-Santamera A. Analysis of in-hospital mortality from coronary artery bypass grafting surgery. Rev Esp Cardiol. 2003;56(7):687–94. pmid:12855152
- 8. Akkerhuis KM, Deckers JW, Boersma E, et al. Geographic variability in outcomes within an international trial of glycoprotein IIb/IIIa inhibition in patients with acute coronary syndromes. Results from PURSUIT. Eur Heart J. 2000; 21:371–81. pmid:10666351
- 9. Howell NJ, Head SJ, Freemantle N, van der Meulen TA, Senanayake E, Menon A et al. The new EuroSCORE II does not improve prediction of mortality in high-risk patients undergoing cardiac surgery: a collaborative analysis of two European centres. Eur J Cardiothorac Surg. 2013; 44:1006–11. pmid:23536616
- 10. Shih T, Paone G, Theurer PF, McDonald D, Shahian DM, Prager RL. The Society of Thoracic Surgeons Adult Cardiac Surgery Database version 2.73: more is better. Ann Thorac Surg. 2015; 100:516–21. pmid:26052059
- 11. Mejía OA, Lisboa LA, Dallan LA, Pomerantzeff PM, Moreira LF, Jatene FB et al. Validation of the 2000 Bernstein-Parsonnet and EuroSCORE at the Heart Institute—USP. Rev Bras Cir Cardiovasc. 2012;27(2):187–94. pmid:22996968
- 12. Barili F, Pacini D, Capo A, Ardemagni E, Pellicciari G, Zanobini M, et al. Reliability of new scores in predicting perioperative mortality after isolated aortic valve surgery: A comparison with the society of thoracic surgeons score and logistic EuroSCORE. Annals of Thoracic Surgery. 2013;95(5):1539–44. pmid:23473650
- 13. Barros e Silva PG, Baruzzi AC, Ramos DLet al. Improving indicators in a Brazilian hospital through quality-improvement programs based on STS database reports. Braz J Cardiovasc Surg. 2015;30(6):660–3. pmid:26934408
- 14. Ranucci M, Castelvecchio S, Menicanti LA, Scolletta S, Biagioli B, Giomarelli P. An adjusted EuroSCORE model for high-risk cardiac patients. Eur J Cardiothorac Surg. 2009; 36:791–8. pmid:19359191
- 15. Grant SW, Hickey GL, Dimarakis I et al. Performance of the EuroSCORE models in emergency cardiac surgery. Circ Cardiovasc Qual Outcomes. 2013; 6:178–185. pmid:23463809
- 16. LaPar DJ, Filardo G, Crosby IK, Speir AM, Rich JB, Kron IL, et al. The challenge of achieving 1% operative mortality for coronary artery bypass grafting: A multi-institution Society of Thoracic Surgeons Database analysis. J Thorac Cardiovasc Surg. 2014 Dec;148(6):2686–96. pmid:25152473
- 17. Howell NJ, Head SJ, Freemantle N et al. The new EuroSCORE II does not improve prediction of mortality in high-risk patients undergoing cardiac surgery: a collaborative analysis of two European centres. Eur J Cardiothorac Surg. 2013;44(6):1006–11. pmid:23536616
- 18. Kappetein AP, Head SJ. Predicting prognosis in cardiac surgery: a prophecy? Eur J Cardiothorac Surg. 2012;41(4):732–3. pmid:22378850
- 19. Tu JV, Sykora K, Naylor CD. Assessing the outcomes of coronary artery bypass graft surgery: how many risk factors are enough? Steering Committee of the Cardiac Care Network of Ontario. J Am Coll Cardiol. 1997;30(5):1317–23. pmid:9350934
- 20. Ranucci M, Di Dedda U, Castelvecchio S, La Rovere MTet al. In search of the ideal risk-scoring system for very high-risk cardiac surgical patients: a two-stage approach. J Cardiothorac Surg. 2016; 11:13. pmid:26782077
- 21. Nashef SA, Roques F, Hammill BG, Peterson ED, Michel P, Grover FL et al. EurpSCORE Project Group. Validation of European System for Cardiac Operative Risk Evaluation (EuroSCORE) in North American cardiac surgery. Eur J Cardiothorac Surg. 2002; 22:101–5. pmid:12103381
- 22. Sergeant P, de Worm E, Meyns B. Single centre, single domain validation of the EuroSCORE on a consecutive sample of primary and repeat CABG. Eur J Cardiothorac Surg. 2001;20(6):1176–82. pmid:11717024
- 23. Yamaoka H, Kuwaki K, Inaba H, Yamamoto T, Kato TSet al. Comparison of modern risk scores in predicting operative mortality for patients undergoing aortic valve replacement for aortic stenosis. J Cardiol. 2016;68(2):135–40. pmid:26411253
- 24. Schrutka L, Rohmann F, Binder C, Haberl T, Dreyfuss B, Heinz G et al. Discriminatory power of scoring systems for outcome prediction in patients with extracorporeal membrane oxygenation following cardiovascular surgery. Eur J Cardiothorac Surg. 2019; pii: ezz040. pmid:30789227
- 25. Lisboa LAF, Mejia OA, Moreira LF, Dallan LAO, Pomerantzeff PM, Dallan LRet al. EuroSCORE II e a importância de um modelo local, InsCor e o futuro SP-SCORE. Rev Bras Cir Cardiovasc. 2014;29(1):1–8. pmid:24896156
- 26. Hu Z, Chen S, Du J et al. An in-hospital mortality risk model for patients undergoing coronary artery bypass grafting in China. Ann Thorac Surg. 2020;109(4):1234–42. pmid:31541633
- 27. Neumann FJ, Sousa-Uva M, Ahlsson A et al. 2018 ESC/EACTS Guidelines on myocardial revascularization. Eur Heart J. 2019;40(2):87–165. pmid:30165437
- 28. Shahian DM, Torchiana DF, Engelman DT, Sundt TMet al. Mandatory public reporting of cardiac surgery outcomes: The 2003–2014 Massachusetts experience. J Thorac Cardiovasc Surg. Elsevier Inc 2019;158(1):110–24. pmid:30772041
- 29. Sergeant P, Meuris B, Pettinari M. EuroSCORE II, illum qui est gravitates magni observe. Eur.J Cardiothorac Surg. 2012:41(4):729–31. pmid:22378851