Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

The prognostic value of the early neutrophil-to-lymphocyte ratio for 28-day mortality in sepsis patients: A machine learning-based investigation of the MIMIC database

  • Jiyang Liao ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation

    ☯ These authors contributed equally to this study and are co-first authors.

    Affiliation Department of Intensive Care Unit, Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, Shenzhen, Guangdong Province, China

  • Qianwen Xiang ,

    Roles Data curation, Investigation, Resources, Writing – original draft

    ☯ These authors contributed equally to this study and are co-first authors.

    Affiliation Department of Cardiovascular Medicine, Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, Shenzhen, Guangdong Province, China

  • Xingwang Chen ,

    Roles Conceptualization, Data curation, Formal analysis, Methodology, Project administration, Writing – original draft

    ☯ These authors contributed equally to this study and are co-first authors.

    Affiliation Department of Intensive Care Unit, Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, Shenzhen, Guangdong Province, China

  • Long Wu,

    Roles Formal analysis, Supervision, Writing – original draft

    Affiliation Department of Intensive Care Unit, Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, Shenzhen, Guangdong Province, China

  • Houwang Chen,

    Roles Conceptualization, Formal analysis, Writing – original draft

    Affiliation Department of Intensive Care Unit, Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, Shenzhen, Guangdong Province, China

  • Zhijun Yao,

    Roles Supervision, Validation

    Affiliation Department of Intensive Care Unit, Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, Shenzhen, Guangdong Province, China

  • Huachu Wu ,

    Roles Conceptualization, Methodology, Validation, Writing – original draft, Writing – review & editing

    574631282@qq.com (HW); szszxyjhyylaijianbo@outlook.com (JL)

    Affiliation Department of Intensive Care Unit, Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, Shenzhen, Guangdong Province, China

  • Jianbo Lai

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Supervision, Visualization, Writing – original draft, Writing – review & editing

    574631282@qq.com (HW); szszxyjhyylaijianbo@outlook.com (JL)

    Affiliation Department of Intensive Care Unit, Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, Shenzhen, Guangdong Province, China

Abstract

Background

The neutrophil-to-lymphocyte ratio (NLR) has shown inconsistent prognostic value in individuals with sepsis. This study aimed to clarify its ability to predict 28-day mortality via a machine learning-based analysis of a large ICU database.

Methods

This retrospective analysis employed data from the MIMIC-IV database (v3.1). The Boruta algorithm combined with XGBoost was used for two-stage feature selection. Patients were stratified by NLR quartiles into three groups (low: <4.34, intermediate: 4.34–14.70, and high: >14.70). This study defined 28-day mortality as the primary outcome. Associations between the NLR and mortality were evaluated by using multivariable logistic regression (progressively adjusted for demographic, clinical, and machine learning-derived features), along with restricted cubic splines. Sensitivity analyses included quantifying NLR feature importance via machine learning and performing subgroup analyses across clinical strata.

Results

This cohort study included 4,376 patients with a 28-day mortality rate of 18.4%. Compared with the SOFA and SAPS II scores, the prediction performance of XGBoost was superior (ROC-AUC 0.875; 95% CI 0.854–0.896; PR-AUC 0.603). Although the NLR ranked 14th in SHAP-based feature importance, multivariable analysis confirmed its independent association with elevated mortality risk: 28-day (OR 1.16; 95% CI 1.06–1.27; p < 0.001), in-hospital (OR 1.13; 95% CI 1.03–1.24; p < 0.001), and ICU (OR 1.14; 95% CI 1.03–1.25; p = 0.008). Stratified analyses indicated consistent mortality associations for the NLR, with enhanced predictive value being observed in patients aged >45 years and those with SOFA scores ≤4 or SAPS II scores >29. Specifically, patients ≥65 years of age demonstrated a 17% increase in 28-day mortality risk (p = 0.019), and patients with a SOFA score ≤4 exhibited a greater than 20% elevated risk across all of the endpoints (p < 0.001), whereas no significant association was observed in the SOFA ≥9 subgroup (p = 0.369).

Conclusions

The NLR effectively identifies inflammation-driven mortality risk for early sepsis patients but fails to predict outcomes for patients with terminal organ failure. This biphasic predictive pattern highlights the unique value of the NLR in moderate sepsis risk stratification but cautions against its use in cases of advanced disease. Its value lies in dynamic monitoring rather than static risk assessment.

Introduction

Sepsis is a critical medical state involving organ failure resulting from a maladaptive systemic reaction to infection that impacts millions of patients worldwide annually [1]. Despite recent advancements in sepsis recognition and treatment strategies, mortality rates remain alarmingly high and range from 16.7% to 33.3% [2,3], imposing a substantial burden on global healthcare economics. Nevertheless, early identification and intervention have led to significant improvements in clinical outcomes.

Current assessment methods predominantly rely on manual documentation or electronic medical records, which often incorporate nursing process-dependent parameters such as vital sign monitoring [47]. Variations in measurement and recording protocols across healthcare institutions may lead to substantial discrepancies in evaluation results. Although the levels of routinely available inflammatory markers such as procalcitonin and C-reactive protein are easily determined in clinical practice, they exhibit limited prognostic value for sepsis because of their suboptimal specificity and sensitivity [1,8]. Although conventional scoring systems such as sequential organ failure assessment (SOFA) and acute physiology and chronic health assessment scoring system II (APACHE II) demonstrate high prognostic value, their complex assessment procedures and poor operational feasibility limit their widespread clinical application [911]. Consequently, the pursuit of more objective and convenient prognostic indicators remains important.

Neutrophils (NEUT) and lymphocytes (LYMPH), as the principal cellular components of innate and adaptive immunity, respectively, play indispensable roles as first-line immune defenses during systemic inflammatory responses [12,13]. Bacterial or fungal infections typically induce isolated NEUT elevation with concomitant lymphopenia, whereas viral infections or malignancies may predominantly increase LYMPH counts [14]. Consequently, the neutrophil-to-lymphocyte ratio (NLR) generally increases during inflammatory states. Although clinically favorable because of its accessibility and low cost, some alterations in the NLR are not inflammation-specific; for example, acute trauma, stroke, myocardial infarction, and postoperative complications can similarly modify this ratio, which compromises its diagnostic specificity and contributes to persistent clinical skepticism regarding its utility [15]. The COVID-19 pandemic has reinvigorated research interest in the NLR [16,17], particularly as a result of a recent meta-analysis by Wu et al. [18], which reported reliable prognostic performance of the NLR in sepsis despite considerable between-study heterogeneity (I² = 87.2%, 95% CI 79.5–92; p < 0.0001). Moreover, a retrospective study conducted by Schupp et al. [19] failed to demonstrate a discriminative capacity for 30-day mortality. These contradictory prognostic interpretations in sepsis underscore the need for machine learning (ML)-based reevaluations of the ability of the NLR to predict mortality.

Machine learning algorithms have emerged in recent years as powerful tools in clinical prediction models. The 2021 sepsis guidelines highlight the superior discriminative capacity of ML algorithms for predicting in-hospital mortality in sepsis patients [1]. These algorithms offer dual advantages—the efficient management of missing data and the ability to integrate weak predictive features into robust models through data-driven learning. Among the various ML techniques, extreme gradient boosting (XGBoost) has demonstrated exceptional data mining and predictive modeling capabilities. Its dominance is evidenced by its adoption in 17 of 29 winning solutions in the 2015 Kaggle competitions and its exclusive use by all top 10 teams that participated in the 2015 KDD Cup. Several studies have validated the clinical utility of XGBoost: Dong et al. developed a ML model that predicts pediatric acute kidney injury 48 hours in advance by analyzing premorbid physiological measurements and incorporating current guidelines; Zhang et al. established an XGBoost model that effectively distinguishes between fluid-responsive and nonresponsive sepsis patients; and Yue et al. created a ML model for the early detection of sepsis-associated acute kidney injury and identified XGBoost as the optimal predictive algorithm. These findings collectively support the theoretical potential of ML algorithms to enhance predictive model development and validation in critical care. However, no studies have yet employed XGBoost modeling with large datasets to systematically evaluate the ability of the NLR to predict 28-day mortality, which could provide valuable insights into its clinical relevance under resource-constrained conditions.

Methods

Database

The patient data utilized for this study were derived from the Medical Information Mart for Intensive Care IV database version 3.1 (MIMIC-IV v3.1) [20,21]. This dataset, which includes deidentified health data from more than 65,000 Intensive Care Unit (ICU) hospitalizations and over 200,000 emergency department visits at Beth Israel Deaconess Medical Center (Boston, MA) from 2008–2022, offers broad clinical coverage. The dataset includes demographic characteristics, physiological parameters validated by ICU nursing staff, laboratory test results, therapeutic intervention orders, standardized data dictionaries, and diagnostic codes (International Classification of Diseases, 9th and 10th Revisions; ICD-9 and ICD-10). As this study used a deidentified public database with all protected health information removed, this study was exempted from the requirement to obtain individual patient informed consent from the Institutional Review Board (IRB) of the Massachusetts Institute of Technology (MIT). Author JY Liao (certification ID: 63692240) completed the required training and was authorized to access the database for research purposes. This study was performed in accordance with the Declaration of Helsinki. Due to the deidentified nature and public availability of the MIMIC database (which contains no personally sensitive information), the study was ethically exempt from review.

Study population

Inclusion criteria for this study were as follows: (1) aged between 18 and 85 years; (2) an ICU stay duration that exceeded 24 hours; and (3) suspected or confirmed infection with a SOFA score ≥2. For patients with multiple ICU admissions, only data obtained from the first ICU stay were included in the analysis. Patients were excluded if they met any of the following conditions: (1) had missing laboratory data, including NEUT, LYMPH and platelet (PLT) data; or (2) had a documented diagnosis of malignant tumors or autoimmune diseases. The study design is schematically illustrated in Fig 1.

thumbnail
Fig 1. Overview of the study design and workflow.

BIDMC, Beth Israel Deaconess Medical Center; MIMIC-IV, Medical Information Mart for Intensive Care IV; XGBoost, extreme gradient boosting; NLR, neutrophil-to-lymphocyte ratio; RCS, restricted cubic splines; ROC, receiver operating characteristic; PR, precision-recall; SHAP, Shapley additive explanations.

https://doi.org/10.1371/journal.pone.0348676.g001

Data extraction

Using structured query language (SQL) within the Navicat Premium environment (version 16.1.12), we extracted the requisite patient data from the MIMIC-IV database. The analysis focused exclusively on each patient’s first ICU admission record. Due to the extensive temporal data entries in the MIMIC-IV database, strict time windows were applied during data extraction to minimize potential confounding from different treatment phases. Demographic characteristics and clinical variables, such as sex, age, weight, SOFA score, simplified acute physiology score II (SAPS II) score and comorbidities, were extracted from the earliest available clinical records during the first 24 hours following ICU admission. Vital signs were collected within the first 2 hours after ICU admission. The crystalloid fluid intake volume was obtained within the first 3 hours and was used to calculate the crystalloid-to-weight ratio. The first available laboratory test results obtained within the initial 6 hours of ICU admission were extracted for analysis. Cases with NEUT, LYMPH, or PLT counts of zero were excluded to ensure valid calculation of the systemic immune-inflammation index (SII), which was derived using the following formula: absolute NEUT count × PLT count/absolute LYMPH count [22]. Fluid intake and output volumes were recorded during the first 24 hours of ICU admission, after which the fluid intake-to-weight ratio was calculated. The vasoactive inotropic score (VIS) and antibiotic initiation time were concurrently determined. Upon ICU discharge, the types of mechanical ventilation (MV) and continuous renal replacement treatment (CRRT) were recorded. Patients who received noninvasive ventilation, invasive ventilation, or tracheostomy were classified as mechanically ventilated. The primary outcome was defined as all-cause mortality within 28 days following ICU admission. Secondary outcomes included in-hospital mortality, ICU mortality, length of hospital stay, and length of ICU stay. Follow-up was conducted from ICU admission until 28 days thereafter.

To address missing data, a 40% missingness threshold was set. Consequently, any parameter with missing values exceeding this percentage was excluded from analysis under the assumption that the data were missing at random (MAR). [23]. For variables where fewer than 40% of the values were missing, we performed imputation using the “missForest” package (version 1.5) in R Studio [24]. To assess the robustness of the imputation, we compared the distribution of key variables before and after imputation by using the missForest algorithm. Post-imputation values remained within clinically plausible ranges, and no substantial shifts in central tendency or dispersion were observed (S1 and S2 Tables).

Feature selection

Before the association between the NLR and 28-day mortality was assessed, we implemented a rigorous two-stage ML-driven feature selection protocol. In the first phase, we performed 10 iterative runs of the Boruta algorithm to evaluate feature selection stability. During each run, features were classified as “confirmed” if their importance significantly exceeded the maximum shadow feature importance (p < 0.05) [25]. Feature stability was quantified by calculating the selection frequency (SF), which is defined as the ratio of confirmed designations to total runs. Only features that demonstrated high stability (SF ≥ 0.9) were retained for subsequent XGBoost analysis [26]. The XGBoost feature selection phase employed Pearson correlation-based collinearity analysis to remove redundant features, followed by hyperparameter optimization via grid search with three repetitions of 5-fold cross-validation, learning curve analysis, and early stopping to prevent overfitting—all of which were implemented to enhance clinical interpretability and enable SHAP analysis. To address class imbalance in sepsis mortality outcomes, the precision-recall area under the curve (PR-AUC) was employed as a more informative metric compared to the conventional receiver operating characteristic AUC (ROC-AUC) for evaluating model performance, with particular emphasis being focused on the accurate detection of the minority class. The model’s performance was then comprehensively assessed by using a suite of metrics, including accuracy, sensitivity, specificity, and F1-score metrics, in order to ensure a robust evaluation of its clinical utility. The final model was interpreted using Shapley additive explanations (SHAP) to assess the magnitude and direction of the individual feature influences [27,28]; additionally, the top-ranking NLR-associated variables were subsequently entered into multivariable regression analysis for clinical validation.

Statistical analysis

Statistical analyses and visualizations were performed by using the R programming language (version 4.4.3), with the exception of XGBoost modeling, which was implemented in Python (version 3.12). A two-sided p value <0.05 was considered to be statistically significant. Categorical variables were analyzed by using chi-square tests or Fisher’s exact tests and are presented as frequencies (percentages). To control the false discovery rate (FDR) due to multiple comparisons across categorical covariates (e.g., antibiotic initiation, dialysis type and ventilation status), p values were adjusted by using the Benjamini–Hochberg procedure. For continuous variables, we assessed normality by histograms and skewness analysis (absolute skewness ≥2 with evident left or right skewing in histograms was considered to indicate a nonnormal distribution). Normally distributed continuous variables were compared using Student’s t tests or one-way analysis of variance, whereas nonnormally distributed variables were assessed via Wilcoxon or Kruskal‒Wallis tests, with the results being expressed as the means ± standard deviations (SDs) or medians (interquartile ranges, or IQRs). We examined potential nonlinear associations between the NLR and 28-day mortality using restricted cubic splines (RCS) regression, which tests the statistical significance of nonlinear terms. We simultaneously generated PR and ROC curves for the NLR, XGBoost model, and both the SOFA and SAPS II scoring systems and calculated the Youden index to determine the optimal cutoff values and evaluate their ability to predict 28-day mortality. Multivariate regression models were used to assess the associations between the NLR and 28-day mortality, hospital mortality, and ICU mortality. All included continuous variables were standardized using Z scores to minimize scale differences. Three sequential models were constructed: Model 1 included only the standardized NLR, Model 2 added sex, standardized age, heart rate (HR), respiratory rate (RR), weight, and mean arterial pressure (MAP) to Model 1, and Model 3 further incorporated clinically relevant variables, previously established predictors, and features selected through the Boruta algorithm and XGBoost modeling. Subgroup analyses were performed to further examine the NLR–mortality relationship, with subgroups defined according to clinical relevance and data distribution patterns. Interaction terms were included to assess subgroup heterogeneity, and Firth’s penalized likelihood regression was applied to prevent false-positive results in small sample sizes. Interaction p values were also adjusted for the FDR to comprehensively evaluate potential effect modifiers and the robustness of the NLR predictive value. Survival was analyzed by using Kaplan‒Meier curves with log-rank tests for between-group comparisons and Cox proportional hazards models, with the results being reported as hazard ratios with 95% confidence intervals.

Results

Baseline characteristics

In this study, 4,376 eligible patients from the MIMIC-IV database (with 28-day, in-hospital, and ICU mortality rates of 18.40%, 17.50%, and 14.12%, respectively) were ultimately enrolled. The demographic characteristics, vital signs, laboratory tests, organ dysfunction assessments, treatments and other outcomes are summarized in Table 1. Compared with nonsurvivors, survivors were significantly younger (59.77 ± 14.97 vs. 63.37 ± 15.10; p < 0.05), were more likely to be male (63.93% vs. 59.25%), and had lower disease severity, as reflected by SOFA (3.76 ± 1.96 vs. 4.99 ± 2.71) and SAPS II scores (36.79 ± 12.90 vs. 51.74 ± 15.76). With respect to vital signs, survivors demonstrated lower HR (89.48 ± 17.68 vs. 95.55 ± 20.99 bpm; p < 0.001), lower RR (19.07 ± 5.54 vs. 22.11 ± 6.29 breaths/min; p < 0.001), and higher percutaneous oxygen saturation (SpO₂, median: 98.57% vs. 97.00%; p < 0.001) than nonsurvivors, whereas MAP did not significantly differ between the groups (81.58 ± 15.22 vs. 81.81 ± 19.81 mmHg; p = 0.755). Although survivors demonstrated marginally better vital signs (all p < 0.05 except SpO₂), the intergroup differences were clinically modest. Laboratory analyses revealed significantly elevated inflammatory markers, including the NLR (6.97 [IQR 4.11–12.59] vs. 12.70 [IQR 6.92–22.75]; p < 0.05) and the SII, alongside more pronounced acidotic patterns on blood gas analysis, in nonsurvivors. Higher creatinine and blood urea nitrogen (BUN) levels suggested increased renal dysfunction in nonsurvivors, which is consistent with the results of the organ dysfunction assessments. Erythrocyte and hemoglobin-related parameters significantly differed between the two groups. Organ dysfunction distributions revealed greater respiratory and coagulation dysfunction in survivors, whereas nonsurvivors had greater hepatic, neurological, and renal failure rates (all p < 0.05). Comorbidity analyses revealed significant differences in hypertension, heart failure, chronic obstructive pulmonary disease, chronic kidney disease, atrial fibrillation, cerebral infarction, cerebral hemorrhage, and thrombosis. Fluid resuscitation metrics indicated minimal crystalloid administration within the first 3 ICU hours in both groups, with survivors receiving less crystalloid. Although statistically significant differences were observed in weight-adjusted crystalloid ratios, these differences lacked clinical relevance. Analysis of 24-hour fluid balance revealed higher total intake and output volumes in survivors than in nonsurvivors, whereas nonsurvivors exhibited greater net positive fluid balance, suggesting potential fluid overload in this group. Nonsurvivors required more MV, CRRT, and vasopressor support (all p < 0.05), whereas survivors had higher rates of antibiotic initiation within 1 hour of ICU admission. Although survivors had shorter ICU stays, their total hospitalization duration exceeded that of nonsurvivors.

thumbnail
Table 1. Baseline characteristics of the survivor and nonsurvivor groups.

https://doi.org/10.1371/journal.pone.0348676.t001

On the basis of the overall NLR distribution and quartile values, patients were categorized into low (<4.34), intermediate (4.34–14.70), and high (>14.70) NLR groups (as shown in Table 2). All mortality outcomes, including 28-day, hospital, and ICU mortality, significantly increased in a stepwise manner with increasing NLR (p < 0.05) until they reached approximately 30% in the high-NLR group. Similarly, both hospital and ICU length of stay increased in a concentration-dependent manner (p < 0.05). No statistically significant differences were observed in mean corpuscular hemoglobin (MCH) levels, cardiovascular failure incidence, or the incidence of comorbidities such as diabetes, atrial fibrillation, or cerebral infarction. However, all other parameters that demonstrated significant differences between the survivor and nonsurvivor groups maintained concentration-dependent statistical significance across NLR stratifications (p < 0.05).

thumbnail
Table 2. Characteristics and outcomes of patients categorized by NLR.

https://doi.org/10.1371/journal.pone.0348676.t002

Selection of features in the models

The results from a single Boruta run are presented in S1 Fig, whereas Fig 2A displays stability-selected features (SF ≥ 0.9) from 10 iterations; this process identified 37 confirmed and 2 tentative features, whereas the remaining features were rejected. These 39 features were subjected to XGBoost modeling, which was preceded by collinearity analysis (Pearson’s r ≥ 0.7 exclusion) and clinical relevance screening (S2 Fig and S3 Table), thus yielding 29 final parameters. As shown in S4 Table, the optimized XGBoost model achieved a test-set ROC-AUC of 0.875 (95% CI: 0.854–0.896) and a PR-AUC of 0.603 (95% CI: 0.534–0.671), thereby outperforming the individual NLR, as well as the SAPS II and SOFA scores, for 28-day mortality prediction (Fig 3A, 3B). The data in Fig 2B reveal 13 important parameters, and Model 3 ultimately incorporated SAPS II and SOFA scores, as well as 24-hour fluid output, VIS, 24-hour fluid balance, dialysis type, cerebral infarction, atrial fibrillation, red cell distribution width (RDW), partial pressure of oxygen (PO2), hematocrit, creatinine, LYMPH, base excess (BE), and MCH. S5 Table presents the predictive contribution ranking of the NLR in the XGBoost model for 28-day mortality prediction. The NLR ranked 8th in global importance but decreased to 14th in terms of the SHAP-based marginal contribution, which indicates its relatively low individual impact and potential synergistic interactions with other parameters.

thumbnail
Fig 2. Application of machine learning algorithm for feature selection.

A Feature stability selection for the relationship between the NLR and 28-day mortality was analyzed by using the Boruta algorithm. The x-axis lists parameters (actual and shadow features), whereas the y-axis shows noise-adjusted importance scores from 10 Boruta runs. Features are color-coded by stability: light blue (confirmed, SF ≥ 90%), green (tentative, 50% ≤ SF < 90%), red (rejected, SF < 50%), and gray (shadow features). A red dashed line marks the stability threshold. B SHAP analysis of XGBoost model predictions. Features are vertically ranked by the mean absolute SHAP value (light blue bars), with individual SHAP values (colored dots) distributed horizontally and colored by normalized feature magnitude (blue: low, red: high). The gray dashed line marks the baseline.

https://doi.org/10.1371/journal.pone.0348676.g002

thumbnail
Fig 3. Comparative analysis of predictive models and survival stratification by the NLR on 28-day mortality.

Panel A: receiver operating characteristic (ROC) curves demonstrate the discriminative performance of the NLR, SAPS II, SOFA, and XGBoost; Panel B: precision-recall (PR) curves assessing the trade-off between precision and recall; Panel C: Kaplan-Meier survival analysis stratified by NLR quartiles.

https://doi.org/10.1371/journal.pone.0348676.g003

NLR and 28-day, hospital, and ICU mortality

Multivariable logistic regression analysis demonstrated that the standardized NLR was independently associated with 28-day mortality (OR 1.16; 95% CI 1.06–1.27; p < 0.001), in-hospital mortality (OR 1.13; 95% CI 1.03–1.24; p < 0.001), and ICU mortality (OR 1.14; 95% CI 1.03–1.25; p = 0.008) after full adjustment (Model 3, Table 3). Unadjusted analysis by NLR quartiles revealed a dose‒response relationship with all of the mortality endpoints (S3A–C Fig), and stronger associations were observed with increasing NLR values. In the fully adjusted model (Model 3), patients in the highest NLR quartile exhibited significantly higher risks of 28-day mortality (OR 1.95, 95% CI 1.40–2.75), in-hospital mortality (OR 1.90, 95% CI 1.34–2.70), and ICU mortality (OR 1.79, 95% CI 1.23–2.62) compared to those in the lowest quartile, and all of the trend tests were statistically significant (p < 0.001).(Fig 3C) temporally corroborated these findings, which demonstrates early divergence among groups by day 7 (log-rank p < 0.0001), whereas the high-NLR group exhibited markedly decreased survival. Restricted cubic spline analysis (S4 Fig) demonstrated biphasic nonlinear relationships between the NLR and all of the mortality endpoints, with Wald tests confirming significant nonlinear effects across all of the models (P-nonlinear <0.05). The ROC-derived threshold (NLR = 7.3) coincided with both the median NLR value and the first inflection point, where the predicted probabilities sharply increased. This predictive probability plateaued near the second inflection point (NLR = 27.3) and increased marginally from 0.15 to 0.30, even though sample representation markedly declined in this range. Beyond the second inflection point, statistical reliability diminished because of sparse data.

thumbnail
Table 3. The association between NLR groups and 28-day mortality, hospital mortality and ICU mortality.

https://doi.org/10.1371/journal.pone.0348676.t003

Subgroup analysis

To further evaluate the independent association between the standardized NLR and mortality outcomes, including 28-day, in-hospital, and ICU mortality, we conducted stratified analyses and interaction tests on Model 3 covariates. As demonstrated in Fig 4 and S6 Table, the NLR maintained consistent associations with mortality, with increased predictive value being observed in patients aged >45 years, those with SOFA scores ≤4 and those with SAPS II scores > 29. Those individuals ≥65 years exhibited a 17% increased 28-day mortality risk (P = 0.019), and those with a SOFA score ≤4 exhibited a > 20% increased risk across all of the mortality endpoints (p < 0.001), whereas no significant association was observed in the subgroup of patients with SOFA scores ≥9 (p = 0.369). Significant effect modifications (interaction p < 0.05) were identified in patients with a 24-hour fluid output of 1475–3420 mL, a BE > -2, a HR < 100 bpm, or a RR of 12–20 breaths/min, and similar trends were observed in those with a 24-hour fluid balance ≤2493.27 mL, an MCH ≥ 27 pg, a MAP of 65–110 mmHg, or a creatinine concentration of 1.5–3.0 mg/dL. Although the standardized NLR demonstrated strong associations with all three mortality outcomes in patients with an HR < 60 bpm, the limited sample size within this stratified subgroup resulted in imprecise estimates, thus warranting cautious interpretation. The subgroup analyses consistently demonstrated significant associations between the NLR and all three mortality endpoints across most predefined strata, which confirms the robustness of the predictive value of this biomarker in diverse patient populations.

thumbnail
Fig 4. Forest plot of subgroup analyses for the association between the standardized NLR and mortality outcomes.

The plot displays odds ratios (ORs) with 95% confidence intervals (CIs) for 28-day, in-hospital, and ICU mortality across clinically relevant subgroups. Solid squares represent point estimates of ORs; horizontal lines indicate 95% CIs. The vertical dashed line denotes the null effect (OR = 1).

https://doi.org/10.1371/journal.pone.0348676.g004

Discussion

Machine learning algorithms have increasingly been employed to refine prognostic assessment in sepsis using large-scale ICU databases. For example, Lou et al. [29] and Zheng et al. [30] recently applied XGBoost based on the MIMIC database to explore the association between the triglyceride-glucose (TyG) index and risk of death in sepsis patients, thus further confirming the potential of ML in routine biomarker re-evaluation. Within this evolving landscape, as an accessible and widely recorded inflammatory marker, the NLR has garnered renewed interest; however, its independent prognostic value for short-term mortality remains debated, and systematic ML‑driven re-evaluations remain scarce. In the present study, we leveraged a rigorous two-stage feature selection pipeline combining the Boruta algorithm with XGBoost to specifically evaluate the ability of the early NLR to predict 28‑day mortality in a large, real‑world sepsis cohort. Our optimized model achieved robust discrimination (ROC‑AUC 0.875; PR‑AUC 0.603), thus outperforming conventional severity scores. More importantly, after full multivariable adjustment, each 1‑SD increase in the standardized NLR was independently associated with a 16%, 13%, and 14% higher risk of 28‑day, in‑hospital, and ICU mortality, respectively. Dose‑response analyses confirmed a progressive mortality increase with increasing NLR, whereas RCS revealed a nonlinear, biphasic pattern; specifically, predictive capacity became increased near the first inflection point (NLR = 7.3, approximating the optimal threshold) and plateaued beyond the second inflection point (NLR = 27.3). Subgroup analyses further elucidated that the NLR-mortality association was most pronounced in hemodynamically stable patients and those with low organ failure scores (SOFA ≤ 4), whereas it lost statistical significance in the setting of advanced multi‑organ dysfunction (SOFA ≥ 9). Collectively, these findings do not merely replicate prior prognostic observations; rather, they delineate (for the first time) a phenotype‑specific, biphasic predictive pattern of the NLR that distinguishes inflammation‑driven mortality risk from organ failure‑driven mortality. By integrating state‑of‑the‑art feature selection with granular clinical stratification, this study provides a methodologically rigorous and clinically nuanced re-evaluation of the NLR’s prognostic utility in sepsis.

Our XGBoost model achieved a test-set ROC-AUC of 0.875 (PR-AUC 0.603) for 28-day mortality prediction, thus representing a substantial improvement over conventional severity scores and individual biomarkers. To contextualize this performance within the broader landscape of emerging prognostic indices in sepsis, we compared our findings with recent studies evaluating the TyG index and the creatinine-to-albumin ratio (CAR) using the MIMIC database [2931]. Both TyG-based studies reported significant associations with mortality, with Lou et al. [29] identifying a nonlinear relationship (inflection point: TyG = 8.9) and Zheng et al. [30] describing a linear dose-response pattern. The CAR study by Lou et al. [31] demonstrated a J-shaped association with 30-day mortality in sepsis-associated acute kidney injury, with an inflection point at CAR = 1.2 mg/dL and an AUC of 0.68–0.75 being reported. Notably, our NLR-based XGBoost model achieved higher discriminative performance (AUC 0.875) than any of these reported models, which likely reflects the synergistic integration of multiple clinical and laboratory features through a robust two-stage feature selection pipeline. Beyond performance metrics, a key methodological parallel involves the nonlinear risk patterns shared across these biomarkers. The J-shaped CAR-mortality curve and the biphasic NLR-mortality relationship observed in our cohort both suggest threshold effects that demarcate distinct pathophysiological states, including renal dysfunction with systemic inflammation for CAR and innate-adaptive immune imbalance for the NLR. However, the NLR offers distinct practical advantages; specifically, it is derived from a complete blood count without the need for additional metabolic assays, incurs no extra cost, and directly reflects the hyperinflammation–immunosuppression disequilibrium that is a central process to sepsis immunopathology [15,17,18,32]. Thus, although the TyG and CAR capture insulin resistance and renal-inflammatory crosstalk, respectively, the NLR (particularly when embedded in a ML framework) provides complementary prognostic information with superior discrimination and unique biological interpretability. These findings demonstrate the NLR not as a replacement for existing biomarkers; rather, it can function as a phenotype-specific adjunct for early risk stratification in moderate sepsis, where its predictive value is most pronounced.

Recent five-year evidence has consistently demonstrated that an elevated NLR increases sepsis mortality risk by 10–40% across general populations [3342]. More critically, when high-NLR cohorts were compared with their low-NLR counterparts, multivariable-adjusted analyses revealed that the high-NLR group had an 80% greater mortality risk, which establishes a robust dose‒response relationship. Three studies [39,41,42] employing RCS analysis confirmed nonlinear NLR-mortality relationships and demonstrated progressive risk escalation until the NLR reached a range of 15–20, beyond which the trend plateaued. Prior evaluations of the discriminative capacity of the NLR universally relied on ROC curves, with AUROC values ranging from 0.6 to 0.7, which are consistent with our findings and indicate moderate discrimination. However, given the imbalanced outcome distribution (18.4% 28-day mortality in our cohort), ROC metrics may overestimate performance because of excessive true-negative influence, which potentially masks true positive-class identification capacity [32,43,44]. We therefore performed ROC analyses with PR curves and confirmed that the NLR, SOFA score, and SAPS II score all exhibit limited mortality discrimination under outcome imbalance. XGBoost modeling that incorporated multiple parameters significantly improved both the AUC and average precision values. While the NLR demonstrated a substantial overall model contribution, SHAP analysis revealed marginal individual predictive utility, which suggests limited direct mortality risk stratification capacity. Combined NLR stratification and subgroup analyses revealed potential interaction effects, particularly in hemodynamically stable patients for whom the correlations between the NLR and mortality remained robust.

Returning to our primary research question, the limited predictive value of the NLR for 30-day mortality observed in the study by Schupp et al. [19] may stem from two methodological factors. First, the exclusively enrolled critically ill patients demonstrated uniformly high disease severity (mean SOFA score of 11 and mean APACHE II score of 23), and no significant differences were observed in illness severity between survivors and nonsurvivors. Second, the small sample size may have compromised the stability of the results. Conversely, studies [33,35,36,38,39] that support the ability of the NLR to predict sepsis mortality but that demonstrate superior discriminatory performance for 28-day mortality (with the AUROC value being observed at 0.827 in some reports) have predominantly analyzed cohorts with extreme mortality rates (30–70%) and significant intergroup differences in illness severity scores. Notably, three additional studies reported the limited predictive value of the NLR [37,40,45] after patients with critical illness or hemodynamic instability were excluded through subgroup or sensitivity analyses. These collective observations suggest that the ability of the NLR to predict mortality may be restricted to clinically stable conditions, while its utility appears to decrease in severe cases with concomitant organ dysfunction.

The context‑dependent nature of the NLR’s prognostic value observed in our study aligns with emerging evidence on other composite biomarkers that integrate inflammatory and nutritional dimensions. Recent studies investigating the NEUT count‑to‑prognostic nutritional index ratio (NPNR) in septic patients have reported J‑ or V‑shaped associations with mortality [46,47], thus mirroring the biphasic pattern that we identified for the NLR. Both the NLR and NPNR appear to capture the delicate balance between host immune activation and physiological reserve; specifically, their predictive weight diminishes when organ failure becomes the dominant driver of the outcome. This convergence across distinct biomarker constructs reinforces the notion that inflammatory markers are not universal prognostic tools; rather, they are phenotype‑specific indicators exhibiting a utility that is most pronounced in moderate disease and wanes in the setting of terminal organ dysfunction. Therefore, future research should focus on context‑aware risk stratification that accounts for the dynamic interplay between inflammation, nutrition, and organ function.

Mechanistically, the increase in the NLR in sepsis patients precisely mirrors the dual pathological processes of hyperactivated NEUT and apoptotic LYMPH depletion; thus, the NLR serves as a dynamic biomarker of the “hyperinflammation–immunosuppression” imbalance that characterizes sepsis immunopathology [15,45]. Current evidence indicates that delayed NEUT apoptosis leads to systemic accumulation, whereas accelerated CD4+/CD8+ T-cell apoptosis and impaired NK cell function collectively contribute to this dysregulated state. Notably, the magnitude of NLR elevation may reflect the depletion of anti-inflammatory reserves, and higher values may indicate a poorer capacity to counteract inflammatory insults [45]. Clinically, the NLR demonstrates robust predictive value for moderate sepsis (SOFA scores <9), as evidenced by the finding reported by Liu et al. [37] that serial NLR measurements (AUC = 0.823) outperformed single assessments, as a day-7 NLR > 4.18 predicted significantly increased 28-day mortality. Ye et al. [40] further confirmed that an NLR > 20.25 was an independent predictor of mortality (hazard ratio = 1.22) in large cohorts. However, its ability to predict critical illness (SOFA score ≥11) is diminished, which is likely due to the predominance of multiorgan failure observed in terminal pathophysiology [40]. The primary sources of interstudy variability include discrepancies in measurement timing, divergent threshold selection criteria, and confounding effects of therapeutic interventions such as blood purification. Notably, baseline NLR values inherently differ between specific disease populations and healthy individuals. Karakonstantis et al. [48] reported that multiple factors potentially contribute to nonpathological NLR elevation, including advanced age, exogenous steroid administration, elevated endogenous hormone levels, active hematologic disorders, type 2 diabetes mellitus, and acute trauma. Future research should prioritize three objectives: standardizing measurement protocols, establishing population-specific cutoffs, and investigating combination models with emerging immune biomarkers. Fundamentally, the clinical value of the NLR lies in its noninvasive nature and ability to track immune trajectory evolution rather than as a standalone prognostic determinant.

This study implemented multiple methodological innovations in its investigation of sepsis outcomes. Given the well-documented prognostic influences of key sepsis interventions, including fluid resuscitation, antibiotic therapy, MV, and blood purification [1], we developed advanced data extraction protocols to obtain and systematically categorize relevant parameters from the MIMIC-IV database. Through stepwise feature selection, the analysis consistently identified fluid resuscitation and blood purification as clinically significant factors, while numerous other variables were excluded. A planned evaluation of crystalloid fluid administration effects within the initial three hours after ICU admission was determined to be impractical because of insufficient recorded intravenous infusion volumes. The derived weight-adjusted ratios fell below clinically meaningful thresholds, demonstrated minimal predictive value for mortality outcomes, and were consequently excluded during collinearity analysis and feature selection procedures.

Several limitations must be acknowledged. First, although the MIMIC-IV database provides a large, well-curated sample of critically ill patients, its single-center origin introduces inherent selection bias and reflects the treatment protocols and case mixture of a specific healthcare system. Thus, the absence of external validation represents a substantial limitation. Although we implemented rigorous analytical methods to ensure internal validity (particularly the proposed biphasic, phenotype‑dependent predictive pattern of the NLR), our findings must be regarded as hypothesis‑generating rather than definitive. Independent validation in prospective, multi-center cohorts is urgently required before the NLR can be considered for routine clinical risk stratification. Furthermore, external validation should specifically test the generalizability of the identified inflection points (NLR = 7.3 and 27.3) and the differential predictive utility across SOFA strata observed in our study. Second, while NLR standardization was necessary to address scale disparities in multivariable regression modeling, this processing may reduce clinical interpretability. Additionally, the exclusive calculation of the NLR from the first available measurement within six hours of ICU admission captures only a static snapshot of the inflammatory state. Given the highly dynamic nature of sepsis, this approach may underestimate the prognostic value of NLR trajectories or peak values, which could better reflect the evolving host response. Future studies incorporating serial NLR measurements or longitudinal trajectory modeling are warranted to further validate our findings. Third, the dataset lacked several potentially relevant parameters, including nutritional status indicators, metabolic markers, and established inflammatory biomarkers such as procalcitonin and C-reactive protein, which may affect the generalizability of the results. Fourth, the sepsis management guidelines were revised multiple times during the extended study period spanning 2008–2022, which may have introduced temporal confounding factors that could have influenced the outcomes. Despite these limitations, this study benefits from a substantial sample size and comprehensive statistical methodology that may partially mitigate these concerns. Future research should focus on targeted analyses of specific sepsis patient subgroups to more precisely characterize the prognostic utility of the NLR.

Conclusions

This investigation revealed a positive correlation between the magnitude of NLR elevation and sepsis-related mortality, with the strongest predictive value being observed at the highest NLR values. This correlation may primarily reflect the ability of the NLR to predict the outcomes of clinically stable patients, whereas its predictive value may be reduced for patients with severe organ failure. Thus, the utility of the NLR appears to primarily involve dynamic monitoring rather than single static measurements. However, given the absence of external validation, these conclusions remain hypothesis‑generating and require confirmation in independent, prospective, multi-center studies before clinical implementation.

Supporting information

S1 Fig. Boruta algorithm feature selection results for XGBoost modeling.

Box plots display the Z scores of each parameter, with the x-axis showing their names and the y-axis showing the Z values. Parameters are color-coded by importance (blue for important, green for tentative, and red for unimportant).

https://doi.org/10.1371/journal.pone.0348676.s001

(TIF)

S2 Fig. Feature collinearity assessment in XGBoost modeling.

https://doi.org/10.1371/journal.pone.0348676.s002

(TIF)

S3 Fig. Unadjusted associations between NLR levels and mortality outcomes.

Panels A-C correspond to 28-day mortality (A), hospital mortality (B), and ICU mortality (C), respectively. Error bars represent 95% CIs, with the intermediate NLR concentration group serving as the reference category.

https://doi.org/10.1371/journal.pone.0348676.s003

(TIF)

S4 Fig. Restricted cubic spline plots illustrating the nonlinear relationship between the NLR and mortality.

(A) 28-day mortality, (B) in-hospital mortality, and (C) ICU mortality. The solid line represents the adjusted odds ratio; the pink band indicates the 95% confidence interval. The histogram displays the distribution of the NLR values, with the median (7.6) and the ROC-derived optimal cutoff (8.08) marked. The nonlinear association was statistically significant for all of the endpoints (P for nonlinearity < 0.05).

https://doi.org/10.1371/journal.pone.0348676.s004

(TIFF)

S1 Table. Missing rate for demographics and clinical variables extracted from the database during the observation period.

https://doi.org/10.1371/journal.pone.0348676.s005

(PDF)

S2 Table. Comparison of Key Variables Before and After Imputation Using the misForest Algorithm.

https://doi.org/10.1371/journal.pone.0348676.s006

(PDF)

S3 Table. Selected features and discarded features of collinearity assessment in XGBoost modeling.

https://doi.org/10.1371/journal.pone.0348676.s007

(PDF)

S4 Table. Performance evaluation of XGBoost model across training and test datasets.

https://doi.org/10.1371/journal.pone.0348676.s008

(PDF)

S5 Table. Feature importances in the XGBoost model.

https://doi.org/10.1371/journal.pone.0348676.s009

(PDF)

S6 Table. Subgroup analysis for the association of the NLR with 28-day mortality, hospital mortality and ICU mortality.

https://doi.org/10.1371/journal.pone.0348676.s010

(PDF)

Acknowledgments

The authors appreciate all of the investigators who organized, developed, and maintained the MIMIC database.

References

  1. 1. Evans L, Rhodes A, Alhazzani W, Antonelli M, Coopersmith CM, French C, et al. Surviving sepsis campaign: international guidelines for management of sepsis and septic shock 2021. Crit Care Med. 2021;49(11):e1063–143. pmid:34605781
  2. 2. Fleischmann-Struzek C, Mellhammar L, Rose N, Cassini A, Rudd KE, Schlattmann P, et al. Incidence and mortality of hospital- and ICU-treated sepsis: results from an updated and expanded systematic review and meta-analysis. Intensive Care Med. 2020;46(8):1552–62. pmid:32572531
  3. 3. Fleischmann C, Scherag A, Adhikari NK, Hartog CS, Tsaganos T, Schlattmann P, et al. Assessment of global incidence and mortality of hospital-treated sepsis. Current estimates and limitations. Am J Respir Crit Care Med. 2016;193(3):259–72. Epub 2015/09/29. pmid:26414292
  4. 4. Alberto L, Marshall AP, Walker R, Aitken LM. Screening for sepsis in general hospitalized patients: a systematic review. J Hosp Infect. 2017;96(4):305–15. Epub 2017/05/17. pmid:28506711
  5. 5. Bhattacharjee P, Edelson DP, Churpek MM. Identifying patients with sepsis on the hospital wards. Chest. 2017;151(4):898–907. pmid:27374948
  6. 6. Warttig S, Alderson P, Evans DJ, Lewis SR, Kourbeti IS, Smith AF. Automated monitoring compared to standard care for the early detection of sepsis in critically ill patients. Cochrane Database Syst Rev. 2018;6(6):CD012404. pmid:29938790
  7. 7. Makam AN, Nguyen OK, Auerbach AD. Diagnostic accuracy and effectiveness of automated electronic sepsis alert systems: a systematic review. J Hosp Med. 2015;10(6):396–402. pmid:25758641
  8. 8. Tang A, Shi Y, Dong Q, Wang S, Ge Y, Wang C, et al. Prognostic differences in sepsis caused by gram-negative bacteria and gram-positive bacteria: a systematic review and meta-analysis. Crit Care. 2023;27(1):467. pmid:38037118
  9. 9. Raith EP, Udy AA, Bailey M, McGloughlin S, MacIsaac C, Bellomo R, et al. Prognostic accuracy of the SOFA score, SIRS criteria, and qSOFA score for in-hospital mortality among adults with suspected infection admitted to the intensive care unit. JAMA. 2017;317(3):290–300. pmid:28114553
  10. 10. Moreno-Torres V, Royuela A, Múñez E, Ortega A, Gutierrez Á, Mills P, et al. Better prognostic ability of NEWS2, SOFA and SAPS-II in septic patients. Med Clin (Barc). 2022;159(5):224–9. pmid:34949450
  11. 11. Tekin B, Kiliç J, Taşkin G, Solmaz İ, Tezel O, Başgöz BB. The comparison of scoring systems: SOFA, APACHE-II, LODS, MODS, and SAPS-II in critically ill elderly sepsis patients. J Infect Dev Ctries. 2024;18(1):122–30. pmid:38377099
  12. 12. Song M, Graubard BI, Rabkin CS, Engels EA. Neutrophil-to-lymphocyte ratio and mortality in the United States general population. Sci Rep. 2021;11(1):464. pmid:33431958
  13. 13. Li Y, Wang W, Yang F, Xu Y, Feng C, Zhao Y. The regulatory roles of neutrophils in adaptive immunity. Cell Commun Signal. 2019;17(1):147. pmid:31727175
  14. 14. Lowsby R, Gomes C, Jarman I, Lisboa P, Nee PA, Vardhan M, et al. Neutrophil to lymphocyte count ratio as an early indicator of blood stream infection in the emergency department. Emerg Med J. 2015;32(7):531–4. pmid:25183249
  15. 15. Buonacera A, Stancanelli B, Colaci M, Malatino L. Neutrophil to lymphocyte ratio: an emerging marker of the relationships between the immune system and diseases. Int J Mol Sci. 2022;23(7):3636. pmid:35408994
  16. 16. Parthasarathi A, Padukudru S, Arunachal S, Basavaraj CK, Krishna MT, Ganguly K, et al. The role of neutrophil-to-lymphocyte ratio in risk stratification and prognostication of COVID-19: a systematic review and meta-analysis. Vaccines (Basel). 2022;10(8):1233. pmid:36016121
  17. 17. Snopkowska Lesniak SW, Maschio D, Neria F, Rey-Delgado B, Moreno Cuerda V, Henriquez-Camacho C. Novel biomarkers for SARS-CoV-2 infection: a systematic review and meta-analysis. J Pers Med. 2025;15(6):225. pmid:40559088
  18. 18. Wu H, Cao T, Ji T, Luo Y, Huang J, Ma K. Predictive value of the neutrophil-to-lymphocyte ratio in the prognosis and risk of death for adult sepsis patients: a meta-analysis. Front Immunol. 2024;15:1336456. pmid:38562922
  19. 19. Schupp T, Weidner K, Rusnak J, Jawhar S, Forner J, Dulatahu F, et al. The neutrophil-to-lymphocyte-ratio as diagnostic and prognostic tool in sepsis and septic shock. Clin Lab. 2023;69(5):10.7754/Clin.Lab.2022.220812. pmid:37145065
  20. 20. Goldberger A, Amaral L, Glass L, Hausdorff J, Ivanov PC, Mark R, et al. PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation. 2000;101(23):e215–20.
  21. 21. Johnson AEW, Bulgarelli L, Shen L, Gayles A, Shammout A, Horng S, et al. MIMIC-IV, a freely accessible electronic health record dataset. Sci Data. 2023;10(1):1. pmid:36596836
  22. 22. Jiang D, Bian T, Shen Y, Huang Z. Association between admission systemic immune-inflammation index and mortality in critically ill patients with sepsis: a retrospective cohort study based on MIMIC-IV database. Clin Exp Med. 2023;23(7):3641–50. pmid:36930382
  23. 23. Dziura JD, Post LA, Zhao Q, Fu Z, Peduzzi P. Strategies for dealing with missing data in clinical trials: from design to analysis. Yale J Biol Med. 2013;86(3):343–58. pmid:24058309
  24. 24. Stekhoven DJ, Bühlmann P. MissForest--non-parametric missing value imputation for mixed-type data. Bioinformatics. 2012;28(1):112–8. pmid:22039212
  25. 25. Degenhardt F, Seifert S, Szymczak S. Evaluation of variable selection methods for random forests and omics data sets. Brief Bioinform. 2019;20(2):492–503. pmid:29045534
  26. 26. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2016. 785–94.
  27. 27. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2(1):56–67. pmid:32607472
  28. 28. Nohara Y, Matsumoto K, Soejima H, Nakashima N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput Methods Programs Biomed. 2022;214:106584. pmid:34942412
  29. 29. Lou J, Xiang Z, Zhu X, Fan Y, Song J, Cui S, et al. A retrospective study utilized MIMIC-IV database to explore the potential association between triglyceride-glucose index and mortality in critically ill patients with sepsis. Sci Rep. 2024;14(1):24081. pmid:39402158
  30. 30. Zheng R, Qian S, Shi Y, Lou C, Xu H, Pan J. Association between triglyceride-glucose index and in-hospital mortality in critically ill patients with sepsis: analysis of the MIMIC-IV database. Cardiovasc Diabetol. 2023;22(1):307. pmid:37940931
  31. 31. Lou J, Xiang Z, Zhu X, Song J, Cui S, Li J, et al. The non-linear association between creatinine-to-albumin ratio and medium-term mortality in patients with sepsis accompanied by acute kidney injury in the intensive care unit: a retrospective study based on the MIMIC database and external validation. Front Cell Infect Microbiol. 2025;15:1602921. pmid:41425961
  32. 32. Saito T, Rehmsmeier M. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS One. 2015;10(3):e0118432. pmid:25738806
  33. 33. Akilli NB, Yortanlı M, Mutlu H, Günaydın YK, Koylu R, Akca HS, et al. Prognostic importance of neutrophil-lymphocyte ratio in critically ill patients: short- and long-term outcomes. Am J Emerg Med. 2014;32(12):1476–80. pmid:25264245
  34. 34. Chebl RB, Assaf M, Kattouf N, Haidar S, Khamis M, Abdeldaem K, et al. The association between the neutrophil to lymphocyte ratio and in-hospital mortality among sepsis patients: a prospective study. Medicine (Baltimore). 2022;101(30):e29343. pmid:35905272
  35. 35. Hwang SY, Shin TG, Jo IJ, Jeon K, Suh GY, Lee TR, et al. Neutrophil-to-lymphocyte ratio as a prognostic marker in critically-ill septic patients. Am J Emerg Med. 2017;35(2):234–9. pmid:27806894
  36. 36. Li J-Y, Yao R-Q, Liu S-Q, Zhang Y-F, Yao Y-M, Tian Y-P. Efficiency of monocyte/high-density lipoprotein cholesterol ratio combined with neutrophil/lymphocyte ratio in predicting 28-day mortality in patients with sepsis. Front Med (Lausanne). 2021;8:741015. pmid:34722578
  37. 37. Liu S, Li Y, She F, Zhao X, Yao Y. Predictive value of immune cell counts and neutrophil-to-lymphocyte ratio for 28-day mortality in patients with sepsis caused by intra-abdominal infection. Burns Trauma. 2021;9:tkaa040. pmid:33768121
  38. 38. Liu S, Wang X, She F, Zhang W, Liu H, Zhao X. Effects of neutrophil-to-lymphocyte ratio combined with interleukin-6 in predicting 28-day mortality in patients with sepsis. Front Immunol. 2021;12:639735. pmid:33796105
  39. 39. Wei W, Huang X, Yang L, Li J, Liu C, Pu Y, et al. Neutrophil-to-lymphocyte ratio as a prognostic marker of mortality and disease severity in septic acute kidney injury patients: a retrospective study. Int Immunopharmacol. 2023;116:109778. pmid:36738677
  40. 40. Ye W, Chen X, Huang Y, Li Y, Xu Y, Liang Z, et al. The association between neutrophil-to-lymphocyte count ratio and mortality in septic patients: a retrospective analysis of the MIMIC-III database. J Thorac Dis. 2020;12(5):1843–55. pmid:32642088
  41. 41. Denstaedt SJ, Cano J, Wang XQ, Donnelly JP, Seelye S, Prescott HC. Blood count derangements after sepsis and association with post-hospital outcomes. Front Immunol. 2023;14:1133351. pmid:36936903
  42. 42. Di Rosa M, Sabbatinelli J, Soraci L, Corsonello A, Bonfigli AR, Cherubini A, et al. Neutrophil-to-lymphocyte ratio (NLR) predicts mortality in hospitalized geriatric patients independent of the admission diagnosis: a multicenter prospective cohort study. J Transl Med. 2023;21(1):835. pmid:37990223
  43. 43. Chiew CJ, Liu N, Wong TH, Sim YE, Abdullah HR. Utilizing machine learning methods for preoperative prediction of postsurgical mortality and intensive care unit admission. Ann Surg. 2020;272(6):1133–9. pmid:30973386
  44. 44. Keilwagen J, Grosse I, Grau J. Area under precision-recall curves for weighted and unweighted data. PLoS One. 2014;9(3):e92209. pmid:24651729
  45. 45. Salciccioli JD, Marshall DC, Pimentel MAF, Santos MD, Pollard T, Celi LA, et al. The association between the neutrophil-to-lymphocyte ratio and mortality in critical illness: an observational cohort study. Crit Care. 2015;19(1):13. pmid:25598149
  46. 46. Lou J, Kong H, Xiang Z, Zhu X, Cui S, Li J, et al. The J-shaped association between the ratio of neutrophil counts to prognostic nutritional index and mortality in ICU patients with sepsis: a retrospective study based on the MIMIC database. Front Cell Infect Microbiol. 2025;15:1603104. pmid:40766844
  47. 47. Cai J, Jin Y, Lou J, Xu B, Qi H, Li J. The V-shaped association between the ratio of neutrophil counts to prognostic nutritional index and 30-, 60-, and 90-day mortality in elderly critically ill patients aged 65 and older with sepsis: a retrospective study based on the MIMIC database. Front Nutr. 2025;12:1602016. pmid:40969605
  48. 48. Karakonstantis S, Kalemaki D, Tzagkarakis E, Lydakis C. Pitfalls in studies of eosinopenia and neutrophil-to-lymphocyte count ratio. Infect Dis (Lond). 2018;50(3):163–74. pmid:29070003