Figures
Abstract
Background
Acute Respiratory Distress Syndrome (ARDS) remains a critical condition associated with high mortality rates, prolonged hospitalization, and reduced quality of life despite advances in critical care. The albumin-corrected anion gap (ACAG), an emerging biomarker reflecting acid-base disturbances, has been linked to poor outcomes in various critical illnesses. However, its prognostic value for mortality in ARDS patients remains unexplored.
Methods
This retrospective study analyzed data from ARDS patients admitted to intensive care units (ICUs) in the MIMIC-IV database. Patients were stratified into quartiles (Q1–Q4) based on ACAG levels. The association between ACAG and 28-day all-cause mortality was comprehensively evaluated using restricted cubic splines, Kaplan–Meier survival analysis, and Cox proportional hazards regression. We employed the Boruta algorithm and LASSO (Least Absolute Shrinkage and Selection Operator) regression to identify key predictive factors. Six machine learning algorithms were used to develop predictive models, with performance assessed by the area under the ROC curve (AUC).
Results
Higher ACAG levels were significantly associated with increased 28-day mortality risk in ARDS patients (P < 0.001). ACAG remained independently associated with 28-day all-cause mortality after comprehensive adjustment for confounders, with a hazard ratio (HR) of 1.04 (95% CI 1.01–1.07, P = 0.003) Subgroup analysis demonstrated that this association persisted across most demographic and clinical subgroups, with significant interactions observed only for myocardial infarction and malignancy (P for interaction < 0.05). Feature selection using Boruta and LASSO analyses consistently identified ACAG as a key predictor. Among the six machine learning models evaluated, the random forest (RF) algorithm demonstrated superior performance with an AUC of 0.73.
Citation: Liu Q, Zhang M, Wang D (2025) Albumin-corrected anion gap as a predictor of 28-day mortality in acute respiratory distress syndrome: A machine learning-based retrospective study. PLoS One 20(11): e0336662. https://doi.org/10.1371/journal.pone.0336662
Editor: Ahmet Çağlar, University of Health Sciences, Beyhekim Training and Research Hospital, TÜRKIYE
Received: July 4, 2025; Accepted: October 29, 2025; Published: November 20, 2025
Copyright: © 2025 Liu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: This study is a retrospective analysis based on data from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database, version 2.2, which includes detailed clinical information on over 65,000 ICU admissions at Beth Israel Deaconess Medical Center (Boston,Massachusetts, USA) between 2008 and 2019. Due to data use agreements with PhysioNet, individual-level patient data cannot be publicly shared. However, the same dataset is freely available to qualified researchers who complete the required ethics training and data use agreement at PhysioNet (https://physionet.org/). Interested researchers can access MIMIC-IV v2.2 (https://physionet.org/content/mimiciv/2.2/, DOI: https://doi.org/10.13026/6mm1-ek67) by: (1) registering at PhysioNet, (2) completing the CITI “Data or Specimens Only Research” course (https://about.citiprogram.org/course/data-or-specimens-only-research/), (3) signing the data use agreement, and (4) submitting an access request (approval typically within 1-2 weeks). The authors had no special access privileges.
Funding: This study was financially supported by The Key Laboratory of Respiratory Inflammatory Injury and Precision Diagnosis and Treatment, Chongqing Municipal Health and Health Committee in the form of an award received by DW. This study was also financially supported by the National Natural Science Foundation of China in the form of a grant (82270091) received by DW. No additional external funding was received for this study.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Acute Respiratory Distress Syndrome (ARDS) is a life-threatening condition characterized by diffuse alveolar damage, noncardiogenic pulmonary edema, and a severe inflammatory response, leading to impaired gas exchange and respiratory failure [1–3]. Despite advancements in critical care management, including optimized ventilatory strategies and supportive therapies, ARDS remains associated with high mortality rates of 35%–46%, imposing a significant burden on healthcare systems worldwide [4]
Epidemiological data highlight the significant prevalence of ARDS in intensive care settings. A 6-month prospective study at a UK University Hospital ICU found that 43 of 344 admitted patients (12.5%) developed ARDS, consistent with the reported incidence range of 2.5%–19% [4]. A large international cohort study, reported that ARDS accounted for 10.4% of ICU admissions and 23.4% of patients requiring mechanical ventilation, with a period prevalence of 30.0% mild, 46.6% moderate, and 23.4% severe ARDS [5]. These statistics underscore the need for early and accurate prognostic assessment to improve outcomes and reduce medical costs.
Recent research has focused on identifying prognostic biomarkers for ARDS, including inflammatory markers and indicators of alveolar epithelial or endothelial injury. However, these markers often reflect only specific pathological aspects of ARDS and may not capture the comprehensive disease burden [6,7], Traditional tools like the Acute Physiology Score III (APS III) and Sequential Organ Failure Assessment (SOFA) scores are widely used to gauge disease severity and mortality risk [8]. However, their complexity, requirement for multiple variables, and inconsistent performance in certain patient subgroups limit their practical utility in resource-constrained settings. Thus, there remains an urgent need for simple, reliable, and complementary prognostic tools to improve risk stratification, especially given the variability in healthcare practices across regions, where universally applicable biomarkers are essential.
Metabolic acidosis is a common complication in ARDS and may reflect the degree of systemic hypoxia and inflammation [9–12]. The anion gap (AG), a routinely measured biomarker, quantifies the difference between unmeasured serum cations and anions, aiding in diagnosing acid-base imbalances and identifying causes of metabolic acidosis [13,14]. However, hypoalbuminemia, a common condition in critically ill patients, can significantly affect AG calculations, potentially leading to false negatives in detecting metabolic acidosis [15,16]. To overcome the limitations of conventional AG, the albumin-corrected anion gap (ACAG) was developed, adjusting for serum albumin levels to provide a more accurate reflection of acid-base status [17].
Previous studies have demonstrated the prognostic value of ACAG in various critical care settings, including septic patients with diabetes mellitus, congestive heart failure, and acute kidney injury (AKI) [18–20]. Compared with the conventional anion gap (AG), ACAG consistently shows superior predictive performance. For example, in ICU patients with asthma, Wang et al. reported that ACAG predicted 30-day mortality more accurately than AG, with areas under the curve (AUCs) of 0.703 versus 0.642 respectively, using an optimal cut-off of 20.38 mmol/L [21]. Similarly, in patients with acute myocardial infarction, elevated ACAG was independently associated with all-cause mortality (HR 1.423, 95% CI 1.206–1.678), and its AUC for predicting 360-day mortality (0.651) exceeded that of AG (0.609), further supporting the superior prognostic utility of ACAG over AG in diverse critical care settings [22]. Other investigations have also confirmed ACAG’s predictive advantage over conventional AG in various conditions [23].
In the context of ARDS specifically, Li et al. demonstrated that elevated AG was associated with 90-day mortality (HR 1.06, 95% CI 1.02–1.10); however, this study did not account for hypoalbuminemia, potentially underestimating the true extent of metabolic acidosis in this population [24]. Given that ARDS is frequently accompanied by both metabolic acidosis and hypoalbuminemia, the conventional AG may provide an incomplete assessment of acid-base status. Therefore, ACAG may offer superior prognostic value in ARDS patients, though this specific application remains unexplored. Our study aims to address this gap by evaluating ACAG as a predictor of 28-day mortality in ARDS patients. We investigated the relationship between ACAG and 28-day mortality and developed predictive models integrating ACAG with other clinical parameters using machine learning (ML) approaches. By establishing the prognostic value of ACAG in ARDS, our findings provide clinicians with a simple and readily available tool for risk stratification and outcome assessment in this high-mortality population.
Methods
Data source and study population
This is a retrospective study based on the clinical data of patients with ARDS from MIMIC-IV version 2.2, which includes information on over 65,000 ICU admissions and more than 200,000 emergency department admissions at Beth Israel Deaconess Medical Center (BIDMC) in Boston, Massachusetts, between 2008 and 2019 [25]. Patient confidentiality is maintained through de-identification (real data replaced with arbitrary numbers), eliminating the need for ethical approval and informed consent. The author accessed the database after completing the CITI Program (certificate #62816135). Data extraction and analysis procedures are detailed below, allowing replication by researchers with MIMIC-IV access.
Admission information for ARDS patients was identified if they met the ICD-10 code J80 (ARDS) or fulfilled the Berlin definition [26]. The Berlin definition includes the following criteria: (1) acute onset of respiratory symptoms; (2) bilateral opacities on chest imaging; (3) a partial pressure of arterial oxygen (PaO₂) to fraction of inspired oxygen (FiO₂) ratio < 300 mmHg with a minimum positive end-expiratory pressure (PEEP) ≥ 5 cmH2O; and (4) absence of cardiac failure. Patients were excluded if they met any of the following conditions: (1) missing anion gap or albumin values in the first laboratory test; (2) ICU stay < 24 hours; (3) multiple ICU admissions (only the first admission was considered); or (4) age < 18 year. Ultimately, this study included 942 patients (Fig 1).
Data extraction
Data was extracted using Navicat software. Characteristics were collected at the time of the patients’ admission including age, gender, race/ethnicity. Vital signs: Heart rate, mean blood pressure (MBP), body temperature, pulse oxygen saturation (SpO₂). Comorbidities: Diabetes, liver disease, renal disease, cerebrovascular disease, sepsis, myocardial infarction, and malignancy. Laboratory parameters: Anion gap (AG), albumin, white blood cell count (WBC), hemoglobin, platelet count, red cell distribution width (RDW), mean corpuscular volume (MCV), bicarbonate, blood urea nitrogen (BUN), creatinine, sodium, potassium, chloride, glucose, absolute neutrophil count, absolute lymphocyte count, absolute monocyte count, and absolute eosinophil count. Interventions: Renal replacement therapy (RRT) and vasopressor use. Severity scores: Sequential Organ Failure Assessment (SOFA) score and Acute Physiology Score III (APS III).
The ACAG value was calculated using the formula: ACAG = AG + [4.4– albumin (g/dL)] × 2.5 [17,27]. Missing data were handled using multiple imputation by chained equations (MICE) in R. A detailed summary of missingness for each variable is provided in Table S2 in S1 File. Variables with more than 25% missing values were excluded from the analysis. For the remaining variables, five imputed datasets were generated, and the estimates from each dataset were combined using Rubin’s rules to obtain pooled effect estimates and standard errors. MICE was chosen because it flexibly handles both continuous and categorical variables, preserves sample size, and yields valid estimates by incorporating uncertainty in the imputation process.
Sampling size
The sample size was calculated according to the cross-sectional study sample size formula [28].
Where α = 0.05 (Z = 1.96), d = 0.05, and p represents the prevalence of the primary outcome. Based on previous studies, the mortality of ARDS patients was estimated at 46% [4]. The required sample size was therefore 381. In the present study, 942 patients were included, exceeding the minimum required sample size, indicating that the study had adequate statistical power and representativeness.
Groups and clinical outcome
In line with previous research, ACAG levels > 20 mmol/L are generally considered high in clinical practice, while values between 12–20 mmol/L are regarded as normal [29]. In our study, to better reflect the distribution of ACAG within the cohort, patients were stratified into quartiles (Q1 < 15.2, Q2 15.2–17.5, Q3 17.5–20.8, Q4 > 20.8). The primary clinical outcome was the 28-day mortality after ICU admission.
Statistical analysis
Normally distributed continuous variables are presented as the means (standard deviations [SDs]) and were analyzed using analysis of variance (ANOVA). Nonnormally distributed variables were analyzed using the Mann-Whitney U test or the Kruskal-Wallis test. Categorical variables are expressed as numbers and percentages and were analyzed using the χ2 test or Fisher’s exact test. Kaplan-Meier survival curves were constructed to compare 28-day mortality across ACAG quartiles, with differences assessed using the log-rank test. Restricted cubic spline (RCS) analysis was performed to examine the potentially non-linear relationship between ACAG as a continuous variable and 28-day mortality. Multivariate Cox proportional hazards regression was used to evaluate the independent association between ACAG and 28-day mortality after adjusting for potential confounders. Hazard ratios (HR) and 95% confidence intervals (95% CI) were reported. We constructed three models with progressive adjustment: Model I: Unadjusted model; Model II: Adjusted for demographics and clinical characteristics (age, gender, race, vital signs, and comorbidities); Model III: Fully adjusted model including all variables in Model II plus laboratory parameters, severity scores, and interventions. Subgroup analyses were conducted to assess whether the association between ACAG and mortality was consistent across different patient subgroups, including age, gender, comorbidities, and interventions.
To identify key predictive features, we employed two complementary feature selection methods: Boruta Algorithm: A wrapper algorithm based on random forest that determines feature importance by comparing original attributes with their randomly permuted copies; LASSO (Least Absolute Shrinkage and Selection Operator) Regression: A regularization technique that performs variable selection by penalizing the absolute size of the regression coefficients. Variables selected by both methods were included in the final prediction models. Multicollinearity among selected features was assessed using variance inflation factors (VIF), with values < 5 considered acceptable. The dataset was randomly divided into a training set (70%, n = 658) and a validation set (30%, n = 284). Six machine learning algorithms were implemented: Logistic Regression (LR), K-Nearest Neighbors (KNN), Random Forest (RF), Extreme Gradient Boosting (XGBoost), MLP (Multilayer Perceptron), Enet (Elastic Net) algorithms were employed to assess the 28-day death risk in ARDS patients, respectively. The ROC curves and AUCs were utilized to assess model performance. Decision curve analysis (DCA) was implemented to evaluate clinical effectiveness, while calibration curves were employed to measure the accuracy of absolute risk predictions. For the best-performing model, we further explored feature importance using partial dependence plots (PDPs) and SHAP (SHapley Additive exPlanations) values to understand how each feature contributes to the predictions. The detailed of model development details is provided in Table S2 in S1 File.
All statistical analyses were conducted using R software (version 4.4.5), with a p-value < 0.05 considered statistically significant.
Results
Participant characteristics
After applying the inclusion and exclusion criteria, a total of 942 eligible patients with ARDS were included in our analysis (Fig 1). Table 1 presents the baseline characteristics of patients stratified by ACAG quartiles. Patients in higher ACAG quartiles exhibited significantly higher heart rates, body temperatures, white blood cell counts, platelet counts, red cell distribution width (RDW), serum creatinine, blood urea nitrogen (BUN), and lactate levels. They also had higher SOFA and APS III scores, indicating greater disease severity. Additionally, these patients had higher prevalence of diabetes, malignant cancer, myocardial infarction, acute kidney injury (AKI), and sepsis, and were more likely to require renal replacement therapy (RRT). Conversely, they had lower levels of bicarbonate, albumin, and chloride, consistent with more severe metabolic acidosis. The overall 28-day mortality rate in the cohort was 21.3% (201/942). Notably, mortality rates increased progressively across ACAG quartiles: 18.8% in Q1, 16.0% in Q2, 21.5% in Q3, and 29.1% in Q4 (P < 0.001)
Kaplan-Meier survival curve analysis
During the 28-day follow-up period, 201 patients (21.3%) experienced mortality. Kaplan-Meier survival curves for 28-day mortality stratified by ACAG quartiles (Fig 2) demonstrated a significant separation between groups, with higher ACAG levels associated with poorer survival (log-rank test, P < 0.001).
Association between ACAG and 28-day mortality and the subgroup analysis
The relationship between ACAG and 28-day mortality was further examined using Cox proportional hazards regression models with different levels of adjustment (Table 2). Covariates included in the multivariable models were selected based on clinical relevance and previously reported prognostic importance in critically ill patients, in order to minimize potential confounding. Higher levels of ACAG were significantly associated with increased 28-day mortality in patients with ARDS. In the unadjusted model, we observed a significant association between ACAG and 28-day mortality in ARDS patients (HR:1.04, 95% CI 1.01–1.06, P = 0.003). Even in the fully adjusted model, ACAG persisted as an independent risk factor. When analyzed as quartiles variable, patients in the highest ACAG(Q4) quartile had a significantly higher risk of 28-day mortality compared to those in the lowest quartile(Q1) in the fully adjusted model (HR = 1.66; 95% CI: 1.14–2.42; p = 0.008). Moreover, there was a significant linear trend across quartiles (P for trend < 0.05), further supporting a dose-response relationship. Moreover, in the multivariable restricted cubic spline model (Fig 3), a linear association was observed between ACAG and the risk of 28-day mortality (P for nonlinear = 0.113, P for overall < 0.001).
Curves represent estimated hazard ratios, and shaded ribbons represent 95% confidence intervals. HR hazard ratio; CI confidence interval.
The restricted cubic spline analysis (Fig 3) confirmed a predominantly linear association between ACAG and the risk of 28-day mortality (P for nonlinearity = 0.113, P for overall association < 0.001), with the risk increasing progressively as ACAG levels rose above approximately 15 mmol/L.
Subgroup analysis
Subgroup analyses across demographics, comorbidities, and therapeutic interventions generally demonstrated a consistent prognostic value of ACAG in ARDS patients. Notably, significant interactions were only observed in subgroups with myocardial infarction and malignant cancer, suggesting potential effect modification in these populations (Fig 4).
Feature selection
First, LASSO regression with fivefold cross-validation was applied to eliminate non-informative variables. The optimal penalty parameter (λ) minimizing cross-validation error was selected to balance model parsimony and generalizability (Fig 5A), resulting in 18 features with non-zero coefficients (Fig 5B). Subsequently, the Boruta algorithm, a random forest–based wrapper method, was employed to further evaluate feature relevance by comparing original predictors with permuted shadow features (Fig 6), confirming 15 variables as statistically significant. To ensure robustness and clinical interpretability, we retained the intersection of features consistently identified by both methods, yielding a final predictor set comprising ACAG, age, malignant cancer, APS III score, SOFA score, heart rate, body temperature, SpO₂, glucose, WBC count, absolute neutrophil count, and albumin. Variance inflation factor (VIF) analysis confirmed the absence of significant multicollinearity, with all VIF values < 5.
(A) LASSO coefficient of variables. (B) Cross-validation curve.
Machine learning model development
The performance of the six machine learning models in predicting 28-day mortality is presented in Fig 7A. The Random Forest (RF) algorithm demonstrated the best discrimination with an AUC of 0.73 in the validation set, followed by K-Nearest Neighbors (AUC = 0.72), Elastic Net (AUC = 0.71), Multilayer Perceptron (AUC = 0.71), XGBoost (AUC = 0.71), and Logistic Regression (AUC = 0.69). Decision curve analysis (Fig 7B) showed that all models provided substantial net benefit across a wide range of threshold probabilities, confirming their potential clinical utility. Calibration curves indicated good agreement between predicted and observed outcomes for the RF, LR, KNN, and XGBoost models (Fig S1 in S1 file). Taken together, the Random Forest algorithm performed best overall.
A ROC curve of the machine learning model. B Decision curve analysis (DCA). LR,Logistic Regression, KNN, K-Nearest Neighbors, RF,Random Forest, XGBoost,Extreme Gradient Boosting, MLP, Multilayer Perceptron, Enet, Elastic Net.
Interpretability analysis
To enhance interpretability, we employed SHAP analysis to examine both the global and individual-level contributions of predictive features in the Random Forest model, which achieved the highest overall performance. The SHAP-based feature importance ranking plot (Fig 8A) and swarm plot (Fig 8B) depict the global impact of each variable on model predictions. On the horizontal axis, SHAP values quantify both the magnitude and direction of each feature’s contribution to the predicted outcome, while the vertical axis ranks the variables according to their cumulative importance. Our analysis revealed that APS III score, SOFA score, WBC count, SpO₂, and ACAG were among the influential predictors.
(A) SHAP feature importance plot; (B) SHAP summary plot; (C, D) SHAP force plot.
Additionally, SHAP force plots were applied to visualize case-specific predictions (Fig 8C and 8D). These exemplary cases illustrate the model’s decision-making process for a survivor and a non-survivor. The visualization employs a dual-color coding scheme, with red denoting risk-enhancing factors (positive SHAP values) and blue representing protective factors (negative SHAP values). In the non-survivor case, the model predicted a substantially elevated 28-day mortality risk compared with the baseline population, driven by the predominance of risk-enhancing variables. Such individualized explanations provide clinicians with transparent insights into how different factors interact to shape mortality risk, thereby facilitating early recognition of clinical deterioration and enabling data-driven therapeutic decision-making in ARDS management.
Discussion
This retrospective study comprehensively investigated the prognostic significance of the albumin-corrected anion gap (ACAG) in patients with acute respiratory distress syndrome (ARDS), and its association with 28-day all-cause mortality. Our results demonstrate that higher ACAG levels are independently associated with an increased risk of short-term mortality, with a 4% increase in mortality risk per 1 mmol/L increase in ACAG in our fully adjusted model. This relationship was validated through multiple analytical approaches, including restricted cubic splines (RCS), Kaplan–Meier survival analysis, Cox proportional hazards regression, and machine learning-based predictive modeling. These findings highlight ACAG as a promising, simple, and accessible biomarker for risk stratification in ARDS, with potential clinical utility in the intensive care unit (ICU) setting.
Acid–base balance plays a central role in maintaining physiological stability in critical illness [30]. Metabolic acidosis, the most common acid–base disturbance in critically ill patients, can be categorized into high anion gap (AG) and normal AG metabolic acidosis [10,31]. The AG has long been used as a diagnostic tool and has shown prognostic value in several conditions, including acute ischemic stroke and coronary artery disease [32,33]. However, since albumin constitutes a substantial portion of unmeasured anions, hypoalbuminemia, common in critically ill patients, can lead to a falsely low AG, potentially misguiding clinical assessment [34]. Hypoalbuminemia reflects both nutritional status and the acute phase response to inflammation, both of which have prognostic implications in critical illness. A secondary analysis of 794 septic patients with acute kidney injury undergoing continuous renal replacement therapy in Korea showed that each 1 g/dL increase in albumin level was independently associated with a 25% and 27% reduction in the risk of death at 28 and 90 d, respectively [35].
In ARDS, the pathophysiological mechanisms such as widespread endothelial injury, impaired oxygenation, and systemic hypoperfusion contribute to increased lactic acid production and accumulation, a key cause of high AG metabolic acidosis [2]. Additionally, liver and kidney dysfunction, which are common in ARDS and further impair acid clearance, can exacerbate acid–base imbalance [36,37]. Clinical evidence suggests that a greater degree of metabolic acidosis at admission, reflected by a lower base deficit, is associated with an increased risk of developing acute lung injury [37]. Given the limitations of conventional AG in the context of hypoalbuminemia, the albumin-corrected anion gap (ACAG) provides a more accurate reflection of acid–base disturbances [17]. Compared with AG alone, ACAG better reflects metabolic status and disease burden, supporting its role as a complementary prognostic biomarker in ARDS.
Our findings extend the existing evidence base by specifically demonstrating ACAG’s prognostic value in ARDS patients. This aligns with prior research linking elevated AG and ACAG to worse outcomes across various critical illnesses, including congestive heart failure, acute kidney injury, chronic obstructive pulmonary disease, sepsis, and asthma [18–20,38,39]. For instance, in acute respiratory failure, elevated ACAG has been linked to 28-day mortality [40].
Subgroup analysis in our study showed that the association between ACAG and mortality was consistent across most clinical subgroups, supporting its generalizability as a prognostic marker. However, significant interactions were observed in patients with myocardial infarction and malignancy, suggesting that comorbid conditions may modify the prognostic utility of ACAG. A recent large-scale retrospective cohort study demonstrated that ACAG was independently associated with an increased risk of in-hospital mortality in patients with acute myocardial infarction (AMI) [41]. The different strength of association in these subgroups highlights the need for disease-specific considerations when interpreting ACAG values and the potential for developing tailored risk prediction models for different patient populations.
A key strength of our study is the application of machine learning to identify important predictors and build predictive models. Variable selection techniques, including Boruta and least absolute shrinkage and selection operator (LASSO), consistently identified ACAG as a key prognostic feature. In the random forest model, which achieved the highest predictive accuracy (AUC = 0.73), ACAG ranked alongside APS III, SOFA, white blood cell (WBC) count, SpO₂, and heart rate, underscoring its potential to complement complex severity scores. Although predictive performance was moderate, ACAG is a simple, readily available marker requiring only two laboratory measurements (anion gap and albumin), which can be obtained rapidly at the bedside for early identification of high-risk patients. Integrating ACAG with established scores may enhance risk stratification and support timely clinical decision-making in the ICU, particularly in patients with hypoalbuminemia where traditional scores like APS III or SOFA may underestimate metabolic derangements [3,8]. This aligns with prior studies in other populations, such as AKI in heart failure, where combining ACAG with SOFA or APS III improved prognostic performance (AUCs 0.68–0.75) [42]. Similarly, in patients with AKI, ACAG outperformed the conventional anion gap and further enhanced the prognostic ability of the SOFA score (AUC = 0.662) [20].
ACAG provides particular value in patients with hypoalbuminemia, where conventional anion gap may underestimate metabolic acidosis severity, offering more accurate acid-base status assessment for precise clinical decision-making. Since ACAG calculation requires only routine laboratory measurements obtained at ICU admission, it can be rapidly implemented without additional testing burden across diverse healthcare settings, including resource-limited environments, while providing immediate prognostic information to guide early clinical decisions and optimize resource allocation.
Despite these promising findings, several limitations should be acknowledged. First, this was a single-center retrospective study based on data from a single tertiary care hospital in the United States, which may introduce potential selection bias and limit the generalizability of our findings to other settings, patient populations, or healthcare systems. Second, the study relied on a single database (MIMIC-IV), and external validation in independent cohorts was not performed. Third, although we adjusted for multiple potential confounders, residual confounding cannot be entirely ruled out due to the observational nature of the study; unmeasured variables such as genetic factors, pre-hospital treatment, or specific ventilator settings could influence both ACAG levels and mortality outcomes. Fourth, ACAG was assessed only at ICU admission, and dynamic changes over time were not evaluated. Serial measurements of ACAG could potentially provide additional prognostic information and better reflect the evolving metabolic status of critically ill patients. Fifth, while we demonstrated an association between ACAG and mortality, causality cannot be inferred. Finally, missing data were handled using multiple imputation by chained equations, which assumes missing at random; residual bias due to missing not at random cannot be entirely excluded.
Conclusion
Elevated ACAG is an independent predictor of 28-day mortality in ARDS patients. As a readily available and easily interpretable parameter, ACAG may serve as a valuable adjunct to existing severity scores for early risk stratification in the ICU.
Supporting information
S1 File. Tables S1–S2 and Fig S1: missingness, model development detail, Calibration plot.
https://doi.org/10.1371/journal.pone.0336662.s001
(DOCX)
References
- 1. Meyer NJ, Gattinoni L, Calfee CS. Acute respiratory distress syndrome. Lancet. 2021;398(10300):622–37. pmid:34217425
- 2. Gorman EA, O’Kane CM, McAuley DF. Acute respiratory distress syndrome in adults: diagnosis, outcomes, long-term sequelae, and management. Lancet. 2022;400(10358):1157–70. pmid:36070788
- 3. Ferreira PN, Duarte JA, Fustiga J, Anastácio M. Performance of Acute Physiology and Chronic Health Evaluation (APACHE II), Simplified Acute Physiology Score (SAPS III), and Sequential Organ Failure Assessment (SOFA) Scores in a Medical Intermediate Care Unit. Cureus. 2025;17(7):e89010. pmid:40895916
- 4. Summers C, Singh NR, Worpole L, Simmonds R, Babar J, Condliffe AM, et al. Incidence and recognition of acute respiratory distress syndrome in a UK intensive care unit. Thorax. 2016;71(11):1050–1. pmid:27552782
- 5. Bellani G, Laffey JG, Pham T, Fan E, Brochard L, Esteban A, et al. Epidemiology, Patterns of Care, and Mortality for Patients With Acute Respiratory Distress Syndrome in Intensive Care Units in 50 Countries. JAMA. 2016;315(8):788–800. pmid:26903337
- 6. Li X, Shang Y, Zhang J, Mu G, Duan Y, Lu Z, et al. Predictive value of H3K18 lactylation for early detection and prognosis of sepsis-related ARDS: a prospective observational clinical study. Shock. 2025.
- 7. Zhao J, Ji Y, Li B, Yang K, Sun Q, Ma T, et al. Longitudinal Changes in Peripheral and Alveolar Monocyte and Inflammatory Biomarkers are Distinct in Hypercapnia Patients Following Pulmonary Sepsis-Induced ARDS. J Inflamm Res. 2025;18:5217–33. pmid:40255660
- 8. Fan S, Ma J. The value of five scoring systems in predicting the prognosis of patients with sepsis-associated acute respiratory failure. Sci Rep. 2024;14(1):4760. pmid:38413621
- 9. Gendreau S, Geri G, Pham T, Vieillard-Baron A, Mekontso Dessap A. The role of acute hypercapnia on mortality and short-term physiology in patients mechanically ventilated for ARDS: a systematic review and meta-analysis. Intensive Care Med. 2022;48(5):517–34. pmid:35294565
- 10. Achanti A, Szerlip HM. Acid-Base Disorders in the Critically Ill Patient. Clin J Am Soc Nephrol. 2023;18(1):102–12. pmid:35998977
- 11. Lee CH, Banoei MM, Ansari M, Cheng MP, Lamontagne F, Griesdale D, et al. Using a targeted metabolomics approach to explore differences in ARDS associated with COVID-19 compared to ARDS caused by H1N1 influenza and bacterial pneumonia. Crit Care. 2024;28(1):63. pmid:38414082
- 12. Noritomi DT, Soriano FG, Kellum JA, Cappi SB, Biselli PJC, Libório AB, et al. Metabolic acidosis in patients with severe sepsis and septic shock: a longitudinal quantitative study. Crit Care Med. 2009;37(10):2733–9. pmid:19885998
- 13. Kraut JA, Madias NE. Serum anion gap: its uses and limitations in clinical medicine. Clin J Am Soc Nephrol. 2007;2(1):162–74. pmid:17699401
- 14. Brenner BE. Clinical significance of the elevated anion gap. Am J Med. 1985;79(3):289–96. pmid:4036980
- 15. Kim MJ, Kim YH, Sol IS, Kim SY, Kim JD, Kim HY, et al. Serum anion gap at admission as a predictor of mortality in the pediatric intensive care unit. Sci Rep. 2017;7(1):1456. pmid:28469150
- 16. Nanji AA, Campbell DJ, Pudek MR. Decreased anion gap associated with hypoalbuminemia and polyclonal gammopathy. JAMA. 1981;246(8):859–60. pmid:6166764
- 17. Hatherill M, Waggie Z, Purves L, Reynolds L, Argent A. Correction of the anion gap for albumin in order to detect occult tissue anions in shock. Arch Dis Child. 2002;87(6):526–9. pmid:12456555
- 18. Yang C, Jiang Y, Zhang C, Min Y, Huang X. The predictive values of admission characteristics for 28-day all-cause mortality in septic patients with diabetes mellitus: a study from the MIMIC database. Front Endocrinol (Lausanne). 2023;14:1237866. pmid:37608790
- 19. Li N, Li J, Wang K. Independent prognostic importance of the albumin-corrected anion gap in critically ill patients with congestive heart failure: a retrospective study from MIMIC-IV database. BMC Cardiovasc Disord. 2024;24(1):735. pmid:39707198
- 20. Gao P, Min J, Zhong L, Shao M. Association between albumin corrected anion gap and all-cause mortality in critically ill patients with acute kidney injury: a retrospective study based on MIMIC-IV database. Ren Fail. 2023;45(2):2282708. pmid:37975171
- 21. Wang S, Teng H, Han H, Xu Y. The relationship between albumin corrected anion gap levels and mortality in patients with asthma in the ICU. Sci Rep. 2023;13(1):16903. pmid:37803051
- 22. Sheng H, Lu J, Zhong L, Hu B, Sun X, Dong H. The correlation between albumin-corrected anion gap and prognosis in patients with acute myocardial infarction. ESC Heart Fail. 2024;11(2):826–36. pmid:38164072
- 23. Huang S, Zhang Q, Wei F, Kutryk MJB, Zhang J. Association of albumin-corrected anion gap with mortality in ICU patients with heart failure and acute kidney injury: analysis of the MIMIC-IV database. Eur J Med Res. 2025;30(1):745. pmid:40796876
- 24. Li Y, Luo W, Wang Q, Chen Y, Bai F, Zeng Q, et al. Anion gap predicting 90-Day mortality and guiding furosemide use in ARDS. Sci Rep. 2025;15(1):4954. pmid:39930113
- 25. Johnson AEW, Bulgarelli L, Shen L, Gayles A, Shammout A, Horng S, et al. MIMIC-IV, a freely accessible electronic health record dataset. Sci Data. 2023;10(1):1. pmid:36596836
- 26. ARDS Definition Task Force, Ranieri VM, Rubenfeld GD, Thompson BT, Ferguson ND, Caldwell E, et al. Acute respiratory distress syndrome: the Berlin Definition. JAMA. 2012;307(23):2526–33. pmid:22797452
- 27. Fenves AZ, Emmett M. Approach to Patients With High Anion Gap Metabolic Acidosis: Core Curriculum 2021. Am J Kidney Dis. 2021;78(4):590–600. pmid:34400023
- 28. Naing L, Nordin RB, Abdul Rahman H, Naing YT. Sample size calculation for prevalence studies using Scalex and ScalaR calculators. BMC Med Res Methodol. 2022;22(1):209. pmid:35907796
- 29. He X, Liao X, Xie Z, Jiang C, Kang Y. Albumin corrected anion gap is an independent risk factor for long-term mortality of patients with sepsis. Zhonghua Wei Zhong Bing Ji Jiu Yi Xue. 2017;29(2):117–21. pmid:28625257
- 30. Greenberg KI, Lecker SH. Metabolic acidosis. Adv Kidney Dis Health. 2025;32(1):61–8.
- 31. Tanemoto M. Progression of Metabolic Acidosis in Chronic Kidney Disease. Kidney Dis (Basel). 2020;6(1):59–63. pmid:32021875
- 32. Jhou H-J, Chen P-H, Yang L-Y, Chang S-H, Lee C-H. Plasma Anion Gap and Risk of In-Hospital Mortality in Patients with Acute Ischemic Stroke: Analysis from the MIMIC-IV Database. J Pers Med. 2021;11(10):1004. pmid:34683145
- 33. Li M, Li C, Wang J, Yuan Q. The association between anion gap and prognosis in patients myocardial infarction with congestive heart failure: a retrospective analysis of the MIMIC-IV database. Int J Emerg Med. 2025;18(1):33. pmid:39994567
- 34. Pratumvinit B, Lam L, Kongruttanachok N, Hemthong C, Kost GJ, Kamkang P, et al. Anion gap reference intervals show instrument dependence and weak correlation with albumin levels. Clin Chim Acta. 2020;500:172–9. pmid:31669932
- 35. Sheng S, Zhang Y-H, Ma H-K, Huang Y. Albumin levels predict mortality in sepsis patients with acute kidney injury undergoing continuous renal replacement therapy: a secondary analysis based on a retrospective cohort study. BMC Nephrol. 2022;23(1):52. pmid:35109818
- 36. Lee JY, Alexeyev M, Kozhukhar N, Pastukh V, White R, Stevens T. Carbonic anhydrase IX is a critical determinant of pulmonary microvascular endothelial cell pH regulation and angiogenesis during acidosis. Am J Physiol Lung Cell Mol Physiol. 2018;315(1):L41–51. pmid:29631360
- 37. Eberhard LW, Morabito DJ, Matthay MA, Mackersie RC, Campbell AR, Marks JD, et al. Initial severity of metabolic acidosis predicts the development of acute lung injury in severely traumatized patients. Crit Care Med. 2000;28(1):125–31. pmid:10667511
- 38. Giri M, Puri A, Huang L, Guo S. Association between albumin corrected anion gap and in-hospital mortality in critically ill patients with chronic obstructive pulmonary disease. Ther Adv Respir Dis. 2025;19:17534666251315352. pmid:39866132
- 39. Zhou Q, Miao Y, Wang P, Li F, Li J, Li N. Association between albumin corrected anion gap and mortality in septic older adults. Geriatr Nurs. 2024;60:580–5. pmid:39461109
- 40. Qu J, Tang X, Cheng Y, Xiong W, Zhao Y. Association Between Albumin Corrected Anion Gap and 28-Day All-Cause Mortality in Patients With Acute Respiratory Failure in ICU: A Retrospective Study Based on the MIMIC-IV Database. Clin Respir J. 2025;19(7):e70100. pmid:40631543
- 41. Lu Z, Yao Y, Xu Y, Zhang X, Wang J. Albumin corrected anion gap for predicting in-hospital death among patients with acute myocardial infarction: A retrospective cohort study. Clinics (Sao Paulo). 2024;79:100455. pmid:39079461
- 42. Ruan A-F, Zheng J-W, Sun S-Q, Liu X-Z, Chen T-L. The association of albumin-corrected anion gap and acute kidney injury in heart failure patients: a competing risk model analysis. BMC Cardiovasc Disord. 2025;25(1):277. pmid:40217175