Construction of an automated machine learning-based predictive model for postoperative pulmonary complications risk in non-small cell lung cancer patients undergoing thoracoscopic surgery

Xie Qiu; Shuo Hu; Shumin Dong; Haijun Sun

doi:10.1371/journal.pone.0333413

Abstract

Objective

To develop a predictive framework integrating machine learning and clinical parameters for postoperative pulmonary complications (PPCs) in non-small cell lung cancer (NSCLC) patients undergoing video-assisted thoracic surgery (VATS).

Methods

This retrospective study analyzed 286 NSCLC patients (2022–2024), incorporating 13 demographic, metabolic-inflammatory, and surgical variables. An Improved Blood-Sucking Leech Optimizer (IBSLO) enhanced via Cubic mapping and opposition-based learning was developed. Model performance was evaluated using AUC-ROC, F1-score, and decision curve analysis (DCA). SHAP interpretation identified key predictors.

Results

The IBSLO demonstrated significantly superior convergence performance versus original BSLO, ant lion optimizer (ALO), Harris hawks optimization (HHO), and whale optimization algorithm (WOA) across all 12 CEC2022 test functions. Subsequently, the IBSLO-optimized automated machine learning (AutoML) model achieved ROC-AUC/PR-AUC values of 0.9038/0.8091 (training set) and 0.8775/0.8175 (testing set), significantly outperforming four baseline models: logistic regression (LR), support vector machine (SVM), XGBoost, and LightGBM. SHAP interpretability identified six key predictors: preoperative leukocyte count, body mass index (BMI), surgical approach, age, intraoperative blood loss, and C-reactive protein (CRP). Decision curve analysis demonstrated significantly higher net clinical benefit of the AutoML model compared to conventional methods across expanded threshold probability ranges (training set: 8–99%; testing set: 3–80%).

Conclusion

This study establishes an interpretable machine learning framework that improves preoperative risk stratification for NSCLC patients, offering actionable guidance for thoracic oncology practice.

Citation: Qiu X, Hu S, Dong S, Sun H (2025) Construction of an automated machine learning-based predictive model for postoperative pulmonary complications risk in non-small cell lung cancer patients undergoing thoracoscopic surgery. PLoS One 20(9): e0333413. https://doi.org/10.1371/journal.pone.0333413

Editor: Hyun-Sung Lee, Baylor College of Medicine, UNITED STATES OF AMERICA

Received: June 13, 2025; Accepted: September 12, 2025; Published: September 26, 2025

Copyright: © 2025 Qiu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: Data cannot be shared publicly because of Patient privacy. Data are available from the Lianyungang First People’s Hospital Institutional Data Access (contact via Email: lilong@tly.email) for researchers who meet the criteria for access to confidential data.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

1. Introduction

Lung cancer remains the leading cause of tumor-related mortality worldwide [1], with non-small cell lung cancer (NSCLC) accounting for over 80% of pathological subtypes. Compared to small cell lung cancer, NSCLC exhibits slower progression and delayed metastasis, making video-assisted thoracoscopic surgery (VATS) the primary therapeutic approach for early-stage cases [2]. As a minimally invasive technique, VATS achieves lobectomy or sublobar resection through 3–4 micro-incisions, demonstrating superior perioperative outcomes in intraoperative blood loss and postoperative recovery compared to conventional thoracotomy [3]. Clinical evidence [4–5] indicates that 52.84% of NSCLC patients undergo lobectomy, while 47.16% receive sublobar resection. Notably, despite reduced surgical trauma in VATS, the incidence of postoperative pulmonary complications (PPCs) remains as high as 31.12%, underscoring the imperative for refined perioperative risk stratification [6].

PPCs represent a prevalent clinical challenge following thoracoscopic NSCLC resection, with pathogenesis involving intricate interactions among physiological status, surgical stress, and perioperative management [7]. Studies confirm [8] that PPCs significantly elevate risks of ventilator dependency and 30-day readmission rates. Although clinical practice employs risk assessment tools such as the ARISCAT scoring system [9], their predictive efficacy is constrained by static variable selection and linear modeling assumptions, failing to adequately integrate dynamic inflammatory markers (e.g., C-reactive protein) or surgical trauma parameters [10–11]. Current predictive models exhibit dual limitations: methodologically, traditional logistic regression struggles to capture nonlinear relationships among multidimensional clinical features [12], while conventional machine learning algorithms (e.g., random forest, support vector machines) suffer from compromised generalizability due to manual hyperparameter optimization [13]; regarding data integration, extant studies predominantly focus on isolated metrics (e.g., pulmonary function or surgical factors), lacking systematic synthesis of metabolic indices (BMI, blood glucose), inflammatory markers (white blood cell count, neutrophil ratio), and surgical approaches (lobectomy/sublobar resection) [14–15]. These deficiencies engender feature representation bias in real-world clinical scenarios, particularly impairing sensitivity for high-risk cohorts with metabolic syndrome or occult inflammatory responses [16].

The expanding application of artificial intelligence in medical prediction has validated the utility of machine learning in clinical outcome forecasting [17–19], yet challenges persist, including suboptimal feature selection, limited algorithm generalizability, and insufficient model interpretability [20]. Recent advancements in metaheuristic algorithms and automated machine learning (AutoML) offer novel pathways for enhancing predictive performance [21]. However, prevalent optimization algorithms often converge to local optima when processing high-dimensional clinical data and lack tailored improvements addressing biomedical data characteristics. Furthermore, most studies confine efforts to model development without bridging the translational gap to clinical implementation, hindering the transformation of research outcomes into actionable diagnostic tools [22].

To address these challenges, our study focuses on two pivotal scientific inquiries: (1) establishing a comprehensive evaluation framework integrating metabolic, inflammatory, and surgical trauma parameters; and (2) refining AutoML optimization algorithms to enhance model interpretability and generalizability. Through innovative algorithmic design and multidimensional data synthesis, we aim to develop a precision-enhanced PPCs prediction framework, providing theoretical and technical foundations for personalized perioperative management.

2. Methods

2.1. Study population

This single-center retrospective study enrolled 286 NSCLC patients admitted to Lianyungang First People’s Hospital between January 2022 and December 2024. The Ethics Committee granted informed consent exemption in accordance with national regulations for retrospective studies using anonymized data. All data were anonymized prior to accessed them.

Inclusion Criteria

(1) Confirmed diagnosis of lung cancer and complication criteria [23]; (2) VATS-based lobectomy/segmentectomy/wedge resection with systematic lymph node dissection; (3) Postoperative pathological confirmation of NSCLC.

Exclusion Criteria:

(1) Intraoperative conversion to open thoracotomy; (2) Metastatic lung tumors; (3) Incomplete medical records; (4) Patients receiving preoperative radiotherapy/chemotherapy; (5) Previous thoracic surgery.

2.2. Data collection

Data were accessed for research purposes on February 20, 2025. All clinical data were extracted from electronic medical records and categorized into four domains: (1) Demographics: Gender, age, body mass index (BMI), smoking history, comorbidities; (2) Clinical Parameters: Preoperative laboratory tests (leukocyte count, platelet count, C-reactive protein (CRP), hemoglobin); (3) Surgical Variables: Surgical approach (lobectomy/segmentectomy/wedge resection), intraoperative blood loss, pathological stage, histologic subtype; (4) Outcomes: Occurrence of postoperative pulmonary complications (PPCs), including atelectasis, pleural effusion, persistent air leak, pneumothorax, and pneumonia. PPCs were assessed as a binary endpoint (present or absent) based on predefined criteria, without severity grading. Data were retrieved using the hospital’s standardized EHR template with cross-validation by two independent researchers.

2.3. Study design

(1) Automated Machine Learning Model: This study employs an AutoML framework based on optimization algorithms, integrating in-depth three synergistic mechanisms: base-learner selection, feature screening, and hyperparameter optimization. To ensure methodological rigor, the original dataset underwent stratified random assignment into training and held-out independent test sets at the experimental outset. All subsequent procedures—including feature selection, model configuration refinement, and cross-validation assessment—were strictly confined within the training subset. The framework uniformly encodes three decision spaces into a hybrid solution vector:

Where the base-learner type is discretely defined (k: 1 = Logistic Regression [LR], 2 = Support Vector Machine [SVM], 3 = XGBoost, 4 = LightGBM); feature selection follows binary 0/1 encoding; and hyperparameter space adapts dynamically to the selected base model. Driven by swarm intelligence algorithms, each iteration comprises: (a) identifying the candidate base-learner per k-value in the solution vector; (b) extracting a feature subset via the solution vector; and (c) injecting adaptive parameters to instantiate the model. Configured model instances then undergo rigorous ten-fold cross-validation within the training set, forming a synergistic feedback loop for “architecture–feature representation–parameterization.” Synergistic optimization is governed by a dynamically weighted fitness function:

This function holistically balances three critical dimensions: predictive accuracy (ACC term), feature sparsity (ℓ₀norm), and computational efficiency (exponential decay term). Weight coefficients α(t), β(t), γ(t) adapt across iterations—prioritizing accuracy initially, balancing accuracy and sparsity mid-phase, and emphasizing model parsimony terminally (where α(t) ≈ β(t)). Performance benchmarking includes traditional models (LR, SVM) and ensemble learners (XGBoost, LightGBM). For individual sample prediction, the AutoML model yields class probability confidence: For a new sample with feature vector x, the classification probability output through forward propagation is denoted as:

Where denotes the sigmoid activation , the engineered feature transformation, the output layer weight vector, and b the bias term.

(2) Improved Swarm Intelligence Algorithm: An enhanced swarm intelligence algorithm is proposed, utilizing the Blood-Sucking Leech Optimizer (BSLO) to guide AutoML optimization [24]. This novel approach draws inspiration from hematophagous leech foraging behavior, mathematically formalized through five predation strategies: directional exploration, directional exploitation, directional switching mechanisms, non-directional search strategies, and retracing mechanisms. To maximize optimization performance, we incorporate cubic chaotic mapping initialization and a dynamic opposition-based learning strategy into the BSLO framework, significantly enhancing stochastic diversity while amplifying global optimization capabilities. The resulting algorithm, designated as the Improved Blood-Sucking Leech Optimizer (IBSLO), demonstrates superior convergence properties and elevated solution quality compared to conventional swarm intelligence methodologies. To validate IBSLO’s efficacy, performance was benchmarked against original BSLO, ant lion optimizer (ALO), Harris hawks optimization (HHO), and whale optimization algorithm (WOA) using all 12 CEC2022 test functions [25]. Testing parameters: variable dimension = 10, population size = 30, maximum iterations = 500, with 30 independent runs for statistical robustness. The CEC2022 benchmark suite comprises twelve meticulously designed numerical optimization problems, systematically categorized into three archetypal groups for comprehensive algorithm evaluation. Unimodal functions (F1-F3), featuring singular global optima yet characterized by either precipitous gradients or locally plateaued regions, primarily assess convergence velocity and local exploitation capabilities. Foundational multimodal functions (F4-F8) incorporate asymmetric deformations, variable rotation factors, and stochastic perturbations to generate deceptive local optima, thereby rigorously examining algorithmic efficacy in escaping local entrapment while maintaining global exploration competence. Composite functions (F9-F12) exhibit heightened complexity through hybrid function topologies, heterogeneously scaled variable transformations, and ill-conditioned matrices, thoroughly challenging algorithm robustness against high-dimensional nonlinear couplings and intricate variable interdependencies. All algorithms underwent 30 independent trials to mitigate stochastic bias, with boxplot visualizations directly illustrating both convergence precision (logarithmic difference from theoretical optima) and stability (variance distribution across repetitions) across all problem categories. Specifically, compact interquartile ranges coupled with low medians denote superior stability, whereas elongated box structures with elevated outlier densities reveal convergence inconsistencies on specific function landscapes. Algorithm comparisons employed synthetic benchmark functions exclusively, whereas clinical predictor analysis utilized all available patient variables.
(3) Model Development and Evaluation: The dataset was partitioned into training (n = 229, 80%) and testing (n = 57, 20%) sets. Ten-fold cross-validation was applied to mitigate overfitting/underfitting risks. Performance metrics included accuracy (ACC), sensitivity (SEN), specificity (SPE), F1-score, AUC-ROC, and PR-AUC. Clinical utility was further assessed via decision curve analysis (DCA).
(4) Interpretability Analysis: SHAP (SHapley Additive exPlanations), grounded in cooperative game theory, quantified feature contributions. Two visualization tools were deployed: SHAP Summary Plot: Depicts feature importance and impact direction using color gradients (red: high values, blue: low values). SHAP Importance Plot: Ranks global feature contributions by absolute SHAP values.
(5) Clinical Decision System: An interactive decision support system was developed using MATLAB App Designer (2024a), enabling real-time PPCs risk prediction and therapeutic recommendations via structured input interfaces.

2.4. Statistical analysis

Research datasets underwent standardized processing within SPSS version 26.0 analytical software. Continuous variables conforming to normal distributions were expressed as means ± standard deviations (x ± s), while unordered categorical variables were presented through frequency counts and proportions (n(%)). For intergroup comparisons, continuous variables first underwent normality assessment. When both groups exhibited normal distributions with homogeneous variances, independent samples t-tests were employed. Intergroup analysis of categorical variables utilized Pearson’s chi-square tests. Statistical significance was determined by P-values derived from two-tailed hypothesis testing, adopting α = 0.05 as the significance threshold. All analytical findings were systematically organized into structured tabular formats for comprehensive presentation.

3. Results

3.1 Training and testing cohort characteristics

Among 286 NSCLC patients, 89 (31.12%) developed postoperative pulmonary complications (PPCs), including pneumonia (n = 52), pleural effusion (n = 12), atelectasis (n = 7), pneumothorax (n = 9), and persistent air leak (n = 9). The dataset was stratified into training (n = 229) and testing (n = 57) sets. Comparative analysis confirmed no significant differences in baseline characteristics between cohorts (P > 0.05) (Table 1).

Download:

Table 1. Baseline Characteristics of Training and Testing Cohorts.

https://doi.org/10.1371/journal.pone.0333413.t001

3.2. Algorithm enhancement performance

Box plots derived from 30 independent runs demonstrated IBSLO’s superior optimization stability across most CEC2022 benchmark functions compared to BSLO, ALO, HHO, and WOA (Fig 1). Convergence curve analysis revealed IBSLO’s accelerated convergence rate and reduced susceptibility to local optima during iterations (Fig 2).

Download:

Fig 1. Optimization Performance Comparison of Metaheuristic Algorithms.

Note: Box plots illustrating optimization stability and robustness across CEC2022 test functions over 30 independent runs.

https://doi.org/10.1371/journal.pone.0333413.g001

Download:

Fig 2. Convergence Behavior Comparison.

Note: Convergence trajectories demonstrating search efficiency and local optima avoidance capabilities.

https://doi.org/10.1371/journal.pone.0333413.g002

3.3. Model training outcomes

The AutoML framework achieved peak performance on the training dataset, demonstrating area under the receiver operating characteristic curve (AUC-ROC) of 0.9038 and area under the precision-recall curve (AUC-PR) of 0.8091 (detailed in Table 2 and Fig 3). Through coordinated optimization architecture, LightGBM emerged as the optimal base learner, with identified critical predictors encompassing preoperative leukocyte count, body mass index (BMI), surgical approach, patient age, intraoperative blood loss, and C-reactive protein (CRP) levels. For hyperparameter optimization, the ideal configuration comprised: a learning rate of 0.005 (search range: [10⁻⁵, 0.1]), tree depth of 7 (range: 3–12), subsample ratio of 0.8 (range: 0.6–1.0), and L2 regularization intensity of 10⁻⁴ (range: [10⁻⁶, 10⁻²]). Performance comparisons in Table 2 indicate that the AutoML model, when configured accordingly, comprehensively surpassed conventional approaches—enhancing precision (0.7093 vs. LightGBM’s 0.5859) by 21.1%, elevating recall (0.8592 vs. 0.8169) by 5.2%, and boosting the F1-score (0.7771 vs. 0.6824) by 13.9% over prevailing alternatives.

Download:

Table 2. Cross-validation performance metrics on the training set.

https://doi.org/10.1371/journal.pone.0333413.t002

Download:

Fig 3. Training Set Performance Evaluation.

Note: (A) ROC curve; (B) Precision-Recall curve.

https://doi.org/10.1371/journal.pone.0333413.g003

3.4. Testing set validation

The AutoML model maintained robust performance on the testing set, yielding ROC-AUC and PR-AUC values of 0.8775 and 0.8175 (Table 3, Fig 4).

Download:

Table 3. Internal validation performance metrics.

https://doi.org/10.1371/journal.pone.0333413.t003

Download:

Fig 4. Testing Set Classification Performance.

Note: (A) ROC curve; (B) Precision-Recall curve.

https://doi.org/10.1371/journal.pone.0333413.g004

3.5. Interpretability analysis

As shown in Fig 5, SHAP analysis ranked predictive feature importance as follows: 1-Preoperative leukocyte count; 2-BMI; 3-Surgical approach; 4-Age; 5-Intraoperative blood loss; 6-CRP. The interaction heatmap visualization reveals synergistic effects between age and WBC, as well as between surgical approach and CPR.

Download:

Fig 5. Machine learning interpretability visualization.

Note: (A) SHAP summary plot; (B) SHAP feature importance ranking; (C) Heat map of SHAP interaction.

https://doi.org/10.1371/journal.pone.0333413.g005

3.6. Clinical utility

(1) Decision Curve Analysis (DCA)

Decision curve analysis for the predictive model (Fig 6) reveals that across threshold probabilities—spanning 8% to 99% for the training cohort and 3% to 80% for the testing cohort—implementation of the AutoML predictive framework yields superior net clinical benefit compared to conventional methodologies. The model maintains sustained high performance throughout an extensive spectrum of threshold probabilities, demonstrating not only robust generalization capacity but also remarkable stability in predictive consistency. This steady trajectory of the net benefit curve underscores the framework’s resilience across diverse clinical decision-making scenarios.

Download:

Fig 6. Decision curve analysis for predictive models.

Note: (A) Training set; (B) Testing set. Net benefit (Y-axis) calculated against two extreme scenarios: “treat all” (red dashed) and “treat none” (black dashed).

https://doi.org/10.1371/journal.pone.0333413.g006

(2) Software implementation

To address usability barriers in AI clinical deployment, we developed an intuitive risk prediction system (Fig 7). This platform enables clinicians to input six preoperative parameters via a user-friendly interface, generating real-time PPCs risk assessments within seconds.

Download:

Fig 7. Clinical decision support system interface.

Note: The interface integrates feature entry (“Input Parameters”), predictive computation (“Calculate”), and risk output (“Prediction Results”) modules.

https://doi.org/10.1371/journal.pone.0333413.g007

4. Discussion

Our study successfully addressed critical challenges in high-dimensional clinical data modeling through the IBSLO. The Cubic mapping initialization reduced the average convergence iterations on standard benchmark functions (CEC2022), while the dynamic opposition-based learning strategy significantly improved local optimum avoidance rates. Specifically, convergence curve analysis over 30 independent runs revealed that IBSLO consistently escaped local optima traps across all 12 CEC2022 benchmark functions—notably complex multimodal landscapes F7 and F11. Comparative quantification showed IBSLO achieved lower stagnation frequency than the next-best comparator (HHO) and maintained solution diversity longer during iterative searches. Our findings validate the restructured value of metaheuristic algorithms in medical data scenarios through their demonstrable superiority in capturing complex predictor interactions. These performance differentials originate from the algorithm’s capacity to resolve high-dimensional interactions between surgical trauma and physiological reserves via iterative feature-space partitioning—a capability absent in generalized linear methodologies. The synergistic effects of metabolic indices (BMI), inflammatory markers (CRP), and surgical trauma parameters were quantitatively verified for the first time in our multidimensional feature integration, addressing the ARISCAT scoring system’s insensitivity to dynamic biomarkers [26]. Compared to the static risk stratification framework proposed by Lee et al. [27], our AutoML framework demonstrates three breakthroughs: (1) Robustness against data heterogeneity, evidenced by 10-fold cross-validation (training set AUC = 0.9038; test set AUC = 0.8775); (2) SHAP interpretability analysis revealing preoperative leukocyte count and BMI as dominant decision factors, effectively transforming the “black-box model” into a clinically interpretable pathway; and (3) Integrated decision curve analysis (DCA) showing significantly higher clinical net benefits than conventional approaches, providing quantitative support for personalized interventions. This end-to-end “prediction-interpretation-application” framework transcends the limitations of prior studies confined to model development [28].

The proposed AutoML framework from our institute significantly enhances model robustness against data heterogeneity. At its core, this framework intelligently amalgamates complementary advantages of diverse base learners through collaborative search mechanisms—incorporating both globally focused linear models (e.g., Logistic Regression and Support Vector Machines) known for noise tolerance and structural coherence, alongside gradient-boosted decision trees (e.g., XGBoost and LightGBM) excelling in capturing intricate nonlinear relationships and localized patterns. This integrative dynamic selection architecture enables adaptive component orchestration when confronting imbalanced distributions or heterogeneous patterns within datasets, thereby mitigating dependence on homogeneous data assumptions. Concurrently, the framework effectively curtails overfitting risks in small-sample scenarios through dual mechanisms: Explicit dimensionality reduction via embedded feature selection eliminates redundant or irrelevant predictors, while the dynamically adjusted fitness function explicitly penalizes model complexity through feature-sparsity regularization terms. These strategies synergize with strictly isolated dataset protocols employing exclusively training data for nested ten-fold cross-validation—safeguarding against information leakage to final test sets. Collectively, these layered protections substantially fortify model generalizability.

SHAP interpretability analysis elucidated complex interaction patterns between key predictors and PPCs. The interaction heatmap visualization reveals significant synergistic effects between age and WBC, as well as between surgical approach and CRP levels in PPCs. Suggesting age-dependent amplification of leukocyte-mediated pulmonary injury pathways. Concurrently, more extensive surgical approaches potentiate the detrimental impact of elevated CRP, where inflammatory cascades triggered by surgical trauma appear to synergize with baseline systemic inflammation via cytokine-mediated pleural injury mechanisms. These high-dimensional interactions—quantified through SHAP force values—illustrate critical threshold effects where predictor combinations generate supra-additive risk magnitudes beyond their individual impacts, highlighting the AutoML framework’s capacity to detect nonlinear synergies. Preoperative leukocyte levels emerged as the primary predictor, with elevated values (>8.0 × 10⁹/L) reflecting subclinical inflammatory activation and correlating exponentially with pneumonia risk [29]. BMI manifests differential pathophysiological effects on pulmonary outcomes through compartmentalized biological pathways—nutritional depletion impairing respiratory mechanics in underweight cohorts versus adipose-driven chronic inflammation exacerbating pleural stress in overweight patients. These dichotomous mechanisms operate independently rather than through curvilinear continuity [30]. Surgical modality analyses revealed a dose-dependent relationship between anatomical resection extent and complications: lobectomy’s higher incidence of persistent air leaks likely stems from diminished compensatory capacity in residual lung tissue [31]. Intraoperative blood loss operated via dual pathways: direct tissue hypoxia (hemoglobin reduction (1 g/dL corresponding to 3.2% decline in oxygen carriers)) and transfusion-induced immune dysregulation (delayed CRP peaks with amplified magnitude). Notably, quadratic age associations in patients >65 years demonstrated nonlinear coupling between physiological declines in pulmonary reserve and prolonged mechanical ventilation. These discoveries not only refine the pathophysiological understanding of PPCs but also quantify factor-specific weights via AutoML, establishing evidence-based foundations for personalized risk-stratified interventions. Regarding the potential association between prolonged operative time and PPCs highlighted by recent literature [32], our initial analysis incorporated operative duration as a candidate feature. However, during model construction, this variable demonstrated no significant predictive value for PPCs in the trainingor testing cohorts. We attribute this outcome to two interrelated factors: first, the limited sample size restricted statistical power for detecting subtle relationships, particularly given the heterogeneity in surgical complexity within our cohort; second, operative time data contained missing values, which necessitated mean imputation—a method that may obscure biologically relevant nonlinear effects in real-world surgical settings. Future multi-center validation incorporating granular operative metrics is warranted to elucidate this relationship.

Our study innovatively developed a clinical decision support system (CDSS) with three transformative advancements: (1) High practical utility requiring only six preoperative variables for real-time risk prediction; (2) MATLAB App Designer integration enabling IBSLO-driven optimization (response latency <2 s); and (3) SHAP-driven risk heatmaps improving predictive precision. This system bridges theoretical models to bedside applications, demonstrating artificial intelligence’s feasibility in perioperative management. For clinical deployment, we propose risk-stratified monitoring protocols: > 70% risk triggers intensive surveillance, 40–70% justifies moderate assessment intervals, and <40% permits standard care—translating algorithmic outputs into resource-efficient interventions.

4.1. Limitations

Three constraints warrant acknowledgement: (1) Single-center retrospective design (80.79% Stage I cases) necessitates multi-center validation cohorts (e.g., Stage III inclusion) to assess generalizability; (2) Current feature engineering excludes postoperative dynamic monitoring (e.g., drainage curves), potentially limiting early detection of delayed complications (e.g., persistent air leaks); (3) IBSLO’s convergence stability requires refinement in >10-dimensional feature spaces. Future work will integrate federated learning frameworks to enable multi-institutional model training without compromising data privacy; (4) Our study utilized a binary outcome definition for postoperative pulmonary complications (PPCs), without incorporating standardized severity grading systems such as the Clavien-Dindo classification. This constitutes a limitation, as it fails to distinguish between minor complications (e.g., those resolving spontaneously) and clinically severe events (e.g., Grade 3 + requiring invasive interventions), thereby potentially diminishing the predictive model’s granular clinical applicability. However, we are actively addressing this gap in our ongoing research, which focuses on integrating severity stratification to develop refined predictive algorithms.

5. Conclusion

Through systematic innovation in machine learning, this study established a multidimensional (metabolic-inflammatory-surgical) predictive framework for NSCLC postoperative complications. Algorithmically, the enhanced IBSLO improved metaheuristic search efficiency, overcoming traditional models’ representational limitations in feature interactions. Clinically, the CDSS established a closed-loop “prediction-interpretation-intervention” management framework, significantly enhancing PPCs risk assessment precision. Our findings validate AutoML’s unique value in perioperative medicine: SHAP-guided identification of modifiable risks (BMI, leukocyte levels) and real-time computational support for precision interventions provide novel paradigms for intelligent surgical advancement. Future optimizations should prioritize dynamic biomarker monitoring, multimodal data fusion, and embedded hardware development to ultimately realize a full-cycle “predict-prevent-treat” intelligent management platform.

References

1. Thai AA, Solomon BJ, Sequist LV, Gainor JF, Heist RS. Lung cancer. Lancet. 2021;398(10299):535–54.
- View Article
- Google Scholar
2. Alexander M, Kim SY, Cheng H. Update 2020: management of non-small cell lung cancer. Lung. 2020;198(6):897–907. pmid:33175991
- View Article
- PubMed/NCBI
- Google Scholar
3. Mithoowani H, Febbraro M. Non-small-cell lung cancer in 2022: a review for general practitioners in oncology. Curr Oncol. 2022;29(3):1828–39. pmid:35323350
- View Article
- PubMed/NCBI
- Google Scholar
4. Miao D, Zhao J, Han Y, Zhou J, Li X, Zhang T, et al. Management of locally advanced non-small cell lung cancer: State of the art and future directions. Cancer Commun (Lond). 2024;44(1):23–46. pmid:37985191
- View Article
- PubMed/NCBI
- Google Scholar
5. Remon J, Soria J-C, Peters S, ESMO Guidelines Committee. Electronic address: clinicalguidelines@esmo.org. Early and locally advanced non-small-cell lung cancer: an update of the ESMO Clinical Practice Guidelines focusing on diagnosis, staging, systemic and local therapy. Ann Oncol. 2021;32(12):1637–42. pmid:34481037
- View Article
- PubMed/NCBI
- Google Scholar
6. Yamanashi K, Marumo S, Shoji T, Fukui T, Sumitomo R, Otake Y, et al. The relationship between perioperative administration of inhaled corticosteroid and postoperative respiratory complications after pulmonary resection for non-small-cell lung cancer in patients with chronic obstructive pulmonary disease. Gen Thorac Cardiovasc Surg. 2015;63(12):652–9. pmid:26419246
- View Article
- PubMed/NCBI
- Google Scholar
7. Wang Y, Hu X, Su M-C, Wang Y-W, Che G-W. Postoperative elevations of neutrophil-to-lymphocyte and platelet-to-lymphocyte ratios predict postoperative pulmonary complications in non-small cell lung cancer patients: a retrospective cohort study. Curr Med Sci. 2020;40(2):339–47. pmid:32337695
- View Article
- PubMed/NCBI
- Google Scholar
8. Gravier F-E, Smondack P, Prieur G, Medrinal C, Combret Y, Muir J-F, et al. Effects of exercise training in people with non-small cell lung cancer before lung resection: a systematic review and meta-analysis. Thorax. 2022;77(5):486–96. pmid:34429375
- View Article
- PubMed/NCBI
- Google Scholar
9. Ülger G, Sazak H, Baldemir R, Zengin M, Kaybal O, İncekara F, et al. The effectiveness of ARISCAT Risk Index, other scoring systems, and parameters in predicting pulmonary complications after thoracic surgery. Med (Baltimore). 2022;101(30):e29723. pmid:35905198
- View Article
- PubMed/NCBI
- Google Scholar
10. Laurent H, Aubreton S, Galvaing G, Pereira B, Merle P, Richard R, et al. Preoperative respiratory muscle endurance training improves ventilatory capacity and prevents pulmonary postoperative complications after lung surgery. Eur J Phys Rehabil Med. 2020;56(1):73–81. pmid:31489810
- View Article
- PubMed/NCBI
- Google Scholar
11. Zhou T, Sun C. Effect of physical manipulation pulmonary rehabilitation on lung cancer patients after thoracoscopic lobectomy. Thorac Cancer. 2022;13(3):308–15. pmid:34882313
- View Article
- PubMed/NCBI
- Google Scholar
12. Song X, Liu X, Liu F, Wang C. Comparison of machine learning and logistic regression models in predicting acute kidney injury: A systematic review and meta-analysis. Int J Med Inform. 2021;151:104484.
- View Article
- Google Scholar
13. Guo W, Liu J, Dong F, Song M, Li Z, Khan MKH, et al. Review of machine learning and deep learning models for toxicity prediction. Exp Biol Med (Maywood). 2023;248(21):1952–73. pmid:38057999
- View Article
- PubMed/NCBI
- Google Scholar
14. Ma S, Li F, Li J, Wang L, Song H. Risk factor analysis and nomogram prediction model construction of postoperative complications of thoracoscopic non-small cell lung cancer. J Thorac Dis. 2024;16(6):3655–67. pmid:38983183
- View Article
- PubMed/NCBI
- Google Scholar
15. Zhai Y, Lin X, Wei Q, Pu Y, Pang Y. Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations. Heliyon. 2023;9(7):e17772. pmid:37483738
- View Article
- PubMed/NCBI
- Google Scholar
16. Xiu W, Zheng J, Zhou Y, Du H, Li J, Li W, et al. A nomogram for the prediction of the survival of patients with advanced non-small cell lung cancer and interstitial lung disease. Cancer Med. 2023;12(10):11375–84. pmid:36999934
- View Article
- PubMed/NCBI
- Google Scholar
17. Gao C, Zhang R, Chen X, Yao T, Song Q, Ye W, et al. Integrating Internet multisource big data to predict the occurrence and development of COVID-19 cryptic transmission. NPJ Digit Med. 2022;5(1):161. pmid:36307547
- View Article
- PubMed/NCBI
- Google Scholar
18. Li L, Han X, Zhang Z, Han T, Wu P, Xu Y, et al. Construction of prognosis prediction model and visualization system of acute paraquat poisoning based on improved machine learning model. Digit Health. 2024;10:20552076241287891. pmid:39398894
- View Article
- PubMed/NCBI
- Google Scholar
19. Jung YJ, Ahn J, Park S, Sun J-M, Lee S-H, Ahn JS, et al. Machine learning prediction of the case-fatality of COVID-19 and risk factors for adverse outcomes in patients with non-small cell lung cancer. Transl Cancer Res. 2024;13(6):2587–95. pmid:38988924
- View Article
- PubMed/NCBI
- Google Scholar
20. Galal A, Talal M, Moustafa A. Applications of machine learning in metabolomics: disease modeling and classification. Front Genet. 2022;13:1017340. pmid:36506316
- View Article
- PubMed/NCBI
- Google Scholar
21. Xu J, Tian F, Wang L, Miao Z. Binary particle swarm optimization intelligent feature optimization algorithm‐based magnetic resonance image in the diagnosis of adrenal tumor. Contrast Media Mol Imaging. 2022;2022(1).
- View Article
- Google Scholar
22. Aung YYM, Wong DCS, Ting DSW. The promise of artificial intelligence: a review of the opportunities and challenges of artificial intelligence in healthcare. British Med Bull. 2021;139(1):4–15.
- View Article
- Google Scholar
23. Riely GJ, Wood DE, Ettinger DS, Aisner DL, Akerley W, Bauman JR, et al. Non-small cell lung cancer, version 4.2024, NCCN clinical practice guidelines in oncology. J Natl Compr Canc Netw. 2024;22(4):249–74. pmid:38754467
- View Article
- PubMed/NCBI
- Google Scholar
24. Bai J, Nguyen-Xuan H, Atroshchenko E, Kosec G, Wang L, Abdel Wahab M. Blood-sucking leech optimizer. Adv Eng Soft. 2024;195:103696.
- View Article
- Google Scholar
25. Sharma P, Raju S. Metaheuristic optimization algorithms: a comprehensive overview and classification of benchmark test functions. Soft Comput. 2023;28(4):3123–86.
- View Article
- Google Scholar
26. Zorrilla-Vaca A, Grant MC, Rehman M, Sarin P, Mendez-Pino L, Urman RD, et al. Performance comparison of pulmonary risk scoring systems in lung resection. J Cardiothorac Vasc Anesthesia. 2023;37(9):1734–43.
- View Article
- Google Scholar
27. Lee SC, Lee JG, Lee SH, Kim EY, Chang J, Kim DJ, et al. Prediction of postoperative pulmonary complications using preoperative controlling nutritional status (CONUT) score in patients with resectable non-small cell lung cancer. Sci Rep. 2020;10(1):12385. pmid:32709867
- View Article
- PubMed/NCBI
- Google Scholar
28. Veiga Oliveira P, Cabral D, Antunes M, Torres C, Alvoeiro M, Rodrigues C, et al. Lung resection for non-small-cell lung cancer - a new risk score to predict major perioperative complications. Port J Card Thorac Vasc Surg. 2022;28(4):31–6. pmid:35334178
- View Article
- PubMed/NCBI
- Google Scholar
29. Benej M, Capov I, Skrickova J, Hejduk K, Pestal A, Wechsler J, et al. Association of the postoperative white blood cells (WBC) count in peripheral blood after radical surgical treatment of left upper lobe non-small cell lung cancer (NSCLC) with overall survival - single center results. Bratisl Lek Listy. 2017;118(5):299–301. pmid:28516794
- View Article
- PubMed/NCBI
- Google Scholar
30. Turna A, Özçıbık Işık G, Ekinci Fidan M, Sarbay İ, Kılıç B, Kara HV, et al. Can postoperative complications be reduced by the application of ERAS protocols in operated non-small cell lung cancer patients? Turk Gogus Kalp Damar Cerrahisi Derg. 2023;31(2):256–68. pmid:37484631
- View Article
- PubMed/NCBI
- Google Scholar
31. Messina G, Natale G, Bove M, Opromolla G, Di Filippo V, Martone M, et al. Intraoperative ventilatory leak: Real-time guidance for management of air leak in lung cancer patients undergoing VATS lobectomy. Thorac Cancer. 2023;14(18):1782–8. pmid:37144333
- View Article
- PubMed/NCBI
- Google Scholar
32. de Angelis P, Tan KS, Chudgar NP, Dycoco J, Adusumilli PS, Bains MS, et al. Operative time is associated with postoperative complications after pulmonary lobectomy. Ann Surg. 2022;278(6):e1259–66.
- View Article
- Google Scholar

[ref1] 1. Thai AA, Solomon BJ, Sequist LV, Gainor JF, Heist RS. Lung cancer. Lancet. 2021;398(10299):535–54.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Alexander M, Kim SY, Cheng H. Update 2020: management of non-small cell lung cancer. Lung. 2020;198(6):897–907. pmid:33175991
View Article
PubMed/NCBI
Google Scholar

[5] View Article

[6] PubMed/NCBI

[7] Google Scholar

[ref3] 3. Mithoowani H, Febbraro M. Non-small-cell lung cancer in 2022: a review for general practitioners in oncology. Curr Oncol. 2022;29(3):1828–39. pmid:35323350
View Article
PubMed/NCBI
Google Scholar

[9] View Article

[10] PubMed/NCBI

[11] Google Scholar

[ref4] 4. Miao D, Zhao J, Han Y, Zhou J, Li X, Zhang T, et al. Management of locally advanced non-small cell lung cancer: State of the art and future directions. Cancer Commun (Lond). 2024;44(1):23–46. pmid:37985191
View Article
PubMed/NCBI
Google Scholar

[13] View Article

[14] PubMed/NCBI

[15] Google Scholar

[ref5] 5. Remon J, Soria J-C, Peters S, ESMO Guidelines Committee. Electronic address: clinicalguidelines@esmo.org. Early and locally advanced non-small-cell lung cancer: an update of the ESMO Clinical Practice Guidelines focusing on diagnosis, staging, systemic and local therapy. Ann Oncol. 2021;32(12):1637–42. pmid:34481037
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref6] 6. Yamanashi K, Marumo S, Shoji T, Fukui T, Sumitomo R, Otake Y, et al. The relationship between perioperative administration of inhaled corticosteroid and postoperative respiratory complications after pulmonary resection for non-small-cell lung cancer in patients with chronic obstructive pulmonary disease. Gen Thorac Cardiovasc Surg. 2015;63(12):652–9. pmid:26419246
View Article
PubMed/NCBI
Google Scholar

[21] View Article

[22] PubMed/NCBI

[23] Google Scholar

[ref7] 7. Wang Y, Hu X, Su M-C, Wang Y-W, Che G-W. Postoperative elevations of neutrophil-to-lymphocyte and platelet-to-lymphocyte ratios predict postoperative pulmonary complications in non-small cell lung cancer patients: a retrospective cohort study. Curr Med Sci. 2020;40(2):339–47. pmid:32337695
View Article
PubMed/NCBI
Google Scholar

[25] View Article

[26] PubMed/NCBI

[27] Google Scholar

[ref8] 8. Gravier F-E, Smondack P, Prieur G, Medrinal C, Combret Y, Muir J-F, et al. Effects of exercise training in people with non-small cell lung cancer before lung resection: a systematic review and meta-analysis. Thorax. 2022;77(5):486–96. pmid:34429375
View Article
PubMed/NCBI
Google Scholar

[29] View Article

[30] PubMed/NCBI

[31] Google Scholar

[ref9] 9. Ülger G, Sazak H, Baldemir R, Zengin M, Kaybal O, İncekara F, et al. The effectiveness of ARISCAT Risk Index, other scoring systems, and parameters in predicting pulmonary complications after thoracic surgery. Med (Baltimore). 2022;101(30):e29723. pmid:35905198
View Article
PubMed/NCBI
Google Scholar

[33] View Article

[34] PubMed/NCBI

[35] Google Scholar

[ref10] 10. Laurent H, Aubreton S, Galvaing G, Pereira B, Merle P, Richard R, et al. Preoperative respiratory muscle endurance training improves ventilatory capacity and prevents pulmonary postoperative complications after lung surgery. Eur J Phys Rehabil Med. 2020;56(1):73–81. pmid:31489810
View Article
PubMed/NCBI
Google Scholar

[37] View Article

[38] PubMed/NCBI

[39] Google Scholar

[ref11] 11. Zhou T, Sun C. Effect of physical manipulation pulmonary rehabilitation on lung cancer patients after thoracoscopic lobectomy. Thorac Cancer. 2022;13(3):308–15. pmid:34882313
View Article
PubMed/NCBI
Google Scholar

[41] View Article

[42] PubMed/NCBI

[43] Google Scholar

[ref12] 12. Song X, Liu X, Liu F, Wang C. Comparison of machine learning and logistic regression models in predicting acute kidney injury: A systematic review and meta-analysis. Int J Med Inform. 2021;151:104484.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref13] 13. Guo W, Liu J, Dong F, Song M, Li Z, Khan MKH, et al. Review of machine learning and deep learning models for toxicity prediction. Exp Biol Med (Maywood). 2023;248(21):1952–73. pmid:38057999
View Article
PubMed/NCBI
Google Scholar

[48] View Article

[49] PubMed/NCBI

[50] Google Scholar

[ref14] 14. Ma S, Li F, Li J, Wang L, Song H. Risk factor analysis and nomogram prediction model construction of postoperative complications of thoracoscopic non-small cell lung cancer. J Thorac Dis. 2024;16(6):3655–67. pmid:38983183
View Article
PubMed/NCBI
Google Scholar

[52] View Article

[53] PubMed/NCBI

[54] Google Scholar

[ref15] 15. Zhai Y, Lin X, Wei Q, Pu Y, Pang Y. Interpretable prediction of cardiopulmonary complications after non-small cell lung cancer surgery based on machine learning and SHapley additive exPlanations. Heliyon. 2023;9(7):e17772. pmid:37483738
View Article
PubMed/NCBI
Google Scholar

[56] View Article

[57] PubMed/NCBI

[58] Google Scholar

[ref16] 16. Xiu W, Zheng J, Zhou Y, Du H, Li J, Li W, et al. A nomogram for the prediction of the survival of patients with advanced non-small cell lung cancer and interstitial lung disease. Cancer Med. 2023;12(10):11375–84. pmid:36999934
View Article
PubMed/NCBI
Google Scholar

[60] View Article

[61] PubMed/NCBI

[62] Google Scholar

[ref17] 17. Gao C, Zhang R, Chen X, Yao T, Song Q, Ye W, et al. Integrating Internet multisource big data to predict the occurrence and development of COVID-19 cryptic transmission. NPJ Digit Med. 2022;5(1):161. pmid:36307547
View Article
PubMed/NCBI
Google Scholar

[64] View Article

[65] PubMed/NCBI

[66] Google Scholar

[ref18] 18. Li L, Han X, Zhang Z, Han T, Wu P, Xu Y, et al. Construction of prognosis prediction model and visualization system of acute paraquat poisoning based on improved machine learning model. Digit Health. 2024;10:20552076241287891. pmid:39398894
View Article
PubMed/NCBI
Google Scholar

[68] View Article

[69] PubMed/NCBI

[70] Google Scholar

[ref19] 19. Jung YJ, Ahn J, Park S, Sun J-M, Lee S-H, Ahn JS, et al. Machine learning prediction of the case-fatality of COVID-19 and risk factors for adverse outcomes in patients with non-small cell lung cancer. Transl Cancer Res. 2024;13(6):2587–95. pmid:38988924
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref20] 20. Galal A, Talal M, Moustafa A. Applications of machine learning in metabolomics: disease modeling and classification. Front Genet. 2022;13:1017340. pmid:36506316
View Article
PubMed/NCBI
Google Scholar

[76] View Article

[77] PubMed/NCBI

[78] Google Scholar

[ref21] 21. Xu J, Tian F, Wang L, Miao Z. Binary particle swarm optimization intelligent feature optimization algorithm‐based magnetic resonance image in the diagnosis of adrenal tumor. Contrast Media Mol Imaging. 2022;2022(1).
View Article
Google Scholar

[80] View Article

[81] Google Scholar

[ref22] 22. Aung YYM, Wong DCS, Ting DSW. The promise of artificial intelligence: a review of the opportunities and challenges of artificial intelligence in healthcare. British Med Bull. 2021;139(1):4–15.
View Article
Google Scholar

[83] View Article

[84] Google Scholar

[ref23] 23. Riely GJ, Wood DE, Ettinger DS, Aisner DL, Akerley W, Bauman JR, et al. Non-small cell lung cancer, version 4.2024, NCCN clinical practice guidelines in oncology. J Natl Compr Canc Netw. 2024;22(4):249–74. pmid:38754467
View Article
PubMed/NCBI
Google Scholar

[86] View Article

[87] PubMed/NCBI

[88] Google Scholar

[ref24] 24. Bai J, Nguyen-Xuan H, Atroshchenko E, Kosec G, Wang L, Abdel Wahab M. Blood-sucking leech optimizer. Adv Eng Soft. 2024;195:103696.
View Article
Google Scholar

[90] View Article

[91] Google Scholar

[ref25] 25. Sharma P, Raju S. Metaheuristic optimization algorithms: a comprehensive overview and classification of benchmark test functions. Soft Comput. 2023;28(4):3123–86.
View Article
Google Scholar

[93] View Article

[94] Google Scholar

[ref26] 26. Zorrilla-Vaca A, Grant MC, Rehman M, Sarin P, Mendez-Pino L, Urman RD, et al. Performance comparison of pulmonary risk scoring systems in lung resection. J Cardiothorac Vasc Anesthesia. 2023;37(9):1734–43.
View Article
Google Scholar

[96] View Article

[97] Google Scholar

[ref27] 27. Lee SC, Lee JG, Lee SH, Kim EY, Chang J, Kim DJ, et al. Prediction of postoperative pulmonary complications using preoperative controlling nutritional status (CONUT) score in patients with resectable non-small cell lung cancer. Sci Rep. 2020;10(1):12385. pmid:32709867
View Article
PubMed/NCBI
Google Scholar

[99] View Article

[100] PubMed/NCBI

[101] Google Scholar

[ref28] 28. Veiga Oliveira P, Cabral D, Antunes M, Torres C, Alvoeiro M, Rodrigues C, et al. Lung resection for non-small-cell lung cancer - a new risk score to predict major perioperative complications. Port J Card Thorac Vasc Surg. 2022;28(4):31–6. pmid:35334178
View Article
PubMed/NCBI
Google Scholar

[103] View Article

[104] PubMed/NCBI

[105] Google Scholar

[ref29] 29. Benej M, Capov I, Skrickova J, Hejduk K, Pestal A, Wechsler J, et al. Association of the postoperative white blood cells (WBC) count in peripheral blood after radical surgical treatment of left upper lobe non-small cell lung cancer (NSCLC) with overall survival - single center results. Bratisl Lek Listy. 2017;118(5):299–301. pmid:28516794
View Article
PubMed/NCBI
Google Scholar

[107] View Article

[108] PubMed/NCBI

[109] Google Scholar

[ref30] 30. Turna A, Özçıbık Işık G, Ekinci Fidan M, Sarbay İ, Kılıç B, Kara HV, et al. Can postoperative complications be reduced by the application of ERAS protocols in operated non-small cell lung cancer patients? Turk Gogus Kalp Damar Cerrahisi Derg. 2023;31(2):256–68. pmid:37484631
View Article
PubMed/NCBI
Google Scholar

[111] View Article

[112] PubMed/NCBI

[113] Google Scholar

[ref31] 31. Messina G, Natale G, Bove M, Opromolla G, Di Filippo V, Martone M, et al. Intraoperative ventilatory leak: Real-time guidance for management of air leak in lung cancer patients undergoing VATS lobectomy. Thorac Cancer. 2023;14(18):1782–8. pmid:37144333
View Article
PubMed/NCBI
Google Scholar

[115] View Article

[116] PubMed/NCBI

[117] Google Scholar

[ref32] 32. de Angelis P, Tan KS, Chudgar NP, Dycoco J, Adusumilli PS, Bains MS, et al. Operative time is associated with postoperative complications after pulmonary lobectomy. Ann Surg. 2022;278(6):e1259–66.
View Article
Google Scholar

[119] View Article

[120] Google Scholar

Figures

Abstract

Objective

Methods

Results

Conclusion

1. Introduction

2. Methods

2.1. Study population

2.2. Data collection

2.3. Study design

2.4. Statistical analysis

3. Results

3.1 Training and testing cohort characteristics

3.2. Algorithm enhancement performance

3.3. Model training outcomes

3.4. Testing set validation

3.5. Interpretability analysis

3.6. Clinical utility

4. Discussion

4.1. Limitations

5. Conclusion

References