Figures
Abstract
Background
Metabolic syndrome (MetS) is characterized by chronic inflammation and can be worsened by circadian disruption, which is common among shift work. Machine learning can predict the risk of MetS in shift workers using inflammatory biomarkers. Most investigations have focused on the general population rather than shift workers, a distinct group that requires continuous health monitoring; therefore, we aimed to examine the relationship between inflammatory indicators and MetS using blood cell counts in this high-risk group of shift workers who require long-term health monitoring and to enhance the biological understanding of MetS by applying machine learning methods.
Methods
In this cross-sectional study, we analyzed data from shift workers included in the National Health and Nutrition Examination Survey between 2005–2010 and 2017–2018. Prediction models, including random forest (RF), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), and linear regression (LR), were developed and validated. We compared the model performance and conducted stratified analysis, smooth curve fitting, and threshold effect analysis to further explore the relationship between inflammation and MetS risk in shift workers.
Results
The analysis included 3,079 participants in total. Each machine learning model demonstrated good predictive performance in assessing MetS risk among shift workers. LightGBM achieved the area under the curve (AUC) of 0.944 in training dataset and 0.722 in testing dataset; XGBoost had an AUC of 0.818 in training dataset and 0.747 in testing dataset; and LR had an AUC of 0.763 in training dataset and 0.699 in testing dataset, RF had an AUC of 0.741 in training dataset and 0.729 in testing dataset.
Furthermore, the analysis revealed that body mass index, age, neutrophil, lymphocyte, monocyte, and platelet counts, along with their derived inflammatory indices, were significant predictors. Multivariate logistic regression adjusted for lifestyle and health factors showed that lymphocytes remained consistently associated with MetS in shift workers. Generalized additive model analysis revealed complex non-linear relationships between lymphocytes and platelets. Inflammatory factors strongly predicted MetS risk in shift workers, with their effects varying by concentration threshold, particularly for lymphocytes (k = 2.2, right side p < 0.001).
Conclusion
Lymphocyte counts and related composite indices are significant predictors of MetS risk in shift workers. Consistent monitoring of these biomarkers may be useful for early odds-based stratification of MetS in this high-risk population, whereas any preventive implications would require confirmation in longitudinal and interventional studies.
Citation: Bao Y, Wang R (2025) Lymphocytes and related inflammatory factors as predictors of metabolic syndrome risk in shift workers: A machine learning approach based on large-scale population data. PLoS One 20(12): e0339673. https://doi.org/10.1371/journal.pone.0339673
Editor: Aleksandra Klisic, University of Montenegro-Faculty of Medicine, MONTENEGRO
Received: March 8, 2025; Accepted: December 10, 2025; Published: December 29, 2025
Copyright: © 2025 Bao, Wang. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: This research was supported by the Zhejiang Provincial Public Service and Application Research Foundation, China (Grant No.LTGY24H270009).
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: AUC, Under the curve; ACC, Accuracy; AISI, Systemic inflammatory composite index; BMI, Body mass index; CBC, Complete blood count; CI, Confidence interval; GAM, Generalized additive model; GED, General educational development; HEI2020, Healthy eating index-2020; IL-6, Interleukin 6; LDL, Low-density lipoprotein; LightGBM, Light gradient boosting machine; LR, Linear regression; MEC, Mobile examination center; ML, Machine learning; MLR, Monocyte-to-lymphocyte ratio; NHANES, National Health and Nutrition Examination Survey; NLR, Neutrophil-to-lymphocyte ratio; NMLR, Neutrophil-to-monocyte ratio; OR, Odds ratios; PHQ, Patient health questionnaire; PIR, Poverty-income ratio; PLR, Platelet-to-lymphocyte ratio; RF, Random forest; ROC, Receiver operating characteristic; SHAP, SHapley Additive exPlanations; SII, Systemic immune-inflammation index; SIRI, Systemic inflammatory response index; TNF-α, Tumor necrosis factor alpha; XGBoost, eXtreme Gradient Boosting
Introduction
Metabolic syndrome (MetS) is characterized by a cluster of metabolic abnormalities and cardiovascular risk factors, including central obesity, hypertriglyceridemia, low high-density lipoprotein cholesterol, hyperglycemia, and high blood pressure. Diagnosis requires the fulfillment of at least three of the five criteria [1]. An unhealthy diet and/or lifestyle contribute to the development of MetS, affecting approximately one-quarter of the global population, with prevalence increasing with age [2].
However, prevalence rates vary across countries. For instance, the estimated prevalence of MetS is 33.4% in the United States, 28.8% in Turkey, and 14.4% in China [3–5]. MetS significantly increases the risk of cardiovascular disease, cancer, and chronic kidney disease, significant causes of morbidity and mortality worldwide [6–8]. Because MetS imposes significant health and economic burdens, identifying new risk factors and improving early prevention remain key priorities for researchers in related fields.
The influence of work-related stress factors on MetS remains unclear. Shift workers face unique challenges, particularly irregular working schedules and night shifts. With rapid industrial development, shift work has become increasingly common worldwide. Approximately 15–30% of the population in Europe and the United States engage in shift work across various occupations, and this proportion continues to rise. This percentage is similarly high in China.
Disruptions in circadian rhythms negatively affect sleep quality, fatigue levels, and emotional well-being and may also accelerate the onset of MetS, stroke, and cancer [9]. The body’s internal biological clock regulates the sleep-wake cycle, metabolism, and hormone secretion. Shift work forces a misalignment between circadian rhythms and environmental cues, resulting in circadian rhythm disorders that disrupt the timing and efficiency of metabolic processes. These disruptions can lead to glucose intolerance, weight gain, and lipid metabolism disorders [10], which are key components of MetS.
Inflammation often coexists in patients with MetS and circadian rhythm disorders, suggesting a complex interaction among these conditions. Chronic inflammation is consistently linked to the pathophysiology of MetS. Inflammatory markers worsen insulin resistance and impair endothelial function. Additionally, they promote fat accumulation and increase the risk of cardiovascular complications [11]. Previous cross-sectional studies have primarily concentrated on the role of inflammatory factors and their derived indices in metabolic disorders [12,13].
Moreover, persistent low-grade inflammation is common among shift-workers and acts as a significant trigger for circadian rhythm disorders [14,15]. As a high-risk group for circadian disruption, shift workers exhibit inflammatory responses that remain insufficiently characterized, and comprehensive analyses of inflammation at different levels in MetS are lacking.
Current research has examined the association between inflammation and circadian rhythm disruptions related to MetS, relying mainly on traditional statistical methods. For instance, Li et al. conducted an observational study using correlation analysis, linear regression, and receiver operating characteristic (ROC) curve calculations to investigate whether disruptions in the circadian clock contributed to MetS components in patients with obstructive sleep apnea [16]. Similarly, Romero-Cabrera et al. conducted a prospective randomized controlled trial lasting up to 4 years, carefully selecting intervention methods and assessment criteria and applying statistical analyses such as analysis of variance (ANOVA) and linear mixed-effects models for repeated measures [17].
However, these traditional methods often require large datasets, involve multiple assumptions, and have strict application criteria.
In contrast, machine learning (ML) provides a robust approach for computer-assisted data mining and analysis and excels at identifying complex patterns and nonlinear relationships within data. This methodology is widely applied in fields such as engineering and medicine, particularly for developing predictive tools [18,19]. Several studies have used ML algorithms to predict the risk of MetS based on physical examination findings or genetic information. Li et al. developed a super learner model that integrates multiple ML algorithms using physical examination variables, such as vital signs, serum indicators, and comorbidities, to predict MetS risk [20]. Similarly, Huang et al. extracted genomic deoxyribonucleic acid (DNA) from serum and applied genetic risk score models based on ML to predict the occurrence of MetS [21]. However, most of these investigations have focused on the general population rather than shift workers, a distinct group that requires continuous health monitoring.
In this study, we used data from the National Health and Nutrition Examination Survey (NHANES) to develop an ML prediction model for assessing the risk of MetS among shift workers. The primary objectives are twofold: (1) to examine the relationship between inflammatory indicators and MetS using blood cell counts in this high-risk group of shift workers who require long-term health monitoring and (2) to enhance the biological understanding of MetS by applying ML methods.
Methods
Study population and design
The study population was derived from the NHANES, a comprehensive survey that collects nationally representative health and nutritional data from participants in the United States. Extensive household interviews were conducted to obtain demographic and health history information. Physical examinations and blood sample collections were performed in a Mobile Examination Center (MEC). Serum samples were analyzed by the Laboratory Sciences Division of the National Center for Environmental Health at the Centers for Disease Control and Prevention.
This cross-sectional study used publicly available data from NHANES for the years 2005–2010 and 2017–2018 and was approved by the National Center for Health Statistics Research Ethics Review Board. Further details are available on the official NHANES website (https://www.cdc.gov/nchs/nhanes/about_nhanes.htm). The original study protocol and formal approval from the Research Ethics Review Board (protocol #2005−06; #2011−17) can be accessed on the NHANES Research Ethics Review Board website (https://www.cdc.gov/nchs/nhanes/irba98.htm). All participants provided written informed consent at the time of registration. Our analysis included 3,079 employed individuals aged 20 years or older who completed both interviews and MEC assessments. Participants were required to have a shift work schedule to be included in the study. We excluded pregnant women and multiracial individuals to minimize potential bias. Pregnancy induces significant physiological changes that can affect MetS assessment. Similarly, participants from different racial backgrounds were excluded because MetS risk factors can vary by ethnicity. As the MetS definition used in this study is primarily suited to European and American populations, we focused on individuals from these groups to maintain consistency.
Assessment of shift work schedule
Daytime work was defined as standard weekday work between 9:00 a.m. and 5:00 p.m. Shift work included any schedule that differed from these regular daytime hours [22]. Shift work status was determined using questionnaire items such as: “Which best describes your hours worked?” and “What was your overall work schedule for the past 3 months?” The response options included “evening or nights, “early mornings,” or “variable schedule (early mornings, days, and nights).” For analysis, participants who reported standard daytime work were classified as the “Regular” group, and those who reported any nonstandard schedule were classified as “Irregular.”
Assessment of MetS
MetS was defined according to the harmonized criteria proposed in the joint interim statement of the International Diabetes Federation Task Force on Epidemiology and Prevention, National Heart, Lung, and Blood Institute, American Heart Association, World Heart Federation, International Atherosclerosis Society, and the International Association for the Study of Obesity [23]. Participants were diagnosed with MetS if they met three or more of the following criteria: (1) waist circumference: ≥ 102 cm in men or ≥ 88 cm in women; (2) triglycerides: ≥ 150 mg/dL or current use of lipid-lowering medication; (3) High-density lipoprotein cholesterol (HDL-C): HDL-C < 40 mg/dL for men or < 50 mg/dL for women; (4) Blood pressure: systolic ≥ 130 mmHg and/or diastolic ≥ 85 mmHg, or current use of antihypertensive medication; and (5) Fasting glucose: elevated levels or current use of antihyperglycemic agents.
Assessment of inflammation
Fasting venous blood samples were collected from all participants to measure leukocyte, neutrophil, lymphocyte, monocyte, and platelet counts, expressed as thousands of cells per microliter (1000 cells/μL). The following complete blood count (CBC)-derived inflammatory indicators were calculated: Systemic Immune-Inflammation Index (SII), Systemic Inflammatory Response Index (SIRI), Systemic Inflammatory Composite Index (AISI), Monocyte-to-Lymphocyte Ratio (MLR), Neutrophil-to-Monocyte Ratio (NMLR), Platelet-to-Lymphocyte ratio (PLR), and Neutrophil-to-Lymphocyte Ratio (NLR) [24]. The formulas were defined as follows:
Covariates
In this study, we examined several sociodemographic factors, including age, sex, and race (non-Hispanic Black, non-Hispanic white, Mexican American and other Hispanic). Education level was characterized as less than high school, high school diploma/GED, or college and above. The poverty-income ratio (PIR) and alcohol consumption (non-drinker, 1–5 drinks/month, 5–10 drinks/month, or ≥10 drinks/month) were also assessed. Smoking status was classified as never smoked, former smoker, or current smoker. Body mass index (BMI) and self-reported diet quality (excellent, very good, good, fair, or poor) were included.
Dietary quality was further evaluated using the Healthy Eating Index-2020 (HEI-2020), which measures adherence to the Dietary Guidelines for Americans based on 13 food and nutrient components. Furthermore, we considered health complications (including weak/failing kidneys, stroke, asthma, anemia, arthritis, congestive heart failure, coronary heart disease, chronic bronchitis, cancer, or malignancy) and Patient Health Questionnaire (PHQ) scores. These covariates were selected based on previous research [25,26].
Variable selection and model construction
Four standard ML models were developed using the shift worker dataset, which served as the model development cohort. A total of 32 variables collected in this study were included in the modeling process. The models applied in this analysis were random forest (RF), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), and linear regression (LR).
The dataset was randomly divided into training and test sets following the widely accepted Pareto principle, with a 70:30 split. Specifically, 70% of the data was used for model selection and tuning, while the remaining 30% was reserved for evaluating performance. During the training, 10-fold cross-validation was employed to ensure robust model validation. In this procedure, the training set was divided into 10 subsets: nine were used to train the model, and one served as the validation set. This process was repeated 10 times so that each subset was used once for validation. The algorithm’s final accuracy was then calculated as the average across the 10 iterations. Following this validation process, the test set (30% of the data) was used to evaluate how well the model generalized to unseen data.
The analytic sample was randomly split into a training dataset accounting for 70% and an independent test dataset accounting for 30%. The training dataset was used for model development and internal validation, and the test dataset was reserved solely for evaluating generalization to unseen data. For all ML algorithms, performance within the training dataset was assessed using cross-validated out-of-fold predictions, or out-of-bag (OOB) predictions for RF. The primary performance metric was AUC.
For LR, XGBoost, and LightGBM, we implemented 10-fold cross-validation using the training dataset. In each iteration, nine folds were used to fit the model and the remaining fold served as the validation fold, so that every observation was predicted only from models that had not been trained on it. Hyperparameters for the XGBoost and LightGBM, such as learning rate, maximum depth, and the number of boosting iterations, were tuned to maximize cross-validated AUC [27]. The LR model was fitted without penalization as a baseline parametric classifier and did not require extensive hyperparameter tuning. After cross-validation, each algorithm was refitted on the full training dataset using the optimal hyperparameters, and its generalization performance was evaluated on the independent 30% test dataset.
The RF model was implemented using bootstrap aggregation with OOB estimation. Each tree was grown on a bootstrap sample of approximately 80% of the training observations, with the remaining observations serving as OOB samples for that tree. For each participant, the OOB prediction was obtained by aggregating votes from trees that did not include that participant in their bootstrap sample, providing an internal cross-validation. To reduce overfitting, we restricted tree complexity by using 1,000 trees, sampling 80% of observations per tree, setting the number of variables considered at each split to the square root of the total number of predictors, and enforcing a minimum terminal node size of 5 observations [28,29].
All models were implemented using the “RandomForest,” “lightgbm,” “treeshap,” “xgboost,” and “fastshap” packages.
Model validation and explainability
To thoroughly evaluate model performance, we utilized standard binary classification metrics, including area under the curve (AUC), accuracy (ACC), sensitivity (recall), and specificity. Furthermore, we analyzed the ROC curve to gain further insight into model behavior. The bootstrap method was used to refine estimates of model performance.
A major limitation of complex ML models is their inherent lack of interpretability, often referred to as the “black box” nature [30,31]. To enhance interpretability, we used SHapley Additive ExPlanations (SHAP) values from the ‘shap’ package, which are based on game theory, to visualize feature importance. This approach enhances model transparency and interpretability, thereby improving the reliability of its predictions [32].
In our study, we utilized the permutation importance method to identify the most significant features for predicting MetS in shift workers. This method evaluates feature importance by measuring how randomly shuffling a feature’s values affects the model’s predictive performance. To ensure reliability and minimize errors, we conducted 1,000 permutations for each feature across all models, yielding importance values that were averaged and ranked to determine their relative contributions. This methodology enables researchers and practitioners to better understand the factors influencing model outcomes by analyzing feature importance, recognizing interdependencies, and considering key elements for informed decision-making [30].
Statistical analysis
Continuous variables were presented as mean, median, minimum, and maximum values, whereas categorical variables were expressed as frequencies and percentages. Participants were divided into MetS and non-MetS groups according to diagnostic criteria. Differences between the two groups were analyzed using the Wilcoxon rank-sum or Kruskal–Wallis test for continuous variables and the Fisher exact or chi-squared test for categorical variables.
Missing data were addressed using multiple imputation, a flexible, simulation-based statistical method commonly applied to handle missing values or nonresponse in the NHANES dataset [33]. To ensure nationally representative estimates, analyses were weighted per NHANES guidelines.
Logistic regression analysis was used to examine the correlation between inflammatory markers and MetS, with results expressed as odds ratios (OR) and 95% confidence intervals (95% CIs). Given that the relationships between inflammatory indicators and MetS may vary among participants due to multiple influencing factors, we conducted stratified analyses to assess these differences in participants with MetS. We also applied a generalized additive model (GAM) to explore potential non-linear relationships. When a non-linear association was detected, a two-piecewise linear regression model was constructed to calculate the threshold effect of inflammation levels on MetS using the resulting smoothing plots. When the smoothing curve revealed a relationship between MetS status and inflammation levels, a recursive method was used to automatically identify the inflection point for subsequent analyses.
All statistical analyses were performed using the R software (http://www.r-project.org) and EmpowerStats (http://www.empowerstats.com), with statistical significance defined as p < 0.05.
Results
Participant characteristics
We screened the shift worker population and included 3,079 individuals in the study cohort. Participants were divided into two groups: MetS (unweighted n = 529, weighted n = 9,291,315) and non-MetS (unweighted n = 2,250, weighted n = 26,216,750). A detailed flowchart of the selection process is presented in Fig 1. Table 1 shows the baseline characteristics of the two groups. Participants with MetS were generally older (median age: 45 vs. 38, p < 0.001), more likely to be overweight (34.2% vs. 30.5%) or obese (61.6% vs. 32.2%), and exhibited higher levels of inflammatory markers, including neutrophils (median: 4.00 vs. 3.60, p < 0.001), lymphocytes (median: 2.10 vs. 2.00, p = 0.024), monocytes (median: 0.60 vs. 0.50, p = 0.009), SIRI (median: 0.91 vs. 1.02, p = 0.048) and AISI (median: 223 vs. 246, p = 0.027). No significant differences were observed in shift work patterns or other inflammatory indicators between the groups.
NHANES: National health and nutrition examination survey; MetS: Metabolic syndrome; XGBoost: eXtreme gradient boosting; LightGBM: Light gradient boosting machine.
Development and validation of prediction models
Of all participants, 2,166 were randomly assigned to the training set and 913 to the test set. All variables were included in training the four ML models, and the AUC was used to evaluate their discriminative performance.
In the LightGBM model (Fig 2A, red line), the AUC for the training dataset was 0.944, which was considerably higher than that for the test dataset (Fig 2B, red line; 0.722). In comparison, the RF model demonstrated an AUC of 0.741 in the training dataset (Fig 2A, purple line) and 0.729 in the test dataset (Fig 2B, purple line). Similarly, the XGBoost model achieved an AUC of 0.818 for the training dataset (Fig 2A, blue line) and 0.747 for the test dataset (Fig 2B, blue line). LR yielded AUC values, 0.763 for the training dataset (Fig 2A, green line) and 0.699 for the test dataset (Fig 2B, green line). All models showed some degree of performance decline when applied to the test dataset; however, the decrease was less significant in RF and XGBoost than in LightGBM. Evaluation of calibration (Fig 2C) and DCA (Fig 2E) for all four models on the training set confirmed that LR (Fig 2C, blue line; Fig 2E, blue-green line) performed acceptable. followed closely by XGBoost (Figs 2C and 2E, purple line), LightGBM (Figs 2C and 2E, brown and green lines), and RF (Fig 2C and 2E, orange and blue line). When evaluated on the test dataset (Figs 2D and 2F), XGBoost (Figs 2D and 2F, purple line) was the best-performing algorithms, followed closely by LR (Figs 2D and 2F, blue and blue-green line), LightGBM (Figs 2D and 2F, brown and green lines), and RF (Figs 2D and 2F, orange and blue line). Per-model performance metrics, including AUC values with 95% CIs estimated via bootstrap resampling, as well as sensitivity, specificity, and calibration data at clinically relevant thresholds, are provided in the S1 and S2 Table.
A. ROC curves of the four algorithms on the training set; B. ROC curves on the test set; C. Calibration curve for the training set; D. Calibration curve for the test set; E. DCA curve for the training set; F. DCA curve for the test set. ROC curves: comparing the discriminative ability of the four algorithms in the training and test sets, with the AUC representing overall classification performance. Calibration curves illustrate the agreement between predicted and observed probabilities, the closer the curve is to the 45° diagonal, the better the calibration. DCA curve: illustrating the net clinical benefit of each model across a range of threshold probabilities.
The bar plot highlights the importance of various features across four models: LightGBM, LR, XGBoost, and RF. In the LightGBM model, the top 15 features identified include BMI, age, sex, and several inflammatory ratios such as MLR, PLR, AISI, SII, SIRI, NMLR and NLR, among others (Fig 3A). Fig 3B compares individual incidence risk (f(x) = −3,21) with the overall average risk (E[f(x)] = −2.64), showing that the individual has a lower risk than the population mean. Factors such as PIR, sex, and AISI contribute to this elevated risk.
A: Bar plot of feature importance based on lightgbm. B: SHAP force plot on lightgbm. C: Beeswarm plot of effects of features based on lightgbm-shap. D: Bar plot of feature importance based on LR. E: SHAP force plot on LR. F: Beeswarm plot of effects of features based on LR-shap. G: Bar plot of feature importance based on RF. H: SHAP force plot on RF. I: Beeswarm plot of effects of features based on RF-shap. J: Bar plot of feature importance based on Xgboost. K: SHAP force plot on Xgboost. L: Beeswarm plot of effects of features based on Xgboost -shap. Bar plots: showing the ranked feature importance scores for each algorithm. SHAP force plots: displaying the individual contribution of each feature to model predictions. SHAP beeswarm plots: summarizing the distribution and magnitude of feature effects, where color represents feature value and position indicates the SHAP value (positive or negative influence on prediction).
The beeswarm plot presents the overall importance of features in the LightGBM, LR, XGBoost, and RF models. A color scale on the right indicates the relative value of each feature, where yellow dots represent high values and purple dots indicate low values. The violin plot, aligned at the midline, aggregates the dots representing each case in the internal validation set; the vertical spread reflects the number of cases sharing identical SHAP values. Fig 3C displays the SHAP values of the 15 features and their association with MetS risk among shift workers in the LightGBM model. Higher values of BMI, age, MLR, and platelet counts were associated with increased risk, while higher values of PLR, PIR, SII, SIRI, and other inflammatory markers were associated with decreased risk.
Figs 3D and 3F present the top 15 features contributing to predictions in the LR model, Figs 3G and 3I show the top 15 predictive features in the XGBoost model, and Figs 3J and 3L depict the top 15 influential features identified by the RF model. The findings indicate that, in addition to established risk factors for MetS such as BMI and age, inflammatory predictors, including lymphocytes, neutrophils, monocytes, and platelets, are also significant [34,35]. Moreover, previous studies have reported associations between these inflammatory factors and both shift work and MetS [15,36]. The risk contribution of each feature closely mirrors that observed in the LightGBM model. When comparing individual and overall incidence risks, the LR model (Fig 3E, f(x) = −589415, E[f(x)] = 0), the XGBoost model (Fig 3H, f(x) = 0.133, E[f(x)] = 0.172), and the RF model (Fig 3K, f(x) = −552, E[f(x)] = 0) all demonstrated lower individual risks than the overall average risk. Across all models, the individual risk consistently falls below the overall risk.
The relationship between hub feature levels and SHAP values (Fig 4.) showed that the varying trends of different inflammatory factors influenced the model’s predictive performance. The SHAP value of neutrophils increased sharply with neutrophil count (Fig 4A), peaking within the range of 3.5–3.7, and then exhibited a U-shaped trend, first decreasing and then increasing, with the lowest value observed within the range of 10–12. The SHAP value of lymphocytes (Fig 4B) showed an overall stable upward trend, with a slight U-shaped pattern within the range of 1.6–2.5. In contrast, the SHAP value of monocytes (Fig 4C) showed an opposite trend to that of neutrophils, decreasing sharply to its lowest point (0.6–0.7), followed by an inverted U-shaped trend. The SHAP value of platelets (Fig 4D) exhibited two inverted U-shaped patterns: the first occurred within the range of 100–250, reaching a trough around 250–260, followed by the second inverted U-shaped trend. Platelet counts within the range of approximately 220–300 and > 500 indicated a negative contribution to the model predictions.
A: SHAP values of neutrophils. B: SHAP values of lymphocytes. C: SHAP values of monocytes. D: SHAP values of platelets. Scatter plots show how SHAP values vary with neutrophil, lymphocyte, monocyte, and platelet counts, respectively, reflecting each feature’s influence on model output.
To facilitate interpretation of feature effects and potential interactions, SHAP dependence plots were generated exclusively for the LR model. Unlike tree-based models such as LightGBM, RF, and XGBoost, which can capture complex non-linear relationships [29,31], LR offers a more transparent and interpretable framework for visualizing the relationship between individual predictors and model outputs. The inherently linear structure of LR enables clearer identification of both the direction and magnitude of feature contributions, as well as more interpretable interactions via color-coded modifiers in SHAP plots. In the LR model, age and lymphocyte count exhibited a positive and approximately linear association with SHAP values, while sex emerged as a prominent interaction factor. In contrast to non-linear models that often produce unstable SHAP dependence plots, the LR model provides more stable and interpretable visualizations. This clarity makes LR particularly valuable for examining feature relationships in analyses based on SHAP values [28].
The SHAP dependence plot displays the distribution of output values for a single feature and uses color coding to introduce a second feature, enhancing the analysis of potential feature interactions. In the LR model (Fig 5.), age (Fig 5A), and lymphocyte count (Fig 5B) were positively and linearly correlated with SHAP values. Chronic bronchitis and shift schedule were used for color classification; however, their effects on the SHAP values of age and lymphocytes were relatively low. Neutrophils (Fig 5C) also demonstrated a positive linear correlation with SHAP values, with alcohol consumption as an interaction factor significantly influencing the SHAP values. Conversely, platelet count (Fig 5D) exhibited a negative linear correlation with SHAP values, and PIR, as an interaction factor, significantly influenced SHAP values. Monocyte counts (Fig 5E) were negatively correlated with the model predictions. Among the categorical variables, participants who were overweight or obese demonstrated significantly higher SHAP values than those in other groups (Fig 5F), suggesting that the interaction between BMI positively influenced model predictions.
A: SHAP feature dependence plot (Age & Chronic bronchitis). B: SHAP feature dependence plot (Lymphocytes & Shift schedule). C: SHAP feature dependence plot (Neutrophils & Alcohol). D: SHAP feature dependence plot (Platelets & PIR). E: SHAP feature dependence plot (Monocytes & Arthritis). F: SHAP feature dependence plot (BMI & Chronic bronchitis). Each plot illustrates the interaction between two variables, where the x-axis represents the feature value and the y-axis represents the SHAP value, indicating the strength and direction of each feature’s contribution to the model prediction.
Association between inflammation factors and MetS in shift workers
Different models were used to adjust for covariates in the multivariate logistic regression analyses examining the association between inflammatory factors and MetS risk (Table 2). For neutrophils, Models I (unadjusted) and II (adjusted for age, sex, and race) showed a significant positive association with MetS risk; however, this association was no longer significant in Model III (further adjusted for shift schedule, PIR, alcohol consumption, smoking, BMI, self-reported diet quality, HEI2020, weak/failing kidneys, stroke, asthma, anemia, arthritis, coronary heart disease, chronic bronchitis, cancer or malignancy, and PHQ score). Similarly, lymphocytes remained consistently associated with elevated MetS risk across all models, with the strongest associations observed in Models II and III, suggesting their potential as reliable predictors. In Model I, monocytes were significantly associated with increased risk. However, the association became non-significant in Model III, indicating that additional covariates may mediate this relationship. When analyzed categorically, participants in the third quartile (Q3: 0.563–0.6) had significantly lower odds of MetS across all models (Model I: OR: 0.02; 95% CI: 0.00–0.08, p < 0.001; Model II: OR: 0.02; 95% CI: 0.00–0.09, p < 0.001; Model III: OR: 0.01, 95% CI: 0.00–0.06, p < 0.001) compared with the reference group. For platelet count, no significant associations were observed in any model (all ORs = 1.00, all p > 0.05). However, in the categorical analysis using quartiles, significant associations were observed for Q3 [255–287) across all models (Model I: OR: 0.53; 95% CI: 0.40–0.70, p < 0.001; Model II: OR: 0.65; 95% CI: 0.49–0.87, p = 0.004; Model III: OR: 0.56; 95% CI: 0.41–0.75, p < 0.001). Additional analyses of other inflammatory markers are presented in S3 Table.
These findings suggest that inflammatory factors are associated with prevalent MetS among shift workers and may be informative when assessing MetS-related risk profiles. However, these effects were attenuated after comprehensive covariate adjustments, warranting further investigation.
Fig 6 illustrates that neutrophils (Fig 6A), lymphocytes (Fig 6B), and monocytes (Fig 6C) were significantly associated with an increased risk of MetS (neutrophils: OR: 1.09; 95% CI: 1.03–1.16, p = 0.002, lymphocytes: OR: 1.21; 95% CI: 1.06–1.38, p = 0.006, monocytes: OR: 1.71; 95% CI: 1.02–2.88, p = 0.044). Among non-Hispanic White participants, both neutrophils (OR: 1.13; 95% CI: 1.03–1.23, p = 0.009) and monocytes (OR: 3.87; 95% CI: 1.72–-8.73, p = 0.001) demonstrated strong associations with MetS risk. Moreover, a negative correlation was observed between alcohol consumption and inflammatory factors.
Effect of different numbers of inflammation factors on MetS
To better understand how variations in inflammatory factor counts influence MetS risk, we conducted a GAM analysis to identify non-linear relationships and potential threshold effects (Fig 7 and Table 3). The results indicated that the risk of MetS among shift workers increased steadily with higher neutrophil counts (Fig 7A). For lymphocytes (Fig 7B), a non-linear relationship was observed, with an inflection point at 2.2. Below this threshold, the association was inverse but not significant (OR: 0.8; 95% CI: 0.6–1.0, p = 0.083), whereas above the threshold, higher counts were significantly associated with an increased risk of MetS (OR: 1.5; 95% CI: 1.2–1.8, p < 0.001; likelihood ratio test, p = 0.002). Monocytes (Fig 7C) demonstrated a linear association with MetS risk, showing no evidence of a threshold effect. Platelet counts (Fig 7D) exhibited a more complex, S-shaped pattern, rising at low levels, plateauing between 160 and 350, and then increasing again after minor fluctuations. Two-piecewise regression identified an inflection point at 257. The effect estimates were 1.0 (95% CI: 1.0–1.0, p < 0.001) on the left and 1.0 (95% CI: 1.0–1.0, p = 0.001) on the right. Although statistically significant, the OR of 1.0 indicated no meaningful association between platelet counts and MetS. Additional analyses of other imflammatory markers are presented in S4 Table.
The x-axis represents each continuous variable, and the y-axis shows the estimated smooth function of the log(relative risk). The central red dotted line denotes the fitted smooth effect estimated by the GAM, while the upper and lower blue dotted lines indicate the 95% confidence interval. The horizontal line at zero represents the reference level, with values above or below indicating positive or negative associations, respectively.
Discussion
We analyzed NHANES datasets from 2005–2010 and 2017–2018 to evaluate the risk of MetS among shift workers. All four ML models (LightGBM, RF, XGBoost, and LR) demonstrated modest-to-good discriminative performance, with LightGBM achieving the highest AUC values. These findings suggest that non-linear algorithms may better capture capturing complex relationships among variables. Consistent with our findings, previous studies predicting MetS and other chronic diseases have reported that tree-based nonlinear algorithms outperform traditional linear models, particularly in handling high-dimensional data and identifying variable interactions [37,38]. For example, LightGBM and RF have been widely applied in the prediction of diabetes and cardiovascular disease, consistently showing superior performance compared with LR [39,40]. The primary objective of our study was to compare relative model performance and identify key predictors. Furthermore, through SHAP-based interpretative analysis, we found that inflammation-related markers, including lymphocytes, neutrophils, monocytes, and their derived ratios such as the NLR, MLR, SII, and SIRI, consistently ranked among the top predictors of MetS in shift workers. These findings suggest that inflammation may be closely linked with prevalent MetS in this population and may serve as a marker of adverse metabolic status.
BMI and age, as conventional risk factors, also remained among the predominant contributors. Shift work disrupts circadian rhythms, leading to irregular eating patterns that promote higher calorie intake and subsequent weight gain [41]. Furthermore, exposure to artificial light at night significantly interferes with lipid metabolism, leading to excessive hepatic fat accumulation over time [42]. Artificial light exposure may also impair sleep quality by increasing glucose and lipid absorption, reducing fatty acid release, and worsening metabolic imbalances. With aging, physiological functions decline and self-repair capacity weakens, thereby increasing susceptibility to MetS-related conditions including obesity, hyperglycemia, and hyperlipidemia. Aging also disrupts circadian rhythms [43], resulting in decreased expression of clock genes and disruption of normal biological cycles. Such circadian rhythm disorders can, in turn, accelerate aging, increase oxidative stress and inflammation, and trigger age-associated diseases including coronary heart disease and osteoarthritis [44]. Together, these mechanisms may help explain why higher BMI and older age are consistently associated with MetS.
To further investigate the impact of inflammatory factors on MetS in shift workers, we applied LR and GAM analyses. LR identified lymphocytes as relatively independent predictors, while GAM revealed both linear and non-linear associations of neutrophils, lymphocytes, monocytes, and platelets with MetS among shift workers. These findings suggest that inflammatory responses may be linked with both the presence and severity of MetS in this population and may involve threshold-like patterns that warrant further investigation.
Consistent with previous studies, alterations in shift patterns may modulate the activity of inflammatory cells through mechanisms such as circadian rhythm disruption, chronic stress, and increased metabolic load [45–47]. We observed that when lymphocyte levels exceeded approximately 2.2 × 10⁹/L, the odds of prevalent MetS were significantly higher, suggesting that lymphocytes not only reflect the degree of systemic inflammation but may also mark a potential threshold of immune activation related to MetS.
In shift workers, adipose tissue inflammation may serve as an essential pathway linking lymphocytes to metabolic dysfunction. Circadian disruption and lifestyle imbalance make long-term night shift workers particularly susceptible to obesity and insulin resistance [48]. Under these conditions, T cells and natural killer cells have been reported to accumulate in adipose tissue and release proinflammatory cytokines such as interferon-gamma and tumor necrosis factor-alpha (TNF-α), which are associated with macrophage polarization toward the M1 phenotype and impaired insulin signaling [49]. As these subsets belong to the circulating lymphocyte pool, their expansion contributes to elevated peripheral lymphocyte counts [50]. These mechanisms, described in previous experimental studies, provide a biological context that may help interpret the observed lymphocyte and MetS associations, although our cross-sectional data cannot confirm these pathways.
Circadian disruption itself constitutes a chronic stress burden, disturbing the hypothalamic-pituitary-adrenal axis and sympathetic activity, and altering cortisol and melatonin rhythms. These changes influence immune cell generation, circulation, and trafficking [51]. Lymphocytes, which are highly sensitive to circadian cues, may undergo metabolic reprogramming under prolonged night shift exposure, adopting a more proinflammatory phenotype [52,53]. Once their levels exceed a critical threshold, amplified cytokine release is associated with more pronounced adipose tissue inflammation and insulin resistance, which may be related to a higher likelihood of MetS.
We also found that neutrophil and monocyte counts were positively and approximately linearly associated with the probability of prevalent MetS. This finding does not entirely align with the conventional leukocyte distribution pattern, in which neutrophil and lymphocyte counts often show antagonistic behavior during acute inflammatory responses. One possible explanation is that the interplay between chronic metabolic disturbances and circadian disruption may alter this antagonistic relationship. According to report, Chronic inflammation in adipose tissue and metabolic dysregulation, such as fat accumulation and insulin resistance, may directly promote leukocyte proliferation [54,55]. Furthermore, circadian rhythm disruption has been shown to enhance the secretion of pro-inflammatory cytokines including interleukin (IL)-6, IL-10, IL-17, and TNF-α, thereby exacerbating leukocyte activation and contributing to the concurrent elevation of neutrophil and monocyte counts [55,56]. In contrast, platelets were not significantly associated with MetS, although previous studies has reported associations between platelet-related indices and metabolic risk [57]. Circadian disruption, common among shift workers, can alter platelet production and activity by affecting the bone marrow microenvironment and through stress-related sympathetic and hormonal pathways, potentially promoting proinflammatory and procoagulant states [58,59]. However, the observed decrease in SHAP values for neutrophils within the 10–12 count range may have been influenced by the relatively sparse sample distribution in this interval, which could affect the robustness of this result.
In this study, composite inflammatory indices, including MLR, NMLR, NLR, PLR, SII, SIRI, and AISI showed relatively strong associations with MetS and frequently ranked among the top 15 features in importance analyses. Notably, all these indices are closely associated with lymphocyte counts. Unlike single-cell measures, these indices integrate data from multiple immune cell types and may more accurately reflect the balance between pro- and anti-inflammatory processes. Prior studies have similarly reported that MLR, NMLR, NLR, PLR, SII, SIRI, and AISI are associated with metabolic disorders and cardiovascular events and can improve risk stratification [60,61]. Jiang et al. demonstrated that SII and SIRI independently predicted MetS, and their levels were positively correlated with obesity severity in individuals with obesity [62,63]. Mechanistically, these composite indices may more effectively capture the interplay between innate and adaptive immunity, enhancing sensitivity to early inflammatory disturbances that precede the onset of metabolic abnormalities.
Importantly, racial differences were observed. Among non-Hispanic White participants, both neutrophils and monocytes exhibited stronger associations with MetS risk than the overall population, with the effect particularly pronounced for monocytes (OR: 3.87). In contrast, lymphocytes showed a stronger association among non-Hispanic Black participants (OR: 1.46, p = 0.003). These findings suggest that inflammatory pathways may exert population-specific effects, potentially influenced by genetic background, dietary patterns, lifestyle behaviours, and socioeconomic conditions [64,65]. Such ethnic heterogeneity highlights the need for tailored predictive models and targeted interventions to more accurately account for differential inflammatory responses across diverse populations.
This study has several limitations. First, the study population was primarily derived from the United States, which may limit the generalizability of the findings to other regions and ethnic groups. Additional research in multi-ethnic and multinational cohorts is warranted to assess the applicability of these results. Second, despite adjustment for a wide range of covariates, residual confounding from unmeasured variables cannot be excluded. Several important factors, such as sleep quality, psychological stress, and occupational category (e.g., healthcare vs. industrial shift workers), were not available in the dataset. These unmeasured variables may be related to both inflammatory status and metabolic outcomes. Advanced statistical methods and longitudinal study designs will be necessary to better address potential bias. Moreover, future studies directly comparing shift workers and day workers would help clarify how shift work is linked to metabolic risk. From a modeling perspective, the four ML algorithms achieved only modest discrimination in the held-out test set and were not externally validated, so generalizability of the prediction models to other shift-working populations remains uncertain. In addition, although NHANES sampling weights and complex survey design were applied in descriptive and regression analyses, the ML models were fit without survey weighting and are best regarded as internal, sample-based tools for benchmarking algorithms and exploring predictors associated with MetS rather than definitive population-level risk prediction tools.
Conclusion
We applied ML techniques to identify key risk factors for MetS in shift workers. Among the tested models, the RF model demonstrated the highest discriminative performance. Our analysis revealed that inflammatory biomarkers, including neutrophils, lymphocytes, monocytes, and platelets, were particularly effective in predicting MetS risk, with lymphocytes showing the strongest correlation. Further investigation of threshold values showed that lymphocyte levels exceeding approximately 2.2 × 109/L were closely associated with MetS, suggesting that even slight elevations above this threshold could be clinically significant. Overall, these findings suggest that lymphocyte counts and composite inflammatory indices may be useful markers for early odds-based stratification of MetS among shift workers, whereas the extent to which monitoring or modifying these markers is associated with changes in MetS odds will need to be clarified in longitudinal and interventional studies.
Supporting information
S1 Table. Performance metrics of machine learning models prediction on the training dataset.
https://doi.org/10.1371/journal.pone.0339673.s001
(PDF)
S2 Table. Performance metrics of machine learning models prediction on the training dataset.
https://doi.org/10.1371/journal.pone.0339673.s002
(PDF)
S3 Table. Multivariate logistic regression analysis of different inflammation factors.
https://doi.org/10.1371/journal.pone.0339673.s003
(PDF)
S4 Table. The threshold effect analysis of Neutrophils, Lymphocytes, Platelets and monocytes on MetS.
https://doi.org/10.1371/journal.pone.0339673.s004
(PDF)
Acknowledgments
We would like to thank NHANES for providing publicly available data. We also appreciate Yuhai Shou (graduated from the University of Pennsylvania) for his help in the writing of this article.
References
- 1. Grundy SM, Brewer HBJ, Cleeman JI, Smith SCJ, Lenfant C. Definition of metabolic syndrome: report of the National Heart, Lung, and Blood Institute/American Heart Association conference on scientific issues related to definition. Arterioscler Thromb Vasc Biol. 2004;24(2):e13–18.
- 2. Tárraga Marcos PJ, López-González ÁA, Martínez-Almoyna Rifá E, Paublini Oliveira H, Martorell Sánchez C, Tárraga López PJ, et al. The Prevalence of Metabolic Syndrome and Hypertriglyceridemic Waist Based on Sociodemographic Variables and Healthy Habits in Healthcare Workers: A Retrospective Study. Life (Basel). 2025;15(1):81. pmid:39860021
- 3. Moore JX, Chaudhary N, Akinyemiju T. Metabolic syndrome prevalence by race/ethnicity and sex in the United States, National Health and Nutrition Examination Survey, 1988-2012. Prev Chronic Dis. 2017;14:E24.
- 4. Gündogan K, Bayram F, Capak M, Tanriverdi F, Karaman A, Ozturk A, et al. Prevalence of metabolic syndrome in the Mediterranean region of Turkey: evaluation of hypertension, diabetes mellitus, obesity, and dyslipidemia. Metab Syndr Relat Disord. 2009;7(5):427–34. pmid:19754305
- 5. Lan Y, Mai Z, Zhou S, Liu Y, Li S, Zhao Z, et al. Prevalence of metabolic syndrome in China: An up-dated cross-sectional study. PLoS One. 2018;13(4):e0196012. pmid:29668762
- 6. Duan M, Zhao X, Li S, Miao G, Bai L, Zhang Q, et al. Metabolic score for insulin resistance (METS-IR) predicts all-cause and cardiovascular mortality in the general population: evidence from NHANES 2001-2018. Cardiovasc Diabetol. 2024;23(1):243. pmid:38987779
- 7. Mili N, Paschou SA, Goulis DG, Dimopoulos M-A, Lambrinoudaki I, Psaltopoulou T. Obesity, metabolic syndrome, and cancer: pathophysiological and therapeutic associations. Endocrine. 2021;74(3):478–97. pmid:34625915
- 8. Hsu C, McCulloch CE, Iribarren C, Darbinian J, Go AS. Body mass index and risk for end-stage renal disease. Ann Intern Med. 2006;144(1):21–8. pmid:16389251
- 9. Wu Q-J, Sun H, Wen Z-Y, Zhang M, Wang H-Y, He X-H, et al. Shift work and health outcomes: an umbrella review of systematic reviews and meta-analyses of epidemiological studies. J Clin Sleep Med. 2022;18(2):653–62. pmid:34473048
- 10. Pivonello C, Negri M, Patalano R, Amatrudo F, Montò T, Liccardi A, et al. The role of melatonin in the molecular mechanisms underlying metaflammation and infections in obesity: A narrative review. Obes Rev. 2022;23(3):e13390. pmid:34861097
- 11. Engin A. Endothelial Dysfunction in Obesity and Therapeutic Targets. Adv Exp Med Biol. 2024;1460:489–538. pmid:39287863
- 12. Guo J, Mutailipu K, Wen X, Yin J, You H, Qu S, et al. Association between lymphocyte to high-density lipoprotein cholesterol ratio and insulin resistance and metabolic syndrome in US adults: results from NHANES 2007-2018. Lipids Health Dis. 2025;24(1):9. pmid:39794792
- 13. Sarrafan-Chaharsoughi Z, Sinaii N, Demidowich AP, Yanovski JA. The association of Neutrophil-to-Lymphocyte ratio with metabolic syndrome in U.S. Adults: Findings from the 1999-2018 National Health and Nutrition Examination survey. J Clin Transl Endocrinol. 2024;39:100382. pmid:39790833
- 14. Harding BN, Espinosa A, Castaño-Vinyals G, Pozo OJ, Skene DJ, Bustamante M, et al. Identification of Predictors of Shift Work Adaptation and Its Association With Immune, Hormonal and Metabolite Biomarkers. J Pineal Res. 2024;76(8):e70017. pmid:39688072
- 15. McDermott JE, Jacobs JM, Merrill NJ, Mitchell HD, Arshad OA, McClure R, et al. Molecular-Level Dysregulation of Insulin Pathways and Inflammatory Processes in Peripheral Blood Mononuclear Cells by Circadian Misalignment. J Proteome Res. 2024;23(5):1547–58. pmid:38619923
- 16. Li X, Liu X, Meng Q, Wu X, Bing X, Guo N, et al. Circadian clock disruptions link oxidative stress and systemic inflammation to metabolic syndrome in obstructive sleep apnea patients. Front Physiol. 2022;13:932596. pmid:36105285
- 17. Romero-Cabrera JL, Garaulet M, Jimenez-Torres J, Alcala-Diaz JF, Quintana Navarro GM, Martin-Piedra L, et al. Chronodisruption and diet associated with increased cardiometabolic risk in coronary heart disease patients: the CORDIOPREV study. Transl Res. 2022;242:79–92. pmid:34752950
- 18. Gardner W, Winkler DA, Muir BW, Pigram PJ. Applications of multivariate analysis and unsupervised machine learning to ToF-SIMS images of organic, bioorganic, and biological systems. Biointerphases. 2022;17(2):020802. pmid:35345884
- 19. Hussain W, Mabrok M, Gao H, Rabhi FA, Rashed EA. Revolutionising healthcare with artificial intelligence: A bibliometric analysis of 40 years of progress in health systems. Digit Health. 2024;10. pmid:38817839
- 20. Li S, Zhang Y, Fu K, Fang K, Zheng L, Lin Y, et al. Machine learning ensemble meets clinical practice: developing a real-world risk prediction model for metabolic syndrome using super learner and scorecard approaches. J Adv Res. 2025.
- 21. Huang T, Li Y, Wang S, Qiao S, Zheng X, Xiong W, et al. Assessing individual genetic susceptibility to metabolic syndrome: interpretable machine learning method. Ann Med. 2025;57(1):2519679. pmid:40544475
- 22. Huang Y, Jing D, Su J, Huang Z, Liu H, Tao J, et al. Association of Night Shift Work With Chronic Spontaneous Urticaria and Effect Modification by Circadian Dysfunction Among Workers. Front Public Health. 2021;9:751579. pmid:34926376
- 23. Alberti KGMM, Eckel RH, Grundy SM, Zimmet PZ, Cleeman JI, Donato KA, et al. Harmonizing the metabolic syndrome: a joint interim statement of the International Diabetes Federation Task Force on Epidemiology and Prevention; National Heart, Lung, and Blood Institute; American Heart Association; World Heart Federation; International Atherosclerosis Society; and International Association for the Study of Obesity. Circulation. 2009;120(16):1640–5. pmid:19805654
- 24. Guo B, Liu X, Si Q, Zhang D, Li M, Li X, et al. Associations of CBC-Derived inflammatory indicators with sarcopenia and mortality in adults: evidence from Nhanes 1999 ∼ 2006. BMC Geriatr. 2024;24(1):432.
- 25. Wang M, Ma G, Tao Z. The association of neutrophil-to-lymphocyte ratio with cardiovascular and all-cause mortality among the metabolic syndrome population. BMC Cardiovasc Disord. 2024;24(1):594. pmid:39462314
- 26. Kurexi A, Peng J, Yao J, Wang L, Wang Q. Association of “a body shape index” with the risk of developing colorectal cancer in U.S. patients with metabolic syndrome: evidence from the NHANES 1999-2018. BMC Gastroenterol. 2024;24(1):447. pmid:39627686
- 27. Rathnayake N, Rathnayake U, Dang TL, Hoshino Y. Water level prediction using soft computing techniques: A case study in the Malwathu Oya, Sri Lanka. PLoS One. 2023;18(4):e0282847. pmid:37099590
- 28. Rathnayake N, Rathnayake U, Chathuranika I, Dang TL, Hoshino Y. Cascaded-ANFIS to simulate nonlinear rainfall–runoff relationship. Applied Soft Computing. 2023;147:110722.
- 29. Kularathne S, Perera A, Rathnayake N, Rathnayake U, Hoshino Y. Analyzing the impact of socioeconomic indicators on gender inequality in Sri Lanka: A machine learning-based approach. PLoS One. 2024;19(12):e0312395. pmid:39724101
- 30. Makumbura RK, Mampitiya L, Rathnayake N, Meddage DPP, Henna S, Dang TL, et al. Advancing water quality assessment and prediction using machine learning models, coupled with explainable artificial intelligence (XAI) techniques like shapley additive explanations (SHAP) for interpreting the black-box nature. Results in Engineering. 2024;23:102831.
- 31. Ekanayake IU, Meddage DPP, Rathnayake U. A novel approach to explain the black-box nature of machine learning in compressive strength predictions of concrete using Shapley additive explanations (SHAP). Case Studies in Construction Materials. 2022;16:e01059.
- 32. Meddage DPP, Ekanayake IU, Weerasuriya AU, Lewangamage CS, Tse KT, Miyanawala TP, et al. Explainable Machine Learning (XML) to predict external wind pressure of a low-rise building in urban-like settings. Journal of Wind Engineering and Industrial Aerodynamics. 2022;226:105027.
- 33. Wang YE, Kakigi C, Barbosa D, Porco T, Chen R, Wang S, et al. Oral Contraceptive Use and Prevalence of Self-Reported Glaucoma or Ocular Hypertension in the United States. Ophthalmology. 2016;123(4):729–36. pmid:26948305
- 34. Zeng P, Li M, Cao J, Zeng L, Jiang C, Lin F. Association of metabolic syndrome severity with frailty progression among Chinese middle and old-aged adults: a longitudinal study. Cardiovasc Diabetol. 2024;23(1):302. pmid:39152431
- 35. Guo T, Zheng S, Chen T, Chu C, Ren J, Sun Y, et al. The association of long-term trajectories of BMI, its variability, and metabolic syndrome: a 30-year prospective cohort study. EClinicalMedicine. 2024;69:102486. pmid:38370536
- 36. Meng G, Zhu Q, Shao J, Zhang Q, Liu L, Wu H, et al. Comparing the diagnostic ability of inflammatory markers in metabolic syndrome. Clin Chim Acta. 2017;475:1–6. pmid:28974361
- 37. Lee S, Lee H, Choi JR, Koh SB. Development and Validation of Prediction Model for Risk Reduction of Metabolic Syndrome by Body Weight Control: A Prospective Population-based Study. Sci Rep. 2020;10(1):10006. pmid:32561810
- 38. Khurshid MR, Manzoor S, Sadiq T, Hussain L, Khan MS, Dutta AK. Unveiling diabetes onset: Optimized XGBoost with Bayesian optimization for enhanced prediction. PLoS One. 2025;20(1):e0310218. pmid:39854291
- 39. Xu C, Shi F, Ding W, Fang C, Fang C. Development and validation of a machine learning model for cardiovascular disease risk prediction in type 2 diabetes patients. Sci Rep. 2025;15(1):32818. pmid:40999026
- 40. Dinh A, Miertschin S, Young A, Mohanty SD. A data-driven approach to predicting diabetes and cardiovascular disease with machine learning. BMC Med Inform Decis Mak. 2019;19(1):211. pmid:31694707
- 41. Kolbe I, Oster H. Chronodisruption, Metabolic Homeostasis, and the Regulation of Inflammation in Adipose Tissues. Yale J Biol Med. 2019;92(2):317–25. pmid:31249492
- 42. Guan Q, Wang Z, Cao J, Dong Y, Tang S, Chen Y. Melatonin restores hepatic lipid metabolic homeostasis disrupted by blue light at night in high-fat diet-fed mice. J Pineal Res. 2024;76(4):e12963. pmid:38779971
- 43. Shen W, Cai L, Li J, Sun Y, Wang B, Wang N, et al. Association of night shift work and biological ageing: the mediating role of body mass index. Age Ageing. 2024;53(11):afae242. pmid:39497270
- 44. de Souza KA, Jackson M, Chen J, Reyes J, Muayad J, Tran E, et al. Shift work schedules alter immune cell regulation and accelerate cognitive impairment during aging. J Neuroinflammation. 2025;22(1):4. pmid:39780172
- 45. Boivin DB, Boudreau P, Kosmadopoulos A. Disturbance of the Circadian System in Shift Work and Its Health Impact. J Biol Rhythms. 2022;37(1):3–28. pmid:34969316
- 46. Schettini MAS, Passos RF do N, Koike BDV. Shift Work and Metabolic Syndrome Updates: A Systematic Review. Sleep Sci. 2023;16(2):237–47. pmid:37425967
- 47. Morris CJ, Purvis TE, Hu K, Scheer FAJL. Circadian misalignment increases cardiovascular disease risk factors in humans. Proc Natl Acad Sci U S A. 2016;113(10):E1402–11. pmid:26858430
- 48. Antunes LC, Levandovski R, Dantas G, Caumo W, Hidalgo MP. Obesity and shift work: chronobiological aspects. Nutr Res Rev. 2010;23(1):155–68. pmid:20122305
- 49. Chatzigeorgiou A, Karalis KP, Bornstein SR, Chavakis T. Lymphocytes in obesity-related adipose tissue inflammation. Diabetologia. 2012;55(10):2583–92. pmid:22733483
- 50. Pan Z, Li M, Zhang P, Li T, Liu R, Liu J, et al. Peripheral Blood Lymphocyte Subsets and Heterogeneity of B Cell Subsets in Patients of Idiopathic Inflammatory Myositis with Different Myositis-specific Autoantibodies. Inflammation. 2025;48(1):118–32. pmid:38755405
- 51. Patel H, Qureshi RA, Jha N, Prajapati VP. Assessment of hypothalamic-pituitary-adrenal (HPA) axis function in chronic stress: correlation with cortisol rhythms and immune markers. Eur Heart J Digit Health. 2025;15:823–6.
- 52.
Pandey A, Kothari A, Kar SK. Sleep and circadian rhythm in peripheral inflammation: Realm of interconnected frontiers. Circadian rhythms, sleep and inflammation. Cham: Springer Nature Switzerland. 2025:57–72.
- 53. Abdolla N, Narmuratova G, Abdusattarova Y, Kali A, Perfilyeva Y, Ostapchuk Y, et al. The Impact of High-Fat Diet and Circadian Disruptions on Obesity and Immune System Dysfunction. ES Food Agrofor. 2025.
- 54. Feješ A, Šebeková K, Borbélyová V. Pathophysiological Role of Neutrophil Extracellular Traps in Diet-Induced Obesity and Metabolic Syndrome in Animal Models. Nutrients. 2025;17(2):241. pmid:39861371
- 55. Markova I, Hüttl M, Gayova N, Miklankova D, Cerna K, Kavanova M, et al. Visceral Adipose Tissue Inflammation and Vascular Complications in a Rat Model with Severe Dyslipidemia: Sex Differences and PAI-1 Tissue Involvement. Biomolecules. 2024;15(1):19. pmid:39858414
- 56. Sinton MC, Shorthouse O, Costain A, Quintana JF. Interleukin-17 and fat: Timing is everything. Immunity. 2025;58(1):15–7. pmid:39813991
- 57. Wu C, Hu Z, Zhang P. Association of the platelet-to-high-density lipoprotein cholesterol ratio (PHR) with metabolic syndrome and metabolic overweight/obesity phenotypes: A study based on the Dryad database. PLoS One. 2025;20(5):e0321625. pmid:40327659
- 58. Farahzadi R, Valipour B, Montazersaheb S, Fathi E. Targeting the stem cell niche micro-environment as therapeutic strategies in aging. Front Cell Dev Biol. 2023;11:1162136. pmid:37274742
- 59. Wang F, Ma X, Zhou M, Pan X, Ni J, Gao M, et al. Serum pigment epithelium-derived factor levels are independently correlated with the presence of coronary artery disease. Cardiovasc Diabetol. 2013;12:56. pmid:23547730
- 60. Qiu Z, Huang C, Xu C, Xu Y. Predictive role of neutrophil-to-lymphocyte ratio in metabolic syndrome: Meta-analysis of 70,937 individuals. BMC Endocr Disord. 2024;24(1):155. pmid:39164679
- 61. He Y, Li L, Zhou T, Yang H, Liu T, Hu H. Association Between Inflammation Indices Derived From Complete Blood Count and Coronary Artery Calcification. J Inflamm Res. 2025;18:3807–16. pmid:40103804
- 62. Jiang P, Chen J, Li J. Association of the systemic immune-inflammatory index and systemic inflammatory response index with all-cause and cardiovascular mortality in individuals with metabolic inflammatory syndrome. Eur J Med Res. 2025;30(1):444. pmid:40452063
- 63. Zhou Y, Wang Y, Wu T, Zhang A, Li Y. Association between obesity and systemic immune inflammation index, systemic inflammation response index among US adults: a population-based analysis. Lipids Health Dis. 2024;23(1):245. pmid:39127686
- 64. Buonacera A, Stancanelli B, Colaci M, Malatino L. Neutrophil to Lymphocyte Ratio: An Emerging Marker of the Relationships between the Immune System and Diseases. Int J Mol Sci. 2022;23(7):3636. pmid:35408994
- 65. Xiu J, Lin X, Chen Q, Yu P, Lu J, Yang Y, et al. The aggregate index of systemic inflammation (AISI): a novel predictor for hypertension. Front Cardiovasc Med. 2023;10:1163900. pmid:37265570