Machine learning-based prediction of metabolic dysfunction-associated steatotic liver disease using National Health and Nutrition Examination Survey (NHANES) data

Yong Zhang; Xiang Liu; Xingqiang Zhang; Yangfan Fei; Xiaoxu Li

doi:10.1371/journal.pone.0335656

Abstract

Objective

With the global increase in obesity rates and lifestyle changes, metabolic dysfunction-associated steatotic liver disease (MASLD) has become a prevalent chronic liver disorder, affecting approximately 25% of the global population. This disease can progress to cirrhosis and liver cancer, posing a significant threat to public health. To facilitate early diagnosis and intervention, this study aims to develop an efficient and reliable prediction model for MASLD using machine learning algorithm.

Methods

This study included 9,232 participants aged 20 years and older from the 2017–2020 National Health and Nutrition Examination Survey (NHANES). After excluding individuals with frequent alcohol consumption, hepatitis B/C infection, those lacking liver ultrasound examinations, and samples with missing data, a total of 2,460 subjects were ultimately included. The dataset was split into training and testing sets in an 80:20 ratio. Five machine learning algorithms—XGBoost, Random Forest (RF), and Logistic Regression (LR), among others—were utilized to build prediction models, while Recursive Feature Elimination (RFE) was employed to identify key predictive factors.

Results

Comparison of the five algorithms revealed that the XGBoost algorithm performed the best. Twelve key features were selected through Recursive Feature Elimination (RFE), and the model achieved an AUC of 0.8740 on the testing set, demonstrating excellent predictive accuracy and discriminative ability. SHAP plot analysis of the model showed that waist circumference, BMI, and other factors played a pivotal role in the prediction of MASLD.

Conclusion

The prediction model developed using the XGBoost algorithm and the 12 selected features demonstrates high efficiency and stability in assessing MASLD risk. This model offers innovative technical solutions and data-driven support for the clinical early identification of high-risk populations, with the potential to optimize and refine MASLD prevention and control strategies.

Citation: Zhang Y, Liu X, Zhang X, Fei Y, Li X (2025) Machine learning-based prediction of metabolic dysfunction-associated steatotic liver disease using National Health and Nutrition Examination Survey (NHANES) data. PLoS One 20(11): e0335656. https://doi.org/10.1371/journal.pone.0335656

Editor: Aleksandra Klisic, University of Montenegro-Faculty of Medicine, MONTENEGRO

Received: July 23, 2025; Accepted: October 13, 2025; Published: November 12, 2025

Copyright: © 2025 Zhang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data are available from the the National Health and Nutrition Examination Survey (NHANES) database (https://wwwn.cdc.gov/nchs/nhanes/Default.aspx).

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Metabolic dysfunction-associated steatotic liver disease (MASLD), one of the most prevalent chronic liver disorders worldwide, is spreading at an alarming rate. Epidemiological studies indicate that MASLD affects approximately 25% of the global population [1]. Currently, the global prevalence of MASLD is approximately 30.2%, with regional variations of 30.9% in Asia, 16.1% in Australia, 30.2% in Europe, 29% in North America, and 34% in South America [2]. The disease spectrum ranges from simple steatosis to non-alcoholic steatohepatitis (NASH), and may progress to liver fibrosis, cirrhosis, and even hepatocellular carcinoma [3]. More critically, MASLD is closely associated with systemic diseases, including metabolic syndrome, type 2 diabetes (T2D), and cardiovascular diseases, significantly increasing all-cause mortality [4]. Cardiovascular diseases (CVD) are the most common cause of mortality in MASLD patients. MASLD and CVD share several common risk factors including obesity, insulin resistance, and T2D [5]. Importantly, NASH also increases the risk of extra-hepatic complications, especially cardiovascular diseases (CVD), which are among the most common causes of death in NASH patients [6]. Indeed, the alterations in hepatic lipid metabolism that lead to MASLD also drive the development of atherogenic dyslipidemia. Altered glucose metabolism and insulin resistance, also hallmarks of MASLD, can further exacerbate CVD risk in these patients [5]. As a result, MASLD has become a major public health concern that threatens global well-being [7].

Early diagnosis and intervention are critical for improving the prognosis of MASLD patients [8]. During the simple steatosis stage, lifestyle modifications and the management of metabolic risk factors can effectively reverse the disease and prevent its progression to more severe stages [9]. Nevertheless, MASLD remains significantly underdiagnosed, particularly in its early phases, due to several clinical challenges. Firstly, the majority of patients with simple steatosis are asymptomatic and exhibit normal conventional liver enzyme levels, reducing the likelihood of clinical detection. Secondly, there is a lack of standardized, widely accessible non-invasive screening protocols specifically targeting at-risk populations in primary care settings. Moreover, healthcare providers often prioritize managing established metabolic conditions such as diabetes or hypertension over screening for MASLD, further contributing to underdiagnosis. As a result, many patients are only diagnosed at advanced stages, missing the optimal window for intervention [10]. Traditional diagnostic methods, such as liver biopsy (the gold standard), are invasive, costly, and associated with potential complications, limiting their applicability in large-scale screening [11]. Non-invasive approaches, such as serological markers and imaging examinations, are widely used but suffer from limited sensitivity and specificity for early steatosis, failing to meet the requirements for early and accurate diagnosis [12].

In recent years, machine learning has demonstrated significant potential in disease prediction due to its powerful data processing and pattern recognition capabilities [13,14]. By uncovering hidden associations within large clinical datasets, machine learning algorithms can construct high-precision predictive models for early disease detection [15]. Although several exploratory studies have applied machine learning in MASLD research, existing models still have room for improvement in terms of feature optimization, algorithm selection, and clinical applicability [16]. Therefore, leveraging the large-scale and multi-dimensional data from the National Health and Nutrition Examination Survey (NHANES) [17], this study systematically compares the performance of multiple machine learning algorithms and optimizes the model structure using Recursive Feature Elimination (RFE) [18]. The resulting model offers several key advantages over previous approaches: it relies on easily obtainable clinical indicators, which significantly reduces prediction costs and enhances feasibility for widespread implementation; furthermore, the model improves clinical interpretability by identifying the most influential risk factors. In terms of interpretability, most previous ML models for MASLD (such as traditional deep learning and early tree-based models) often suffer from a “black-box nature,” making it difficult to quantify the impact of features on prediction results. This limits clinicians’ trust in model outputs and hinders practical application. In this study, SHAP (SHapley Additive exPlanations) analysis was integrated into the XGBoost model. This not only clarifies the predictive contribution of core features (e.g., waist circumference, ALT) but also visually presents the direction of each indicator’s impact on disease risk through visualization results (e.g., how increased waist circumference elevates the probability of MASLD). This aligns the model’s decision-making logic with clinical understanding of MASLD’s pathological mechanisms (e.g., central obesity driving disease development), significantly enhancing clinical acceptability. In terms of accessibility, some previous models rely on advanced imaging features (e.g., MRI fat quantification parameters) or multi-omics data. Such indicators require advanced equipment or complex detection processes, which are difficult to obtain in primary care settings and thus limit practicality. This study only uses routinely collectible indicators, including demographic information, results of physical examinations (e.g., waist circumference), and basic biochemical indicators (e.g., ALT). No additional testing costs or equipment investment are needed, allowing primary care institutions to apply the model directly using routine clinical data. This greatly lowers the threshold for implementation and better meets the clinical needs of large-scale preliminary screening.The goal is to develop an efficient and reliable MASLD prediction model, offering new strategies for the early identification and precise prevention of the disease.

Methods

Data source and study design

This study utilized data from the National Health and Nutrition Examination Survey (NHANES) database (2017–2020), a continuous, nationally representative survey program that systematically collects multi-dimensional data from the non-institutionalized U.S. population through questionnaires, physical examinations, and laboratory tests, providing invaluable resources for public health research [19]. For the prediction of Metabolic dysfunction-associated steatotic liver disease (MASLD), strict inclusion and exclusion criteria were applied: initial screening used the liver controlled attenuation parameter (CAP) from transient elastography (FibroScan) to diagnose samples with CAP ≥ 302 dB/m as fatty liver [20]. Subsequent exclusions included frequent drinkers (≥2 times/week), individuals under 20 years of age, hepatitis B/C virus carriers, and those without liver ultrasound results. To address biases induced by missing values, a case-wise deletion approach was employed, removing any subjects with missing data on key indicators to ensure data integrity. Following this, 2,460 eligible participants remained after screening and data cleaning. Using Python 3.9, the dataset was divided into an 80:20 training set (n = 1,968) and test set (n = 492), with 24 candidate features—including demographics (gender, age, race, education), BMI, and liver function indices—used to construct a high-quality research cohort through strict implementation of the inclusion criteria, thus laying a solid foundation for subsequent model training and validation.

Features

This study incorporated a wide range of multi-dimensional feature variables to construct the MASLD prediction model. Demographic characteristics included gender (male, female), age, race (Mexican American, other Hispanic, non-Hispanic White, non-Hispanic Black, other races), and educational attainment (Less Than 9th Grade, 9–11th Grade, High School Graduate, Some College or AA degree, College Graduate or above), which help to capture population-based differences in disease susceptibility. Physical measurement indicators included body mass index (BMI, kg/m²), waist circumference (cm), systolic blood pressure (SBP, mmHg), and diastolic blood pressure (DBP, mmHg), which were used to assess obesity and blood pressure status—both of which are significant risk factors for MASLD. Biochemical indicators included liver function markers such as alanine aminotransferase (ALT, IU/L), alkaline phosphatase (ALP, IU/L), and aspartate aminotransferase (AST, IU/L), as well as metabolic-related indices such as blood urea nitrogen (BUN, mmol/L), creatine phosphokinase (CPK, IU/L), and globulin (g/L). These biochemical markers reflect the body’s physiological and metabolic status from multiple perspectives and possess potential predictive value for MASLD development.

ML algorithms

This study utilized five machine learning algorithms [21] to develop MASLD prediction models: Logistic Regression (LR), Random Forest (RF), LightGBM (LGBM), CatBoost, and XGBoost. As a classic linear classification algorithm, LR features a simple structure and strong interpretability, often serving as a benchmark for evaluating the performance of other algorithms [22]. RF reduces variance by integrating multiple decision trees, enhancing model stability and generalization ability, and excels at handling high-dimensional, nonlinear data [23]. LGBM optimizes computational efficiency through histogram-based algorithms, offering low memory consumption, fast training speed, and high prediction accuracy for large-scale datasets [24]. CatBoost is optimized for categorical variables, allowing for efficient processing of large-scale categorical data without the need for complex feature engineering [25]. XGBoost, an optimized implementation of Gradient Boosting Decision Trees (GBDT), supports parallel computing and regularization, demonstrating exceptional performance in handling nonlinear data and complex relationships [26]. The finally selected XGBoost parameters (learning_rate = 0.02, max_depth = 4, min_child_weight = 5, subsample = 0.8, reg_alpha = 2, reg_lambda = 20, etc.) correspond to the highest average validation AUC (0.874).

In the model evaluation phase, the area under the receiver operating characteristic curve (AUC) was used as the primary metric, complemented by a comprehensive analysis of accuracy, sensitivity (recall), specificity, and other performance indicators. To optimize algorithm performance, grid search was applied to all models for hyperparameter tuning, systematically exploring parameter combinations (e.g., learning rate, tree depth, and number of estimators for boosting algorithms) to determine the optimal configuration. After tuning, the XGBoost algorithm achieved an AUC of 0.9020 in the training set and 0.8738 in the test set, significantly outperforming other algorithms and thereby being selected as the core prediction model. Subsequent feature selection using the Recursive Feature Elimination (RFE) algorithm retained 12 key features for the final model, which achieved AUC values of 0.8960 and 0.8740 for the training and test sets, respectively. These results fully validate the model’s effectiveness and stability.

Statistical analyses

This study employed a rigorous statistical analysis approach, using appropriate statistical tests for different data types to ensure scientific validity and reliability. For categorical variables such as gender, race, and educational attainment, chi-square tests were used to evaluate distribution differences between the MASLD and non-MASLD groups, with results presented as percentages (%) representing the proportion of each categorical feature in the respective groups.

For continuous variables, normality was first assessed using the Shapiro-Wilk test and the Kolmogorov-Smirnov test. If the data conformed to a normal distribution, independent sample t-tests were applied to compare mean differences between groups, with results expressed as mean ± standard deviation (Mean ± SD). For non-normally distributed data, the Wilcoxon Mann-Whitney U test was used for intergroup comparisons, and data characteristics were reported as median (interquartile range [IQR]). In this study, a two-sided p-value of < 0.05 was set as the threshold for statistical significance, where values below this cutoff indicated statistically significant differences between the groups.

Results

Baseline characteristics

A total of 2,460 participants were included in this study, with 715 (29.1%) in the MASLD group and 1,745 (70.9%) in the non-MASLD group. Baseline characteristic analysis revealed significant differences (P < 0.05) between the two groups across multiple indicators (Table 1). In terms of demographic features, the MASLD group had a significantly higher proportion of males (51.75%) compared to the non-MASLD group (42.52%, P < 0.001) and a greater mean age (median: 55.0 years vs. 49.0 years), with racial distribution also showing statistically significant differences (P < 0.001). Regarding educational attainment, only 19.58% of the MASLD group held a bachelor’s degree or higher, compared to 25.16% in the non-MASLD group (P = 0.031).

Download:

Table 1. Baseline characteristics of participants by MASLD status.

https://doi.org/10.1371/journal.pone.0335656.t001

For physical measurements and biochemical parameters, the MASLD group exhibited significantly higher values in BMI (median: 34.2 kg/m² vs. 27.2 kg/m²), waist circumference (114.0 cm vs. 94.7 cm), blood pressure, liver function markers (e.g., ALT, GGT), and metabolic indices (e.g., SUA, triglycerides, CRP, GLU) (all P < 0.001). Conversely, HDL levels were significantly lower in the MASLD group (1.14 mmol/L vs. 1.37 mmol/L, P < 0.001). By contrast, indices such as CPK, LDH, and TBIL showed no significant differences between the groups (P > 0.05).These baseline differences indicate that factors such as gender, age, obesity, and metabolic disorders are closely associated with MASLD pathogenesis, providing critical evidence for feature selection and predictive analysis in subsequent machine learning models.

Performance evaluation of classification models

In the test set evaluation of the Metabolic dysfunction-associated steatotic liver disease (MASLD) prediction model (Fig 1), receiver operating characteristic (ROC) curves were plotted to compare the performance of Logistic Regression (LR), Random Forest (RF), LightGBM (LGBM), CatBoost, and XGBoost models. The results demonstrated that the XGBoost model outperformed the others, achieving an AUC of 0.8738, followed by CatBoost (0.8625), RF (0.8658), LGBM (0.8633), and LR (0.8541). The ROC curves for all models were positioned above the diagonal, indicating predictive capabilities, with XGBoost showing the highest accuracy in distinguishing MASLD cases from non-cases.

Download:

Fig 1. Comparison of receiver operating characteristic (ROC) curves of five machine learning models in non - alcoholic fatty liver disease (NAFLD) prediction.

https://doi.org/10.1371/journal.pone.0335656.g001

Table 2 presents a comprehensive evaluation of five machine learning algorithms—Logistic Regression (LR), Random Forest (RF), LightGBM (LGBM), CatBoost, and XGBoost—using various metrics, including the area under the receiver operating characteristic curve (AUC), accuracy, precision, sensitivity, specificity, and F1 score. The results indicated that the XGBoost algorithm exhibited the best performance in both the training and test sets, with an AUC of 0.9020 in the training set and 0.8738 in the test set. It also outperformed the other algorithms in test set accuracy (0.7988), precision (0.6170), specificity (0.7937), and F1 score (0.7009). In comparison, the test set AUCs for LR, RF, LGBM, and CatBoost ranged from 0.8541 to 0.8658, with some models displaying imbalances between sensitivity and specificity. The XGBoost algorithm’s balanced performance across multiple metrics, along with its notable advantages, underscores its superior applicability and reliability for Metabolic dysfunction-associated steatotic liver disease (MASLD) prediction, providing robust support for the development of efficient prediction models.

Download:

Table 2. Performance of models by different algorithms.

https://doi.org/10.1371/journal.pone.0335656.t002

Feature selection and final prediction model

Feature selection is a critical step in enhancing model performance and generalization ability for metabolic dysfunction-associated steatotic liver disease (MASLD) prediction. This study initially included 24 candidate features, covering multi-dimensional variables such as gender, age, race, educational attainment, BMI, and liver function indices. To reduce the risk of overfitting caused by feature redundancy, Recursive Feature Elimination (RFE) was applied based on the XGBoost model for feature screening.

The RFE algorithm operates iteratively: in each iteration, it removes the feature with the lowest contribution to the model, re-trains the model, and evaluates its performance until the preset number of features or performance criteria are met. After feature screening via RFE (Fig 2), a total of 12 key features were retained, including Age, Race, Body Mass Index (BMI), Waist Circumference, Alanine Transaminase (ALT), Blood Urea Nitrogen (BUN), Globulin, Gamma-Glutamyl Transpeptidase (GGT), High-Density Lipoprotein (HDL), Triglycerides, C-Reactive Protein (CRP), and Glucose (GLU). These features exhibit clear biological significance in MASLD pathogenesis and were significantly associated with MASLD in univariate analysis (P < 0.05).

Download:

Fig 2. Performance of the XGBoost model based on 12 selected variables for NAFLD prediction.

https://doi.org/10.1371/journal.pone.0335656.g002

An XGBoost prediction model was reconstructed based on the selected features, and the results showed that the model maintained strong performance on both the training set and the test set (Fig 3), with AUC values of 0.8960 and 0.8740, respectively. Notably, the test set AUC of the optimized model (0.8740) was nearly identical to that of the initial model (0.8738), suggesting that the feature combination effectively reduced model complexity while maintaining high prediction accuracy. The model’s AUC is 0.874 with a 95% confidence interval of 0.842–0.906. In addition, the model’s accuracy, precision, sensitivity, specificity, and F1 score on the test set were 0.7907, 0.6010, 0.8322, 0.7736, and 0.6979, respectively (Table 3), further confirming the effectiveness of the feature selection and algorithm combination. These results demonstrate that the selected feature combination accurately captures the key risk factors of MASLD, providing robust data support for efficient and reliable disease prediction in clinical practice.

Download:

Table 3. Evaluation metrics of the XGBoost model with 12 selected features on training and test sets.

https://doi.org/10.1371/journal.pone.0335656.t003

Download:

Fig 3. ROC curves of the training set and the testing set under the XGBoost algorithm.

https://doi.org/10.1371/journal.pone.0335656.g003

Post-hoc interpretation results

This study utilized SHAP (SHapley Additive exPlanations) value visualization to analyze in detail the influence of the 12 selected features on the output of the XGBoost prediction model (Fig 4). The results revealed that waist circumference, blood glucose (GLU), and triglycerides were core features influencing model predictions, as evidenced by their wide distribution ranges and large absolute SHAP values (as shown in the figure). SHAP value analysis demonstrated a significant positive correlation: as waist circumference, blood glucose, and triglyceride levels increased, the predicted probability of MASLD increased substantially. Specifically, high values of waist circumference and triglycerides (represented by the red distribution) were directly associated with higher MASLD prediction probabilities (positive SHAP values). Although blood glucose showed complex interactions in certain samples—where extremely high values occasionally reduced prediction probabilities—the overall trend indicated a dominant positive contribution to MASLD risk.

Download:

Fig 4. SHAP Values of 12 Selected Predictors for NAFLD in the XGBoost Final Model.

https://doi.org/10.1371/journal.pone.0335656.g004

To intuitively present the effect size, we supplemented the SHAP Bar chart of core features (Fig 5), using “mean absolute SHAP value” to quantify the average impact magnitude of each feature on the model’s prediction results. As shown in the figure, waist circumference has the largest mean absolute SHAP value (approximately 0.7), indicating that it has the strongest effect size—consistent with the mechanism that “central obesity is the core driving factor of MASLD”. The effect sizes of metabolic indicators such as blood glucose and triglycerides decrease in turn (approximately 0.35 and 0.22, respectively), clearly reflecting hierarchical differences and enhancing clinical interpretability.

Download:

Fig 5. SHAP Feature Importance: Mean Absolute SHAP Values in the XGBoost Final Model for Predicting MASLD.

https://doi.org/10.1371/journal.pone.0335656.g005

Discussion

This study developed a prediction model for Metabolic dysfunction-associated steatotic liver disease (MASLD) using machine learning algorithms based on data from the National Health and Nutrition Examination Survey (NHANES), with findings of substantial significance for clinical practice and future research.

In model construction, the XGBoost algorithm identified 12 core features via the Recursive Feature Elimination (RFE) method, including metabolic and liver function indicators that align closely with the pathogenesis of MASLD. Among these, metabolic indicators such as waist circumference, triglycerides, and blood glucose had significant impacts in SHAP value analysis, highlighting that metabolic disorders are key driving factors for MASLD [27–29]. This aligns with existing knowledge that central obesity, dyslipidemia, and glycometabolic abnormalities accelerate the progression of MASLD by promoting hepatic fat deposition, inflammatory responses, and insulin resistance [30]. The final XGBoost model achieved an AUC of 0.8740 on the test set, significantly outperforming other algorithms. These results indicate that the model effectively captures MASLD risk features and offers a quantitative tool for the early identification of high-risk populations in clinical settings.

From a clinical application perspective, the core features selected in this study are all routine diagnostic indicators that are easily accessible. The prediction model built on these features demonstrates high practicality and generalizability. Clinicians can monitor patients’ waist circumference, blood glucose, liver function, and other relevant indicators, and combine these measurements with the model’s predictions to implement early interventions for individuals at high risk of MASLD. Such interventions could include dietary adjustments and increased physical activity, which may help delay disease progression and reduce the risk of severe complications, such as cirrhosis and liver cancer [31,32].

However, this study has certain limitations. On one hand, it relies on liver ultrasound for MASLD diagnosis, and the absence of liver biopsy as a pathological gold standard may result in missed diagnoses of some mild-to-moderate cases, potentially affecting the model’s accuracy in identifying early-stage disease [33]. On the other hand, the study sample is primarily drawn from the U.S. population, whose genetic background, lifestyle, and environmental exposures exhibit regional specificity. The model’s generalizability to other racial or regional populations still requires further validation.

Future research could enhance the model’s applicability to different populations by incorporating multi-center and transnational cohort data, alongside multimodal information such as liver elastography and metabolomics. Additionally, long-term follow-up studies could dynamically evaluate the model’s efficacy in predicting disease progression, thereby providing stronger support for the precise prevention and control of MASLD.

Conclusions

This study developed a Metabolic dysfunction-associated steatotic liver disease (MASLD) prediction model using machine learning algorithms based on nationally representative data from the National Health and Nutrition Examination Survey (NHANES), providing an innovative approach for early disease risk identification. Among the five evaluated algorithms—including XGBoost, Random Forest, and Logistic Regression—the XGBoost model demonstrated superior predictive performance. Through Recursive Feature Elimination (RFE), 12 core predictors were selected from 24 candidate variables, with metabolic indicators (e.g., waist circumference, triglycerides, blood glucose) and liver function parameters (e.g., alanine aminotransferase, gamma-glutamyl transferase) playing pivotal roles in model construction. The model achieved an area under the receiver operating characteristic curve (AUC) of 0.8740 on the test set, significantly outperforming other algorithms and exhibiting robust discriminative and predictive capabilities for MASLD risk. SHAP value analysis further emphasized the dominant influence of metabolic features, such as waist circumference and triglycerides, aligning closely with the metabolic dysfunction-driven pathogenesis of MASLD and offering a quantitative basis for clinical prioritization of key risk factors.

Despite these strengths, the study has several limitations. First, the reliance on liver ultrasound for MASLD diagnosis—without the pathological gold standard of liver biopsy—may lead to missed diagnoses of mild-to-moderate cases. Second, the predominantly U.S.-based sample population limits the generalizability of the findings to other racial or geographic groups. Future research could improve the model’s universality and predictive accuracy by incorporating multi-center, cross-national cohort data and integrating multimodal information, such as liver elastography and metabolomics. Such advancements would strengthen global efforts toward early MASLD prevention and control by providing more robust technical support.

Supporting information

S1 Data.

https://doi.org/10.1371/journal.pone.0335656.s001

(ZIP)

References

1. Zhang X, Linden S, Levesley CR, He X, Yang Z, Barnet SD, et al. Projected Trends in Metabolic Dysfunction-Associated Steatotic Liver Disease Mortality Through 2040. JAMA Netw Open. 2025;8(6):e2516367. pmid:40526381
- View Article
- PubMed/NCBI
- Google Scholar
2. Pouwels S, Sakran N, Graham Y, Leal A, Pintar T, Yang W, et al. Non-alcoholic fatty liver disease (NAFLD): a review of pathophysiology, clinical management and effects of weight loss. BMC Endocr Disord. 2022;22(1):63. pmid:35287643
- View Article
- PubMed/NCBI
- Google Scholar
3. Chen K, Ma J, Jia X, Ai W, Ma Z, Pan Q. Advancing the understanding of NAFLD to hepatocellular carcinoma development: From experimental models to humans. Biochim Biophys Acta Rev Cancer. 2019;1871(1):117–25. pmid:30528647
- View Article
- PubMed/NCBI
- Google Scholar
4. Cotter TG, Rinella M. Nonalcoholic Fatty Liver Disease 2020: The State of the Disease. Gastroenterology. 2020;158(7):1851–64. pmid:32061595
- View Article
- PubMed/NCBI
- Google Scholar
5. Rinella ME, Neuschwander-Tetri BA, Siddiqui MS, Abdelmalek MF, Caldwell S, Barb D, et al. AASLD Practice Guidance on the clinical assessment and management of nonalcoholic fatty liver disease. Hepatology. 2023;77(5):1797–835. pmid:36727674
- View Article
- PubMed/NCBI
- Google Scholar
6. Saiman Y, Duarte-Rojo A, Rinella ME. Fatty Liver Disease: Diagnosis and Stratification. Annu Rev Med. 2022;73:529–44. pmid:34809436
- View Article
- PubMed/NCBI
- Google Scholar
7. Nassir F. NAFLD: Mechanisms, Treatments, and Biomarkers. Biomolecules. 2022;12(6):824. pmid:35740949
- View Article
- PubMed/NCBI
- Google Scholar
8. Sheka AC, Adeyi O, Thompson J, Hameed B, Crawford PA, Ikramuddin S. Nonalcoholic Steatohepatitis: A Review. JAMA. 2020;323(12):1175–83. pmid:32207804
- View Article
- PubMed/NCBI
- Google Scholar
9. Masoodi M, Gastaldelli A, Hyötyläinen T, Arretxe E, Alonso C, Gaggini M, et al. Metabolomics and lipidomics in NAFLD: biomarkers and non-invasive diagnostic tests. Nat Rev Gastroenterol Hepatol. 2021;18(12):835–56. pmid:34508238
- View Article
- PubMed/NCBI
- Google Scholar
10. Israelsen M, Francque S, Tsochatzis EA, Krag A. Steatotic liver disease. Lancet. 2024;404(10464):1761–78. pmid:39488409
- View Article
- PubMed/NCBI
- Google Scholar
11. Gong H, You X, Jin M, Meng Y, Zhang H, Yang S, et al. Graph neural network and multi-data heterogeneous networks for microbe-disease prediction. Front Microbiol. 2022;13:1077111. pmid:36620040
- View Article
- PubMed/NCBI
- Google Scholar
12. Zhang X, Xie J, You X, Gong H. Risk factors and drug discovery for cognitive impairment in type 2 diabetes mellitus using artificial intelligence interpretation and graph neural networks. Front Endocrinol (Lausanne). 2023;14:1213711. pmid:37693358
- View Article
- PubMed/NCBI
- Google Scholar
13. Liang D, Wang L, Zhong P, Lin J, Chen L, Chen Q, et al. Perspective: Global Burden of Iodine Deficiency: Insights and Projections to 2050 Using XGBoost and SHAP. Adv Nutr. 2025;16(3):100384. pmid:39914495
- View Article
- PubMed/NCBI
- Google Scholar
14. Chang D, Truong E, Mena EA, Pacheco F, Wong M, Guindi M, et al. Machine learning models are superior to noninvasive tests in identifying clinically significant stages of NAFLD and NAFLD-related cirrhosis. Hepatology. 2023;77(2):546–57. pmid:35809234
- View Article
- PubMed/NCBI
- Google Scholar
15. Liu Y, Yang S, Wei J. The relationship between visceral adiposity index and estimated pulse wave velocity: insights from NHANES database. Front Nutr. 2025;12:1544084. pmid:40568423
- View Article
- PubMed/NCBI
- Google Scholar
16. Al-Syouf R, Aljarrah OY, Bani-Hani R, Alma’aitah A. Ensemble Machine Learning Models Utilizing a Hybrid Recursive Feature Elimination (RFE) Technique for Detecting GPS Spoofing Attacks Against Unmanned Aerial Vehicles. Sensors (Basel). 2025;25(8):2388. pmid:40285076
- View Article
- PubMed/NCBI
- Google Scholar
17. Johnson CL, Dohrmann SM, Kerckove V d, Diallo MS, Clark J, Mohadjer LK, et al. National Health and Nutrition Examination Survey: National Youth Fitness Survey Estimation Procedures, 2012. Vital and Health Statistics Series 2, Data Evaluation and Methods Research. 2014;(168):1–25.
- View Article
- Google Scholar
18. Liu L, Lin J, Liu L, Gao J, Xu G, Yin M, et al. Automated machine learning models for nonalcoholic fatty liver disease assessed by controlled attenuation parameter from the NHANES 2017-2020. Digit Health. 2024;10:20552076241272535. pmid:39119551
- View Article
- PubMed/NCBI
- Google Scholar
19. Guo J, Zhang R, Dong R, Yang F, Wang Y, Miao W. Interpretable Machine Learning Model for Predicting the Prognosis of Guillain-Barré Syndrome Patients. J Inflamm Res. 2024;17:5901–13. pmid:39247840
- View Article
- PubMed/NCBI
- Google Scholar
20. Stoltzfus JC. Logistic regression: a brief primer. Acad Emerg Med. 2011;18(10):1099–104. pmid:21996075
- View Article
- PubMed/NCBI
- Google Scholar
21. Hu J, Szymczak S. A review on longitudinal data analysis with random forest. Brief Bioinform. 2023;24(2):bbad002. pmid:36653905
- View Article
- PubMed/NCBI
- Google Scholar
22. Wang W, Sheng R, Liao S, Wu Z, Wang L, Liu C, et al. LightGBM is an Effective Predictive Model for Postoperative Complications in Gastric Cancer: A Study Integrating Radiomics with Ensemble Learning. J Imaging Inform Med. 2024;37(6):3034–48. pmid:38940888
- View Article
- PubMed/NCBI
- Google Scholar
23. Ajin RS, Segoni S, Fanti R. Optimization of SVR and CatBoost models using metaheuristic algorithms to assess landslide susceptibility. Sci Rep. 2024;14(1):24851. pmid:39438526
- View Article
- PubMed/NCBI
- Google Scholar
24. Gong H, Wang M, Zhang H, Elahe MF, Jin M. An Explainable AI Approach for the Rapid Diagnosis of COVID-19 Using Ensemble Learning Algorithms. Front Public Health. 2022;10:874455. pmid:35801239
- View Article
- PubMed/NCBI
- Google Scholar
25. Chen Q, Hu P, Hou X, Sun Y, Jiao M, Peng L, et al. Association between triglyceride-glucose related indices and mortality among individuals with non-alcoholic fatty liver disease or metabolic dysfunction-associated steatotic liver disease. Cardiovasc Diabetol. 2024;23(1):232. pmid:38965572
- View Article
- PubMed/NCBI
- Google Scholar
26. Golabi P, Paik JM, Kumar A, Al Shabeeb R, Eberly KE, Cusi K, et al. Nonalcoholic fatty liver disease (NAFLD) and associated mortality in individuals with type 2 diabetes, pre-diabetes, metabolically unhealthy, and metabolically healthy individuals in the United States. Metabolism. 2023;146:155642. pmid:37380016
- View Article
- PubMed/NCBI
- Google Scholar
27. Angelini G, Panunzi S, Castagneto-Gissey L, Pellicanò F, De Gaetano A, Pompili M, et al. Accurate liquid biopsy for the diagnosis of non-alcoholic steatohepatitis and liver fibrosis. Gut. 2023;72(2):392–403. pmid:35820779
- View Article
- PubMed/NCBI
- Google Scholar
28. Handelsman Y, Anderson JE, Bakris GL, Ballantyne CM, Bhatt DL, Bloomgarden ZT, et al. DCRM 2.0: Multispecialty practice recommendations for the management of diabetes, cardiorenal, and metabolic diseases. Metabolism. 2024;159:155931. pmid:38852020
- View Article
- PubMed/NCBI
- Google Scholar
29. Benedé-Ubieto R, Cubero FJ, Nevzorova YA. Breaking the barriers: the role of gut homeostasis in Metabolic-Associated Steatotic Liver Disease (MASLD). Gut Microbes. 2024;16(1):2331460. pmid:38512763
- View Article
- PubMed/NCBI
- Google Scholar
30. Blais P, Husain N, Kramer JR, Kowalkowski M, El-Serag H, Kanwal F. Nonalcoholic fatty liver disease is underrecognized in the primary care setting. Am J Gastroenterol. 2015;110(1):10–4. pmid:24890441
- View Article
- PubMed/NCBI
- Google Scholar
31. Abdelhameed F, Kite C, Lagojda L, Dallaway A, Chatha KK, Chaggar SS, et al. Non-invasive Scores and Serum Biomarkers for Fatty Liver in the Era of Metabolic Dysfunction-associated Steatotic Liver Disease (MASLD): A Comprehensive Review From NAFLD to MAFLD and MASLD. Curr Obes Rep. 2024;13(3):510–31. pmid:38809396
- View Article
- PubMed/NCBI
- Google Scholar
32. Paternostro R, Trauner M. Current treatment of non-alcoholic fatty liver disease. J Intern Med. 2022;292(2):190–204.
- View Article
- Google Scholar
33. Bedossa P. Diagnosis of non-alcoholic fatty liver disease/non-alcoholic steatohepatitis: why liver biopsy is essential. Liver international: official journal of the International Association for the Study of the Liver. 2018;38(Suppl 1):64–6.
- View Article
- Google Scholar

[ref1] 1. Zhang X, Linden S, Levesley CR, He X, Yang Z, Barnet SD, et al. Projected Trends in Metabolic Dysfunction-Associated Steatotic Liver Disease Mortality Through 2040. JAMA Netw Open. 2025;8(6):e2516367. pmid:40526381
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Pouwels S, Sakran N, Graham Y, Leal A, Pintar T, Yang W, et al. Non-alcoholic fatty liver disease (NAFLD): a review of pathophysiology, clinical management and effects of weight loss. BMC Endocr Disord. 2022;22(1):63. pmid:35287643
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Chen K, Ma J, Jia X, Ai W, Ma Z, Pan Q. Advancing the understanding of NAFLD to hepatocellular carcinoma development: From experimental models to humans. Biochim Biophys Acta Rev Cancer. 2019;1871(1):117–25. pmid:30528647
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Cotter TG, Rinella M. Nonalcoholic Fatty Liver Disease 2020: The State of the Disease. Gastroenterology. 2020;158(7):1851–64. pmid:32061595
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Rinella ME, Neuschwander-Tetri BA, Siddiqui MS, Abdelmalek MF, Caldwell S, Barb D, et al. AASLD Practice Guidance on the clinical assessment and management of nonalcoholic fatty liver disease. Hepatology. 2023;77(5):1797–835. pmid:36727674
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Saiman Y, Duarte-Rojo A, Rinella ME. Fatty Liver Disease: Diagnosis and Stratification. Annu Rev Med. 2022;73:529–44. pmid:34809436
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Nassir F. NAFLD: Mechanisms, Treatments, and Biomarkers. Biomolecules. 2022;12(6):824. pmid:35740949
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Sheka AC, Adeyi O, Thompson J, Hameed B, Crawford PA, Ikramuddin S. Nonalcoholic Steatohepatitis: A Review. JAMA. 2020;323(12):1175–83. pmid:32207804
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Masoodi M, Gastaldelli A, Hyötyläinen T, Arretxe E, Alonso C, Gaggini M, et al. Metabolomics and lipidomics in NAFLD: biomarkers and non-invasive diagnostic tests. Nat Rev Gastroenterol Hepatol. 2021;18(12):835–56. pmid:34508238
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Israelsen M, Francque S, Tsochatzis EA, Krag A. Steatotic liver disease. Lancet. 2024;404(10464):1761–78. pmid:39488409
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Gong H, You X, Jin M, Meng Y, Zhang H, Yang S, et al. Graph neural network and multi-data heterogeneous networks for microbe-disease prediction. Front Microbiol. 2022;13:1077111. pmid:36620040
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Zhang X, Xie J, You X, Gong H. Risk factors and drug discovery for cognitive impairment in type 2 diabetes mellitus using artificial intelligence interpretation and graph neural networks. Front Endocrinol (Lausanne). 2023;14:1213711. pmid:37693358
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Liang D, Wang L, Zhong P, Lin J, Chen L, Chen Q, et al. Perspective: Global Burden of Iodine Deficiency: Insights and Projections to 2050 Using XGBoost and SHAP. Adv Nutr. 2025;16(3):100384. pmid:39914495
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Chang D, Truong E, Mena EA, Pacheco F, Wong M, Guindi M, et al. Machine learning models are superior to noninvasive tests in identifying clinically significant stages of NAFLD and NAFLD-related cirrhosis. Hepatology. 2023;77(2):546–57. pmid:35809234
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Liu Y, Yang S, Wei J. The relationship between visceral adiposity index and estimated pulse wave velocity: insights from NHANES database. Front Nutr. 2025;12:1544084. pmid:40568423
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref16] 16. Al-Syouf R, Aljarrah OY, Bani-Hani R, Alma’aitah A. Ensemble Machine Learning Models Utilizing a Hybrid Recursive Feature Elimination (RFE) Technique for Detecting GPS Spoofing Attacks Against Unmanned Aerial Vehicles. Sensors (Basel). 2025;25(8):2388. pmid:40285076
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref17] 17. Johnson CL, Dohrmann SM, Kerckove V d, Diallo MS, Clark J, Mohadjer LK, et al. National Health and Nutrition Examination Survey: National Youth Fitness Survey Estimation Procedures, 2012. Vital and Health Statistics Series 2, Data Evaluation and Methods Research. 2014;(168):1–25.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref18] 18. Liu L, Lin J, Liu L, Gao J, Xu G, Yin M, et al. Automated machine learning models for nonalcoholic fatty liver disease assessed by controlled attenuation parameter from the NHANES 2017-2020. Digit Health. 2024;10:20552076241272535. pmid:39119551
View Article
PubMed/NCBI
Google Scholar

[69] View Article

[70] PubMed/NCBI

[71] Google Scholar

[ref19] 19. Guo J, Zhang R, Dong R, Yang F, Wang Y, Miao W. Interpretable Machine Learning Model for Predicting the Prognosis of Guillain-Barré Syndrome Patients. J Inflamm Res. 2024;17:5901–13. pmid:39247840
View Article
PubMed/NCBI
Google Scholar

[73] View Article

[74] PubMed/NCBI

[75] Google Scholar

[ref20] 20. Stoltzfus JC. Logistic regression: a brief primer. Acad Emerg Med. 2011;18(10):1099–104. pmid:21996075
View Article
PubMed/NCBI
Google Scholar

[77] View Article

[78] PubMed/NCBI

[79] Google Scholar

[ref21] 21. Hu J, Szymczak S. A review on longitudinal data analysis with random forest. Brief Bioinform. 2023;24(2):bbad002. pmid:36653905
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref22] 22. Wang W, Sheng R, Liao S, Wu Z, Wang L, Liu C, et al. LightGBM is an Effective Predictive Model for Postoperative Complications in Gastric Cancer: A Study Integrating Radiomics with Ensemble Learning. J Imaging Inform Med. 2024;37(6):3034–48. pmid:38940888
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref23] 23. Ajin RS, Segoni S, Fanti R. Optimization of SVR and CatBoost models using metaheuristic algorithms to assess landslide susceptibility. Sci Rep. 2024;14(1):24851. pmid:39438526
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref24] 24. Gong H, Wang M, Zhang H, Elahe MF, Jin M. An Explainable AI Approach for the Rapid Diagnosis of COVID-19 Using Ensemble Learning Algorithms. Front Public Health. 2022;10:874455. pmid:35801239
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref25] 25. Chen Q, Hu P, Hou X, Sun Y, Jiao M, Peng L, et al. Association between triglyceride-glucose related indices and mortality among individuals with non-alcoholic fatty liver disease or metabolic dysfunction-associated steatotic liver disease. Cardiovasc Diabetol. 2024;23(1):232. pmid:38965572
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref26] 26. Golabi P, Paik JM, Kumar A, Al Shabeeb R, Eberly KE, Cusi K, et al. Nonalcoholic fatty liver disease (NAFLD) and associated mortality in individuals with type 2 diabetes, pre-diabetes, metabolically unhealthy, and metabolically healthy individuals in the United States. Metabolism. 2023;146:155642. pmid:37380016
View Article
PubMed/NCBI
Google Scholar

[101] View Article

[102] PubMed/NCBI

[103] Google Scholar

[ref27] 27. Angelini G, Panunzi S, Castagneto-Gissey L, Pellicanò F, De Gaetano A, Pompili M, et al. Accurate liquid biopsy for the diagnosis of non-alcoholic steatohepatitis and liver fibrosis. Gut. 2023;72(2):392–403. pmid:35820779
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

[ref28] 28. Handelsman Y, Anderson JE, Bakris GL, Ballantyne CM, Bhatt DL, Bloomgarden ZT, et al. DCRM 2.0: Multispecialty practice recommendations for the management of diabetes, cardiorenal, and metabolic diseases. Metabolism. 2024;159:155931. pmid:38852020
View Article
PubMed/NCBI
Google Scholar

[109] View Article

[110] PubMed/NCBI

[111] Google Scholar

[ref29] 29. Benedé-Ubieto R, Cubero FJ, Nevzorova YA. Breaking the barriers: the role of gut homeostasis in Metabolic-Associated Steatotic Liver Disease (MASLD). Gut Microbes. 2024;16(1):2331460. pmid:38512763
View Article
PubMed/NCBI
Google Scholar

[113] View Article

[114] PubMed/NCBI

[115] Google Scholar

[ref30] 30. Blais P, Husain N, Kramer JR, Kowalkowski M, El-Serag H, Kanwal F. Nonalcoholic fatty liver disease is underrecognized in the primary care setting. Am J Gastroenterol. 2015;110(1):10–4. pmid:24890441
View Article
PubMed/NCBI
Google Scholar

[117] View Article

[118] PubMed/NCBI

[119] Google Scholar

[ref31] 31. Abdelhameed F, Kite C, Lagojda L, Dallaway A, Chatha KK, Chaggar SS, et al. Non-invasive Scores and Serum Biomarkers for Fatty Liver in the Era of Metabolic Dysfunction-associated Steatotic Liver Disease (MASLD): A Comprehensive Review From NAFLD to MAFLD and MASLD. Curr Obes Rep. 2024;13(3):510–31. pmid:38809396
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref32] 32. Paternostro R, Trauner M. Current treatment of non-alcoholic fatty liver disease. J Intern Med. 2022;292(2):190–204.
View Article
Google Scholar

[125] View Article

[126] Google Scholar

[ref33] 33. Bedossa P. Diagnosis of non-alcoholic fatty liver disease/non-alcoholic steatohepatitis: why liver biopsy is essential. Liver international: official journal of the International Association for the Study of the Liver. 2018;38(Suppl 1):64–6.
View Article
Google Scholar

[128] View Article

[129] Google Scholar

Figures

Abstract

Objective

Methods

Results

Conclusion

Introduction

Methods

Data source and study design

Features

ML algorithms

Statistical analyses

Results

Baseline characteristics

Performance evaluation of classification models

Feature selection and final prediction model

Post-hoc interpretation results

Discussion

Conclusions

Supporting information

S1 Data.

References