Machine learning-based prediction of hepatocellular carcinoma risk in steatotic liver disease: A nationwide cohort study

Log Young Kim; Ji Soo Lee; Jeong-Ju Yoo; Eun Ju Cho; Sang Gyune Kim; Young Seok Kim

doi:10.1371/journal.pone.0349593

Abstract

Background and aims

Steatotic liver disease (SLD) has emerged as an important risk factor for hepatocellular carcinoma (HCC), often in the absence of cirrhosis. We aimed to develop explainable machine learning (ML) models to predict HCC risk in individuals with SLD using routinely collected screening data.

Methods

Using the Korean National Health Insurance Service database, we included adults aged 20–79 years who underwent national health screening in 2017. SLD was defined as a fatty liver index (FLI) ≥ 60. Multiple ML algorithms, including deep learning models, were trained using a 7:3 train–test split with repeated non-replacement undersampling at a 1:3 case-to-control ratio to address extreme class imbalance.

Results

Among 1,241,560 adults with SLD, 2,152 (0.17%) developed HCC during a 6-year follow-up period. In the internal validation cohort, the final weighted multi-head attention deep neural network ensemble achieved an area under the receiver operating characteristic curve of 0.923, with a sensitivity of 71.36% and specificity of 93.65%. SHapley Additive exPlanations consistently identified age, sex, triglycerides, total cholesterol, aminotransferases, gamma-glutamyl transferase (GGT), Charlson Comorbidity Index, and FLI as key contributors to HCC risk. In multivariable Cox models, older age, male sex, elevated GGT, higher aspartate aminotransferase and FLI, and greater comorbidity burden were positively associated with HCC risk, whereas higher triglyceride and total cholesterol levels were inversely associated. Model-based risk stratification identified four groups with distinct HCC-free survival curves; the extremely high-risk group had an approximately 74.9-fold higher hazard of HCC than the low-risk group (95% CI, 55.3–101.5).

Conclusions

Overall, this explainable ML model based on routine health screening variables enables robust HCC risk stratification and may help inform future targeted surveillance strategies in SLD populations after external validation.

Citation: Kim LY, Lee JS, Yoo J-J, Cho EJ, Kim SG, Kim YS (2026) Machine learning-based prediction of hepatocellular carcinoma risk in steatotic liver disease: A nationwide cohort study. PLoS One 21(5): e0349593. https://doi.org/10.1371/journal.pone.0349593

Editor: Tyng-Yuan Jang, Kaohsiung Medical University, TAIWAN

Received: February 12, 2026; Accepted: May 2, 2026; Published: May 28, 2026

Copyright: © 2026 Kim et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data supporting the findings of this study are available from the National Health Insurance Sharing Service (NHISS) of the Republic of Korea. However, these data are not publicly available due to legal and ethical restrictions imposed by the National Health Insurance Service (NHIS), as the data contain potentially identifiable and sensitive personal health information. Access to the data is controlled by the NHIS data access committee. Data are available to qualified researchers who obtain approval from their Institutional Review Board (IRB) and the NHIS inquiry committee. Interested researchers can apply for access through the NHISS portal (http://nhiss.nhis.or.kr/bd/ab/bdaba021eng.do) or contact the NHIS data sharing service (email: nhiss@nhis.or.kr) for further information.

Funding: This work was supported in part by grants from the Seoul National University Hospital (04-2025-2180), the Liver Research Foundation of Korea (Bio Future Strategies Research Project), and the Research Supporting Program of the Korean Association for the Study of the Liver (KASL2024-04). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: No authors have competing interests.

Abbreviations: ALT, alanine aminotransferase; AST, aspartate aminotransferase; AUROC, area under the receiver operating characteristic curve; BMI, body mass index; CCI, Charlson comorbidity index; CNN, convolutional neural network; DNN, deep neural network; eGFR, estimated glomerular filtration rate; FLI, fatty liver index; GBM, gradient boosting machine; GGT, gamma-glutamyl transferase; HCC, hepatocellular carcinoma; HDL, high-density lipoprotein; HRs, hazard ratios; ICD-10, International Classification Of Diseases, 10th revision; IRB, Institutional Review Board; KNN, k-nearest neighbors; LDL, low-density lipoprotein; LR, logistic regression; MHA, multi-head attention; ML, machine learning; NB, naive Bayes; NHIS, National Health Insurance Service; RF, random forest; SHAP, Shapley additive explanations; SLD, steatotic liver disease; SVM, support vector machine; XGBoost, extreme gradient boosting

Introduction

Steatotic liver disease (SLD) is the most common cause of chronic liver disease worldwide and is increasingly recognized as a major contributor to hepatocellular carcinoma (HCC) [1,2]. As the prevalence of metabolic syndrome rises, the global burden of HCC attributable to SLD continues to grow. Surveillance in this population is challenging because the annual incidence of HCC among SLD patients without cirrhosis is low; however, the large at-risk population results in a substantial absolute number of HCC cases [3,4]. Furthermore, a clinically meaningful proportion of HCC in SLD occurs in the absence of cirrhosis, limiting the effectiveness of surveillance strategies restricted to cirrhotic patients [5].

Current international guidelines generally advise against universal HCC surveillance in patients with SLD without cirrhosis due to insufficient evidence supporting its cost-effectiveness [6,7]. Consequently, most individuals with SLD are excluded from surveillance programs, even though high-risk metabolic phenotypes may go unrecognized in routine care. Conventional noninvasive fibrosis tests, such as FIB-4 or the NAFLD fibrosis score, serve as surrogates for fibrosis and predictors of HCC risk; however, their population-level application is limited by reliance on specialized testing and complete clinical data, which are often unavailable in unselected screening populations [8]. These gaps underscore the need for a precision screening method to identify the small subset of patients with SLD at high risk for HCC within the broader population.

Machine learning (ML) models can capture complex non-linear interactions among clinical variables and may improve risk discrimination under conditions of extreme class imbalance [9,10]. Using a nationwide cohort of more than 1.2 million adults with SLD, we developed and validated interpretable ML models to identify high-risk subgroups within the broader SLD population and to inform a targeted framework for HCC surveillance.

Patients and methods

Ethics statement

The study protocol was approved by the Institutional Review Board (IRB) of Soonchunhyang University Bucheon Hospital (IRB no. SCHBC 2025-04-006). The study was conducted in accordance with the Declaration of Helsinki and its subsequent amendments. The requirement for informed consent was waived by the IRB because the data were anonymized in accordance with confidentiality guidelines. This study was supported by Soonchunhyang University Research Fund.

Data source and study population

This retrospective cohort study used data from the Korean National Health Insurance Service (NHIS), which provides mandatory health coverage to more than 97% of the Korean population. Data were accessed for research purposes in 01/07/2025. Because imaging modalities are not included in the NHIS health screening program, hepatic steatosis was assessed using the fatty liver index (FLI), a validated surrogate marker ranging from 0 to 100; values < 30 indicate low risk, whereas values ≥ 60 indicate high risk of SLD [11].

We initially identified 8,233,669 adults aged 20–79 years who participated in the 2017 National Health Screening Program. We excluded individuals with a prior diagnosis of HCC before the index year (International Classification of Diseases, 10th revision [ICD-10] code C22.0; n = 9,129), those who died within 1 year of screening (n = 5,264), those with insufficient diagnostic records to meet the operational definition of HCC (fewer than three HCC-related claims between 2018 and 2023; n = 1), those with incomplete health screening data (n = 27), those with documented chronic hepatitis B or C or HIV infection (ICD-10 codes B18.0–B18.2 and B20–B24; n = 258,173), and those with FLI < 60 (n = 6,719,515). The final analytic cohort comprised 1,241,560 individuals with FLI ≥ 60 (Supplementary Materials; Fig S1 in S1 File).

Study outcome

The primary outcome was the development of HCC between January 1, 2018, and December 31, 2023. HCC was defined as three or more claims with the ICD-10 code C22.0, a claims-based definition previously validated in NHIS studies with a positive predictive value exceeding 90% [12]. The study population was followed from baseline until HCC diagnosis, death, or December 31, 2023, whichever occurred first.

Variables

Baseline demographic and clinical characteristics were obtained from the 2017 health screening data. Demographic variables included age and sex. Anthropometric measurements included body mass index (BMI), waist circumference, and systolic and diastolic blood pressure. Laboratory parameters comprised fasting plasma glucose, alanine aminotransferase (ALT), aspartate aminotransferase (AST), gamma-glutamyl transferase (GGT), total cholesterol, low-density lipoprotein (LDL) cholesterol, high-density lipoprotein (HDL) cholesterol, triglycerides, hemoglobin, serum creatinine, estimated glomerular filtration rate (eGFR), and urine protein. Comorbidity was assessed using the Charlson Comorbidity Index (CCI), calculated from diagnostic codes recorded in the year preceding the index examination. ICD-10 codes are listed in Supplementary Materials; Table S1 in S1 File.

Lifestyle information was obtained from standardized self-administered questionnaires. Smoking status was categorized as never, former, or current. Alcohol consumption was categorized as none/mild or moderate-to-heavy. Based on the national screening criteria, moderate-to-heavy drinking was defined as consuming more than 7 units for men and 5 units for women at least twice weekly. Mild (social) drinking was defined as consuming up to 6 units for men and 4 units for women no more than once weekly. Physical activity was assessed as the self-reported number of weekly sessions of moderate-to-vigorous exercise.

Prior to model development, we performed data preprocessing to ensure data quality. Individuals with missing values in key variables were excluded (n = 27, corresponding to those with incomplete health screening data shown in the Fig. S1 in S1 File), and implausible values (e.g., physiologically impossible or extreme values) were removed based on predefined clinical criteria. For variables with limited missingness such as family history, missing values were treated as absence based on the structure of the questionnaire.

Model development and selection

The study cohort was randomly divided into a development set (70%) and an internal validation set (30%), stratified by HCC status to maintain similar event rates in both sets. The development set was used for model training, hyperparameter tuning, and model selection, while the validation set was reserved for final assessment of model discrimination, calibration, and risk stratification.

To address extreme class imbalance (event rate 0.17%), we initially constructed a balanced training dataset including all HCC cases in the development set and a random sample of non-HCC controls at a 1:3 case-to-control ratio. On this subset, we trained and compared multiple predictive models. Deep learning architectures included standard deep neural networks (DNN), convolutional neural networks (CNN) for tabular data, a multi-head attention (MHA)–based DNN, and an MHA-CNN model. Classical ML algorithms included extreme gradient boosting (XGBoost), random forest (RF), gradient boosting machines (GBM), logistic regression (LR), support vector machines (SVM), naive Bayes (NB), and k-nearest neighbors (KNN). We also evaluated a hybrid ensemble model combining XGBoost and RF (Ensemble XGB–RF) to assess the potential benefit of aggregating tree-based methods. Hyperparameters were optimized using cross-validation. Deep learning models were trained with binary cross-entropy loss and the Adam optimizer, with early stopping to prevent overfitting.

Based on area under the receiver operating characteristic curve (AUROC) and F-measure in the validation set, the MHA-DNN, MHA-CNN, and XGBoost models were selected as the top three performers. MHA-DNN is a deep neural network architecture incorporating multi-head attention mechanisms, which enable the model to capture complex and non-linear relationships among input features by attending to multiple representation subspaces simultaneously [13]. MHA-CNN combines convolutional neural networks with multi-head attention to capture both local feature interactions and global dependencies [14]. XGBoost is an optimized gradient boosting algorithm based on decision trees, known for its efficiency and strong performance in structured data [15].

Explainability and variable selection

To identify robust predictors of HCC, SHapley Additive exPlanations (SHAP) were applied to the top three models. Mean absolute SHAP values were calculated for each feature to quantify its global contribution to predictions. Common important variables, features consistently ranked highly across all three models, included AST, GGT, triglycerides, CCI, and FLI. To assess clinical and biological plausibility, these AI-selected features were further examined using Cox proportional hazards models and Kaplan–Meier survival analyses.

Construction of the final MHA-DNN ensemble

MHA-DNN was chosen as the final base learner for risk prediction. We applied a repeated non-replacement undersampling strategy to generate 20 distinct training subsets, each containing all HCC cases and a unique set of non-HCC controls at a 1:3 ratio. An ensemble risk score was then constructed by aggregating predictions from the three best-performing MHA-DNN models, weighted at 0.7, 0.2, and 0.1, respectively.

Performance evaluation and risk stratification

Model performance was assessed using AUROC, sensitivity, specificity, and F-measure at the optimal cutoff determined by the Youden index. For risk stratification and survival analysis, predicted risk scores were applied to the full dataset without sampling in both the training and validation cohorts. Finally, individuals in both the training and validation cohorts were categorized into four risk groups. Survival outcomes across these groups were compared using Kaplan–Meier curves and log-rank tests, and hazard ratios (HRs) were estimated using Cox proportional hazards models.

Statistical analysis

Continuous variables are presented as means ± standard deviations and compared using Student’s t-test. Categorical variables are expressed as counts and percentages and compared using Chi-square tests. All analyses were conducted using SAS version 9.4 (SAS Institute, Cary, NC, United States) and R version 4.5.1 (R Foundation for Statistical Computing, Vienna, Austria; http://www.R-project.org). A two-sided P-value < 0.05 was considered statistically significant.

Results

Study population characteristics

Among 1,241,560 participants with FLI ≥ 60, 2,152 (0.17%) developed HCC over 6 years. Individuals who developed HCC were older, more frequently male, and had a higher prevalence of cirrhosis, diabetes, hypertension, and dyslipidemia compared with those who remained HCC-free (Table 1). Metabolic and biochemical profiles also differed between groups. Compared with individuals who did not develop HCC, those who developed HCC exhibited more adverse glycemic and liver enzyme profiles, while triglyceride and total cholesterol levels tended to be lower, despite a similar or higher burden of SLD (mean FLI, 78.3 ± 11.0 vs. 76.5 ± 11.0, P < 0.0001). Comorbidity burden, reflected by the CCI, was higher in the HCC group, consistent with clustering of metabolic and systemic risk factors.

Download:

Table 1. Baseline characteristics of study population stratified by hepatocellular carcinoma development.

https://doi.org/10.1371/journal.pone.0349593.t001

Performance of candidate models

To identify the most effective approach for predicting HCC, we first evaluated the performance of various ML and deep learning architectures (Supplementary Materials; Table S2 in S1 File). The MHA-DNN, MHA-CNN, and XGBoost achieved the highest validation AUROC values (~0.92) along with favorable sensitivity, specificity, F-measure, and geometric mean (Fig 1). Other models, such as RF and LR, showed slightly lower sensitivity or overall F-measure at comparable specificity. Based on these results, the MHA-DNN, MHA-CNN, and XGBoost were selected as the top-performing models for SHAP-based interpretation and variable selection.

Download:

Fig 1. Receiver operating characteristic curves of 12 artificial intelligence algorithms for predicting hepatocellular carcinoma in steatotic liver disease.

https://doi.org/10.1371/journal.pone.0349593.g001

Identification of common key predictors via explainable AI

SHAP analysis of the top three models revealed a consistent set of important predictors, including age, sex, triglycerides, total cholesterol, AST, ALT, GGT, CCI, and FLI. SHAP summary plots (Fig 2) provided a granular view of how these features influenced model output; notably, lower values of TG and total cholesterol were frequently associated with an increased HCC risk.

Download:

Fig 2. SHAP-based feature importance for the three top-performing models.

Summary plot showing the impact of each feature on hepatocellular carcinoma risk predictions. Features are ranked by mean absolute SHAP values across all predictions. The color gradient represents feature values (red = high, blue = low). Horizontal position indicates the effect on model output, with positive values increasing HCC prediction probability. Age was the most important feature, followed by gamma-glutamyl transferase and liver transaminases. The consistent rankings across different model architectures support the biological relevance of these predictors.

https://doi.org/10.1371/journal.pone.0349593.g002

Kaplan–Meier analyses further demonstrated distinct risk patterns for incident HCC across categories of these predictors. For instance, age and liver enzymes showed clear dose-dependent associations. For triglyceride levels (Supplementary Materials; Fig. S2G in S1 File), the group with triglycerides < 150 mg/dL exhibited a significantly higher cumulative incidence of HCC compared to the group with triglycerides ≥ 150 mg/dL (P < 0.0001), supporting the inverse association captured by the models.

Using the common important variables, we performed explanatory Cox regression analyses (Table 2). Because ALT was strongly correlated with AST (Supplementary Materials; Fig S3 in S1 File), only AST was included in multivariable Cox models to avoid multicollinearity. All eight variables, including older age, male sex, elevated GGT, higher AST, and lower lipid levels, showed significant associations with incident HCC in univariate analyses and remained independently associated after adjustment (Table 2).

Download:

Table 2. High-risk features for hepatocellular carcinoma: univariable and multivariable hazard ratios across model rankings.

https://doi.org/10.1371/journal.pone.0349593.t002

Development and performance of the final weighted ensemble model

Given the superior performance of the MHA-DNN, we developed a final weighted voting ensemble combining predictions from the three best-performing iterations among 20 repeated non-replacement undersampling training sessions. In the independent validation set, this ensemble model achieved an AUROC of 0.9237, sensitivity of 71.36%, and specificity of 93.65% (Table 3), comparable to performance in the training set (AUROC 0.9277), suggesting minimal overfitting.

Download:

Table 3. Performance of the final multi-head attention deep neural network ensemble.

https://doi.org/10.1371/journal.pone.0349593.t003

The population was stratified into four risk groups: low, medium, high, and extremely high risk. Kaplan–Meier analysis based on predicted risk scores demonstrated clear separation of HCC-free survival across risk groups in both the validation cohort (Fig 3A) and the training cohort (Fig 3B) (P < 0.0001 for both). The cumulative incidence of HCC increased stepwise across risk groups, with the highest incidence observed in the extremely high-risk group (Fig 3). Similar patterns were observed in both cohorts, supporting the robustness and consistency of the model. In univariate Cox models, HRs for HCC compared with the low-risk group were 7.0 for medium-risk, 37.5 for high-risk, and 74.9 for extremely high-risk individuals. The cumulative incidence in the extremely high-risk group approached levels typically used as thresholds for HCC surveillance in patients with cirrhosis.

Download:

Fig 3. Kaplan–Meier curves for cumulative incidence of HCC according to predicted risk groups.

(A) Validation cohort. (B) Training cohort.

https://doi.org/10.1371/journal.pone.0349593.g003

Discussion

In this nationwide cohort of over 1.2 million adults with SLD, we developed and validated an explainable ML model to identify individuals with SLD at increased risk of HCC. In the internal validation cohort, the final model demonstrated high discrimination (AUROC 0.9237) and effectively stratified the population into risk groups with markedly different cumulative incidences. Although the extremely high-risk stratum represented only a small fraction of the cohort, it had an approximately 75-fold higher hazard of HCC than the low-risk group.

Model interpretation suggested a biologically coherent metabolic phenotype associated with HCC risk. SHAP analysis revealed a ‘metabolic burnout’ signature, characterized by lower levels of triglycerides and total cholesterol alongside elevated AST and GGT in individuals at high risk for HCC. This pattern is consistent with prior observations of impaired hepatic lipoprotein export and metabolic derangement in advanced fibrosis and burned-out steatohepatitis [16,17].

Specifically, the high predictive value of GGT and AST may reflect persistent hepatic injury, oxidative stress, and systemic inflammation, which have been implicated in hepatocarcinogenesis [18,19]. Furthermore, our findings are consistent with prior reports that progression toward advanced fibrosis may be accompanied by lower lipid levels [20,21]. While categorical analysis in the Kaplan–Meier curves (Fig. S2G in S1 File) primarily highlights this inverse relationship, our MHA-based models incorporated these values as continuous multivariable features within a broader risk-prediction framework. The superior performance of MHA-based models may reflect their ability to integrate these metabolic signatures with systemic factors, such as the CCI, which captures the cumulative burden of chronic comorbidity.

Methodologically, we addressed extreme class imbalance using repeated non-replacement undersampling and ensembling. We conducted 20 independent training sessions with non-replacement undersampling and constructed a final weighted voting MHA-DNN ensemble by aggregating predictions from the three best-performing iterations. This approach improved stability in a rare-event setting (0.17% event rate) while preserving sensitivity for HCC detection without compromising specificity [22]. It also complements linear noninvasive fibrosis tests, such as FIB-4 and the NAFLD fibrosis score, which primarily identify advanced fibrosis rather than predict incident HCC [23].

These findings may inform refinement of surveillance strategies in SLD. Current international guidelines generally discourage universal HCC surveillance in individuals with SLD without cirrhosis [6,24]. In contrast, the extremely high-risk stratum identified by our model likely exceeds commonly cited annual incidence thresholds for surveillance cost-effectiveness [25,26]. If externally validated, this framework could support targeted surveillance for high-risk individuals while reducing unnecessary testing in low-risk populations.

Our study has several limitations. First, steatosis was defined using FLI rather than histology or transient elastography, although FLI is well validated for epidemiologic studies [11,27,28]. The retrospective design and reliance on administrative data may introduce misclassification despite rigorous exclusion criteria. Second, in the Korean NHIS system, patients with confirmed HCC are typically registered in a special reimbursement program that requires strict diagnostic criteria and is closely linked to ICD-10 code C22.0. Therefore, misclassification of typical HCC cases under alternative codes such as C22.8 is likely to be minimal, although a small degree of underestimation cannot be excluded. Third, the cohort was exclusively Korean, requiring external validation in diverse populations, particularly given ethnic differences in adiposity distribution and metabolic risk [29]. Finally, a simplified risk score based on conventional statistical models, such as Cox regression using selected predictors, may be feasible and could improve clinical usability. However, such approaches may not fully capture the complex non-linear interactions identified by machine learning models. Future studies should focus on developing and validating simplified scoring systems derived from these findings.

In conclusion, an explainable ML model effectively stratified HCC risk in a large SLD population, highlighting a metabolic burnout signature of low triglycerides and liver injury. This approach may provide a basis for future external validation and evaluation of risk-based surveillance strategies.

Supporting information

S1 File. Supplementary materials.

Supplementary figures and tables including Fig. S1–S3 and Tables S1–S2.

https://doi.org/10.1371/journal.pone.0349593.s001

(DOCX)

References

1. Singh SP, Madke T, Chand P. Global epidemiology of hepatocellular carcinoma. J Clin Exp Hepatol. 2025;15(2):102446. pmid:39659901
- View Article
- PubMed/NCBI
- Google Scholar
2. Rinella ME, Lazarus JV, Ratziu V, Francque SM, Sanyal AJ, Kanwal F, et al. A multisociety Delphi consensus statement on new fatty liver disease nomenclature. Hepatology. 2023;78(6):1966–86. pmid:37363821
- View Article
- PubMed/NCBI
- Google Scholar
3. Geh D, Manas DM, Reeves HL. Hepatocellular carcinoma in non-alcoholic fatty liver disease-a review of an emerging challenge facing clinicians. Hepatobiliary Surg Nutr. 2021;10(1):59–75. pmid:33575290
- View Article
- PubMed/NCBI
- Google Scholar
4. Armandi A, Rosso C, Caviglia GP, Bugianesi E. An updated overview on hepatocellular carcinoma in patients with Metabolic dysfunction-Associated Steatotic Liver Disease: Trends, pathophysiology and risk-based surveillance. Metabolism. 2025;162:156080. pmid:39571891
- View Article
- PubMed/NCBI
- Google Scholar
5. Piscaglia F, Svegliati-Baroni G, Barchetti A, Pecorelli A, Marinelli S, Tiribelli C, et al. Clinical patterns of hepatocellular carcinoma in nonalcoholic fatty liver disease: a multicenter prospective study. Hepatology. 2016;63(3):827–38. pmid:26599351
- View Article
- PubMed/NCBI
- Google Scholar
6. Singal AG, Llovet JM, Yarchoan M, Mehta N, Heimbach JK, Dawson LA, et al. AASLD Practice Guidance on prevention, diagnosis, and treatment of hepatocellular carcinoma. Hepatology. 2023;78(6):1922–65. pmid:37199193
- View Article
- PubMed/NCBI
- Google Scholar
7. European Association for the Study of the Liver. EASL Clinical Practice Guidelines on the management of hepatocellular carcinoma. J Hepatol. 2025;82(2):315–74. pmid:39690085
- View Article
- PubMed/NCBI
- Google Scholar
8. Mózes FE, Lee JA, Vali Y, Alzoubi O, Staufer K, Trauner M, et al. Performance of non-invasive tests and histology for the prediction of clinical outcomes in patients with non-alcoholic fatty liver disease: an individual participant data meta-analysis. Lancet Gastroenterol Hepatol. 2023;8(8):704–13. pmid:37290471
- View Article
- PubMed/NCBI
- Google Scholar
9. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44–56. pmid:30617339
- View Article
- PubMed/NCBI
- Google Scholar
10. Calderaro J, Seraphin TP, Luedde T, Simon TG. Artificial intelligence for the prevention and clinical management of hepatocellular carcinoma. J Hepatol. 2022;76(6):1348–61. pmid:35589255
- View Article
- PubMed/NCBI
- Google Scholar
11. Cho E-J, Jung G-C, Kwak M-S, Yang J-I, Yim J-Y, Yu S-J, et al. Fatty liver index for predicting nonalcoholic fatty liver disease in an asymptomatic Korean population. Diagnostics (Basel). 2021;11(12):2233. pmid:34943469
- View Article
- PubMed/NCBI
- Google Scholar
12. Yang MS, Park M, Back JH, Lee GH, Shin JH, Kim K, et al. Validation of cancer diagnosis based on the National Health Insurance Service Database versus the National Cancer Registry Database in Korea. Cancer Res Treat. 2022;54(2):352–61. pmid:34353000
- View Article
- PubMed/NCBI
- Google Scholar
13. Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, California, USA: Curran Associates Inc.; 2017. pp. 6000–10.
14. Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 1998;86(11):2278–324.
- View Article
- Google Scholar
15. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, California, USA: Association for Computing Machinery; 2016. pp. 785–94.
16. Eslam M, Valenti L, Romeo S. Genetics and epigenetics of NAFLD and NASH: clinical impact. J Hepatol. 2018;68(2):268–79. pmid:29122391
- View Article
- PubMed/NCBI
- Google Scholar
17. Jiang ZG, Tsugawa Y, Tapper EB, Lai M, Afdhal N, Robson SC, et al. Low-fasting triglyceride levels are associated with non-invasive markers of advanced liver fibrosis among adults in the United States. Aliment Pharmacol Ther. 2015;42(1):106–16. pmid:25913437
- View Article
- PubMed/NCBI
- Google Scholar
18. Cho EJ, Jeong S-M, Chung GE, Yoo J-J, Cho Y, Lee K-N, et al. Gamma-glutamyl transferase and risk of all-cause and disease-specific mortality: a Nationwide Cohort Study. Sci Rep. 2023;13(1):1751. pmid:36720971
- View Article
- PubMed/NCBI
- Google Scholar
19. Lee D-H, Blomhoff R, Jacobs DR Jr. Is serum gamma glutamyltransferase a marker of oxidative stress? Free Radic Res. 2004;38(6):535–9. pmid:15346644
- View Article
- PubMed/NCBI
- Google Scholar
20. Cho Y, Cho EJ, Yoo J-J, Chang Y, Chung GE, Jeong S-M, et al. Association between Lipid Profiles and the Incidence of Hepatocellular Carcinoma: A Nationwide Population-Based Study. Cancers (Basel). 2021;13(7):1599. pmid:33808412
- View Article
- PubMed/NCBI
- Google Scholar
21. Kawamura Y, Arase Y, Ikeda K, Seko Y, Imai N, Hosaka T, et al. Large-scale long-term follow-up study of Japanese patients with non-alcoholic Fatty liver disease for the onset of hepatocellular carcinoma. Am J Gastroenterol. 2012;107(2):253–61. pmid:22008893
- View Article
- PubMed/NCBI
- Google Scholar
22. Johnson JM, Khoshgoftaar TM. Survey on deep learning with class imbalance. J Big Data. 2019;6(1).
- View Article
- Google Scholar
23. European Association for the Study of the Liver (EASL), European Association for the Study of Diabetes (EASD), European Association for the Study of Obesity (EASO). EASL-EASD-EASO Clinical Practice Guidelines on the management of metabolic dysfunction-associated steatotic liver disease (MASLD). J Hepatol. 2024;81(3):492–542. pmid:38851997
- View Article
- PubMed/NCBI
- Google Scholar
24. European Association for the Study of the L. EASL Clinical Practice Guidelines on the management of hepatocellular carcinoma. J Hepatol 2025;82:315–74.
- View Article
- Google Scholar
25. Cadier B, Bulsei J, Nahon P, Seror O, Laurent A, Rosa I, et al. Early detection and curative treatment of hepatocellular carcinoma: a cost-effectiveness analysis in France and in the United States. Hepatology. 2017;65(4):1237–48. pmid:28176349
- View Article
- PubMed/NCBI
- Google Scholar
26. Sarasin FP, Giostra E, Hadengue A. Cost-effectiveness of screening for detection of small hepatocellular carcinoma in western patients with Child-Pugh class A cirrhosis. Am J Med. 1996;101(4):422–34. pmid:8873514
- View Article
- PubMed/NCBI
- Google Scholar
27. Bedogni G, Bellentani S, Miglioli L, Masutti F, Passalacqua M, Castiglione A, et al. The Fatty Liver Index: a simple and accurate predictor of hepatic steatosis in the general population. BMC Gastroenterol. 2006;6:33. pmid:17081293
- View Article
- PubMed/NCBI
- Google Scholar
28. Kim JH, Kwon SY, Lee SW, Lee CH. Validation of fatty liver index and lipid accumulation product for predicting fatty liver in Korean population. Liver Int. 2011;31(10):1600–1. pmid:22093336
- View Article
- PubMed/NCBI
- Google Scholar
29. Younossi ZM, Kalligeros M, Henry L. Epidemiology of metabolic dysfunction-associated steatotic liver disease. Clin Mol Hepatol. 2025;31(Suppl):S32–50. pmid:39159948
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Singh SP, Madke T, Chand P. Global epidemiology of hepatocellular carcinoma. J Clin Exp Hepatol. 2025;15(2):102446. pmid:39659901
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Rinella ME, Lazarus JV, Ratziu V, Francque SM, Sanyal AJ, Kanwal F, et al. A multisociety Delphi consensus statement on new fatty liver disease nomenclature. Hepatology. 2023;78(6):1966–86. pmid:37363821
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Geh D, Manas DM, Reeves HL. Hepatocellular carcinoma in non-alcoholic fatty liver disease-a review of an emerging challenge facing clinicians. Hepatobiliary Surg Nutr. 2021;10(1):59–75. pmid:33575290
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Armandi A, Rosso C, Caviglia GP, Bugianesi E. An updated overview on hepatocellular carcinoma in patients with Metabolic dysfunction-Associated Steatotic Liver Disease: Trends, pathophysiology and risk-based surveillance. Metabolism. 2025;162:156080. pmid:39571891
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Piscaglia F, Svegliati-Baroni G, Barchetti A, Pecorelli A, Marinelli S, Tiribelli C, et al. Clinical patterns of hepatocellular carcinoma in nonalcoholic fatty liver disease: a multicenter prospective study. Hepatology. 2016;63(3):827–38. pmid:26599351
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Singal AG, Llovet JM, Yarchoan M, Mehta N, Heimbach JK, Dawson LA, et al. AASLD Practice Guidance on prevention, diagnosis, and treatment of hepatocellular carcinoma. Hepatology. 2023;78(6):1922–65. pmid:37199193
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. European Association for the Study of the Liver. EASL Clinical Practice Guidelines on the management of hepatocellular carcinoma. J Hepatol. 2025;82(2):315–74. pmid:39690085
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Mózes FE, Lee JA, Vali Y, Alzoubi O, Staufer K, Trauner M, et al. Performance of non-invasive tests and histology for the prediction of clinical outcomes in patients with non-alcoholic fatty liver disease: an individual participant data meta-analysis. Lancet Gastroenterol Hepatol. 2023;8(8):704–13. pmid:37290471
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44–56. pmid:30617339
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Calderaro J, Seraphin TP, Luedde T, Simon TG. Artificial intelligence for the prevention and clinical management of hepatocellular carcinoma. J Hepatol. 2022;76(6):1348–61. pmid:35589255
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Cho E-J, Jung G-C, Kwak M-S, Yang J-I, Yim J-Y, Yu S-J, et al. Fatty liver index for predicting nonalcoholic fatty liver disease in an asymptomatic Korean population. Diagnostics (Basel). 2021;11(12):2233. pmid:34943469
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Yang MS, Park M, Back JH, Lee GH, Shin JH, Kim K, et al. Validation of cancer diagnosis based on the National Health Insurance Service Database versus the National Cancer Registry Database in Korea. Cancer Res Treat. 2022;54(2):352–61. pmid:34353000
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, California, USA: Curran Associates Inc.; 2017. pp. 6000–10.

[ref14] 14. Lecun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proc IEEE. 1998;86(11):2278–324.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref15] 15. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, California, USA: Association for Computing Machinery; 2016. pp. 785–94.

[ref16] 16. Eslam M, Valenti L, Romeo S. Genetics and epigenetics of NAFLD and NASH: clinical impact. J Hepatol. 2018;68(2):268–79. pmid:29122391
View Article
PubMed/NCBI
Google Scholar

[55] View Article

[56] PubMed/NCBI

[57] Google Scholar

[ref17] 17. Jiang ZG, Tsugawa Y, Tapper EB, Lai M, Afdhal N, Robson SC, et al. Low-fasting triglyceride levels are associated with non-invasive markers of advanced liver fibrosis among adults in the United States. Aliment Pharmacol Ther. 2015;42(1):106–16. pmid:25913437
View Article
PubMed/NCBI
Google Scholar

[59] View Article

[60] PubMed/NCBI

[61] Google Scholar

[ref18] 18. Cho EJ, Jeong S-M, Chung GE, Yoo J-J, Cho Y, Lee K-N, et al. Gamma-glutamyl transferase and risk of all-cause and disease-specific mortality: a Nationwide Cohort Study. Sci Rep. 2023;13(1):1751. pmid:36720971
View Article
PubMed/NCBI
Google Scholar

[63] View Article

[64] PubMed/NCBI

[65] Google Scholar

[ref19] 19. Lee D-H, Blomhoff R, Jacobs DR Jr. Is serum gamma glutamyltransferase a marker of oxidative stress? Free Radic Res. 2004;38(6):535–9. pmid:15346644
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref20] 20. Cho Y, Cho EJ, Yoo J-J, Chang Y, Chung GE, Jeong S-M, et al. Association between Lipid Profiles and the Incidence of Hepatocellular Carcinoma: A Nationwide Population-Based Study. Cancers (Basel). 2021;13(7):1599. pmid:33808412
View Article
PubMed/NCBI
Google Scholar

[71] View Article

[72] PubMed/NCBI

[73] Google Scholar

[ref21] 21. Kawamura Y, Arase Y, Ikeda K, Seko Y, Imai N, Hosaka T, et al. Large-scale long-term follow-up study of Japanese patients with non-alcoholic Fatty liver disease for the onset of hepatocellular carcinoma. Am J Gastroenterol. 2012;107(2):253–61. pmid:22008893
View Article
PubMed/NCBI
Google Scholar

[75] View Article

[76] PubMed/NCBI

[77] Google Scholar

[ref22] 22. Johnson JM, Khoshgoftaar TM. Survey on deep learning with class imbalance. J Big Data. 2019;6(1).
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref23] 23. European Association for the Study of the Liver (EASL), European Association for the Study of Diabetes (EASD), European Association for the Study of Obesity (EASO). EASL-EASD-EASO Clinical Practice Guidelines on the management of metabolic dysfunction-associated steatotic liver disease (MASLD). J Hepatol. 2024;81(3):492–542. pmid:38851997
View Article
PubMed/NCBI
Google Scholar

[82] View Article

[83] PubMed/NCBI

[84] Google Scholar

[ref24] 24. European Association for the Study of the L. EASL Clinical Practice Guidelines on the management of hepatocellular carcinoma. J Hepatol 2025;82:315–74.
View Article
Google Scholar

[86] View Article

[87] Google Scholar

[ref25] 25. Cadier B, Bulsei J, Nahon P, Seror O, Laurent A, Rosa I, et al. Early detection and curative treatment of hepatocellular carcinoma: a cost-effectiveness analysis in France and in the United States. Hepatology. 2017;65(4):1237–48. pmid:28176349
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref26] 26. Sarasin FP, Giostra E, Hadengue A. Cost-effectiveness of screening for detection of small hepatocellular carcinoma in western patients with Child-Pugh class A cirrhosis. Am J Med. 1996;101(4):422–34. pmid:8873514
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref27] 27. Bedogni G, Bellentani S, Miglioli L, Masutti F, Passalacqua M, Castiglione A, et al. The Fatty Liver Index: a simple and accurate predictor of hepatic steatosis in the general population. BMC Gastroenterol. 2006;6:33. pmid:17081293
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref28] 28. Kim JH, Kwon SY, Lee SW, Lee CH. Validation of fatty liver index and lipid accumulation product for predicting fatty liver in Korean population. Liver Int. 2011;31(10):1600–1. pmid:22093336
View Article
PubMed/NCBI
Google Scholar

[101] View Article

[102] PubMed/NCBI

[103] Google Scholar

[ref29] 29. Younossi ZM, Kalligeros M, Henry L. Epidemiology of metabolic dysfunction-associated steatotic liver disease. Clin Mol Hepatol. 2025;31(Suppl):S32–50. pmid:39159948
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

Figures

Abstract

Background and aims

Methods

Results

Conclusions

Introduction

Patients and methods

Ethics statement

Data source and study population

Study outcome

Variables

Model development and selection

Explainability and variable selection

Construction of the final MHA-DNN ensemble

Performance evaluation and risk stratification

Statistical analysis

Results

Study population characteristics

Performance of candidate models

Identification of common key predictors via explainable AI

Development and performance of the final weighted ensemble model

Discussion

Supporting information

S1 File. Supplementary materials.

References