Figures
Abstract
Hepatocellular carcinoma (HCC) is a malignant tumor with a high incidence and fatality. The occurrence and progression of HCC are tightly linked to ferroptosis, a unique type of cell death. To accurately predict the prognosis, immunological traits, treatment sensitivity, and drug prediction for patients with HCC, this work attempts to develop a novel ferroptosis-related gene signature (nFRGs). Several machine learning techniques were applied to build the nFRGs model utilizing data from The Cancer Genome Atlas (TCGA) and GSE14520 datasets. Different analysis packages in R version 4.4.1 were also used for prognosis analysis, molecular function analysis, somatic mutation analysis, immunotherapy response analysis, immunotherapy evaluation, drug sensitivity analysis, and drug prediction to compare the differences between the low-risk and high-risk groups. The nFRGs model includes five ferroptosis-related genes (KIF20A, NT5DC2, G6PD, SLC7A11, and EZH2). The results indicate that nFRGs are an independent prognostic risk factor for HCC patients, and patients in the high-risk group have a worse prognosis. Our nFRGs model shows better accuracy and reliability in predicting the prognosis of HCC patients than other existing ferroptosis-related gene models. Both the high- and low-risk groups of nFRGs had differentially expressed genes (DEGs) enriched in pathways mostly associated with immunological traits and tumor progression. The high-risk group exhibited clear immune escape characteristics, with significant upregulation in the expression of immune checkpoints and TIDE scores. Furthermore, IPS analysis also revealed that the high-risk group is less responsive to immunotherapy, while the low-risk group showed a better potential for immune therapy response, which further highlights the potential of nFRGs as a predictor for immunotherapy outcomes. This suggests a stronger immune suppression status in high-risk patients, potentially leading to a poorer response to immune checkpoint inhibitors (ICIs). In contrast, the low-risk group displayed lower immune escape features, making them potentially more susceptible to immune responses. Additionally, there were significant differences in gene mutations, molecular functions, and other factors between the low-risk and high-risk groups. Lastly, our investigation predicted possible medications that would work well for the model and found sensitive chemotherapeutic and targeted medications for both high-risk and low-risk groups. In conclusion, nFRGs could serve as a novel prognostic biomarker, providing valuable insights for personalized treatment strategies.
Citation: Wu C, Chen X, Zhang Y, Du Y, Xu J, Peng Y, et al. (2025) A novel ferroptosis-related signature for predicting prognosis, immune characteristics, and treatment prediction in hepatocellular carcinoma. PLoS One 20(6): e0322158. https://doi.org/10.1371/journal.pone.0322158
Editor: Xiaosheng Tan, Rutgers: Rutgers The State University of New Jersey, UNITED STATES OF AMERICA
Received: December 22, 2024; Accepted: May 11, 2025; Published: June 4, 2025
Copyright: © 2025 Wu et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: This research was funded by the National Natural Science Foundation of China grant number [No. 82460957], the Guangxi Natural Science Foundation Project [No. 2022GXNSFAA035460, No. 2024GXNSFDA010005], and the Guangxi Graduate Education Innovation Program [No. YCSW2023395]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Abbreviation: HCC, Hepatocellular carcinoma; nFRGs, novel ferroptosis-related gene signature; TCGA, The Cancer Genome Atlas; DEGs, Differentially expressed genes; ICIs, immune checkpoint inhibitors; TIDE, Tumor Immune Dysfunction and Exclusion; FRGs, Ferroptosis-related genes; OS, Overall survival; GEO, Gene Expression Omnibus; Cox, Cox proportional hazards regression model; C-index, Concordance Index; AUC, Area Under the Curve; HPA, Human Protein Atlas; ROC, Receiver operating characteristic; HR, Hazard ratio; PCA, Principal Component Analysis; GO, Gene Ontology; KEGG, Kyoto Encyclopedia of Genes and Genomes; GSEA, Gene Set Enrichment Analysis; MsigDB, Molecular Signatures Database; STRING, Search Tool for the Retrieval of Interacting Genes; TME, Tumor immune microenvironment; IC50, Half maximal inhibitory concentration; DsigDB, Database of Signatures of Gene Expression; Coef, Coefficients; SNPs, Single nucleotide polymorphisms
Introduction
The incidence and mortality rates of HCC, one of the most prevalent malignant tumors in the world, are increasing yearly [1,2]. The prognosis of HCC is still not adequate despite major advancements in treatment approaches, such as immunotherapy and targeted therapy [3–6]. Therefore, increasing the survival rate and quality of life for patients with HCC requires the development of novel predictive models and treatment approaches.
Ferroptosis, a form of programmed cell death induced by iron-dependent lipid peroxidation, plays an important role in the occurrence and progression of various tumors [7,8]. Ferroptosis has been implicated in the development, progression, and mechanisms of treatment resistance of HCC in recent years [9,10]. By controlling lipid peroxidation and iron homeostasis inside the tumor cells, ferroptosis influences the survival and death of tumor cells [11]. Ferroptosis-related genes (FRGs) have shown great promise in the prognostic assessment of HCC in a growing number of studies [12–17]. As a result, prognostic models built using these genes can enhance HCC prediction skills while also providing a scientific foundation for individualized care.
HCC immunological milieu is essential to the development and spread of the tumor. The immunological microenvironment of HCC affects immune cell invasion, immune escape, and immune tolerance [18,19]. Ferroptosis’s function in controlling the immunological milieu of HCC has drawn more and more attention. According to studies, ferroptosis influences immune cell activity in the tumor immunological milieu, which modifies the immune response in addition to directly affecting the survival of HCC cells [20,21]. To comprehend the immune escape mechanisms of HCC and create novel immunotherapy approaches, it is crucial to investigate the relationships between ferroptosis-related genes and the immunological milieu of HCC.
Furthermore, with a deeper understanding of the mechanisms of ferroptosis, an increasing number of drugs have been found to have the potential to regulate ferroptosis. By focusing on iron metabolism or lipid peroxidation pathways, these medications can cause ferroptosis in HCC cells. They can also improve the efficacy of immunotherapy by altering the immune microenvironment [22,23]. Thus, ferroptosis-related gene-based drug prediction models offer fresh perspectives on the accurate management of HCC and could potentially be a useful strategy for enhancing the therapeutic result of HCC, especially when combined with immunotherapy and targeted medications.
In conclusion, new routes for the clinical diagnosis and treatment of HCC are opened by the potential of FRGs in prognostic assessment, immunological microenvironments, and medication treatment. Ferroptosis-related gene-based prognostic models, immune microenvironment analysis, and medication prediction will all contribute to better treatment outcomes and a higher standard of living for HCC patients.
Materials and methods
HCC patients’ RNA sequencing and related clinical data were sourced from the TCGA database (https://portal.gdc.cancer.gov/). The TCGA-LIHC dataset includes 50 normal tissue samples and 369 HCC samples for differential expression analysis. After excluding samples with overall survival (OS) < 30 days or incomplete clinical information, 341 HCC samples were included in the subsequent analysis (Table 1). Among the 341 HCC patients in the TCGA cohort, the OS distribution was as follows: 219 patients survived, and 122 patients died, with a median OS time of 630 days.The Gene Expression Omnibus (GEO) database (GSE14520) includes 241 normal tissue samples and 247 HCC samples for differential expression analysis. After excluding samples with overall survival (OS) < 30 days or incomplete clinical information, 221 HCC samples were included in the subsequent analysis (Table 1). Among the 221 HCC patients in the GSE14520 cohort, the OS distribution was as follows: 136 patients survived, and 85 patients died, with a median OS time of 1569 days.To identify ferroptosis-related genes (FRGs) associated with the prognosis of HCC, we gathered FRGs from the FerrDb database (http://www.zhounan.org/ferrdb/), which includes genes that promote, inhibit, or are involved in ferroptosis. The research workflow is displayed in Fig.1.
DEGs analysis
Using the “DESeq2” R program, the TCGA database’s DEGs between tumor and normal samples were filtered. Either “Wald” or “LRT”, which will then use either Wald significance tests (defined by nbinomWaldTest), or the likelihood ratio test on the difference in deviance between a full and reduced model formula (defined by nbinomLRT).The filtering limits were |log2(FC)| > 1 and false discovery rate (FDR) < 0.05.
Construction and analysis of nFRGs
The tumor samples were randomly split into training and testing sets at an 8:2 ratio using the “caret” R package. To find FRGs with prognostic significance, we first chose the intersection of FRGs and upregulated DEGs. Subsequently, we performed univariate Cox proportional hazards regression model (Cox)regression analysis independently in both the training and testing sets.
We employed the “Mime1” R package, which offers a combined analysis of ten popular machine learning techniques to produce different prediction models, to choose the best model [24]. To select the best-performing model, we ranked the models according to their performance using Mime1 and computed the Concordance Index (C-index) and the 1-, 2-, and 3-year Area Under the Curve (AUC)values for each model in the training and validation sets. After comparing the C-index and AUC values of various algorithm combinations, we selected the model that performed well in both the training and validation sets. We then used this model for further analysis and prediction. Along with RNA-level analysis, we also evaluated the protein expression of prognostic genes in normal liver and HCC tissues using immunohistochemistry staining maps that we acquired from the Human Protein Atlas (HPA) database (https://www.proteinatlas.org/).
Next, we used the “survminer” and “survival” packages for analysis and divided the training set into high-risk and low-risk groups based on the median risk score of the model. The “survminer” package was used to perform Kaplan-Meier survival curve analysis, and time-dependent receiver operating characteristic (ROC) curves and AUC values for 1-, 2-, and 3-year survival were used to evaluate the model’s accuracy. To evaluate the target factors’ influence on prognosis, we also conducted univariate Cox regression analysis on datasets like TCGA_train, TCGA_test, TCGA, and GSE14520. Each feature’s hazard ratio (HR) and 95% CI were determined as part of the study, and the statistical significance (P-value) of each was assessed. The results from various datasets were then combined using meta-analysis, which used fixed-effects and random-effects models to assess the robustness and consistency of characteristics across datasets. R software was used for all statistical analyses, and p < 0.05 was chosen as the significance level.
Finally, using the median risk score from the TCGA and GSE14520 datasets to categorize the samples into high-risk and low-risk groups, we analyzed the distribution of risk scores, patient survival status, and gene expression. We then performed Principal Component Analysis (PCA) on the two risk groups to assess the differences in their multidimensional distributions. To evaluate the performance of our model, we compared it with previous ferroptosis-related prognostic gene models, using HR, C-index, and AUC.
Nomogram construction and validation
To ascertain whether the developed risk model could function as an independent prognostic factor for HCC, univariate and multivariate Cox regression analyses were conducted on the clinical features and risk groups of the TCGA and GSE14520 datasets using the “survival” package. Lastly, a nomogram was created using the “rms” R package to forecast the OS of HCC patients at one, two, and three years.
Functional enrichment analysis
We used the “limma” R package to perform DEGs on the two subgroups to examine the biological function and molecular mechanisms that differ between the high-risk and low-risk groups according to the median risk score of nFRGs. Next, we used the “clusterProfiler” program to do enrichment analysis on the derived DEGs using Gene Ontology (GO) and the Kyoto Encyclopedia of Genes and Genomes (KEGG). To identify possible biological pathways and functions, we lastly carried out Gene Set Enrichment Analysis (GSEA) using the “clusterProfiler” and “enrichplot” packages based on reference gene sets from the Molecular Signatures Database (MsigDB) database (symbols. gmt v7.0).
Protein-Protein Interaction (PPI) network construction
First, we used Pearson correlation analysis to assess the relationship between predictive genes. The “limma” packages were then used to conduct DEGs comparing high-risk and low-risk groups. These genes were added to the Search Tool for the Retrieval of Interacting Genes (STRING) online database (https://cn.string-db.org/) to create their PPI network, with a 0.4 confidence level. Cytoscape was used to visualize the PPI network and show the interaction network of the chosen nFRGs.
Gene mutation analysis
We obtained genetic variation data from the TCGA dataset and used the mutation data to determine each patient’s TMB score. Next, we examined the association between TMB and survival rate and contrasted the TMB variations between the high-risk and low-risk groups. We examined the quantity and kind of gene mutations in the high-risk and low-risk groups in the TCGA and GSE14520 datasets using the “Maftools” packages. We determined the main types of mutations and evaluated the prognostic-related gene mutation status in HCC. Furthermore, we examined the relationship between the main nFRG mutation types and the levels of mRNA expression in the TCGA and GSE14520 datasets.
Immune characteristics
We compared the immune cell infiltration disparities between high-risk and low-risk groups, evaluated the relationship between risk scores and 22 immune cell types, and quantified immune cell infiltration and immunological function for each sample using the “GSVA” package and the “CIBERSORT” algorithm. The immunological properties of the tumor immune microenvironment (TME) were assessed using the “estimate” package. The ESTIMATE score is a method that calculates the immune and stromal scores of the tumor microenvironment. These scores are indicative of tumor purity and the extent of immune infiltration, which can influence prognosis and treatment responses in HCC.Additionally, the expression variations of common immunological checkpoints in each subgroup were examined using the “limma” c. To predict how HCC patients will react to immunotherapy, the TIDE algorithm was used to determine the TIDE score for each sample and examine the variations in TIDE scores between high-risk and low-risk groups.The TIDE algorithm was employed to predict potential responses to immune checkpoint blockade in HCC, where a lower TIDE score indicates a better response to immunotherapy.The TIDE data used in this study was obtained from the online database (http://tide.dfci.harvard.edu/faq/).
Drug sensitivity analysis
We employed the “oncoPredict” R packages to evaluate the medication sensitivity of nFRGs in both high-risk and low-risk groups of HCC patients. Intending to offer recommendations for individualized treatment, this software forecasts medication sensitivity using gene expression data. We determined the expected half maximal inhibitory concentration (IC50) values for each of the 198 medications we chose for the analysis in patients with HCC. The IC50 value, which indicates how sensitive cells are to a drug, is the concentration of the drug that suppresses cell growth by 50%. We were able to determine viable treatment choices with varying drug sensitivities in various risk populations by comparing the IC50 values between the high-risk and low-risk groups.
Drug prediction based on nFRGs
We used the Database of Signatures of Gene Expression (DsigDB) database to further investigate possible therapeutic medications associated with the nFRGs in the study. We examined medications linked to the target genes based on their expression patterns. It is well known that the chosen medications significantly regulate particular genes.
Statistical analysis
R software (version 4.4.1) and the online analysis tool MicrobeX (https://www.bioinformatics.com.cn/) were used for all statistical analyses. With a significance level of p < 0.05, this study used several R packages, including Mime1, limma, survival, and caret. The criteria for statistical significance were p < 0.05, p < 0.01 and p < 0.001.
Results
nFRG construction and validation
50 normal samples and 369 HCC samples were among the 419 samples that were obtained from the TCGA database. 3349 elevated DEGs between tumor and normal tissues were found using differential analysis (Fig 2A). Furthermore, the FerrDb database yielded 484 FRGs. 55 differentially expressed FRGs were chosen after calculating the overlap of DEGs and FRGs (Fig 2B and S1 Table).
(B) Venn diagram of upregulated differentially expressed genes and ferroptosis-related genes (FRGs). (C) Venn diagram of univariate Cox genes in TCGA_train and TCGA_test datasets. (D) Lasso regression coefficient plot of the 11 FRGs. (E) Tuning parameters in the Lasso model. (F) Heatmap of the expression of five gene features in normal and tumor tissues from the TCGA dataset.
The 55 differentially expressed FRGs were subjected to univariate Cox regression analysis in the training and testing set that were separated from the TCGA dataset. According to the findings, 15 genes were linked to prognosis in the testing set and 20 genes were linked to prognosis in the training set. Eleven genes that were prognostic in both datasets were found by intersecting these two gene sets (Fig 2C). The number of feature genes was then decreased using LASSO regression analysis. Following the preprocessing of the gene expression data, a LASSO regression model was built. To maximize the predictive performance and control complexity of the model, the ideal λ value was chosen by cross-validation. Five nFRGs were ultimately chosen (Figs 2D and 2E): G6PD, SLC7A11, KIF20A, NT5DC2, and EZH2.
It should be mentioned that only 10 genes were shown in the Coefficients (Coef) of the LASSO regression model, even though 11 genes were found in the intersection analysis. This is probably because several genes’ regression coefficients were compressed to zero, meaning that they were not included in the current analysis since they lacked the necessary predictive power. Finally, we used a heatmap to show the expression of these 5 genes in the TCGA dataset (Fig 2F), and we examined the immunohistochemical expression of 4 of these genes in normal and HCC tissues using data from the HPA database. The findings demonstrated that while G6PD, KIF20A, NT5DC2, and EZH2 were not found in normal tissues, they were expressed in HCC tissues to varied degrees. Interestingly, the HPA database did not contain any immunohistochemistry expression data for SLC7A11 in liver tissue (S1 Fig).
Using the “Mime1” machine learning package, we fitted 87 predictive models to the TCGA-train dataset and determined the C-index for each model in the training and validation datasets (Fig 3A). The results show that the StepCox[forward] model performed best with a simple model in both the training and validation datasets, with average C-index values of 0.692 and 0.69, respectively (Fig 3A). In the training and validation datasets, its C-index values were 0.7, 0.72, 0.71, and 0.64, in that order (Fig 3B). The StepCox[forward] model performed well in the 3-year AUC values, according to the additional computation of the AUC values for each model at 1, 2, and 3 years (S2 Fig).
(B) C-index of the StepCox[forward] model across different datasets. (C) AUC values of the StepCox[forward] model for 1-year, 2-year, and 3-year outcomes in different datasets. (D-F) ROC curves for 1-year, 2-year, and 3-year outcomes in various datasets.
Therefore, StepCox[forward] was selected as the best model, and its AUC value was extracted (Fig 3C). To assess the StepCox[forward] model’s prognostic prediction performance, we then plotted the 1-, 2-, and 3-year ROC curves for the training and validation datasets (Figs 3D-F).
Finally, based on the median of the risk scores calculated by the StepCox[forward] model in the “Mime1” package, patients in both the training and validation cohorts were divided into high-risk and low-risk groups. The following formula is used to determine the risk score: 0.071 × KIF20A expression + 0.1015 × SLC7A11 expression + 0.1239 × NT5DC2 expression + (0.1452 × G6PD expression) + (0.1514 × EZH2 expression). High-risk patients in the training sample are substantially linked to a worse OS, according to the Kaplan-Meier survival curve (Fig 4A). Kaplan-Meier survival curves were analyzed using the log-rank test to compare survival differences between high- and low-risk groups.The validation set showed similar results (Figs 4B–D).To further validate whether the difference in diagnosis year affects survival analysis results, we divided the patients into two groups based on their diagnosis year: pre-2007 and post-2007, with 2007 marking a significant change in the treatment era. Initial survival analysis showed that diagnosis year alone was associated with differences in overall survival. However, when patients were stratified based on their risk scores, the survival differences between the pre-2007 and post-2007 groups were no longer statistically significant. This suggests that diagnosis year is not a decisive factor influencing survival outcomes. Instead, survival differences appear to be more strongly driven by the risk group (high-risk vs low-risk), which further supports the stability and robustness of the constructed risk model across different treatment eras (S3 Fig and S2 Table).
Dashed line represents the Median Overall Survival (OS). (E) Meta-analysis of univariate Cox regression. High: High risk group based on the ferroptosis-related gene signature (nFRGs). Low: Low risk group based on the ferroptosis-related gene signature (nFRGs).
To further evaluate the survival prognostic value of the selected features across different datasets, we performed univariate Cox regression analysis in each dataset and used meta-analysis to integrate the results. Univariate Cox regression was performed to estimate the hazard ratios (HR) for each feature’s impact on OS. According to specific findings, the chosen feature’s HR in the training set was 1.76, with a 95% CI of 1.16–2.67 and a P-value of 0.008, suggesting a significant correlation with OS (Fig 4E). The P-values were less than 0.05 in every test set, indicating statistical significance.
In the meta-analysis, we used both a random-effects model and a fixed-effects model to integrate the results from different datasets. The random-effects model was employed to account for heterogeneity across datasets, while the fixed-effects model was used when assuming homogeneity across the studies.With a P-value of 0.001 and an HR of 2.08 (95% CI: 1.66–2.61) after merging, the random-effects model showed that this trait had a consistent and statistically significant impact on survival prognosis across several datasets. When combined, the meta-analysis provided additional confirmation of this feature’s resilience and prognostic predictive significance across other datasets.
Risk Model Evaluation
To better stratify risk scores into high-risk and low-risk groups, we compared the use of the first quartile, median, and third quartile in univariate Cox regression analysis of clinical prognosis to select the optimal grouping method. The results showed that in the TCGA dataset, the low-high risk groups defined by the third quartile exhibited the best effect and significance, while the median grouping also showed significance, whereas the first quartile grouping did not demonstrate significant effects. In the GSE14520 dataset, only the median grouping showed significance, while both the first quartile and third quartile groupings did not show significant effects. Therefore, we concluded that using the median to define low and high-risk groups has stronger generalizability and can be applied in the GSE14520 dataset. Based on this, we divided patients in both the TCGA and GSE14520 cohorts into high-risk and low-risk groups according to the median risk score(S4 Fig).High-risk and low-risk patients were significantly clustered in both cohorts in the PCA analysis (Figs 5A and 5E). Gene expression levels, survival time, survival status, and distribution of risk scores were also examined. The TCGA cohort’s findings (Figs 5B–D) demonstrated that five genes had higher expression levels and that high-risk individuals died at higher rates. A similar pattern was also seen in the GSE14520 cohort’s data (Figs 5F-H). Overall, these results indicate that our prognostic model demonstrates accuracy and robustness across different datasets.
(A-D) The PCA 3D analysis, distribution of risk scores, correlation between survival time and survival status for each patient, and heatmap of the prognostic model genes in the risk subgroups for TCGA. (E-H) The PCA 3D analysis, distribution of risk scores, correlation between survival time and survival status for each patient, and heatmap of the prognostic model genes in the risk subgroups for GSE14520.
Comparison with previously published ferroptosis-related gene prognostic models
Through a literature search, we collected 6 ferroptosis-related gene prognostic models published in recent years and organized the corresponding genes, Coef, and PMID numbers for each model (Table 2 and S3 Table). This collection offers background knowledge and references for assessing our model, enabling a thorough comparison with earlier versions.
The findings demonstrate that our model performs at an intermediate to above-average level in both the average C-index and the C-index of the validation cohort when compared to existing prognostic models (Fig 6A). The AUC values for 1-, 2-, and 3-year survival showed a similar pattern (S5 Fig). We contrasted our model with six other published HCC FGRs prognostic models (Signature1 through Signature6 and Table 1) to confirm its prognostic prediction power. HR are represented by different colors in the heatmap shown in Fig 6B, where higher HR is indicated by darker hues. Statistical significance is indicated by the following symbols: ** for p < 0.01, and * for p < 0.05.
(A) C-index for each model in the different datasets. (B) HR heatmap for each model in the different datasets, with different colors representing different hazard ratios (HR), where darker colors indicate higher HR. Statistical significance is indicated by symbols: * for p < 0.05 and ** for p < 0.01, indicating statistical significance.
Our model showed a strong prognostic prediction capacity across four datasets when comparing the training and testing cohorts. The HR is near 4, with significance at p < 0.01, especially in the TCGA_test cohort, suggesting a better capacity to differentiate between individuals at high and low risk. The HR values of the other five published models, on the other hand, were comparatively weak and their significance was not as strong as that of our model, even though all models performed well in the training set (p < 0.01), except the Signature1 model, in the testing set. This implies that when extrapolated to separate datasets, their prognostic efficacy is lower, most likely as a result of variations in data processing techniques that result in poorer prediction performance. The robustness and superiority of this model in prognostic prediction for HCC are further validated by the comparison findings of our model with prior models across four datasets, which show higher C-index, AUC, and HR values.
Nomogram construction and validation
To find out if risk score grouping and other clinical characteristics could be independent variables for HCC prognosis, we employed univariate and multivariate Cox regression analysis. The TCGA clinical characteristics dataset’sunivariate Cox analysis revealed that risk score grouping, T staging, and TNM staging were all substantially correlated with prognosis (p < 0.05) (Fig 7A). Only the risk score grouping displayed comparable significant results in the multivariate Cox analysis (Fig 7B). These findings imply that the risk score may function as a stand-alone predictor of HCC.
(D-F) Forest plots of univariate and multivariate Cox analyses for risk score subgroups and clinical features in the GSE14520 dataset, along with the construction of the risk score nomogram.
We created a nomogram based on the risk score using the findings of the univariate and multivariate analyses from the TCGA entire cohort (Fig 7C). The nomogram’s ability to accurately predict the prognosis of patients with HCC was validated.
We also performed the same analysis on the GSE14520 clinical characteristics dataset’s external validation cohort. TNM staging, alpha-fetoprotein levels, tumor size, and risk score grouping were all substantially correlated with prognosis (p < 0.05), according to the univariate Cox analysis of the GSE14520 dataset (Fig 7D). Risk score grouping and TNM staging both produced significant results in the multivariate Cox analysis (Fig 7E). The risk score’s function as an independent prognostic factor for HCC is further supported by these findings.
Based on the univariate and multivariate analysis results from GSE14520, we constructed a nomogram using only the risk score from GSE14520 to compare with the TCGA nomogram (Fig 7F). These findings confirm once more how well the nomogram predicts the prognosis of patients with HCC.
DEGs functional enrichment analysis for high vs. low-risk groups
These DEGs were mainly involved in biological processes related to chromosome separation, nuclear chromosome separation, sister chromatid separation, and mitotic chromosome separation, according to the GO analysis’s GO-biological process (GO-BP) enrichment results. The DEGs were substantially enriched in cellular components such as centromere regions, condensed chromosomes, and chromosome regions, according to the GO-cellular component (GO-CC) enrichment data. These genes were linked to molecular functions such as arachidonic acid monooxygenase activity, steroid hydroxylase activity, and microtubule binding in the GO-molecular function (GO-MF) study (Figs 8A and 8B).
(C-D) KEGG enrichment analysis of DEGs between high-risk and low-risk groups. (E) Gene set C7 enrichment analysis (GSEA). (F) Gene set Hallmark enrichment analysis (GSEA).
According to the KEGG pathway enrichment analysis, these genes were found to be enriched in a number of biological pathways, including those pertaining to complement and coagulation cascades, the cell cycle, carbon metabolism, drug metabolism, DNA replication, and chemical carcinogenesis-DNA adducts (Figs 8C and 8D).
In the GSEA analysis, the results of the C7 gene set (Fig 8E) suggested that the immune microenvironment of the high-risk group might be more complex, involving significant activation or abnormal regulation of B cells, CD8 T cells, and natural killer (NK) cells. Additionally, the hallmark gene set’s GSEA analysis (Fig 8F) revealed that the high-risk group’s genes were enriched in important pathways such as MYC targets, G2M checkpoint, and E2F targets. Overall, these findings suggest that the identified DEGs may contribute to the regulation of cell cycle, immune response, and carcinogenic processes, emphasizing their potential relevance in disease prognosis and targeted therapeutic approaches (S4 Table).
PPI network construction
First, we identified the DEGs between the high-risk and low-risk groups. The STRING database was used to build the PPI network of these DEGs. A network diagram was then created by selecting five important model genes and the genes that are directly linked to these five genes using Cytoscape software. We used a Pearson correlation analysis heatmap to display the correlations between these prognostic genes to better investigate their linkages. KIF20A and EZH2 were shown to be strongly correlated (Figs 9A and 9B).
(B) Heatmap of nFRGs correlation.
Gene mutation analysis
After downloading the mutation data from the TCGA dataset, we further investigated the differences in TMB and survival between the high-risk and low-risk subgroups. Unfortunately, despite no significant difference in TMB between the high-risk and low-risk groups, and no clear correlation between TMB and survival (S6 Fig), patients with high TMB in the high-risk group generally exhibited poorer prognosis (Fig 10A). To further analyze the mutation patterns, we generated waterfall plots to show the top 20 genes with the highest mutation frequencies in the TCGA dataset and both risk subgroups (Figs 10B-D). Furthermore, we examined the main mutation types for the risk categories and discovered that the high-risk and low-risk groups were primarily affected by missense mutations, single nucleotide polymorphisms (SNPs), and C > T mutations, respectively (Figs 10E and 10F).
(B) A waterfall plot showing detailed mutation information for the top 20 genes in the TCGA dataset. (C, D) Waterfall plots showing detailed mutation information for the top 20 genes in the high-risk (C) and low-risk (D) groups. (E, F) Mutation information in the high-risk (E) and low-risk (F) groups categorized by different mutation types. (G, H) Correlation heatmaps of the top 20 mutated genes in the high-risk (G) and low-risk (H) groups.
Additionally, we examined the co-occurrence and mutually exclusive associations between the top 20 genes with the highest mutation frequencies in the low-risk group (Fig 10H) and the high-risk group (Fig 10G), as well as the median variation and various mutation types (Figs 10E and F). Different interactions are shown by the colors in the Fig: co-occurrence of gene pairs is shown in green, mutual exclusion is shown in red, and no significant association is shown in white. The level of significance is indicated by the dots or asterisks (p < 0.05 or p < 0.01).
TP53 and PCLO were found to have a significant co-occurrence association in the high-risk group (Fig 10G) (green, p < 0.05). Other gene pairs also showed notable mutation patterns in the low-risk group (Fig 10H). For instance, CTNNB1 and ARID2 were found to have a substantial co-occurrence connection (green, p < 0.01).
Correlation between risk scores and immune microenvironment
The immune cell composition of the tumor microenvironment differed significantly between the high-risk and low-risk groups in the immune infiltration study performed with ssGSEA and CIBERSORT (Figs 11A and 11C). Activated CD4 + T cells, effector memory CD4 + T cells, natural killer T cells, activated dendritic cells, T follicular helper cells, regulatory T cells (Tregs), and M0 macrophages were the most abundant immune response components in the high-risk group. Furthermore, in both analyses, eosinophils for the high-risk group displayed some degree of infiltration. The low-risk group, on the other hand, showed a distinct immune infiltration profile that was mainly enriched in resting mast cells, monocytes, M2 macrophages, activated NK cells, and CD4 + memory T cells.
(B) Comparison of immune function between the high-risk and low-risk groups using the ssGSEA algorithm. (C) Comparison of immune cell infiltration between the high-risk and low-risk groups using the CIBERSORT algorithm.
In terms of immunological function, the high-risk group expressed more MHC class I molecules, whereas the low-risk group displayed increased cytotoxic activity and active interferon responses (Fig 11B).
We examined the relationship between the risk score and infiltration of 22 immune cell types using Spearman’s rank correlation coefficient (ρ), which is suitable for detecting non-linear associations. Positive correlations were found between the risk score and activated CD4 + memory T cells (ρ = 0.20, p = 1.53e-04), follicular helper T cells (ρ = 0.23, p = 1.91e-05), M0 macrophages (ρ = 0.37, p = 8.63e-13), and resting dendritic cells (ρ = 0.21, p = 7.06e-05) (Figs 12B–D). However, no significant correlation was observed with memory B cells (ρ = 0.18, p = 1.05e-03), regulatory T cells (ρ = 0.17, p = 2.11e-03), or eosinophils (ρ = 0.16, p = 1.82e-04) (Figs 12A, 12E and 12F). Resting NK cells showed the strongest negative correlation with the risk score (ρ = −0.20, p = 1.05e-03) (Figs 12M), while the correlations for other immune cells were weaker (ρ > −0.2) (Figs 12H–L and 12N).
Subsequently, we performed a correlation analysis between the genes in the model and immune infiltrating cells and plotted the corresponding heatmap (Fig 13A). The findings indicate that there is a specific relationship between each gene and immune cells.
(A) Correlation analysis of nFGRs and immune infiltrating cells; (B) Comparison of immune checkpoint expression between the high-risk and low-risk groups; (C) Differences in StromalScore between the high-risk and low-risk groups; (D) Differences in TIDE scores between the high-risk and low-risk groups. (E)Differences in IPS between the high-risk and low-risk groups.
Immune response and immune checkpoints
42 immune checkpoints were found when we screened for common immunological checkpoints that differed significantly across the high-risk and low-risk groups. Only the IDO2 immune checkpoint exhibited an upregulation trend in the low-risk group, while most were upregulated in the high-risk group (Fig 13B).
We discovered a substantial difference in the stromal score between the low-risk and high-risk groups in the TME analysis. TumorPurity, ImmuneScore, and ESTIMATEScore did not change significantly (Fig 13C and S7 Fig). Additionally, the TIDE score analysis showed that the high-risk group’s TIDE score was substantially greater than the low-risk group’s (Fig 13D).To further explore the impact of risk score grouping on immunotherapy, we analyzed the Immune Prognostic Score (IPS). The results showed that in both the CTLA4-negative and PD1-negative group, as well as the CTLA4-positive and PD1-negative group, the IPS score of the low-risk group was significantly higher than that of the high-risk group.
Drug sensitivity analysis
We evaluated drug sensitivity by comparing the IC50 values between the high-risk and low-risk groups to find possible medications for the HCC prognostic model (Fig 14). According to the findings, there were notable variations among 103 medications. According to the data, people in the high-risk group were more sensitive to the majority of medications.
Drug prediction based on nFRGs
We compiled the significant correlations between each medicine and the associated genes after choosing the top ten candidate drugs based on the examination of the relationship between distinctive genes and drugs (Table 3 and S5 Table). Among these, acetaminophen (CTD 00005295) and 7646-79-9 (CTD 00000928) have P-values of 3.77E-04 and 8.84E-05, respectively, and are strongly linked to five distinctive genes (KIF20A, NT5DC2, G6PD, SLC7A11, and EZH2).
Discussion
In this study, we used univariate Cox regression, LASSO regression, and multivariate forward stepwise Cox regression to identify five genes—G6PD, NT5DC2, SLC7A11, KIF20A, and EZH2—as independent predictive characteristics for HCC. These characteristics successfully differentiated between high-risk and low-risk patients and showed strong predictive power in both the training and testing groups, indicating that high-risk patients have a lower OS. Additionally, The GEO database (GSE14520 dataset) was used as an external validation set to verify the reliability of the constructed predictive biomarkers.validation results from the GEO database supported these genetic characteristics’ potential as predictive biomarkers for HCC by confirming their accuracy and dependability.
Furthermore, this study is the first to demonstrate the originality and predictive power of the model we built (G6PD, KIF20A, EZH2, NT5DC2, SLC7A11), as well as to compare and assess several published FRGs prognostic models. According to the results, our model outperformed the majority of previously published models in every performance indicator, including average C-index, validation set C-index, and AUC values at 1, 2, and 3 years. In the testing set, several models had comparatively poor generalization ability, even though all models performed well in the training set. Our model’s consistent performance over several datasets indicates its potential for real-world use, indicating that future model development should concentrate on generalization skills to guarantee the model’s dependability on separate datasets.
However, the assumptions and limitations of the statistical analyses and models warrant further discussion. First, both the univariate and multivariate Cox regression models are based on the proportional hazards assumption, which assumes that the hazard ratios between different groups remain constant over time. However, this assumption may not hold in certain situations. Although we performed thorough data preprocessing and checks, there remains a risk that this assumption may be violated in some cases, which could impact the interpretation of the results. Furthermore, while LASSO regression is effective in reducing the number of variables and preventing overfitting, it may also lead to the exclusion of some potentially significant genes. Therefore, future studies should consider employing additional model validation techniques, such as cross-validation or external validation datasets, to further confirm the robustness and stability of the model.
Despite the excellent performance of all models in both the training and testing sets in this study, we also observed that certain previously published models exhibited poor generalization ability in the testing set, particularly in the context of clinical applications. This suggests that current predictive models may be prone to overfitting, highlighting the need to focus on enhancing the generalizability of the models during their development and optimization, ensuring their effectiveness in different datasets and clinical settings. Moreover, while our model outperformed existing FRG prognostic models across several indicators, the selection and weighting of these indicators may be influenced by the characteristics of the dataset and the analysis methods, potentially limiting the generalizability of the findings. Therefore, future studies should validate our findings in a broader range of external datasets to determine whether our results are applicable to a wider HCC patient population.
Another limitation is that we chose to exclude samples with a total survival time (OS) of less than 30 days to minimize the potential confounding effects of early non-tumor-related mortality on the analysis. Additionally, to ensure the reliability of the statistical analysis, we excluded samples with incomplete clinical data. We acknowledge that this complete-case analysis may introduce selection bias, but given the nature and proportion of missing data, we believe that directly applying multiple imputation methods could introduce additional uncertainty, which may impact the robustness of the results. Therefore, to address potential selection bias, we have minimized the impact of missing data as much as possible during the study design phase and conducted strict quality control in the statistical analysis.
Additionally, Another limitation is that our study primarily relied on data from public databases, which may suffer from sample selection bias or lack certain clinical information. Future research incorporating data from multi-center clinical samples would help to enhance the clinical applicability of the model. Additionally, although our study indicates that the selected genes are closely associated with the prognosis of HCC, how these genes interact in the tumor microenvironment and their specific mechanisms remain unclear. Therefore, future investigations should focus on further exploring the molecular mechanisms of these genes, particularly their roles in the liver cancer immune microenvironment.
In summary, while this study provides a robust statistical model for prognostic evaluation in HCC and validates the potential biomarkers associated with HCC prognosis, the assumptions of the statistical methods and the limitations of the models still need to be addressed and improved in future research. Specifically, validating the stability and generalizability of the models and addressing the challenges they may face in clinical applications will be crucial directions for future studies.
Ferroptosis-related pathways are crucial to the development of HCC, according to earlier research [25–27]. By controlling gene expression, cancer cells can develop resistance to ferroptosis [28], and the five FRGs that we found are important for both tumor growth and ferroptosis. G6PD (glucose-6-phosphate dehydrogenase) is a key enzyme in the pentose phosphate pathway and has been shown to inhibit ferroptosis in HCC by regulating the reductase (POR). POR is upregulated in G6PD deficiency, which prevents HCC cells from growing and spreading [29]. Additionally, G6PD expression is strongly linked to OS and the immune microenvironment in HCC, and its overexpression lowers ferroptosis in HCC, underscoring its potential as a tumor prognostic and therapeutic biomarker [30]. Increased expression of NT5DC2 (5’-nucleotidase domain-containing 2) has been linked to immunological infiltration and is a predictor of a poor outcome in patients with HCC, with possible uses in clinical stratification [31–34]. By controlling intracellular iron homeostasis and oxidative stress reactions, NT5DC2 contributes significantly to ferroptosis. NT5DC2 plays a crucial function in the regulation of ferroptosis, as evidenced by the fact that its knockdown increases cell susceptibility to ferroptosis inducers [35]. As a component of the system Xc^-, SLC7A11 (solute carrier family 7 member 11) facilitates glutathione synthesis and is in charge of cysteine transport, both of which are critical for antioxidant defense [26]. By controlling glutathione synthesis, SLC7A11 prevents ferroptosis and lowers lipid peroxide buildup. Research has demonstrated that SLC7A11 inhibits ferroptosis, which increases tumor growth in addition to HCC cell proliferation [36–40]. By controlling microtubule dynamics and cell division, the protein KIF20A (Kinesin Family Member 20A) plays a crucial role in HCC proliferation and tumor growth by facilitating chromosomal movement during cell division [41]. The transcription factor Gli2, which is a downstream target gene of the Hedgehog (Hh) signaling pathway, activates the FoxM1-MMB complex to enhance KIF20A expression, which in turn stimulates the formation of tumors and the proliferation of HCC cells. High KIF20A expression is strongly linked to HCC recurrence and a poor prognosis for patients, according to clinical studies [42]. Additionally, KIF20A interacts with the c-Myc pathway and immune evasion mechanisms to contribute to the development of HCC and the immunological response to treatment. By enhancing the benefits of anti-PD-1 immunotherapy, targeting KIF20A therapy may open up new avenues for precision medicine and immunotherapy in HCC [43]. Histone methyltransferase EZH2 (Enhancer of Zeste Homolog 2) mostly suppresses gene expression by trimethylating H3K27. In addition to controlling the expression of TFR2 (Transferrin Receptor 2) through epigenetic pathways, EZH2 is essential for the development, growth, metastasis, and immune evasion of HCC. This prevents ferroptosis and increases drug resistance in HCC cells [44]. As a result, EZH2 targeting could be a crucial approach to treating HCC [45]. The existing research has not disclosed the direct mechanistic relationship between EZH2 and KIF20A, even though we discovered a strong association between the two in the PPI network diagram and Pearson correlation analysis. However, research has revealed that EZH2 and KIF20A are strongly linked to the growth and prognosis of cancer and coexist in many prognostic indicators [46–50], offering crucial hints for further investigation.
After describing the individual roles of the ferroptosis-related genes (G6PD, KIF20A, EZH2, NT5DC2, SLC7A11), it is essential to investigate how these genes might interact with each other to influence the tumor microenvironment (TME) in a synergistic manner. These genes likely regulate ferroptosis through multiple interconnected mechanisms, where each gene not only plays a role in its specific pathway but also interacts with others to fine-tune the overall process. Their combined actions can influence critical biological processes such as iron homeostasis, oxidative stress, and antioxidant defense, which are all key factors in ferroptosis induction.
For instance, G6PD, as an important regulator of the pentose phosphate pathway, may impact the cellular oxidative stress balance, potentially enhancing or suppressing ferroptosis depending on the metabolic context. Meanwhile, SLC7A11, which is involved in glutathione synthesis and antioxidant defense, can work in conjunction with G6PD to modulate cellular resistance to ferroptosis by maintaining the cellular antioxidant capacity. KIF20A, which is linked to cell division, might influence the proliferation of tumor cells, thereby modulating their susceptibility to ferroptosis. The upregulation of KIF20A could potentially enhance the evasion of ferroptosis, promoting tumor growth and progression.
EZH2, a key epigenetic regulator, could influence the expression of other ferroptosis-related genes by altering chromatin structure, which might further contribute to ferroptosis resistance or enhance susceptibility. Similarly, NT5DC2’s role in regulating immune responses and its potential to influence ferroptosis suggests it might play a part in modulating the immune microenvironment. By interacting with immune cells, these genes could either support immune evasion or enhance immune responses, depending on their expression levels and activity.
The interaction of these genes creates a complex relationship between tumor progression, ferroptosis, and immune modulation. Some genes may suppress ferroptosis to evade immune detection, while others may induce ferroptosis to enhance immune recognition and tumor cell clearance. Understanding how these ferroptosis-related genes modulate the immune microenvironment and contribute to immune escape mechanisms is crucial for the development of new immunotherapies that target these pathways to promote ferroptosis in tumor cells and enhance anti-tumor immunity.
Our enrichment analysis revealed that the differentially expressed genes (DEGs) between the high- and low-risk groups are significantly enriched in key biological processes such as cell division, chromosome segregation, cell cycle, and mitosis, which are closely related to tumor proliferation and metastasis [51–54]. These results suggest that genes like G6PD may play a role in ferroptosis by regulating cell proliferation and division processes, further influencing tumor growth and metastasis.
Specifically, in terms of cellular components, the DEGs are enriched in centromeric and chromosomal regions, indicating that they may be involved in ferroptosis through the regulation of the cytoskeleton. For example, the role of KIF20A is closely related to the regulation of microtubule dynamics, and its key function in cell division may influence ferroptosis by regulating iron homeostasis and oxidative stress responses [41]. Additionally, SLC7A11 controls antioxidant defense by promoting the transport of cysteine and the synthesis of glutathione, thereby inhibiting ferroptosis. These processes are closely associated with the cell cycle and proliferation [26]. By regulating these cellular biological processes, these genes may exacerbate tumor proliferation and drug resistance in the high-risk group.
KEGG analysis shows that the DEGs in the high-risk group are enriched in key processes such as the cell cycle, carbon metabolism, and DNA replication, suggesting that these genes are closely related to cell proliferation and metabolic imbalance [55–57]. Notably, the role of EZH2 involves the epigenetic regulation of TFR2 expression, which further inhibits ferroptosis and enhances drug resistance [44], a process closely related to cell cycle regulation and tumor proliferation and metastasis. Therefore, the enrichment of these genes in the high-risk group may influence the occurrence of ferroptosis and promote tumor progression by altering iron homeostasis, oxidative stress, and cell cycle regulation.
GSEA analysis further revealed that the immune microenvironment of patients in the high-risk group is complex and closely related to immune evasion mechanisms [58–60]. This provides new insights into the role of these genes in the regulation of ferroptosis. For example, the expression of NT5DC2 is closely associated with immune infiltration and plays a crucial role in immune evasion [31–34], which may affect the response of patients to immunotherapy. Moreover, the interaction of KIF20A with the c-Myc pathway and immune evasion mechanisms may also be associated with both immune evasion and ferroptosis [42]. Additionally, our Hallmark gene set analysis revealed that genes in the high-risk group are enriched in pathways such as MYC target genes, G2M checkpoint, and E2F target genes, which are crucial for tumor progression, cell cycle regulation, and proliferation [61–63].
In summary, our enrichment analysis results indicate that DEGs in the high- and low-risk groups are closely related to the cytoskeleton, lipid metabolism, chromosomal regulation, proliferative processes, and the cell cycle. These genes are also associated with immune evasion strategies and a complex immune microenvironment. These findings provide further insights into the interactions of these ferroptosis-related genes and their potential in regulating ferroptosis within the tumor microenvironment (TME), as well as their viability as therapeutic targets for ferroptosis. Future studies can further explore the interactions between these genes, particularly in the TME, to understand how they synergistically regulate the occurrence of ferroptosis and provide a theoretical basis for the development of individualized treatment strategies.
It is imperative to investigate further how these findings might be used in therapeutic techniques, given the correlation found in our work between the DEGs in the high-risk group and the immunological microenvironment and immune evasion mechanisms. Immunotherapy has emerged as a novel therapeutic approach that has improved survival rates for HCC patients globally [64]. Immune checkpoint clinical research is comparatively advanced and well-established. Over time, immune checkpoint modification has gained popularity as an immunotherapy technique [65]. Furthermore, according to recent research, tumor mutational burden, or TMB, may be a predictive biomarker for immunotherapy, and a high TMB may indicate a favorable reaction to immune checkpoint inhibitors (ICIs) [66]. Initially, we examined the gene mutations in the prognostic genes in HCC and the gene mutations in the low-risk and high-risk groups. TMB did not significantly affect the survival rate, and there was no statistically significant difference in TMB between the high-risk and low-risk groups, according to the results. However, we discovered that missense mutations were prevalent, with the two main altered genes in both groups being TP53 and CTNNB1. It is commonly recognized that many types of cancer include mutations in TP53 and CTNNB1. It has been demonstrated that the tumor suppressor gene TP53 controls the metabolism of tumor cells [67,68].TP53-mutant HCC tissues, on the other hand, are often poorly differentiated and can be linked to angiogenesis and vascular invasion [69]. The protein β-catenin, encoded by the CTNNB1 gene, has been shown to encourage immune evasion and may influence how well HCC patients respond to immunotherapy [70]. Additionally, a deeper comprehension of the immune milieu will facilitate the creation of novel HCC treatment approaches.
The two patient groups based on risk scores differ significantly in the number of immune cells, according to the immune infiltration study. We discovered that the proportion of M0 macrophages, Tregs, and natural killer T (NKT) cells was higher in the high-risk group. Inhibiting anti-tumor immunity and influencing NK cell activity, Tregs and NKT cells are important components of the tumor microenvironment of HCC, contributing to tumor growth [71]. M0 macrophages, which are strongly linked to a poor prognosis, increased angiogenesis, and a low response to immunotherapy, also contribute significantly to the development of tumors in HCC. This discovery offers fresh approaches to risk assessment and tiered treatment in HCC [72]. The aforementioned findings imply that an immunosuppressive environment may be the cause of the high-risk group’s poor prognosis. Furthermore, immunotherapy based on ICIs has a lot of potential for treating HCC. We compared the two risk groups’ immunological checkpoints. ICIs may be more beneficial for low-risk patients, as the high-risk group exhibited greater expression of 41 immune checkpoints, indicating a stronger immune suppression status. Additionally, we discovered that low-risk patients had lower TIDE ratings, suggesting that immunotherapy may work better for them. Furthermore, we further evaluated immunotherapy using the IPS and found that the IPS of the low-risk group were generally higher than those of the high-risk group. In addition, the IPS of the low-risk group were significantly higher than those of the high-risk group in both the CTLA4-negative and PD1-negative group and the CTLA4-positive and PD1-negative group, further confirming the advantage of the low-risk group in immunotherapy.By analyzing the TIDE and IPS scores, clinicians can predict a patient’s response to immunotherapy based on their immune profile, thus providing a basis for personalized treatment. We found that the low-risk group exhibited more favorable immune responses in both TIDE and IPS scores, suggesting that these patients may respond better to immunotherapy, especially ICIs. This model helps clinicians identify patients who are likely to benefit from immunotherapy, avoiding ineffective treatment for low-responding patients, thereby improving treatment efficacy and reducing side effects.
Moreover, based on risk scores, clinicians can adjust treatment strategies. High-risk patients may require combination therapy, such as chemotherapy or targeted therapy, while low-risk patients may benefit from ICIs alone. Given the better response of the low-risk group to immunotherapy, ICIs may be the preferred treatment for this population. This model can also provide precise prognostic assessment and personalized follow-up management for patients, ensuring that high-risk patients receive more frequent monitoring and proactive intervention.
Finally, the immune scoring model can provide guidance for clinical trial design, helping to refine inclusion criteria, ensuring the efficacy and safety of immunotherapy, and serving as an efficacy biomarker to evaluate new drugs or treatment protocols. By integrating this model into clinical practice, it allows for tailored treatment plans, optimizing immunotherapy outcomes and improving prognosis, which holds significant clinical value, especially in the context of using ICIs in immunotherapy.
However, there are several limitations to the clinical application of this immune scoring model. One significant limitation is the lack of detailed treatment data in the public databases such as TCGA and GSE14520. These databases do not provide information about the specific treatments patients received, such as surgery, chemotherapy, or immunotherapy regimens. As a result, the model may conflate true prognostic effects (inherent tumor biology) with treatment-related factors (such as access to specific therapies), potentially leading to biased conclusions.
For instance, high-risk patients may have been ineligible for curative therapies or have received less aggressive treatments, which could affect their prognosis independently of the tumor’s biological behavior. This lack of treatment data introduces unmeasured confounding, limiting the model’s ability to fully assess the impact of treatment on patient outcomes. Future studies should aim to integrate treatment data, including detailed information about specific therapies and their effects on the tumor microenvironment, to refine this model and enhance its clinical applicability. Incorporating treatment-related variables will help differentiate the prognostic value of the immune scoring model from treatment-related biases, providing a clearer picture of how immune profiles influence treatment response.
We also used the prognostic model to assess the sensitivity to targeted medications and chemotherapy to forecast the treatment results of the patients. The findings indicated that the majority of medications were more sensitive in the high-risk group, indicating that these treatments would work better for high-risk individuals.
We also used DsigDB to screen the association between feature genes and medications to improve the therapy of nFRGs, and this revealed some promising therapeutic possibilities. By looking at the relationship between these medications and important genes, we discovered that several medications, including acetaminophen (CTD 00005295) and 7646-79-9 (CTD 00000928), exhibited strong connections with the feature genes. The five feature genes (KIF20A, NT5DC2, G6PD, SLC7A11, and EZH2) were substantially correlated with these two medications. The aforementioned medications may provide viable therapeutic approaches by modifying the expression of these genes, given their possible significance in tumor growth and therapy response. Additional research could examine how these medications work in particular pathological situations and assess their viability for clinical use
Conclusion
In conclusion, this research aims to determine five important genes with prognostic predictive value and investigate the possible contribution of ferroptosis-related gene features (nFRGs) to the prognosis of hepatocellular carcinoma (HCC). Our findings imply that low-risk patients might benefit more from immunotherapy, including immune checkpoint inhibitors (ICIs), as they exhibit lower immune suppression and better immune responsiveness. Conversely, the high-risk group shows increased expression of immune checkpoints, indicating a stronger immune-suppressive status.However, high-risk patients have greater sensitivity to targeted medications and chemotherapy, suggesting that they may respond better to these treatments. However, this study has certain limitations. As our analysis primarily relies on bioinformatics analysis using public datasets, the lack of in vitro or in vivo experimental validation of the selected genes limits the clinical applicability of the results. Nevertheless, based on the results from multidimensional data analysis and cross-validation, we have provided preliminary support for the potential application of the nFRGs model in HCC. Future experimental studies will help confirm the specific roles of these five selected genes in ferroptosis and hepatocellular carcinoma, and further validate their clinical translational value.
Supporting information
S1 Fig. Immunohistochemical expression of novel ferroptosis-related genes in normal and HCC tissues from the HPA database.
https://doi.org/10.1371/journal.pone.0322158.s001
(TIF)
S2 Fig. The AUC values of each model at 1 year, 2 years, and 3 years.
https://doi.org/10.1371/journal.pone.0322158.s002
(TIF)
S3 Fig. Kaplan-Meier survival analyses based on diagnosis year and risk stratification.
https://doi.org/10.1371/journal.pone.0322158.s003
(TIF)
S4 Fig. Forest plots: Univariate Cox analysis for risk score subgroups and clinical features in the TCGA and GSE14520 datasets.
https://doi.org/10.1371/journal.pone.0322158.s004
(TIF)
S5 Fig. The AUC values of our prognostic model compared with other ferroptosis-related prognostic models at 1 year, 2 years, and 3 years in each dataset.
https://doi.org/10.1371/journal.pone.0322158.s005
(TIF)
S7 Fig. Comparison of risk subgroups in the tumor microenvironment (TME).
https://doi.org/10.1371/journal.pone.0322158.s007
(TIF)
S1 Table. Intersection of ferroptosis and upregulated differential genes.
https://doi.org/10.1371/journal.pone.0322158.s008
(XLSX)
S2 Table. Pairwise comparison of overall survival among combined diagnosis year and risk group categories using Log-rank test.
https://doi.org/10.1371/journal.pone.0322158.s009
(XLSX)
S3 Table. Previously published ferroptosis-related prognostic models.
https://doi.org/10.1371/journal.pone.0322158.s010
(XLSX)
S4 Table. Functional analysis of differentially expressed genes between high- and low-risk subgroups.
https://doi.org/10.1371/journal.pone.0322158.s011
(XLSX)
S5 Table. Drug prediction for novel ferroptosis-related genes (nFRGs).
https://doi.org/10.1371/journal.pone.0322158.s012
(XLSX)
Acknowledgments
We sincerely thank the Huang Jingjing research group members for providing resources and support during the study. We also extend our gratitude to The Cancer Genome Atlas (TCGA) and the GEO database for their publicly available data, which were crucial for our analysis—special thanks to Professor Huang Jingjing for her valuable guidance in statistical analysis and bioinformatics. Additionally, we appreciate the technical assistance provided by the Guangxi Key Laboratory of Translational Medicine for Treating High-Incidence Infectious Diseases with Integrative Medicine and the constructive comments from the reviewers, which helped improve the quality of this manuscript.
References
- 1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin. 2021;71(3):209–49. pmid:33538338
- 2. Forner A, Reig M, Bruix J. Hepatocellular carcinoma. Lancet. 2018;391(10127):1301–14. pmid:29307467
- 3. Rebouissou S, Nault J-C. Advances in molecular classification and precision oncology in hepatocellular carcinoma. J Hepatol. 2020;72(2):215–29. pmid:31954487
- 4. Pillai A, Ahn J, Kulik L. Integrating Genomics Into Clinical Practice in Hepatocellular Carcinoma: The Challenges Ahead. Am J Gastroenterol. 2020;115(12):1960–9. pmid:33038134
- 5. Greten T.F, Lai C.W, Li G, Staveley-O’Carroll KF. Targeted and Immune-Based Therapies for Hepatocellular Carcinoma. Gastroenterology. 2024;167(4):819. pmid:38970609
- 6. Llovet JM, Montal R, Sia D, Finn RS. Molecular therapies and precision medicine for hepatocellular carcinoma. Nat Rev Clin Oncol. 2018;15(10):599–616. pmid:30061739
- 7. Dixon SJ, Lemberg KM, Lamprecht MR, Skouta R, Zaitsev EM, Gleason CE, et al. Ferroptosis: an iron-dependent form of nonapoptotic cell death. Cell. 2012;149(5):1060–72. pmid:22632970
- 8. Li D, Li Y. The interaction between ferroptosis and lipid metabolism in cancer. Signal Transduct Target Ther. 2020;5(1):108. pmid:32606298
- 9. Li J, Cao F, Yin H-L, Huang Z-J, Lin Z-T, Mao N, et al. Ferroptosis: past, present and future. Cell Death Dis. 2020;11(2):88. pmid:32015325
- 10. Chen J, Li X, Ge C, Min J, Wang F. The multifaceted role of ferroptosis in liver disease. Cell Death Differ. 2022;29(3):467–80. pmid:35075250
- 11. Yan H-F, Zou T, Tuo Q-Z, Xu S, Li H, Belaidi AA, et al. Ferroptosis: mechanisms and links with diseases. Signal Transduct Target Ther. 2021;6(1):49. pmid:33536413
- 12. Zhao C, Zhang Z, Tao J. A Novel Ferroptosis-Related Signature for Prediction of Prognosis, Immune Profiles and Drug Sensitivity in Hepatocellular Carcinoma Patients. Curr Oncol. 2022;29(10):6992–7011. pmid:36290827
- 13. Dai T, Li J, Lu X, Ye L, Yu H, Zhang L, et al. Prognostic Role and Potential Mechanisms of the Ferroptosis-Related Metabolic Gene Signature in Hepatocellular Carcinoma. Pharmgenomics Pers Med. 2021;14:927–45. pmid:34377010
- 14. Zhang B, Zhao J, Liu B, Shang Y, Chen F, Zhang S, et al. Development and Validation of a Novel Ferroptosis-Related Gene Signature for Prognosis and Immunotherapy in Hepatocellular Carcinoma. Front Mol Biosci. 2022;9:940575. pmid:35847985
- 15. Wang Q, Wang B, Ma X, Zhuang H, Xie Z, Tang C, et al. Identification of a Novel Ferroptosis-Related Gene Signature for Predicting Prognosis and Responsiveness to Immunotherapy in Hepatocellular Carcinoma. J Hepatocell Carcinoma. 2023;10:1–16. pmid:36647389
- 16. Wang W, Pan F, Lin X, Yuan J, Tao C, Wang R. Ferroptosis-Related Hub Genes in Hepatocellular Carcinoma: Prognostic Signature, Immune-Related, and Drug Resistance Analysis. Front Genet. 2022;13:907331. pmid:35938001
- 17. He Y, Wu Y, Song M, Yang Y, Yu Y, Xu S. Establishment and validation of a ferroptosis-related prognostic signature for hepatocellular carcinoma. Front Oncol. 2023;13:1149370. pmid:37143953
- 18. Chen C, Wang Z, Ding Y, Qin Y. Tumor microenvironment-mediated immune evasion in hepatocellular carcinoma. Front Immunol. 2023;14:1133308. pmid:36845131
- 19. Donne R, Lujambio A. The liver cancer immune microenvironment: Therapeutic implications for hepatocellular carcinoma. Hepatology. 2023;77(5):1773–96. pmid:35989535
- 20. Huang Y, Wang S, Ke A, Guo K. Ferroptosis and its interaction with tumor immune microenvironment in liver cancer. Biochim Biophys Acta Rev Cancer. 2023;1878(1):188848. pmid:36502929
- 21. Lei G, Zhuang L, Gan B. The roles of ferroptosis in cancer: Tumor suppression, tumor microenvironment, and therapeutic interventions. Cancer Cell. 2024;42(4):513–34. pmid:38593779
- 22. Zhang C, Liu X, Jin S, Chen Y, Guo R. Ferroptosis in cancer therapy: a novel approach to reversing drug resistance. Mol Cancer. 2022;21(1):47. pmid:35151318
- 23. Lei G, Zhuang L, Gan B. Targeting ferroptosis as a vulnerability in cancer. Nat Rev Cancer. 2022;22(7):381–96. pmid:35338310
- 24. Liu H, Zhang W, Zhang Y, Adegboro AA, Fasoranti DO, Dai L, et al. Mime: A flexible machine-learning framework to construct and visualize models for clinical characteristics prediction and feature selection. Computational and Structural Biotechnology Journal. 2024;23:2798–810.
- 25. Sun X, Ou Z, Chen R, Niu X, Chen D, Kang R, et al. Activation of the p62-Keap1-NRF2 pathway protects against ferroptosis in hepatocellular carcinoma cells. Hepatology. 2016;63(1):173–84. pmid:26403645
- 26. Gao R, Kalathur RKR, Coto-Llerena M, Ercan C, Buechel D, Shuang S, et al. YAP/TAZ and ATF4 drive resistance to Sorafenib in hepatocellular carcinoma by preventing ferroptosis. EMBO Mol Med. 2021;13(12):e14351. pmid:34664408
- 27. Yang R, Gao W, Wang Z, Jian H, Peng L, Yu X, et al. Polyphyllin I induced ferroptosis to suppress the progression of hepatocellular carcinoma through activation of the mitochondrial dysfunction via Nrf2/HO-1/GPX4 axis. Phytomedicine. 2024;122:155135. pmid:37856990
- 28. Friedmann Angeli JP, Krysko DV, Conrad M. Ferroptosis at the crossroads of cancer-acquired drug resistance and immune evasion. Nat Rev Cancer. 2019;19(7):405–14. pmid:31101865
- 29. Cao F, Luo A, Yang C. G6PD inhibits ferroptosis in hepatocellular carcinoma by targeting cytochrome P450 oxidoreductase. Cell Signal. 2021;87:110098. pmid:34325001
- 30. Zhou L, Zhang Q, Cheng J, Shen X, Li J, Chen M, et al. LncRNA SNHG1 upregulates FANCD2 and G6PD to suppress ferroptosis by sponging miR-199a-5p/3p in hepatocellular carcinoma. Drug Discov Ther. 2023;17(4):248–56. pmid:37599085
- 31. Seifried A, Schultz J, Gohla A. Human HAD phosphatases: structure, mechanism, and roles in health and disease. FEBS J. 2013;280(2):549–71. pmid:22607316
- 32. Chen J, Cao J, Wang P, He X. NT5DC2 is a novel prognostic marker in human hepatocellular carcinoma. Oncol Lett. 2020;20(4):70. pmid:32863903
- 33. Li R, Liu R, Zheng S, Liu W, Li H, Li D. Comprehensive Analysis of Prognostic Value and Immune Infiltration of the NT5DC Family in Hepatocellular Carcinoma. J Oncol. 2022;2022:2607878. pmid:35047040
- 34. Li K-S, Zhu X-D, Liu H-D, Zhang S-Z, Li X-L, Xiao N, et al. NT5DC2 promotes tumor cell proliferation by stabilizing EGFR in hepatocellular carcinoma. Cell Death Dis. 2020;11(5):335. pmid:32382041
- 35. Chen Y-C, Oses-Prieto JA, Pope LE, Burlingame AL, Dixon SJ, Renslo AR. Reactivity-Based Probe of the Iron(II)-Dependent Interactome Identifies New Cellular Modulators of Ferroptosis. J Am Chem Soc. 2020;142(45):19085–93. pmid:33124817
- 36. Chen Q, Zheng W, Guan J, Liu H, Dan Y, Zhu L, et al. SOCS2-enhanced ubiquitination of SLC7A11 promotes ferroptosis and radiosensitization in hepatocellular carcinoma. Cell Death Differ. 2023;30(1):137–51. pmid:35995846
- 37. Li D, Pan J, Zhang Y, Li Y, Jin S, Zhong C, et al. C8orf76 Modulates Ferroptosis in Liver Cancer via Transcriptionally Up-Regulating SLC7A11. Cancers (Basel). 2022;14(14):3410. pmid:35884471
- 38. Zhang B, Bao W, Zhang S, Chen B, Zhou X, Zhao J, et al. LncRNA HEPFAL accelerates ferroptosis in hepatocellular carcinoma by regulating SLC7A11 ubiquitination. Cell Death Dis. 2022;13(8):734. pmid:36008384
- 39. Tang J, Long G, Hu K, Xiao D, Liu S, Xiao L, et al. Targeting USP8 Inhibits O-GlcNAcylation of SLC7A11 to Promote Ferroptosis of Hepatocellular Carcinoma via Stabilization of OGT. Adv Sci (Weinh). 2023;10(33):e2302953. pmid:37867237
- 40. Bi F, Qiu Y, Wu Z, Liu S, Zuo D, Huang Z, et al. METTL9-SLC7A11 axis promotes hepatocellular carcinoma progression through ferroptosis inhibition. Cell Death Discov. 2023;9(1):428. pmid:38017014
- 41. Gasnereau I, Boissan M, Margall-Ducos G, Couchy G, Wendum D, Bourgain-Guglielmetti F, et al. KIF20A mRNA and its product MKlp2 are increased during hepatocyte proliferation and hepatocarcinogenesis. Am J Pathol. 2012;180(1):131–40. pmid:22056911
- 42. Shi C, Huang D, Lu N, Chen D, Zhang M, Yan Y, et al. Aberrantly activated Gli2-KIF20A axis is crucial for growth of hepatocellular carcinoma and predicts poor prognosis. Oncotarget. 2016;7(18):26206–19. pmid:27036048
- 43. Chen S, Zhao L, Liu J, Han P, Jiang W, Liu Y, et al. Inhibition of KIF20A enhances the immunotherapeutic effect of hepatocellular carcinoma by enhancing c-Myc ubiquitination. Cancer Lett. 2024;598:217105. pmid:38971490
- 44. Lai Y, Han X, Xie B, Xu Y, Yang Z, Wang D, et al. EZH2 suppresses ferroptosis in hepatocellular carcinoma and reduces sorafenib sensitivity through epigenetic regulation of TFR2. Cancer Sci. 2024;115(7):2220–34. pmid:38623968
- 45. Wang B, Liu Y, Liao Z, Wu H, Zhang B, Zhang L. EZH2 in hepatocellular carcinoma: progression, immunity, and potential targeting therapies. Exp Hematol Oncol. 2023;12(1):52. pmid:37268997
- 46. Gu P, Yang D, Zhu J, Zhang M, He X. Bioinformatics analysis identified hub genes in prostate cancer tumorigenesis and metastasis. Math Biosci Eng. 2021;18(4):3180–96. pmid:34198380
- 47. Chen X, Liao L, Li Y, Huang H, Huang Q, Deng S. Screening and Functional Prediction of Key Candidate Genes in Hepatitis B Virus-Associated Hepatocellular Carcinoma. Biomed Res Int. 2020;2020:7653506. pmid:33102593
- 48. Xu R, Lin L, Zhang B, Wang J, Zhao F, Liu X, et al. Identification of prognostic markers for hepatocellular carcinoma based on the epithelial-mesenchymal transition-related gene BIRC5. BMC Cancer. 2021;21(1):687. pmid:34112092
- 49. Solanki R, Zubbair Malik M, Alankar B, Ahmad FJ, Dohare R, Chauhan R, et al. Identification of novel biomarkers and potential molecular targets for uterine cancer using network-based approach. Pathol Res Pract. 2024;260:155431. pmid:39029376
- 50. He F, Zeng P, Ma S, Yang X, Liu H, Liu Q, et al. Identification and validation of a novel cuproptosis-related genes signature associated with prognosis, clinical implications and immunotherapy of hepatocellular carcinoma. Front Pharmacol. 2023;14:1088993. pmid:36843949
- 51. Goodson HV, Jonasson EM. Microtubules and Microtubule-Associated Proteins. Cold Spring Harb Perspect Biol. 2018;10(6):a022608. pmid:29858272
- 52. Jordan MA, Wilson L. Microtubules as a target for anticancer drugs. Nat Rev Cancer. 2004;4(4):253–65. pmid:15057285
- 53. Miller WL, Auchus RJ. The molecular biology, biochemistry, and physiology of human steroidogenesis and its disorders. Endocr Rev. 2011;32(1):81–151. pmid:21051590
- 54. Wang K, Shi J-H, Gao J, Sun Y, Wang Z, Shi X, et al. Arachidonic acid metabolism CYP450 pathway is deregulated in hepatocellular carcinoma and associated with microvascular invasion. Cell Biol Int. 2024;48(1):31–45. pmid:37655528
- 55. Poirier MC. DNA adducts as exposure biomarkers and indicators of cancer risk. Environ Health Perspect. 1997;105 Suppl 4(Suppl 4):907–12. pmid:9255579
- 56. Pio R, Corrales L, Lambris JD. The role of complement in tumor growth. Adv Exp Med Biol. 2014;772:229–62. pmid:24272362
- 57. Lima LG, Monteiro RQ. Activation of blood coagulation in cancer: implications for tumour progression. Biosci Rep. 2013;33(5):e00064. pmid:23889169
- 58. Helmink BA, Reddy SM, Gao J, Zhang S, Basar R, Thakur R, et al. B cells and tertiary lymphoid structures promote immunotherapy response. Nature. 2020;577(7791):549–55. pmid:31942075
- 59. Sun Y, Li F, Sonnemann H, Jackson KR, Talukder AH, Katailiha AS, et al. Evolution of CD8+ T Cell Receptor (TCR) Engineered Therapies for the Treatment of Cancer. Cells. 2021;10(9):2379. pmid:34572028
- 60. Xie G, Dong H, Liang Y, Ham JD, Rizwan R, Chen J. CAR-NK cells: A promising cellular immunotherapy for cancer. EBioMedicine. 2020;59:102975. pmid:32853984
- 61. Huntington JT, Tang X, Kent LN, Schmidt CR, Leone G. The Spectrum of E2F in Liver Disease--Mediated Regulation in Biology and Cancer. J Cell Physiol. 2016;231(7):1438–49. pmid:26566968
- 62. Xu X, Jiang C, Wang S, Tai Y, Wang T, Kang L, et al. HPIP is upregulated in liver cancer and promotes hepatoma cell proliferation via activation of G2/M transition. IUBMB Life. 2013;65(10):873–82. pmid:24038948
- 63. Duffy MJ, O’Grady S, Tang M, Crown J. MYC as a target for cancer treatment. Cancer Treat Rev. 2021;94:102154. pmid:33524794
- 64. El-Khoueiry AB, Sangro B, Yau T, Crocenzi TS, Kudo M, Hsu C, et al. Nivolumab in patients with advanced hepatocellular carcinoma (CheckMate 040): an open-label, non-comparative, phase 1/2 dose escalation and expansion trial. Lancet. 2017;389(10088):2492–502. pmid:28434648
- 65. Topalian SL, Forde PM, Emens LA, Yarchoan M, Smith KN, Pardoll DM. Neoadjuvant immune checkpoint blockade: A window of opportunity to advance cancer immunotherapy. Cancer Cell. 2023;41(9):1551–66. pmid:37595586
- 66. Chan TA, Yarchoan M, Jaffee E, Swanton C, Quezada SA, Stenzinger A, et al. Development of tumor mutation burden as an immunotherapy biomarker: utility for the oncology clinic. Ann Oncol. 2019;30(1):44–56. pmid:30395155
- 67. Voskarides K, Giannopoulou N. The Role of TP53 in Adaptation and Evolution. Cells. 2023;12(3):512. pmid:36766853
- 68. Aubrey BJ, Strasser A, Kelly GL. Tumor-Suppressor Functions of the TP53 Pathway. Cold Spring Harb Perspect Med. 2016;6(5):a026062. pmid:27141080
- 69. Calderaro J, Couchy G, Imbeaud S, Amaddeo G, Letouzé E, Blanc J-F, et al. Histological subtypes of hepatocellular carcinoma are related to gene mutations and molecular tumour classification. J Hepatol. 2017;67(4):727–38. pmid:28532995
- 70. Ruiz de Galarreta M, Bresnahan E, Molina-Sánchez P, Lindblad KE, Maier B, Sia D, et al. β-Catenin Activation Promotes Immune Escape and Resistance to Anti-PD-1 Therapy in Hepatocellular Carcinoma. Cancer Discov. 2019;9(8):1124–41. pmid:31186238
- 71. Bozward AG, Warricker F, Oo YH, Khakoo SI. Natural Killer Cells and Regulatory T Cells Cross Talk in Hepatocellular Carcinoma: Exploring Therapeutic Options for the Next Decade. Front Immunol. 2021;12:643310. pmid:33995362
- 72. Farha M, Jairath NK, Lawrence TS, El Naqa I. Characterization of the Tumor Immune Microenvironment Identifies M0 Macrophage-Enriched Cluster as a Poor Prognostic Factor in Hepatocellular Carcinoma. JCO Clin Cancer Inform. 2020;4:1002–13. pmid:33136432