Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Construction of a 5-gene prognostic signature based on oxidative stress related genes for predicting prognosis in osteosarcoma

  • Xiaofang Hong ,

    Contributed equally to this work with: Xiaofang Hong, Ribin Fu

    Roles Conceptualization, Project administration, Validation, Writing – original draft

    Affiliation Department of Stomatology, Zhongshan Hospital, Xiamen University, Xiamen, China

  • Ribin Fu

    Contributed equally to this work with: Xiaofang Hong, Ribin Fu

    Roles Formal analysis, Investigation, Software, Validation, Visualization, Writing – review & editing

    13799786280@163.com

    Affiliation Department of Joint Surgery and Sports Medicine, Zhongshan Hospital, Xiamen University, Xiamen, China

Abstract

Background

The understanding of the complex biological scenario of osteosarcoma will open the way to identifying new strategies for its treatment. Oxidative stress is a cancer-related biological scenario. At present, it is not clear the oxidative stress genes in affecting the prognosis and progression of osteosarcoma, the underlying mechanism as well as their impact on the classification of osteosarcoma subtypes.

Methods

We selected samples and sequencing data from TARGET data set and GSE21257 data set, and downloaded oxidative stress related-genes (OSRGs) from MsigDB. Univariate Cox analysis of OSRG was conducted using TARGET data, and the prognostic OSRG was screened to conduct unsupervised clustering analysis to identify the molecular subtypes of osteosarcoma. Through least absolute shrinkage and selection operator (LASSO) regression analysis and COX regression analysis of differentially expressed genes (DEGs) between subgroups, a risk assessment system for osteosarcoma was developed.

Results

45 prognosis-related OSRGs genes were acquired, and two molecular subtypes of osteosarcoma were clustered. C2 cluster displayed prolonged overall survival (OS) accompanied with high degree of immune infiltration and enriched immune pathways. While cell cycle related pathways were enriched in C2 cluster. Based on DEGs between subgroups and Lasso analysis, 5 hub genes (ZYX, GJA5, GAL, GRAMD1B, and CKMT2) were screened to establish a robust prognostic risk model independent of clinicopathological features. High-risk group had more patients with cancer metastasis and death as well as C1 subtype with poor prognosis. Low-risk group exhibited favorable OS and high immune infiltration status. Additionally, the risk assessment system was optimized by building decision tree and nomogram.

Conclusions

This study defined two molecular subtypes of osteosarcoma with different prognosis and tumor immune microenvironment status based on the expression of OSRGs, and provided a new risk assessment system for the prognosis of osteosarcoma.

Introduction

Osteosarcoma as one the most frequently detected malignant tumors in bone system usually occurs in the metaphysis of long renal tubules, accounting for 0.2% of all solid malignant tumors, and it mainly threatens children and adolescents with exuberant bone growth, accounting for 3–6% of childhood cancers [14]. For high-grade osteosarcoma, the combination of surgery and chemotherapy is the first choice [5]. Despite treatment, 30% to 40% of patients will relapse within 2 to 3 years after treatment [6]. Worse, there is no standard treatment for patients with recurrence or metastasis, and the 5-year survival rate is only 20% [79]. Therefore, digging effective therapeutic biomarkers [10] for predicting treatment response, and monitoring indicators for early metastasis [11] to ameliorate the survival of osteosarcoma patients is urgent.

Oxidative stress is a cancer-related biological situation, which means that the intracellular reactive oxygen species (ROS) is high [12, 13]. Tumor development and anticancer treatment response involves the regulation of oxidative stress [14]. Previous research showed evidence of the relationship between some molecular markers related to oxidative stress and the progression and prognosis of cancer. In gastric cancer, 11 prognostic oxidative stress molecules (SERPINE1, CTLA4, HBB, F5, AGT, KIT, GPX3, GAD1, CYP19A1, BBC3 and NOX4) were involved in tumor progression [15]. To independently evaluate the prognosis of operable breast cancer, past study produced a systematic oxidative stress score based on five oxidative stress biomarkers (CRE, TBIL, LDH, BUN and ALB) [16]. A recently published paper also constructed a risk model with three genes related to oxidative stress (ERO1A, CDC25C and ITGB4), which is related to the clinical manifestations of patients with lung adenocarcinoma [17]. More importantly, numerous studies have disclosed that anticancer therapies containing immunotherapies through regulating ROS levels are prospective [18]. For example, monoamine oxidase A regulated tumor-associated macrophage polarization through upregulating oxidative stress to improve cancer immunotherapy [19]. Therefore, Leone and other researchers pointed out that according to different oxidative stress genes, tumors should be classified into subcategories to ensure precision medicine-based method development in certain subgroup of cancer patients [20]. In this point, exploring the studies on the biomarkers of ROS in osteosarcoma and their impact on immunotherapy may provide new therapeutic strategies.

In this study, our plan is to classify osteosarcoma into subclasses and construct a risk assessment system by oxidative stress-related genes and to study the biological differences such as tumor immune microenvironment (TIME) and pathway enrichment analysis between subclasses and risk groups. We also aim to design a nomogram for promoting personalized osteosarcoma treatment.

Materials and methods

Collection of osteosarcoma data

Osteosarcoma patients’ clinical follow-up information and RNA-seq data were collected from the database of Therapeutically Applicable Research to Generate Effective Treatments (TARGET, https://ocg.cancer.gov/programs/target) and processed using R packages “TCGAbiolinks” [21]. Only the genes encoded in the database and expressed in at least 50% of the samples were retained, with a total of 84 samples. Osteosarcoma sample data in the GSE21257 dataset was achieved from Gene Expression Omnibus, 53 of them were included in this study.

Clustering for the oxidative stress related-gene

“GOBP_RESPONSE_TO_OXIDATIVE_STRESS” gene set was downloaded from Molecular Signatures Database (MsigDB), which covered 356 oxidative stress related-genes (OSRGs). OSRGs associated with osteosarcoma prognosis were screened and entered into the R package “ConsensusClusterPlus” [22] by performing univariate COX regression analysis of OSRGs in TARGET using the R “Survival” package [23]. In the process of unsupervised hierarchical clustering analysis, the algorithm we set was “hc”, the measurement distance was “pearson”, the bootstarps was 500, and the cluster range was 2–10.

Conclusion of tumor immune microenvironment (TIME)

Different methods were implemented to evaluate the indicators in the TIME. ESTIMATE method [24] applied immune score and stromal score and ESTIMATE score as the criteria for evaluating immune components, matrix content and tumor purity. The abundance of infiltrating immune cells in TIME was quantified by Microenvironment Cell Populations-counter (MCP-counter) [25] and ssGSEA [26] based on 28 immune cell subsets [27], TIMER [28], EPIC [29], respectively.

Prediction of the response to PD-1 therapy

T cell inflammatory microenvironment characterized by cytotoxic effector molecules, antigen presentation, active IFN- γ signal transduction, T cell active cytokines is a common feature of tumor biology in response to PD-1 checkpoint blocking [30]. In this study, IFN- γ scores and immunocyte infiltrated cytolytic (CYT) scoreswere calculated according to type 1 helper T (Th1) / IFN- γ gene signatures [31] and the expression of PRF1 and GZMA [32], respectively. Ayers et al. proposed T cell-inflamed gene expression profile (GEP) score, which predicts the potential clinical response of samples to PD-1 checkpoint blockade based on the T-cell-inflamed GEP of 18 genes [30]. Based on this, this study compared the performance of T cell inflamed GEP score between the clusters divided by clustering analysis.

Functional annotation

We extracted the GSEA hallmark set “h.all.v7.5.1.entrez.gmt” from MSigDB. The GSEA software tool [33] was adopted to calculate the pathway activity of different subgroups, and compared them between groups. The differential enrichment pathway between subgroups was determined by setting the threshold value within false discovery rate (FDR) < 0.05.

Differential expression analysis and establishment of a risk assessment system

LIMMA package [34] of R was adopted to identify OSRGs differentially expressed genes (DEGs) between relevant subtypes [8] with the threshold of FDR < 0.05 & | |fold change (FC)| > 1.5. The DEGs was analyzed by univariate COX regression, and genes showing potential prognostic value were screened with p < 0.001 as the threshold. LASSO Cox regression analysis in R with 10-fold cross-validation was performed to identify prognostic DEGs. To obtain the most concise risk assessment system, stepwise multivariate COX regression analysis was adopted to delete the variables that had little contribution to osteosarcoma prognosis by LASSO regression with “glmnet” package [35], and by combining the variables retained by multistep screening, a risk assessment system was established. Each sample in TARGET and GSE21257 datasets was assigned with a risk score by the risk assessment system and then normalized by Z score. Samples with Z score > 0 were divided into the high risk group, while Z score < 0 was divided into low risk group. The Kaplan-Meier curve (K-M curve) and logarithmic rank test were carried out to evaluate the survival difference between groups. The receiver operating characteristic [20] curves were subjected to describe the prognosis value of risk assessment system using the “timeROC” package [36].

Construction of a nomogram and a decision tree

The optimization model based on risk assessment system was constructed, including a decision tree and a nomogram. The sample clinical features (age, gender, metastatic) given in TARGET and the risk assessment system were selected as input features in “rpart” to recursive partition and build a decision tree model. To screen independent prognostic factors with a cut value of p < 0.05, we performed univariate and multivariate COX regression analysis before constructing nomogram. The independent prognostic factors were incorporated into the “RMS” package of R software [37] to construct nomogram, and the performance of nomogram model was evaluated based on calibration curve, decision curve analysis (DCA) and ROC curve.

Statistical analysis

The analysis and calculation of all data and the visualization of graph were performed using R software. Student T test was employed for comparisons between two groups, the statistical correlation of parametric variables was assessed by Pearson correlation. K-M curve was drawn by “survminer” package [38] and logarithmic rank test was adopted to analyze how overall survival (OS) was related to the risk score. Results with a p < 0.05 were considered as a statistical significance if not otherwise specified.

Results

Identification of two molecular subtypes of osteosarcoma with different clinical characteristics based on OSRGs unsupervised clustering

According to the OSRGs expression level given in TARGET, univariate COX regression analysis was conducted through the coxph function in R. The results showed that 45 OSRGs had statistical significance in the prediction of osteosarcoma. The hazard ratio [21] of 27 of them were < 1, was protective factors for osteosarcoma prognosis, and 18 of them were poor prognostic factors for osteosarcoma, with HR > 1 (Fig 1A). According to hierarchical clustering analysis for the expression of 45 OSRGs, we identified two molecular subtypes of osteosarcoma, C1 and C2 (Fig 1B). The OSRGs score of each sample in TARGET was calculated by ssGSEA and compared between the two subgroups. We found that the OSRGs score of C2 was higher than C1, and the difference of OSRGs score between the two groups was at the borderline significant (Fig 1C). Meanwhile, the expression of 45 OSRGs in two subgroups was shown directly in the form of heatmap (Fig 1D). It was found that in C1, 18 risk OSRGs were generally highly expressed and 27 protective OSRGs were generally under-expressed. However, in C2, the expression of 45 OSRGs showed an opposite trend than that in C1. The prognostic analysis of C1 and C2 subtypes in TARGET and GSE21257 demonstrated a significantly worse prognosis of C1 than C2 (Fig 1E and 1F). In term of age, gender and metastatic, the distribution of the two subgroups did not show statistical significance, although difference existed. The proportion of dead samples in C1 subgroup with poor prognosis was more than C2 (Fig 1G). These results together suggested that prognostic related OSRGs indeed could classify osteosarcoma patients into subcategories and effectively evaluate the prognostic differences between molecular subtypes.

thumbnail
Fig 1. Identification of two molecular subtypes of osteosarcoma with different clinical characteristics based on OSRGs unsupervised clustering.

A: Univariate COX regression forest plots for 45 OSRGs in the TARGET dataset. B: Sample clustering heatmap for consensus k = 2 in TARGET cohort. C: The box chart shows the OSRGs score difference between two subgroups in TARGET dataset. D: Expression heatmap of OSRGs associated with osteosarcoma prognosis in distinct subgroups of TARGET. E: K-M curves and survival differences for two clusters in the TARGET cohort. F: Prognostic differences between the two subgroups in the GSE21257 cohort. G: Comparison of clinical features of two subgroups in TARGET.

https://doi.org/10.1371/journal.pone.0295364.g001

The two subgroups of osteosarcoma showed different TIME and the response to PD-1 therapy

We explored the TIME of two subgroups defined by OSRGs in TARGET cohort. ESTIMATE showed significant differences between C1 and C2 in terms of immune score and stromal score and ESTIMATE score. Compared with C1, all three types of TIME score were significantly up-regulated in C2 (Fig 2A). MCP-counter quantified the abundance of 10 immune cells in C1 and C2, and the abundance of 9 immune cells showed significant difference between C1 and C2, and both were significantly higher in C2 (Fig 2B). Similarly, among the infiltrating immune cell enrichment scores obtained by ssGSEA, the scores of 13 kinds of immune cells (myeloid-derived suppressor cells (MDSC), central memory CD8 T cell, effector memory CD8 T cell, natural killer T cell, activated B cell, regulatory T cell, central memory CD4 T cell, type 1 T helper cell, natural killer T cell, macrophage, monocyte, activated dendritic cell, plasmacytoid dendritic cell) were higher than that in C1 (Fig 2C). Considering that T cell inflammatory microenvironment characterized by T cell active cytokines, cytotoxic effector molecules, and active IFN- γ signal transduction is an important part of blocking reactive immune microenvironment at PD-1 checkpoint. Here, the CYT score, IFN-γ score, T cell inflamed GEP score and autophagy score of C1 and C2 were analyzed, the scores of these T-cell inflammatory microenvironments in C2 were significantly higher than C1 (Fig 2D–2G). The results showed that C2 with high degree of immune infiltration was more likely to respond to PD-1 checkpoint blocking than C1.

thumbnail
Fig 2. The two subgroups of osteosarcoma showed different TIME and the response to anti-programmed death-1 (PD-1) therapy.

A: The difference of immune score, stromal score and ESTIMATE score between C1 and C2 calculated by ESTIMATE. B: The abundance difference of 10 immune cells in C1 and C2 quantified by MCP-counter. C: Differences in infiltrating immune cell enrichment scores calculated by ssGSEA between C1 and C2. D-G: Differences in CYT score, IFN- γ score, T cell inflamed GEP score and autophagy score between C1 and C2. (T-test, significance of difference is marked with *,* P <0.05; ** P <0.01; *** P <0.001; **** P <0.0001).

https://doi.org/10.1371/journal.pone.0295364.g002

Differences in signal pathways of two subgroups enrichment in osteosarcoma

The above results provided evidence for the TIME heterogeneity of two subgroups in osteosarcoma. Then, we studied the mediated functional differences between the two molecular subtypes. GSEA identified several signal pathways that showed significant differences between C1 and C2. In the TARGET dataset, compared with C2, 13 pathways in C1 were activated and 15 pathways were inhibited. Most of the pathways leading to cell-cycle regulation (including MYC targets, E2F targets, DNA repair, G2M checkpoint) were included in the list of activated pathways. The suppressed pathway was mainly involved in immune-associated regulation (including allograft rejection, inflammatory response, complement, and interferon alpha response), epithelial-mesenchymal transition and apoptosis (Fig 3A). Similarly, in the GSE21257 dataset, the pathways that were significantly activated and inhibited in C1 relative to C2 showed the same state as in TARGET (Fig 3B). In the two datasets, compared with C2, the pathways significantly activated and inhibited by C1 overlap a lot, and the biological functions mediated by C1 are surprisingly consistent as a whole (Fig 3C).

thumbnail
Fig 3. Differences in signal pathways of two subgroups enrichment in osteosarcoma.

A: GSEA analyzed the signaling pathways that showed significant differences between C1 and C2 in TARGET data sets. The left box represents the pathway that is significantly activated in C1 compared to C2, and the right box represents the pathway that is significantly inhibited in C1 vs C2. B: GSEA analyzed the signal pathways that showed significant differences between C1 and C2 in GSE21257 datasets. C: C1 Comparison of significantly activated and significantly inhibited pathways in C1 compared to C2 between TARGET and GSE21257 datasets.

https://doi.org/10.1371/journal.pone.0295364.g003

Establishment and verification of risk assessment system

Due to the heterogeneity of TIME and mediated biological pathways between C1 and C2, the DEGs between the two molecular subtypes was screened by difference analysis for further analysis. We identified 1005 DEGs between C1 and C2 in TARGET (Fig 4A). Compared with C2, 621 DEGs were down-regulated and 384 DEGs were up-regulated in C1 (Fig 4B). Univariate COX regression analysis screened 30 potential prognostic genes out of 1005 DEGs with p < 0.001 as the threshold. LASSO regression eliminated the multicollinearity and screened 16 genes (Fig 4C). Multivariate COX analysis based on stepwise regression finally identified the five genes that had the greatest impact on the prognosis of osteosarcoma, which were ZYX, GJA5, GAL, GRAMD1B and CKMT2 (Fig 4D). The established risk assessment system was: risk score = -0.552×ZYX-0.572×GJA5 +0.229×GAL +0.42×GRAMD1B+0.262×CKMT2. TARGET was used as the training set, and the ROC curve was plotted to evaluate the risk assessment system’s accuracy. In the first year, the AUC of ROC curve was 0.87, 0.84, 0.82 at 1 year, 3 years and 5 years, respectively (Fig 4E). The high-risk group manifested significantly worse survival rate than low-risk group of patients at different time points (Fig 4F). We tested the performance of the risk assessment system we created in the GSE21257 dataset. In this verification set, we also detect that the OS of the high-risk group was significantly lower than that of the low-risk group (Fig 4H). The AUC of the ROC curve reached 0.91 (in 1 year) and 0.78 in the 5 years (Fig 4G). It means that the risk evaluation system we created had a relatively stable and good performance in predicting the prognosis of osteosarcoma in both test set and verification set.

thumbnail
Fig 4. Establishment and verification of risk assessment system.

A: The volcano diagram shows the DEGs of C1 compared to C2 in TARGET. B: Heatmap shows the expression pattern of DEGs in C1 and C2 of TARGET. C: Lasso regression process of 30 genes with potential prognostic ability. D: Multivariate COX analysis based on stepwise regression was adopted to identify the key genes with the greatest prognostic impact on osteosarcoma. E: ROC curve assessed the accuracy of the risk assessment system in predicting the prognosis of osteosarcoma patients in the TARGET cohort. F: Survival K–M curves for the patients with diverse risk score in the TARGET cohort. G: ROC curve assessed the accuracy of the risk assessment system in predicting the prognosis of osteosarcoma patients in the GSE21257 cohort. H: Survival K–M curves for the patients with diverse risk score in the GSE21257 cohort.

https://doi.org/10.1371/journal.pone.0295364.g004

Clinical correlation of risk assessment system

In the TARGET cohort, we evaluated the clinical relevance of the risk assessment system. Firstly, the differences of clinical manifestations were analyzed in the two risk groups. In the distribution of gender and age between high-risk group and low-risk group, no significant difference has been detected. There were significant differences in the degree of metastatic, survival state and distribution of molecular subtypes between the two groups (Fig 5A). No significant difference was found in risk score between subgroups stratified by gender (female & male) and age (< = 15 & > 15). Risk score was significantly correlated with metastatic degree, survival status, molecular subtypes and risk type. Samples with metastasis, death, C1, and metastasis and high risk exhibited significantly up-regulated risk scores relative to the clinical states that were in opposition to them (Fig 5B).

thumbnail
Fig 5. Clinical correlation of risk assessment system.

A: Differences in the distribution of clinical features (chisq.test). B: The risk score calculated by the risk assessment system is compared between hierarchical subgroups according to gender, age, metastatic, status, and molecular subtypes. Ns: no significant difference; *: p<0.05, **: p<0.01, ***: p<0.001, ****: p<0.0001.

https://doi.org/10.1371/journal.pone.0295364.g005

TIME and biological processes of risk score calculated by the risk assessment system

We studied the TIME of risk score calculated by the risk assessment system. ESTIMATE analysis showed that the high-risk group was significantly correlated with lower stromal score and immune score and ESTIMATE score (Fig 6A). According to the relationship between the level of immune cells calculated by MCP-counter and ssGSEA and risk score, we found that the level of many kinds of immune cells in the high-risk group was significantly lower than that in the low-risk group, including monocytic lineage, plasmacytoid dendritic cell, B lineage, T cells, fibroblasts, neutrophils, myeloid dendritic cells, macrophage, natural killer cell, MDSC, and etc. (Fig 6B and 6C). The abundance of five immune cells in each risk group was calculated by the TIMER. Compared with the abundance in the low-risk group, the abundance of CD4 T cell, neutrophil and macrophage was significantly reduced in the high-risk group (S1A Fig). Among the 8 immune cells analyzed by EPIC, B cell, cancer associated fibroblast, endothelial cell and macrophage showed significantly lower abundance levels in the high-risk group than in the low-risk group (S1B Fig). The strength of the association between risk score and the state of immune cells was measured by correlation degree r calculated by Pearson correlation test. Risk score was indeed significantly negatively correlated with the abundance of multiple immune cell infiltration, including monocytic lineage, endothelial cells, neutrophils, B lineage, fibroblasts and myeloid dendritic cells, etc. (Fig 6D, S2 Fig). Moreover, the Pearson correlation test also showed that risk score was significantly negatively linked with the signal pathways regulating immunity, such as cytokine-cytokine receptor interaction, JAK-STAT signaling pathway, complement and coagulation cascade, FC γ R mediated phagocytosis (Fig 6E).

thumbnail
Fig 6. TIME and biological processes of risk score calculated by the risk assessment system.

A: The difference of stromal score and immune score and ESTIMATE score between high-risk group and low-risk group. B: The level of immune cells calculated by MCP-counter was in the high-risk group and the low-risk group. C: Comparison of tumor immune infiltrating cell score calculated by ssGSEA between high-risk and low-risk groups. D: The correlation between score and risk score of tumor immune infiltrating cells was calculated by Pearson correlation test. E: Display of signaling pathways significantly related to risk score. Ns: no significant difference; *: p<0.05, **: p<0.01, ***: p<0.001, ****: p<0.0001.

https://doi.org/10.1371/journal.pone.0295364.g006

Construction and performance evaluation of optimization model based on risk assessment system

We layered samples and optimized the risk assessment system more accurately by building a decision tree and nomogram. According to the age, gender, metastatic and risk assessment system of the samples in TARGET, recursive partitioning analysis was conducted, and a decision tree was obtained, which formed 4 nodes and 5 subgroups (Fig 7A). We detected statistical differences in OS among five subgroups (Fig 7B). It was very obvious that C2 and C3 were members of the low-risk group, and C4 and C5 were members of the high-risk group (Fig 7C). And from C1 to C5, the proportion of death samples in the group increased with the increase of the cluster number (Fig 7D). Metastasis and risk score was an independent variable for predicting OS of osteosarcoma (Fig 8A). Therefore, these independent variables are integrated together to build a nomogram (Fig 8B). The calibration curve showed that the OS predicted by nomogram was very close to the actual situation at different time points (Fig 8C). The DCA showed that in predicting OS, the risk assessment system and nomogram brought more net benefits than the independent prognostic variable metastatic, and also obtained the highest AUC values > 0.6 from 1 to 5yeras (Fig 8D and 8E).

thumbnail
Fig 7. Construction of a decision tree.

A: The decision tree classifier model is constructed according to the age, gender, metastatic and risk assessment system of the samples in TARGET. B: OS differences of five clusters divided by the decision tree. C: The distribution of five clusters divided by decision tree in high- and low-risk groups. D: Survival status statistics in the five clusters divided by the decision tree.

https://doi.org/10.1371/journal.pone.0295364.g007

thumbnail
Fig 8. Construction and evaluation of a nomogram based on risk assessment system.

A: Univariate COX regression and multivariate COX regression were used to analyze the clinical variables and risk score of TARGET. B: Nomogram integrating risk score and metastatic. C: The calibration curves of nomogram. D: Predictive net benefit of the nomogram, risk assessment system and metastatic. E: Comparing the performance of nomogram, risk assessment system, age, and metastatic osteosarcoma OS by comparing AUC (The AUC value of each indicator for 1–5 years is represented by the indicator’s representative color).

https://doi.org/10.1371/journal.pone.0295364.g008

Discussion

There is growing evidence that bone biology is particularly affected by the regulation of redox balance [39]. In healthy tissues, oxidative stress is involved in the maintenance of bone remodeling and the development of bone malignant tumors as well as a poor prognosis [40]. At present, a new strategy for the treatment of osteosarcoma called cold air ionomer based on the mechanism of oxidative stress has been developed [41]. And the survival factors and adaptive pathways for oxidative stress are being explored as targets for cancer treatment and intervention [42]. However, due to the heterogeneity of pathology, the development of therapeutic targets in OS is very complex, which limits the effectiveness of treatment and is conducive to tumor recurrence and drug resistance [4]. The classification of osteosarcoma based on molecular subtypes provides an opportunity for effective treatment of the disease [43]. In this study, according to 45 OSRGs with statistical significance for the prognosis of osteosarcoma, we divided osteosarcoma into two molecular subtypes. Through LASSO and COX regression analysis of DEGs between two molecular subtypes, an independent and prognostic risk assessment system for osteosarcoma was established.

Because osteosarcoma is a complex disease that interacts with its microenvironment and immune system [44], these aspects should not be ignored in the study of this disease. Immune cells, such as T cells, dendritic cells, and macrophages, are the main constitutes of the immunosuppressive TIME from the ROS-induced inflammation [45, 46]. Between the two subgroups, TIME of C2 with better prognosis had more degree of immune infiltration and were more likely to receive ICB therapy. A recently published paper revealed that ICB responders are characterized by CD8+ central memory T cell accumulation [47]. Consistently, we found high proportions of CD8+ central memory T cell in C2 group. What’s more, gastric cancer patients with response to anti PD-1 treatment had high CYT score [48]. High IFN- γ score and T cell inflamed GEP score are suggested to be favorable prognostic indicators for anti-PD-1/PD-L1 [49] or chemotherapeutics therapy [50]. These three high scores in C2 cluster further supported the conclusion that C2 had more possibility for PD-1 checkpoint blockade treatment. Collectively, these results explained the reason for osteosarcoma patients in C2’s better survival time and future treating strategy.

Considering the heterogeneity between molecular subtypes defined by specific oxidative stress genes, we identified the DEGs between the two molecular subtypes and deduced a risk assessment system composing of ZYX, GJA5, GAL, GRAMD1B, and CKMT2 for the prognosis of osteosarcoma. Many of the genes in the risk assessment system we have created have biological significance in the progression of cancer and are related to prognosis. Zyxin (ZYX) is a LIM domain protein found in cytoplasm and nucleus, which has been shown to be involved in apoptosis, migration and invasion of cancer [51, 52], and has potential as a prognostic marker for colorectal cancer 30697742. ZXY was selected a protective gene, which was in line with the findings that overexpressed ZXY could repress the progression of osteosarcoma through Rap1-mediated suppression of the MEK/ERK signaling pathway [53]. GJA5 was identified as one of OS related genes for hepatocellular carcinoma patients, who received transarterial chemoembolization treatment [54]. Nonetheless, no reports of this gene have been found in studies related to osteosarcoma. Galectins (GAL) are a family of endogenous glycan-binding proteins, among which, GAL1, GAL3 and GAL9 can be seen in many types of cancer progression and is considered to be a promising molecular target for the development of cancer therapeutic drugs [55, 56]. Underexpressed GAL1 was reported to inhibit the growth and invasion of osteosarcoma cells through depressing the MAPK/ERK pathway [57], and the inhibition of GAL3 has a therapeutic efficacy of Semliki Forest virus in pediatric osteosarcoma [58]. GRAMD1B inhibits cell migration by negatively regulating JAK / STAT and AKT signal transduction in breast cancer [59]. This gene was firstly discovered in an osteosarcoma model. CKMT2 was selected as a hypoxia-immune-related microenvironment prognostic gene for osteosarcoma [60] as a risk gene, which is consistent with our result. These findings provided evidence for genes in the risk assessment model as prognostic indicators of osteosarcoma.

There were some limitations in this study. First, the small number of OSRGs were obtained and the small number of patients were studied, as well as the lack of some key clinical information, inevitably limits the conclusion. Second, the conclusions were only based on bioinformatics analysis in two retrospective cohorts, and further experimental and clinical validation of our findings are warranted. Third, the internal molecular regulation mechanism of the risk assessment system was not considered, and further research is needed.

Conclusions

Overall, this study defined two osteosarcoma molecular subtypes. Wherein, C2 cluster displayed prolonged survival time and was more likely to benefit from PD-1 immunotherapy. A risk model composing of ZYX, GJA5, GAL, GRAMD1B, and CKMT2 was established for assessing osteosarcoma prognosis. Patients in low risk group had favorable OS and high degree of immune infiltration. Finally, a nomogram was also designed for future clinical practice. The oxidative stress-related subtypes defined in our study may help reveal the interaction between oxidative stress and the immune microenvironment, and its associated risk models and hub genes have potential implications for future clinical management and precision therapy.

Supporting information

S1 Fig. The abundance of immune cells in each risk group.

A: The abundance of five immune cells in each risk group was calculated by the TIMER. B: Abundance levels of the eight immune cells analyzed by EPIC in each risk group.

https://doi.org/10.1371/journal.pone.0295364.s001

(PDF)

S2 Fig. Correlation between abundance levels of immune cells analyzed by TIMER and EPIC and risk score.

https://doi.org/10.1371/journal.pone.0295364.s002

(PDF)

References

  1. 1. Nakano K. Challenges of Systemic Therapy Investigations for Bone Sarcomas. Int J Mol Sci. 2022;23(7):3540. pmid:35408900
  2. 2. Tobeiha M, Rajabi A, Raisi A, Mohajeri M, Yazdi SM, Davoodvandi A, et al. Potential of natural products in osteosarcoma treatment: Focus on molecular mechanisms. Biomed Pharmacother. 2021;144:112257. pmid:34688081
  3. 3. Kazantseva L, Becerra J, Santos-Ruiz L. Traditional Medicinal Plants as a Source of Inspiration for Osteosarcoma Therapy. Molecules. 2022;27(15). pmid:35956961
  4. 4. Moukengue B, Lallier M, Marchandet L, Baud’huin M, Verrecchia F, Ory B, et al. Origin and Therapies of Osteosarcoma. Cancers (Basel). 2022;14(14). pmid:35884563
  5. 5. Zhu T, Han J, Yang L, Cai Z, Sun W, Hua Y, et al. Immune Microenvironment in Osteosarcoma: Components, Therapeutic Strategies and Clinical Applications. Front Immunol. 2022;13:907550. pmid:35720360
  6. 6. Kaneuchi Y, Hakozaki M, Yamada H, Hasegawa O, Yamada S, Oka Y, et al. Very late relapse of high-grade osteosarcoma: A case report and review of the literature. Medicine (Baltimore). 2020;99(29):e21206. pmid:32702886
  7. 7. Odri GA, Tchicaya-Bouanga J, Yoon DJY, Modrowski D. Metastatic Progression of Osteosarcomas: A Review of Current Knowledge of Environmental versus Oncogenic Drivers. Cancers (Basel). 2022;14(2). pmid:35053522
  8. 8. Soghli N, Ferns GA, Sadeghsoltani F, Qujeq D, Yousefi T, Vaghari-Tabari M. MicroRNAs and osteosarcoma: Potential targets for inhibiting metastasis and increasing chemosensitivity. Biochem Pharmacol. 2022;201:115094. pmid:35588853
  9. 9. Lu Y, Zhang J, Chen Y, Kang Y, Liao Z, He Y, et al. Novel Immunotherapies for Osteosarcoma. Front Oncol. 2022;12:830546. pmid:35433427
  10. 10. Tang L, Hu H, Zhou Y, Huang Y, Wang Y, Zhang Y, et al. Expression and Clinical Significance of ACTA2 in Osteosarcoma Tissue. Oncologie. 2022;24(4):913–25.
  11. 11. Cao L, Yang D, Bai B. Mir-1247 Affects the Proliferation, Invasion and Apoptosis of Osteosarcoma Cells through SOX9. Oncologie. 2021;23(1):149–58.
  12. 12. Ebrahimi SO, Reiisi S, Shareef S. miRNAs, oxidative stress, and cancer: A comprehensive and updated review. J Cell Physiol. 2020;235(11):8812–25. pmid:32394436
  13. 13. Sosa V, Moline T, Somoza R, Paciucci R, Kondoh H, ME LL. Oxidative stress and cancer: an overview. Ageing Res Rev. 2013;12(1):376–90. pmid:23123177
  14. 14. Gorrini C, Harris IS, Mak TW. Modulation of oxidative stress as an anticancer strategy. Nat Rev Drug Discov. 2013;12(12):931–47. pmid:24287781
  15. 15. Wu Z, Wang L, Wen Z, Yao J. Integrated analysis identifies oxidative stress genes associated with progression and prognosis in gastric cancer. Sci Rep. 2021;11(1):3292. pmid:33558567
  16. 16. Zhang K, Ping L, Du T, Wang Y, Sun Y, Liang G, et al. A Novel Systematic Oxidative Stress Score Predicts the Prognosis of Patients with Operable Breast Cancer. Oxid Med Cell Longev. 2021;2021:9441896. pmid:34659642
  17. 17. Guo Q, Liu XL, Liu HS, Luo XY, Yuan Y, Ji YM, et al. The Risk Model Based on the Three Oxidative Stress-Related Genes Evaluates the Prognosis of LAC Patients. Oxid Med Cell Longev. 2022;2022:4022896. pmid:35783192
  18. 18. Nakamura H, Takada K. Reactive oxygen species in cancer: Current findings and future directions. Cancer Sci. 2021;112(10):3945–52. pmid:34286881
  19. 19. Wang YC, Wang X, Yu J, Ma F, Li Z, Zhou Y, et al. Targeting monoamine oxidase A-regulated tumor-associated macrophage polarization for cancer immunotherapy. Nat Commun. 2021;12(1):3530. pmid:34112755
  20. 20. Leone A, Roca MS, Ciardiello C, Costantini S, Budillon A. Oxidative Stress Gene Expression Profile Correlates with Cancer Patient Poor Prognosis: Identification of Crucial Pathways Might Select Novel Therapeutic Approaches. Oxid Med Cell Longev. 2017;2017:2597581. pmid:28770020
  21. 21. Colaprico A, Silva TC, Olsen C, Garofano L, Cava C, Garolini D, et al. TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data. Nucleic Acids Res. 2016;44(8):e71. pmid:26704973
  22. 22. Wilkerson MD, Hayes DN. ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking. Bioinformatics. 2010;26(12):1572–3. pmid:20427518
  23. 23. Therneau TM, Lumley T. Package ‘survival’. R Top Doc. 2015;128(10):28–33.
  24. 24. Yoshihara K, Shahmoradgoli M, Martinez E, Vegesna R, Kim H, Torres-Garcia W, et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat Commun. 2013;4:2612. pmid:24113773
  25. 25. Becht E, Giraldo NA, Lacroix L, Buttard B, Elarouci N, Petitprez F, et al. Estimating the population abundance of tissue-infiltrating immune and stromal cell populations using gene expression. Genome Biol. 2016;17(1):218.
  26. 26. Barbie DA, Tamayo P, Boehm JS, Kim SY, Moody SE, Dunn IF, et al. Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nature. 2009;462(7269):108–12. pmid:19847166
  27. 27. Charoentong P, Finotello F, Angelova M, Mayer C, Efremova M, Rieder D, et al. Pan-cancer Immunogenomic Analyses Reveal Genotype-Immunophenotype Relationships and Predictors of Response to Checkpoint Blockade. Cell Rep. 2017;18(1):248–62. pmid:28052254
  28. 28. Li T, Fan J, Wang B, Traugh N, Chen Q, Liu JS, et al. TIMER: A Web Server for Comprehensive Analysis of Tumor-Infiltrating Immune Cells. Cancer Res. 2017;77(21):e108–e10. pmid:29092952
  29. 29. Racle J, Gfeller D. EPIC: A Tool to Estimate the Proportions of Different Cell Types from Bulk Gene Expression Data. Methods Mol Biol. 2020;2120:233–48. pmid:32124324
  30. 30. Ayers M, Lunceford J, Nebozhyn M, Murphy E, Loboda A, Kaufman DR, et al. IFN-gamma-related mRNA profile predicts clinical response to PD-1 blockade. J Clin Invest. 2017;127(8):2930–40.
  31. 31. Danilova L, Ho WJ, Zhu Q, Vithayathil T, De Jesus-Acosta A, Azad NS, et al. Programmed Cell Death Ligand-1 (PD-L1) and CD8 Expression Profiling Identify an Immunologic Subtype of Pancreatic Ductal Adenocarcinomas with Favorable Survival. Cancer Immunol Res. 2019;7(6):886–95. pmid:31043417
  32. 32. Rooney MS, Shukla SA, Wu CJ, Getz G, Hacohen N. Molecular and genetic properties of tumors associated with local immune cytolytic activity. Cell. 2015;160(1–2):48–61. pmid:25594174
  33. 33. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–50. pmid:16199517
  34. 34. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47. pmid:25605792
  35. 35. Simon N, Friedman J, Hastie T, Tibshirani R. Regularization Paths for Cox’s Proportional Hazards Model via Coordinate Descent. J Stat Softw. 2011;39(5):1–13. pmid:27065756
  36. 36. Blanche P, Dartigues JF, Jacqmin-Gadda H. Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks. Stat Med. 2013;32(30):5381–97. pmid:24027076
  37. 37. He S, Jiang R, Sun H, Yang J, Ye C, Liu W, et al. Surgical efficacy and survival prediction of patients with unspecified malignant bone tumors. BMC Cancer. 2022;22(1):1078. pmid:36266614
  38. 38. Kassambara A, Kosinski M, Biecek P, Fabian S. Package ‘survminer’. Drawing Survival Curves using ‘ggplot2’(R package version 03 1). 2017.
  39. 39. Wauquier F, Leotoing L, Coxam V, Guicheux J, Wittrant Y. Oxidative stress in bone remodelling and disease. Trends Mol Med. 2009;15(10):468–77. pmid:19811952
  40. 40. Tornin J, Villasante A, Sole-Marti X, Ginebra MP, Canal C. Osteosarcoma tissue-engineered model challenges oxidative stress therapy revealing promoted cancer stem cell properties. Free Radic Biol Med. 2021;164:107–18. pmid:33401009
  41. 41. Mateu-Sanz M, Tornin J, Ginebra MP, Canal C. Cold Atmospheric Plasma: A New Strategy Based Primarily on Oxidative Stress for Osteosarcoma Therapy. J Clin Med. 2021;10(4). pmid:33672274
  42. 42. Fiocchetti M, Fernandez VS, Montalesi E, Marino M. Neuroglobin: A Novel Player in the Oxidative Stress Response of Cancer Cells. Oxid Med Cell Longev. 2019;2019:6315034. pmid:31354909
  43. 43. Joshi SS, Badgwell BD. Current treatment and recent progress in gastric cancer. CA Cancer J Clin. 2021;71(3):264–79. pmid:33592120
  44. 44. Lilienthal I, Herold N. Targeting Molecular Mechanisms Underlying Treatment Efficacy and Resistance in Osteosarcoma: A Review of Current and Future Strategies. Int J Mol Sci. 2020;21(18). pmid:32961800
  45. 45. Kuo CL, Ponneri Babuharisankar A, Lin YC, Lien HW, Lo YK, Chou HY, et al. Mitochondrial oxidative stress in the tumor microenvironment and cancer immunoescape: foe or friend? J Biomed Sci. 2022;29(1):74. pmid:36154922
  46. 46. Paijens ST, Vledder A, de Bruyn M, Nijman HW. Tumor-infiltrating lymphocytes in the immunotherapy era. Cell Mol Immunol. 2021;18(4):842–59. pmid:33139907
  47. 47. Wu TD, Madireddi S, de Almeida PE, Banchereau R, Chen YJ, Chitre AS, et al. Peripheral T cell expansion predicts tumour infiltration and clinical response. Nature. 2020;579(7798):274–8. pmid:32103181
  48. 48. Hu Q, Nonaka K, Wakiyama H, Miyashita Y, Fujimoto Y, Jogo T, et al. Cytolytic activity score as a biomarker for antitumor immunity and clinical outcome in patients with gastric cancer. Cancer Med. 2021;10(9):3129–38. pmid:33769705
  49. 49. Qian J, Wang C, Wang B, Yang J, Wang Y, Luo F, et al. The IFN-γ/PD-L1 axis between T cells and tumor microenvironment: hints for glioma anti-PD-1/PD-L1 therapy. J Neuroinflammation. 2018;15(1):290.
  50. 50. Haddad RI, Seiwert TY, Chow LQM, Gupta S, Weiss J, Gluck I, et al. Influence of tumor mutational burden, inflammatory gene expression profile, and PD-L1 expression on response to pembrolizumab in head and neck squamous cell carcinoma. J Immunother Cancer. 2022;10(2). pmid:35217573
  51. 51. Partynska A, Gomulkiewicz A, Dziegiel P, Podhorska-Okolow M. The Role of Zyxin in Carcinogenesis. Anticancer Res. 2020;40(11):5981–8. pmid:33109535
  52. 52. Wen XM, Luo T, Jiang Y, Wang LH, Luo Y, Chen Q, et al. Zyxin (ZYX) promotes invasion and acts as a biomarker for aggressive phenotypes of human glioblastoma multiforme. Lab Invest. 2020;100(6):812–23. pmid:31949244
  53. 53. Wei Z, Xia K, Zhou B, Zheng D, Guo W. Zyxin Inhibits the Proliferation, Migration, and Invasion of Osteosarcoma via Rap1-Mediated Inhibition of the MEK/ERK Signaling Pathway. Biomedicines. 2023;11(8). pmid:37626810
  54. 54. Wang D, Zhang L, Sun Z, Jiang H, Zhang J. A radiomics signature associated with underlying gene expression pattern for the prediction of prognosis and treatment response in hepatocellular carcinoma. Eur J Radiol. 2023;167:111086. pmid:37708675
  55. 55. Goud NS, Soukya PSL, Ghouse M, Komal D, Alvala R, Alvala M. Human Galectin-1 and Its Inhibitors: Privileged Target for Cancer and HIV. Mini Rev Med Chem. 2019;19(16):1369–78.
  56. 56. Mariño KV, Cagnoni AJ, Croci DO, Rabinovich GA. Targeting galectin-driven regulatory circuits in cancer and fibrosis. Nat Rev Drug Discov. 2023;22(4):295–316. pmid:36759557
  57. 57. Miao JH, Wang SQ, Zhang MH, Yu FB, Zhang L, Yu ZX, et al. Knockdown of galectin-1 suppresses the growth and invasion of osteosarcoma cells through inhibition of the MAPK/ERK pathway. Oncol Rep. 2014;32(4):1497–504. pmid:25069486
  58. 58. Herrador-Cañete G, Zalacain M, Labiano S, Laspidea V, Puigdelloses M, Marrodan L, et al. Galectin-3 inhibition boosts the therapeutic efficacy of Semliki Forest virus in pediatric osteosarcoma. Mol Ther Oncolytics. 2022;26:246–64. pmid:35949950
  59. 59. Khanna P, Lee JS, Sereemaspun A, Lee H, Baeg GH. GRAMD1B regulates cell migration in breast cancer cells through JAK/STAT and Akt signaling. Sci Rep. 2018;8(1):9511. pmid:29934528
  60. 60. Zhang W, Lyu P, Andreev D, Jia Y, Zhang F, Bozec A. Hypoxia-immune-related microenvironment prognostic signature for osteosarcoma. Front Cell Dev Biol. 2022;10:974851. pmid:36578780