Figures
Abstract
Background
The important regulatory role of competitive endogenous RNAs (ceRNAs) in hepatocellular carcinoma (HCC) has been confirmed. Tumor infiltrating lymphocytes (TILs) are of great significance to tumor outcome and prognosis. This study will systematically analyze the key factors affecting the prognosis of HCC from the perspective of ceRNA and TILs.
Methods
The Cancer Genome Atlas (TCGA) database was used for transcriptome data acquisition of HCC. Through the analysis of the Weighted Gene Co-expression Network Analysis (WCGNA), the two modules for co-expression of the disease were determined, and a ceRNA network was constructed. We used Cox regression and LASSO regression analysis to screen prognostic factors and constructed a risk score model. The Gene Expression Omnibus (GEO) was used to validate the model. The Kyoto Encyclopedia of Genes and Genomes (KEGG) was used for mRNAs functional analysis. The cell composition of TILs was analyzed by the CIBERSORT algorithm, and Pearson correlation analysis was utilized to explore the correlation between TILs and prognostic factors.
Results
We constructed a ceRNA regulatory network composed of 67 nodes through WGCNA, including 44 DElncRNAs, 19 DEGs, and 4 DEmiRNAs. And based on the expression of 4 DEGs in this network (RRM2, LDLR, TXNIP, and KIF23), a prognostic model of HCC with good specificity and sensitivity was developed. CIBERSORT analyzed the composition of TILs in HCC tumor tissues. Correlation analysis showed that RRM2 is significantly correlated with T cells CD4 memory activated, T cells CD4 memory resting, T cells CD8, and T cells follicular helper, and TXNIP is negatively correlated with B cells memory.
Citation: He Y, Xu R, Peng L, Hu X (2021) Tumor infiltrating lymphocytes associated competitive endogenous RNA networks as predictors of outcome in hepatic carcinoma based on WGCNA analysis. PLoS ONE 16(7): e0254829. https://doi.org/10.1371/journal.pone.0254829
Editor: Isabelle Chemin, Centre de Recherche en Cancerologie de Lyon, FRANCE
Received: January 8, 2021; Accepted: July 5, 2021; Published: July 29, 2021
Copyright: © 2021 He et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are available within the paper and its Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Liver cancer is classified into hepatocellular carcinoma (HCC) and intrahepatic cholangiocarcinoma [1]. HCC accounts for 90% of primary liver cancers with estimated deaths ranking fourth among all kinds of cancers [2]. HCC is characterized by atypical clinical symptoms, difficult early diagnosis, a high degree of malignancy, early metastasis, and poor efficacy [3–5]. The incidence of liver cancer is still increasing, and the survival rate of 5-year is less than 18% [6]. Therefore, it is of great importance to understand the molecular mechanism of HCC development and to study potential foreseeable biological indicators to improve patient prognosis. The development of high throughput sequencing and computer technologies provides a methodological approach for the identification of disease-related genes in HCC, and also provides a new perspective on the study of cancer.
In recent years, the view that ceRNA is an important way to regulate the progression of various cancers including HCC has been widely accepted [7]. In the ceRNA network, microRNA (miRNA) is the core element, and lncRNA can produce sponge-like effects compete with ceRNA for miRNA response elements, thereby regulating mRNA expression, which is the so-called ceRNA hypothesis proposed by Salmena et al [8]. As a non-coding single-stranded RNA, miRNA is composed of approximately 22 nucleotides, and its binding site is located in the 3’-untranslated region, which can induce degradation of target mRNA, thereby inhibiting gene expression at the post-translated level and playing a variety of biological effects [9,10]. Regulating the expression of miRNA can affect a variety of cell functions. For example, loss of miRNA-10a (miR-10a) can induce intestinal tumors, and miRNA-138 can inhibit HCC cells by inhibiting SOX9 [11,12]. The ceRNA network is a major mode of regulation of miRNA expression. In gastric cancer, lncRNA HOTAIR can act as ceRNA, adsorb miR-331-3p, which in turn affects the proliferation, migration, and invasion of gastric cancer cells [13]. Therefore, we have reason to believe that ceRNA also plays an equally crucial regulatory role in HCC. Based on this, we will screen out the differentially expressed RNA in HCC tissues through the TCGA database to construct a ceRNA network, and explore the biological function and prognostic value of this network in HCC.
TILs are lymphocytes that infiltrate the stroma of tumor cells, and activated TILs can induce tumor cell apoptosis [14]. TILs have been proved to be associated with the survival of cancer patients. For example, the number of B cells infiltrating tumor tissues is believed to be correlated with the survival rate of a variety of cancer patients including breast cancer, ovarian cancer, and melanoma [15,16]. Meanwhile, TILs seem to have a predictive effect on neoadjuvant chemotherapy response [17]. At present, cancer immunotherapies are a key component of cancer clinical treatment [18]. As an immune organ, the liver contains a great number of congenital and adaptive immune cells [19]. The liver has been confirmed to contain numerous CD69 + NK cells [20]. Therefore, in-depth study of immune cells infiltrating in HCC tissues to find potentially related molecules that may reflect or regulate is essential for developing new drugs, enhancing the effectiveness of HCC immunotherapy, and predicting the prognosis of HCC patients. CIBERSORT is considered to be a reliable tool for calculating immune cell composition with excellent performance. Its main advantage lies in high sensitivity and specificity. It can quantify 22 human immune cell phenotypes at the same time, and through characterizing ~ 500 marker genes to calculate the relative ratio of each cell type [21]. WGCNA uses hierarchical clustering to identify clusters (modules) of highly related genes, uses module characteristic genes to define and classify clusters, and associates the modules with each other and the traits of external samples [22]. It is the most commonly used method for constructing co-expression networks and has been widely used to predict gene function, detect gene mutations in cancer, and discover new disease biomarkers [23,24]. Based on this, this study will construct a co-expression network through WGCNA, screen key genes in the ceRNA, and evaluate TILs in liver tumor tissues by CIBERSORT. In addition, the correlation between candidate genes in the constructed network and TILs in HCC tissues and their prognostic value was investigated.
Materials and methods
Data access
We obtained high-throughput RNA-sequencing data from the TCGA database (https://portal.gdc.cancer.gov/), including transcript data of 424 HCC patients and 425 patients miRNA-sequencing data. To eliminate the influence of gene length and sequencing depth on gene expression level, we calculated the reads per kilobase per million mapped reads (rpkm) for subsequent analysis. The ensemble grch38.84 was used for gene symbol conversion and annotate RNA categories. External ChIP-seq data sets for validation were downloaded from GEO database (https://www.ncbi.nlm.nih.gov/geo) including GSE14520, GSE22058, GSE25097, GSE57957 and GSE76427.
Identification of differentially expressed RNAs
Differentially expressed genes (DEGs), lncRNAs (DElncRNAs), and miRNA (DEmiRNAs) were performed by the “limma” Bioconductor R package [25]. The significance thresholds were identified of | log2 fold change (FC) | > 1 and false discovery rate (FDR) < 0.05. Finally, the volcano map was drawn using the “ggplot2” package in R software, while the heatmap was drawn using the “pheatmap” R package.
Construction of mRNA co-expression network
Based on mRNA expression, we used the “WCGNA” R package (Version 1.69) to construct the co-expression networks and screened the disease-related core modules [22]. The data set was grouped according to whether the samples were tumor tissues or not, including 374 tumor tissues and 50 non-tumor tissues. The nodes in the network represent mRNAs that are connected when the Pearson correlation coefficient between these nodes is greater than a specific threshold. In this study, the R-square cut-off value was set at 0.9. Conversion of a correlation matrix to adjacency matrix requires parameter β, and the value of β is set in such a way that the resulting adjacency matrix needs to approximate the scale-free topology feature. In this study, the soft threshold power is the parameter β, which is determined to be 8. Next, the topological overlap matrix (TOM) was calculated based on the adjacency matrix, and 1-TOM was used as a distance metric to perform the average linkage hierarchical clustering [26]. Correlation between module eigengenes and traits was evaluated with Pearson’s r correlation. The resulting co-expression network was built with a minimum module size of 50, merge cut height of 0.25, and signed network type.
Function annotation of mRNAs
Metascape (http://metascape.org) was used for KEGG analysis of mRNAs in two key modules, which showed the highest levels of positive or negative correlation with HCC, to understand the key pathways in the pathological process of HCC. In addition, the mRNAs in the ceRNA network we constructed also used this method for pathway analysis to explore the function of this ceRNA.
Construction of the DElncRNA-DEmiRNA-DEG regulatory network
We first used the miRcode database (https://www.mircode.org/) to forecast the miRNAs that interact with DElncRNAs, and cross the predicted miRNAs with DEmiRNAs to obtain the DElncRNA-DEmiRNA interactions pairs in the target dataset. Then we used TargetScan database (www.targetscan.org), miRDB database (http://mirdb.org/miRDB/), and miRTarBase database (http://mirtarbase.mbc.nctu.edu.tw) to predict the target mRNAs of the DEmiRNAs screened in the previous step, only predicted mRNAs with consistent predictions from the three databases were selected for further analysis. The predicted mRNAs were intersected with DEGs and the mRNAs in the core modules screened by WGCNA, and the final intersectant mRNAs were used for the construction of ceRNAs (Fig 4B). Cytoscape V3.7.1. was used for visualization.
Identification of prognostic signatures in ceRNA network
The 19 mRNAs in ceRNAs were further used to construct a prognostic risk score model. To make the model more reproducible, we also performed differential expression analysis on GSE14520, and selected mRNAs that were differentially expressed in GSE14520 and consistent with the trend of changes in the expression of these 19 mRNAs for the final analysis. Firstly, the 19 factors were assessed using the univariate Cox proportional hazard regression model, then the least absolute shrinkage and selection operator (LASSO) was applied to carry out dimensionality reduction for the variables in the regression model, and finally, a multivariate Cox regression analysis was carried out [27,28]. The "glmnet" in R software was used for LASSO Cox variable selection and model building, and the “survival” R package was used for comparison of the survival curves. The comprehensive risk score of each patient was calculated according to the linear combination of expressed values weighted by the LASSO Cox regression coefficient, and the formula is as follows.
In this equation, Exp represents the value of key mRNAs, and β represents the corresponding coefficient of the multivariate Cox regression analysis. According to the prognostic risk score, the HCC patients were regrouped. Cases with RiskScore > median were defined as the high-risk group, and vice versa, the low-risk group. We used "survplot" R package to plot the Kaplan-Meier (KM) survival curves In the high and low-risk groups of patients, and the Log-rank was used to compare the overall survival (OS) of the inter-group. We also conducted survival analysis on all signatures that make up the ceRNA network. The specificity and sensitivity of the model in prognosis prediction were evaluated according to the areas under the ROC curve at 1, 3, and 5 years.
Prognosis and expression validation for key DEGs
The Human Protein Atlas Database (HPA) (http://www.proteinatlas.org) is used to obtain representative immunohistochemical images of key DEGs between normal and hepatic carcinoma tissue. GSE14520, GSE22058, GSE25097, GSE57957, and GSE76427 are all used for expression verification of key prognostic genes. The GSE14520 data set contains 225 HCC tumors and 220 adjacent paracancerous tissues, with 221 complete survival data, this data set will be used as a validation set to verify the accuracy and sensitivity of the prognostic risk score model.
Analysis of lymphocyte infiltration through CIBERSORT
For analyzing the immunobiological characteristics of HCC patients, we use CIBERSORT (http://cibersort.stanford.edu/) to classify and quantify the abundance of TILs. The input data is the RNA expression matrix downloaded directly from the TCGA database [29]. For the next analysis, only cases with CIBERSORT P-value < 0.05 can be used. Violin plots were generated using the “vioplot” R package. Meanwhile, Spearman correlation analysis was used to investigate the correlation between prognosis-related DEGs and infiltrating lymphocytes.
Results
Differential expression analysis
The DEGs were screened out using FDR < 0.05 and | log2FC | > 1 as the cutoffs. A total of 2769 mRNAs (1741 downregulated and 1028 upregulated) (Fig 1A), 259 lncRNAs (119 downregulated and 140 upregulated) (Fig 1B), and 127 miRNAs (58 downregulated and 69 upregulated) (Fig 1C) were differently expressed between the HCC tumor and normal liver tissues. Differential analysis of validation set GSE14520 showed that there were 9405 differentially expressed genes, of which 4879 genes were up-regulated and 4526 genes were down-regulated (Fig 1D).
(A) mRNA. (B) lncRNA. (C) miRNA. (D) Differential analysis of GSE14520.
Construction of WGCNA and identification of hub modules
We used the "WGCNA" R package to explore the expression profiles of mRNAs in 374 HCC tissues and 50 normal tissues. The mRNAs were hierarchically clustered with the cut line value set to 20,000 to remove outliers (Fig 2A), but no samples were removed in this study. The independence and average connectivity of the modules is determined by the power value.
(A) Hierarchical clustering tree of 374 tumor and 50 normal liver tissues mRNAs expression patterns. (B) Identification of power value. The red line represents R2 > 0.9 when the power value is 8. (C) Clustering dendrogram and merging of mRNAs co-expression modules. (D) Correlation heatmap of mRNA modules and clinical traits. The correlation is related to color changes, red and blue represent positive and negative correlation, respectively. (E) Scatter diagram of mRNAs in the magenta module, which is positively correlated with HCC. (F) Scatter diagram of mRNAs in the yellow module, which is positively correlated with HCC.
Setting the power value to 8 can make the scale-free R2 reach 0.9, which is the lowest threshold (Fig 2B), to better reflect the scale-free topology of the co-expression network. According to the topological overlap measure (TOM), a total of 7 color modules are determined through dynamic tree cutting. The mRNAs that could not be included in any module were put into the gray module (Fig 2C). The correlation between modules and traits is shown in Fig 2D, in which magenta module (r = 0.86, P < 0.01) and yellow module (r = -0.57, P < 0.01) represent the most positive and negative correlations with HCC. The number of mRNAs contained in these two modules is 214 (magenta module) and 623 (yellow module) (Fig 2E and 2F), and all of these 837 mRNAs identified by WGCNA will be used in the next analysis.
The mRNAs in the selected core modules were performed KEGG pathway analysis. The pathway analysis of 214 mRNAs involved in the magenta module showed that they were mainly rich in mineral absorption (ko04978), malaria (ko05144), and MAPK signaling pathways (hsa04010), while the 623 mRNAs in the yellow module were mainly enriched in the cell cycle (hsa04110), DNA Replication (ko03030) and p53 signaling pathway (ko04115), etc (Fig 3A and 3B).
(A) The KEGG analysis of mRNAs involved in the magenta module. (B) The KEGG analysis of mRNAs involved in the yellow module.
Construction of the ceRNA network
Firstly, the miRNAs targeted by 259 DElncRNAs were foreseen using miRcode database and the predicted miRNAs were mapped to DEmiRNAs. A total of 175 DElncRNAs-DEmiRNAs pairs were identified, which included 58 DElncRNAs and 9 DEmiRNAs. Secondly, we used the 9 DEmiRNAs selected in the previous step to predict their target mRNAs through miRDB, miRTarBase, and TargetScan databases. The predicted mRNAs were intersected with DEGs and the mRNAs in the core modules screened by WGCNA, and 19 DEmiRNAs-DEGs pairs were obtained. Finally, we constructed a ceRNA network through Cytoscape including 44 DElncRNAs, 19 DEGs, and 4 DEmiRNAs (Fig 4A). The related flow chart is shown in Fig 4B. KEGG analysis shows that the pathway is mainly enriched in Hepatitis B (hsa05161) and Cell cycle (ko04110) (Fig 4B).
(A) ceRNA network. (B) Flow chart of construction and analysis of ceRNA network. Rectangles indicate DEmiRNAs, triangles indicate DElncRNAs, and ellipses indicate DEGs. There are blue and red marks that represent down-regulated and up-regulated RNAs, respectively. The KEGG analysis of the 19 mRNAs in the network was performed by the Metascape database with a P-value cutoff of 0.01.
Prognostic value of ceRNA network
We intersected the 19 DEGs included in the ceRNA network with the mRNAs in GSE14520 with statistically significant differences between tumor and normal tissues, and finally obtained a total of 14 mRNAs with the same trend of change (Fig 5A). These 14 mRNAs were used for Univariate Cox analysis to identify potential risk factors, and only 6 mRNAs with a statistical difference were selected (Table 1). Four mRNAs that remained after LASSO regression analysis were selected for the construction of the multivariate Cox prediction model (Fig 5B). In the prognosis model, four molecules with statistical differences are RRM2, LDLR, TXNIP, and KIF23 (Fig 5C). We calculated the risk score of each patient as follows:
(A) The 14 mRNAs used for the construction of the risk score model were produced by the intersection of DEGs in ceRNA and GSE14520. (B) Partial likelihood deviance of minimum number corresponds to the covariates revealed by the LASSO regression model. (C) Forest plots of hazard ratios (HR) of the four molecules with statistical differences OS-associated mRNAs. (D) The 1, 3, and 5 years of survival were analyzed by ROC curve to evaluate the efficiency of the clinical predictive model. (E) Survival curve of high and low-risk groups stratified by the prediction model. (F) The ROC curves of the external validation set GSE14520. (G) Survival curve of the external validation set GSE14520.
Patients were segmented into high-risk and low-risk groups using the median as a cut-off value. We used time-dependent ROC curves at 1, 3, and 5 years and KM plots to evaluate the three-mRNA signature in predicting the outcome of the HCC patients. In the training set, the area under the curve (AUC) for 1, 3, and 5 years of survival were 0.685, 0.668, and 0.643, respectively, which indicates that our prediction model constructed using these 4 mRNAs can accurately predict the survival of patients (Fig 5D). KM curve between high and low-risk groups showed a significant difference in OS (log-rank test P < 0.001) (Fig 5E). This result was further confirmed in the external validation set GSE14520. In the validation set, the AUC for 1, 3, and 5 years of survival were 0.616, 0.626, and 0.639, respectively. At the same time, KM curve also confirmed that the high-risk score group has a worse survival (Fig 5F and 5G).
To further explore the prognostic value of the ceRNA network we constructed, we also performed survival analysis on all the constituent elements of ceRNA. The results showed that 8 DEGs (CCDC71L, KPNA2, RRM2, E2F1, MCM4, CHAF1A, TXNIP, and KIF23), 10 DElncRNAs (C2orf48, LINC00152, HAO2-IT1, SNHG3, DIO3OS, LINC00261, WARS2-IT1, FBXL19-IT1), and one DEmiRNA (miR-139-5p) were found to have statistical significance in predicting OS (Fig 6).
Gene expression validation
We selected 5 data sets from the GEO database for external verification of 4 key model genes (RRM2, LDLR, TXNIP, and KIF23) expressions. The results showed that the expression of RRM2 and KIF23 was significantly up-regulated in HCC tumor tissues compared with adjacent tissues (P < 0.001). TXNIP was down-regulated in tumor tissues, with statistically significant differences in GSE14520, GSE25097, and GSE76427 (P < 0.001), while there was no statistically significant difference in GSE22058 and GSE57957 (P > 0.05). LDLR was confirmed to be down-regulated in HCC tumor tissues in all external data sets (P < 0.01) (Fig 7A–7E).
External validation data set downloaded from GEO including (A) GSE14520, (B) GSE22058, (C) GSE25097, (D) GSE57957, and (E) GSE76427 used to compare the differential expression of the target genes between HCC and para-cancerous tissues. The immunohistochemical images from the HPA database showed the expression of (F) RRM2, (G) KIF23, (H) LDLR, and (I) TXNIP in HCC tumor and normal tissue.
To assess the protein expression levels of these 4 mRNAs, we used the HPA database to obtain immunohistochemical images of normal and HCC tumor tissues. Among them, the difference of RRM2 was the most significant, the expression level of RRM2 in tumor tissues was significantly higher than that in normal tissues (Fig 7F). KIF23 was highly expressed in both normal liver tissues and tumor tissues (Fig 7G). LDLR and TXNIP were observed at different levels of down-regulated in HCC tumor tissues (Fig 7H and 7I).
CIBERSORT analysis of tumor-infiltrating lymphocytes
The CIBERSORT was used to deconvolute the relative composition of immune cells in the normal and tumor sample from the liver tissue. The bar plot shows the composition ratio of 22 immune cell types in all samples with CIBERSORT P < 0.05 (Fig 8A). The core plot shows the correlation among immune cell subpopulations (Fig 8B), and from the figure, it can be seen that Eosinophils and Dendritic cells activated have a significant positive correlation (correlation coefficient r = 0.97). We compared the infiltration of different immune cells in the tumor and normal liver tissue and found that tumor tissues have higher T cell regulatory (Tregs) infiltration, and the difference is statistically significant (P = 0.018) (Fig 9A). Next, the 4 key mRNAs selected were used for correlation analysis with different types of immune cells, and only the results with P < 0.05 and |r| > 0.4 were listed according to Pearson’s correlation coefficient analysis. The analysis showed that the expression of RRM2 has a strong positive correlation with T cells CD4 memory activated, T cells CD8, and T cells follicular helper, and a significant negative correlation with T cells CD4 memory resting (Fig 9B–9E). The expression of TXNIP is closely related to B cell memory (Fig 9F), and there is no immune cell significantly related to KIF23 and LDLR.
(A) The composition ratio of 22 immune cell types. (B) Correlation matrix between the 22 immune cell types.
(A) Different proportions of immune cell subsets between normal and HCC liver tissues. Pearson correlations indicate that the expression of RRM2 has a positive correlation with T cells CD4 memory activated (B), T cells CD8 (C), and T cells follicular helper (D), and a negative correlation with T cells CD4 memory resting (E). The expression of TXNIP has a positive correlation with B cell memory (F).
Discussion
Consistently, most HCC patients are diagnosed as advanced HCC at the initial diagnosis, which leads to poor prognosis and higher mortality of HCC worldwide [30]. In terms of treatment methods, cancer immunotherapy has become one of the most promising treatment methods for various advanced solid tumors, including HCC [31]. Immunotherapy has been widely used in the treatment of HCC and has shown remarkable efficacy. However, for some patients with HCC, immunotherapy is ineffective, which may be related to the unique microenvironment of the liver, because the liver is a special organ that induces immune tolerance rather than immunity [32]. This unique and highly heterogeneous immune composition may affect all aspects of HCC, so the implementation of immunotherapy requires a comprehensive understanding of HCC TILs. At present, the regulatory role of ceRNA in malignant tumors has been widely recognized, but the specific mechanism of regulating TILs to affect HCC remains to be determined. LncRNA acts as a ceRNA to target PCED1B mediated by miR-194-5p to promote glioma cell proliferation and inhibit cell apoptosis [33]. LncRNA H19 via sponging miR-152-3p up-regulates the expression of BRD4, thereby promoting the malignant behavior of multiple myeloma cells [34]. Therefore, we constructed a ceRNA regulatory network composed of 67 nodes through WGCNA, in which 44 DElncRNAs and 19 DEGs competed with a limited number of 4 DEmiRNAs. LASSO Cox regression analysis identified 4 key mRNAs in the ceRNA network, including RRM2, KiF23, LDLR, and TXNIP, that could effectively predict prognosis. Correlation analysis showed that these molecules were significantly correlated with TILs. Based on this, we speculated that the ceRNA network and related TILs might play a potentially significant role in the future treatment and prognosis prediction of HCC.
CeRNA plays an important regulatory role in a variety of physiological and pathological processes. C2orf48 was proved to be significantly correlated with the prognosis of liver cancer patients [35]. LINC00152 is associated with a poor prognosis of liver cancer. Cell experiments confirmed that LINC00152 may be involved in the regulation of HCC cell proliferation, cell apoptosis, and cell migration/invasion [36]. In addition, LINC00152 was confirmed to sponge miR-139 and relieve the inhibition of the target gene PIK3CA. Anti-miR-139 can reverse the inhibition of cell proliferation and migration/invasion induced by LINC00152 knockdown [37]. In our study, the low expression of miR-139 was associated with a worse prognosis. Other studies also showed that miR-139 was significantly down-regulated in HCC, and Upregulation of miR-139 promoted the apoptosis of HCC cells and inhibited the growth of tumor cells [38]. SNHG3 is up-regulated in liver cancer, silencing SNHG3 may impair tumor progression, and miR-139-5p has been proved to bind with SNHG3 to play a regulatory role [39]. DIO3OS may inhibit malignant biological behaviors by sponging miR-328 in HCC [40]. The deletion of LINC00261 has been observed in a variety of malignant tumors including HCC, breast cancer, and gastric cancer [41]. LINC00261 can inhibit glycolysis and proliferation of cancer cells [42]. FBXL19-AS1 has been reported to promote tumor progression in both cervical cancer and osteosarcoma [43,44]. In colon cancer, LINC00402 competitively binds miR-141 and miR-424 as the ceRNA of PHLPP2, and LINC00402 has also been confirmed to be associated with metastatic melanoma [45,46]. Other OS-related lncRNAs (WARS2-IT1, TPRG1-AS1, and HAO2-IT1) located in the ceRNA network are less studied, and further studies are needed to clarify the functions of these molecules in HCC and other tumors. We performed KEGG pathway analysis on 19 DEGs in the ceRNA network to characterize the biological functions of the ceRNA network. The enrichment results are shown in Fig 4B, which are mainly enriched in Hepatitis B and Cell cycle. Chronic HBV infection has long been considered an important cause of HCC [47]. HBV integrates into the genome of infected hepatocytes, and promotes the continuous imbalance of hepatocytes through chronic inflammatory damage, affects the DNA repair mechanism and promotes mutation events, and ultimately leads to the malignant transformation of hepatocytes [48]. The results suggest that the ceRNA network may affect the cell cycle through the interaction with cancer-related miRNAs, thereby affecting the transformation from hepatitis to HCC mediated by HBV. In addition, our study developed a risk score model based on 4 mRNAs (RRM2, KIF23, LDLR and TXNIP) in the ceRNA network, and explored the correlation between these signatures and TILs. The model we constructed showed that as RRM2 and KIF23 expression levels were upregulated, patients’ risk scores increased, which represented a worse survival outcome. Confirmed by previous research, compared with non-HCC tissues, the expression of RRM2 in HCC is up-regulated, and inhibiting RRM2 can inhibit the proliferation of HCC cells [49]. RRM2 promotes the occurrence of cell resistance by activating the EGFR/AKT pathway, while lncRNA can up-regulate RRM2 by binding to and inhibiting the expression of miR-139-5p, which is consistent with our research [50]. KIF23 is related to the worse prognosis of HCC patients and has a promoting effect on the proliferation and chemoresistance of HCC cells [51]. TXNIP has long been recognized as a potential tumor suppressor gene that inhibits cell growth and is involved in stress response, REDOX regulation, and cell proliferation [52,53].
CIEBERSORT analysis showed that compared with the non-tumor tissues, the infiltration of Tregs in HCC tumor tissues increased. Correlation analysis shows that RRM2 and TXNIP are correlated with CD4+ T cells and B lymphocytes, respectively. CD4+ T cells, mainly composed of regulatory T cells (Tregs) and CD4+ helper T cells (Th), among which helper T cells can mediate immunity by secreting various cytokines such as IL-4, IL-10, and interferon‑γ [54]. Tregs can suppress tumor immune response by releasing immunosuppressive cytokines [55]. CD4 + T cells are protective in the development of hepatocellular carcinoma in mouse models [56]. It has been reported that the degree of tumor infiltration of B cells is associated with the increased survival rate of HCC patients [57]. TXNIP has a potential role in the formation of germinal centers in peripheral lymphoid organs where B lymphocytes divide rapidly [53]. As a low-density lipoprotein (LDL) receptor, LDLR mediates the absorption of cholesterol by cells [58] and regulates extracellular cholesterol concentration. Also, LDLR is the main pathway to mediate the entry of cholesterol-rich lipoproteins into cancer cells. Blocking this pathway would inhibit tumorigenesis of adenocarcinoma cells, increase the sensitivity of tumor cells to chemotherapy drugs, and promote tumor regression [59]. In breast cancer, increased LDLR expression accelerates tumor growth in mouse models [60]. However, the situation may be the opposite in HCC. In the liver, neutrophil-specific microRNA-223 (miR-223) internalizes into hepatocytes through the expression of LDLR, thereby inhibiting the expression of hepatitis and fibrosis genes and improving nonalcoholic steatohepatitis [61]. Therefore, the lack of LDLR expression in the liver may lead to liver steatosis and persistent inflammation, thus promoting the occurrence of HCC. Therefore, due to the special role of the liver in fat metabolism, the biological role of LDLR in HCC may be different from that in other types of tumors. The specific mechanism needs to be confirmed by further studies.
This study mainly discussed the prognostic value of ceRNA network in HCC and its correlation with TILs, and constructed a prognostic model with good sensitivity and specificity. The research results suggest that the ceRNA we constructed may be involved in HCC through TILs and can become potential HCC therapeutic targets, but we still need to further study the molecular functions involved in this research in the following clinical, cellular, and animal experiments.
Conclusion
Our research uses WGCNA, combined with survival analysis and CIBERSORT cell composition analysis to identify the core genes in the prognostic-related ceRNA network and build a prognostic model based on this. The core genes of this prognostic model were subsequently confirmed to have a significant correlation with B lymphocytes and CD4+ T cells infiltrated in HCC tumor tissue. The results suggest that the 4 mRNAs in the model we constructed participate in the regulation of HCC immune cells, which can be used as potential prognostic indicators for HCC patients.
Supporting information
S1 File. The data contained in this file is all the data ultimately used for the analysis in this article.
https://doi.org/10.1371/journal.pone.0254829.s001
(RAR)
References
- 1. Ma L, Hernandez MO, Zhao Y, Mehta M, Tran B, Kelly M, et al. Tumor Cell Biodiversity Drives Microenvironmental Reprogramming in Liver Cancer. Cancer Cell. 2019 14;36(4):418–430.e6. pmid:31588021
- 2. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA Cancer J Clin. 2019;69(1):7–34. pmid:30620402
- 3. Bosetti C, Turati F, La Vecchia C. Hepatocellular carcinoma epidemiology. Best Pract Res Clin Gastroenterol. 2014 Oct;28(5):753–70. pmid:25260306
- 4. Yang JD, Hainaut P, Gores GJ, Amadou A, Plymoth A, Roberts LR. A global view of hepatocellular carcinoma: trends, risk, prevention and management. Nat Rev Gastroenterol Hepatol. 2019;16(10):589–604. pmid:31439937
- 5. Bruix J, Gores GJ, Mazzaferro V. Hepatocellular carcinoma: clinical frontiers and perspectives. Gut. 2014 May;63(5):844–55. pmid:24531850
- 6. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. 2020 Jan;70(1):7–30. pmid:31912902
- 7. Zhang Z, Li J, He T, Ouyang Y, Huang Y, Liu Q, et al. The competitive endogenous RNA regulatory network reveals potential prognostic biomarkers for overall survival in hepatocellular carcinoma. Cancer Sci. 2019 Sep;110(9):2905–23. pmid:31335995
- 8. Salmena L, Poliseno L, Tay Y, Kats L, Pandolfi PP. A ceRNA hypothesis: the Rosetta Stone of a hidden RNA language? Cell. 2011 Aug 5;146(3):353–8. pmid:21802130
- 9. Hauptman N, Glavac D. MicroRNAs and long non-coding RNAs: prospects in diagnostics and therapy of cancer. Radiol Oncol. 2013;47(4):311–8. pmid:24294175
- 10. Kong YW, Ferland-McCollough D, Jackson TJ, Bushell M. microRNAs in cancer management. Lancet Oncol. 2012 Jun;13(6):e249–258. pmid:22652233
- 11. Stadthagen G, Tehler D, Høyland-Kroghsbo NM, Wen J, Krogh A, Jensen KT, et al. Loss of miR-10a activates lpo and collaborates with activated Wnt signaling in inducing intestinal neoplasia in female mice. PLoS Genet. 2013 Oct;9(10):e1003913. pmid:24204315
- 12. Liu Y, Zhang W, Liu K, Liu S, Ji B, Wang Y. miR-138 suppresses cell proliferation and invasion by inhibiting SOX9 in hepatocellular carcinoma. Am J Transl Res. 2016;8(5):2159–68. pmid:27347323
- 13. Liu X-H, Sun M, Nie F-Q, Ge Y-B, Zhang E-B, Yin D-D, et al. Lnc RNA HOTAIR functions as a competing endogenous RNA to regulate HER2 expression by sponging miR-331-3p in gastric cancer. Mol Cancer. 2014 Apr 28;13:92. pmid:24775712
- 14. Koyama S, Koike N, Adachi S. Fas receptor counterattack against tumor-infiltrating lymphocytes in vivo as a mechanism of immune escape in gastric carcinoma. J Cancer Res Clin Oncol. 2001 Jan;127(1):20–6. pmid:11206267
- 15. Binnewies M, Roberts EW, Kersten K, Chan V, Fearon DF, Merad M, et al. Understanding the tumor immune microenvironment (TIME) for effective therapy. Nat Med. 2018 May;24(5):541–50. pmid:29686425
- 16. Linnebacher M, Maletzki C. Tumor-infiltrating B cells: The ignored players in tumor immunology. Oncoimmunology. 2012 Oct 1;1(7):1186–8. pmid:23170274
- 17. Denkert C, Loibl S, Noske A, Roller M, Müller BM, Komor M, et al. Tumor-associated lymphocytes as an independent predictor of response to neoadjuvant chemotherapy in breast cancer. J Clin Oncol. 2010 Jan 1;28(1):105–13. pmid:19917869
- 18. Finn RS, Qin S, Ikeda M, Galle PR, Ducreux M, Kim T-Y, et al. Atezolizumab plus Bevacizumab in Unresectable Hepatocellular Carcinoma. N Engl J Med. 2020 May 14;382(20):1894–905. pmid:32402160
- 19. Racanelli V, Rehermann B. The liver as an immunological organ. Hepatology. 2006 Feb;43(2 Suppl 1):S54–62. pmid:16447271
- 20. Stegmann KA, Robertson F, Hansi N, Gill U, Pallant C, Christophides T, et al. CXCR6 marks a novel subset of T-bet(lo)Eomes(hi) natural killer cells residing in human liver. Sci Rep. 2016 May 23;6:26157. pmid:27210614
- 21. Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. Robust enumeration of cell subsets from tissue expression profiles. Nat Methods. 2015 May;12(5):453–7. pmid:25822800
- 22. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008 Dec 29;9:559. pmid:19114008
- 23. Fiscon G, Conte F, Farina L, Paci P. Network-Based Approaches to Explore Complex Biological Systems towards Network Medicine. Genes (Basel). 2018 Aug 31;9(9). pmid:30200360
- 24. Zhang J, Huang K. Normalized lmQCM: An Algorithm for Detecting Weak Quasi-Cliques in Weighted Graph with Applications in Gene Co-Expression Module Discovery in Cancers. Cancer Inform. 2014;13(Suppl 3):137–46. pmid:27486298
- 25. Smyth GK. Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2004;3:Article3. pmid:16646809
- 26. Yip AM, Horvath S. Gene network interconnectedness and the generalized topological overlap measure. BMC Bioinformatics. 2007 Jan 24;8:22. pmid:17250769
- 27. Alhamzawi R, Ali HTM. The Bayesian adaptive lasso regression. Math Biosci. 2018; 303:75–82.Epub 2018/06/20. pmid:29920251
- 28. Waldmann P, Ferencakovic M, Meszaros G, Khayatzadeh N, Curik I, Solkner J. AUTALASSO: an auto-matic adaptive LASSO for genome-wide prediction. BMC Bioinformatics. 2019; 20(1):167.Epub 2019/04/04. PubMed Central PMCID: PMC6444607. pmid:30940067
- 29. Su L, Zhang G, Kong X. Prognostic Significance of Pregnancy Zone Protein and Its Correlation with Immune Infiltrates in Hepatocellular Carcinoma. Cancer Manag Res. 2020;12:9883–91. pmid:33116846
- 30. Yang JD, Roberts LR. Hepatocellular carcinoma: A global view. Nat Rev Gastroenterol Hepatol. 2010 Aug;7(8):448–58. pmid:20628345
- 31. Xu W, Liu K, Chen M, Sun J-Y, McCaughan GW, Lu X-J, et al. Immunotherapy for hepatocellular carcinoma: recent advances and future perspectives. Ther Adv Med Oncol. 2019;11:1758835919862692. pmid:31384311
- 32. Thomson AW, Knolle PA. Antigen-presenting cell function in the tolerogenic liver environment. Nat Rev Immunol. 2010 Nov;10(11):753–66. pmid:20972472
- 33. Yang J, Yu D, Liu X, Changyong E, Yu S. LncRNA PCED1B-AS1 activates the proliferation and restricts the apoptosis of glioma through cooperating with miR-194-5p/PCED1B axis. J Cell Biochem. 2020 Feb;121(2):1823–33. pmid:31680313
- 34. Zheng J-F, Guo N-H, Zi F-M, Cheng J. Long Noncoding RNA H19 Promotes Tumorigenesis of Multiple Myeloma by Activating BRD4 Signaling by Targeting MicroRNA 152-3p. Mol Cell Biol. 2020 Jan 16;40(3).
- 35. Yue C, Ren Y, Ge H, Liang C, Xu Y, Li G, et al. Comprehensive analysis of potential prognostic genes for the construction of a competing endogenous RNA regulatory network in hepatocellular carcinoma. Onco Targets Ther. 2019;12:561–76. pmid:30679912
- 36. Deng X, Zhao XF, Liang XQ, Chen R, Pan YF, Liang J. Linc00152 promotes cancer progression in hepatitis B virus-associated hepatocellular carcinoma. Biomed Pharmacother. 2017 Jun;90:100–8. pmid:28343069
- 37. Li S-Q, Chen Q, Qin H-X, Yu Y-Q, Weng J, Mo Q-R, et al. Long Intergenic Nonprotein Coding RNA 0152 Promotes Hepatocellular Carcinoma Progression by Regulating Phosphatidylinositol 3-Kinase/Akt/Mammalian Target of Rapamycin Signaling Pathway through miR-139/PIK3CA. Am J Pathol. 2020 May;190(5):1095–107. pmid:31954697
- 38. Li P, Xiao Z, Luo J, Zhang Y, Lin L. MiR-139-5p, miR-940 and miR-193a-5p inhibit the growth of hepatocellular carcinoma by targeting SPOCK1. J Cell Mol Med. 2019 Apr;23(4):2475–88. pmid:30710422
- 39. Wu J, Liu L, Jin H, Li Q, Wang S, Peng B. LncSNHG3/miR-139-5p/BMI1 axis regulates proliferation, migration, and invasion in hepatocellular carcinoma. Onco Targets Ther. 2019;12:6623–38. pmid:31692508
- 40. Wang Z, Song L, Ye Y, Li W. Long Noncoding RNA DIO3OS Hinders Cell Malignant Behaviors of Hepatocellular Carcinoma Cells Through the microRNA-328/Hhip Axis. Cancer Manag Res. 2020;12:3903–14. pmid:32547226
- 41. Shahabi S, Kumaran V, Castillo J, Cong Z, Nandagopal G, Mullen DJ, et al. LINC00261 Is an Epigenetically Regulated Tumor Suppressor Essential for Activation of the DNA Damage Response. Cancer Res. 2019 Jun 15;79(12):3050–62. pmid:30796052
- 42. Zhai S, Xu Z, Xie J, Zhang J, Wang X, Peng C, et al. Epigenetic silencing of LncRNA LINC00261 promotes c-myc-mediated aerobic glycolysis by regulating miR-222-3p/HIPK2/ERK axis and sequestering IGF2BP1. Oncogene. 2021 Jan;40(2):277–91. pmid:33122827
- 43. Wan S, Ni G, Ding J, Huang Y. Long Noncoding RNA FBXL19-AS1 Expedites Cell Growth, Migration and Invasion in Cervical Cancer by miR-193a-5p/PIN1 Signaling. Cancer Manag Res. 2020;12:9741–52. pmid:33116834
- 44. Pan R, He Z, Ruan W, Li S, Chen H, Chen Z, et al. lncRNA FBXL19-AS1 regulates osteosarcoma cell proliferation, migration and invasion by sponging miR-346. Onco Targets Ther. 2018;11:8409–20. pmid:30555237
- 45. Wu H-K, Liu C, Li X-X, Ji W, Xin C-D, Hu Z-Q, et al. PHLPP2 is regulated by competing endogenous RNA network in pathogenesis of colon cancer. Aging (Albany NY). 2020 Jul 7;12(13):12812–40. pmid:32633726
- 46. Wang L-X, Wan C, Dong Z-B, Wang B-H, Liu H-Y, Li Y. Integrative Analysis of Long Noncoding RNA (lncRNA), microRNA (miRNA) and mRNA Expression and Construction of a Competing Endogenous RNA (ceRNA) Network in Metastatic Melanoma. Med Sci Monit. 2019 Apr 20;25:2896–907. pmid:31004080
- 47. Wang F-S, Fan J-G, Zhang Z, Gao B, Wang H-Y. The global burden of liver disease: the major impact of China. Hepatology. 2014 Dec;60(6):2099–108. pmid:25164003
- 48. Hernandez-Gea V, Toffanin S, Friedman SL, Llovet JM. Role of the microenvironment in the pathogenesis and treatment of hepatocellular carcinoma. Gastroenterology. 2013 Mar;144(3):512–27. pmid:23313965
- 49. Gao J, Chen H, Yu Y, Song J, Song H, Su X, et al. Inhibition of hepatocellular carcinoma growth using immunoliposomes for co-delivery of adriamycin and ribonucleotide reductase M2 siRNA. Biomaterials. 2013 Dec;34(38):10084–98. pmid:24060417
- 50. Huang N, Guo W, Ren K, Li W, Jiang Y, Sun J, et al. LncRNA AFAP1-AS1 Supresses miR-139-5p and Promotes Cell Proliferation and Chemotherapy Resistance of Non-small Cell Lung Cancer by Competitively Upregulating RRM2. Front Oncol. 2019;9:1103. pmid:31696057
- 51. Cheng C, Wu X, Shen Y, Li Q. KIF14 and KIF23 Promote Cell Proliferation and Chemoresistance in HCC Cells, and Predict Worse Prognosis of Patients with HCC. Cancer Manag Res. 2020;12:13241–57. pmid:33380832
- 52. Kaadige MR, Looper RE, Kamalanaadhan S, Ayer DE. Glutamine-dependent anapleurosis dictates glucose uptake and cell growth by regulating MondoA transcriptional activity. Proc Natl Acad Sci U S A. 2009 Sep 1;106(35):14878–83. pmid:19706488
- 53. Shao Y, Kim SY, Shin D, Kim MS, Suh H-W, Piao Z-H, et al. TXNIP regulates germinal center generation by suppressing BCL-6 expression. Immunol Lett. 2010 Apr 8;129(2):78–84. pmid:20156484
- 54. Murphy KM, Reiner SL. The lineage decisions of helper T cells. Nat Rev Immunol. 2002 Dec;2(12):933–44. pmid:12461566
- 55. Speiser DE, Ho P-C, Verdeil G. Regulatory circuits of T cell function in cancer. Nat Rev Immunol. 2016;16(10):599–611. pmid:27526640
- 56. Mossanen JC, Kohlhepp M, Wehr A, Krenkel O, Liepelt A, Roeth AA, et al. CXCR6 Inhibits Hepatocarcinogenesis by Promoting Natural Killer T- and CD4+ T-Cell-Dependent Control of Senescence. Gastroenterology. 2019;156(6):1877–1889.e4. pmid:30710528
- 57. Garnelo M, Tan A, Her Z, Yeong J, Lim CJ, Chen J, et al. Interaction between tumour-infiltrating B cells and T cells controls the progression of hepatocellular carcinoma. Gut. 2017;66(2):342–51. pmid:26669617
- 58. Go G-W, Mani A. Low-density lipoprotein receptor (LDLR) family orchestrates cholesterol homeostasis. Yale J Biol Med. 2012 Mar;85(1):19–28. pmid:22461740
- 59. Guillaumond F, Bidaut G, Ouaissi M, Servais S, Gouirand V, Olivares O, et al. Cholesterol uptake disruption, in association with chemotherapy, is a promising combined metabolic therapy for pancreatic adenocarcinoma. Proc Natl Acad Sci U S A. 2015 Feb 24;112(8):2473–8. pmid:25675507
- 60. Gallagher EJ, Zelenko Z, Neel BA, Antoniou IM, Rajan L, Kase N, et al. Elevated tumor LDLR expression accelerates LDL cholesterol-mediated breast cancer growth in mouse models of hyperlipidemia. Oncogene. 2017 16;36(46):6462–71. pmid:28759039
- 61. He Y, Rodrigues RM, Wang X, Seo W, Ma J, Hwang S, et al. Neutrophil-to-hepatocyte communication via LDLR-dependent miR-223-enriched extracellular vesicle transfer ameliorates nonalcoholic steatohepatitis. J Clin Invest. 2020 Dec 10;