Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A pan-cancer bioinformatic analysis of the carcinogenic role of SMARCA1 in human carcinomas

  • Lei Dai ,

    Roles Formal analysis, Project administration, Writing – original draft, Writing – review & editing

    ‡ LD and JM are co-first authors on this work.

    Affiliation Department of Hepatopancreatobiliary Surgery, Ningbo Medical Centre Lihuili Hospital, Ningbo University, Ningbo, Zhejiang, China

  • Joseph Mugaanyi ,

    Roles Methodology, Software, Visualization

    ‡ LD and JM are co-first authors on this work.

    Affiliation Department of Hepatopancreatobiliary Surgery, Ningbo Medical Centre Lihuili Hospital, Ningbo University, Ningbo, Zhejiang, China

  • Tongyue Zhang,

    Roles Data curation, Formal analysis, Visualization

    Affiliation Tongji Hospital of Tongji Medical College, Huazhong University of Science and Technology, Wuhan, Hubei, China

  • Jingshu Tong,

    Roles Investigation

    Affiliation Department of Hepatopancreatobiliary Surgery, Ningbo Medical Centre Lihuili Hospital, Ningbo University, Ningbo, Zhejiang, China

  • Xingchen Cai,

    Roles Investigation

    Affiliation Department of Hepatopancreatobiliary Surgery, Ningbo Medical Centre Lihuili Hospital, Ningbo University, Ningbo, Zhejiang, China

  • Caide Lu ,

    Roles Resources, Supervision

    yangtze21@sina.com (CL); lucaide@nbu.edu.cn (CL)

    Affiliation Department of Hepatopancreatobiliary Surgery, Ningbo Medical Centre Lihuili Hospital, Ningbo University, Ningbo, Zhejiang, China

  • Changjiang Lu

    Roles Validation, Writing – review & editing

    yangtze21@sina.com (CL); lucaide@nbu.edu.cn (CL)

    Affiliation Department of Hepatopancreatobiliary Surgery, Ningbo Medical Centre Lihuili Hospital, Ningbo University, Ningbo, Zhejiang, China

Abstract

SMARCA1is a mammalian imitation switch (ISWI) gene that encodes for SNF2L. SNF2L is involved in regulating cell transition from a committed progenitor state to a differentiated state. Although many papers have detailed the correlation between SMARCA1 and different cancers, no pan-cancer analysis has been conducted to date. We started by exploring the potential carcinogenic role of SMARCA1 across 33 carcinomas using the cancer genome atlas (TCGA) and the genotype-tissue expression (GTEx) databases. The expression of SMARCA1 was significantly elevated in some tumor types but not in others. There was a distinct relationship between SMARCA1 expression and patient prognosis. S116 phosphorylation levels were up-regulated in both lung adenocarcinoma and uterine corpus endometrial carcinoma. The expression level of SMARCA1 was positively correlated with cancer-associated fibroblasts infiltration in a number of tumors, such as colon adenocarcinoma, cervical squamous cell carcinoma and endocervical adenocarcinoma. It was also associated with CD8+ T-cell infiltration in head and neck squamous cell carcinoma and lung adenocarcinoma. Furthermore, SMARCA1 is involved in chromatin remodeling and protein processing-associated mechanisms. Our study presents an initial assessment and illustration of the carcinogenic role of SMARCA1 in different carcinomas.

1. Introduction

Oncogenesis and tumor progression mechanisms are very complex and involve many different genes. It is therefore essential to perform pan-cancer analysis of the involved genes to investigate any potential relationship between the mechanisms and patient prognosis. The cancer genome atlas (TCGA) project, the genotype-tissue expression (GTEx) and the Oncomine databases are three publicly accessible databases that provide an extensive collection of tumor-related functional genomics datasets [15] for researchers to carry out a pan-cancer analysis.

SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily A, member 1 (SMARCA1) protein is an important subunit of the SWI/SNF complex involved in chromatin remodeling and transcriptional activation [6, 7]. The structure and function of SMARCA1 have been partially explored from an epigenetic and clinicopathological perspective [611]. Although the SMARCA1 protein has only been partially mapped, five conserved domains have been identified which are: DNA-binding domain (DBINO), HAND domain, SLIDE domain, Helicase superfamily c-terminal domain (HELICc) as well as the ’SWI3, ADA2, N-CoR and TFIIIB’ DNA-binding domains (SANT) [1214]. Besides its correlation with non-neoplastic diseases [7, 10, 11, 15], SMARCA1 has also been observed in numerous malignancies, such as soft tissue sarcoma [16], colorectal cancer [17], gastric cancer [18], and urothelial cancer [19]. However, a comprehensive big data-based pan-cancer analysis of the correlation between SMARCA1 and different types of carcinomas is yet to be undertaken. This study outlines the pathways mediated by SMARCA1 in different disorders and cancers based on present cell- or animal-experimental evidence (S1 Fig).

We conducted a pan-cancer analysis of SMARCA1 based on the TCGA and GTEx databases and investigated the underlying molecular mechanisms of SMARCA1 in carcinogenesis, clinical prognosis and targeted therapy of various carcinomas.

2. Materials and methods

The methods are summarized in S2 Fig.

2.1 Genome and proteomic structure analysis

Genome annotations of the SMARCA1 gene were obtained from the University of California Santa Cruz (UCSC) genome browser (http://genome.ucsc.edu/) according to the human Dec 2013 (GRch38/hg28) assembly [20]. Analysis of the conserved domain of SMARCA1 was performed using the HemoloGene database (http://www.ncbi.nlm.nih.gov/homologene/) [21]. In addition, we acquired the SMARCA1 phylogenetic tree for different species based on NCBI’s constraint-based multiple alignment tool (COBALT) which computed protein sequence alignment using conserved domains and local sequence similarity information (http://www.ncbi.nlm.nih.gov/tools/cobalt) [22].

2.2 Gene expression analysis with HPA

The human protein atlas (HPA) database [23], a spatial map of the human proteome (http://www.proteinatlas.org/humanproteome/pathology) was used to ascertain the physiological and pathological expression data of the SMARCA1 gene in different tissues and cells by inputting the term “SMARCA1”. “Low specificity” was considered as “Normalized expression (NX) ≥ 1 in at least one tissue/region/cell type without being elevated. “Enhanced” was when “NX levels of a group (of 1–5 tissues or 1–10 cell types or 1–5 brain regions) elevated to at least four times the mean of other tissue/region/cell types”. Detailed information at: https://www.proteinatlas.org/ENSG00000102038-SMARCA1.

2.3 Gene expression analysis with TIMER2

The tumor immune estimation resource (TIMER), version 2.0 database (http://timer.comp-genomics.org), based on TCGA, is a comprehensive web resource for the systematical evaluation of the clinical relevance of tumor-immune infiltrates and differential gene correlation analysis [24]. We used TIMER 2.0’s “Gene_DE” module to explore the difference in SMARCA1 expression between tumor and adjacent normal tissues.

2.4 Gene expression analysis with GEPIA2

Gene expression profiling interactive analysis (GEPIA), version 2.0 is a public (http://gepia.cancer-pku.cn/index.html) web application based on GTEx and TCGA databases [25]. We used it to assess further the difference in expression of SMARCA1 between tumor and the corresponding normal tissues, for tumors with significantly restricted normal or without normal tissues in TIMER2.0 [e.g., Adrenocortical Carcinoma (ACC), Lower Grade Glioma (LGG), etc.]. Configuration was as follows: the cutoff of Log2FC (fold change) and P-value were 1 and 0.01 respectively, and TCGA normal and GTEx data were matched. Additionally, we evaluated the SMARCA1 expression level in each pathological stage of all TCGA carcinomas using the “Expression DIY-Stage plot” module of GEPIA2. The log2(TPM (Transcripts per million) + 1) for the log-scale was applied in assessments.

2.5 Gene expression analysis with UALCAN

UALCAN (http://ualcan.path.uab.edu/index.html) is a comprehensive and interactive public resource for cancer OMICS data analysis [26], provided by the Clinical proteomic tumor analysis consortium (CPTAC) dataset [27]. We used it to conduct translational level analysis of the SMARCA1 gene. Using “SMARCA1” as the search keyword, we explored and compared the total protein expression levels (NP_001269803.1) between normal and primary tumor tissues of the five available tumors (breast cancer, ovarian cancer, clear cell renal cell carcinoma (RCC), uterine corpus endometrial carcinoma (UCEC) and lung adenocarcinoma (LUAD)).

2.6 Gene expression analysis with Oncomine

For further verification, the Oncomine database, currently the world’s largest database of oncogene chips and features an integrated data mining platform (https://www.oncomine.org) [5], was used to assess the difference in SMARCA1 expression between tumor and normal tissues under the default settings (with a threshold of “FC = 1.5, P-value = 0.05, and Gene rank = Top 10%”). Several comparisons across at least 6 analyses were obtained with the included studies’ median rank, P-value, and legends.

2.7 Prognosis—survival analysis

Correlation maps for SMARCA1 expression, overall survival (OS) and disease-free survival (DFS) for cancers in TCGA were obtained using the “Survival Map” module of GEPIA2 [25]. The median, estimated by the Mantel-Cox test was used as the cutoff value (cutoff-high = 50%, cutoff-low = 50%). Subsequently, relevant Kaplan-Meier curves were obtained using the same module with the log-rank test and default parameters.

The Kaplan-Meier Plotter database [28] (http://kmplot.com/analysis) was constructed based on gene microarray and RNA-seq data from the gene expression omnibus (GEO) [29], European genome-phenome archive (EGA) [30], and TCGA public databases. We conducted a series of survival analyses of the relationships between SMARCA1 and various cancers to determine OS, relapse-free survival (RFS), distant metastasis-free survival (DMFS), post-progression survival (PPS), progress-free survival (PFS), first progression (FP) and disease-specific survival (DSS). The parameters were as follows: Affymetrix id/Gene symbol: use mean expression of selected genes (203873_at, 203874_s_at, 203875_at, 215294_s_at) in breast, ovarian, lung and gastric cancers; use RNA-seq id_6594 in liver cancer; patients split by auto-select best cutoff; HR: yes; 95% confidence interval (CI): yes; with the remaining settings set to their default values.

We also screened the clinical indicators for the subgroup analyses to find out the high-risk factors related to the SMARCA1 expression and prognosis of different cancers [e.g., stage, grade, AJCC_T, gender, race, alcohol consumption, intrinsic subtype, surgery, or chemotherapy, etc.]. Then we imported the metan module of STATA 15.0 software (College Station, TX, StataCorp LP, USA) [31] to perform a meta-analysis of the above data. Z-test was used and a forest plot was plotted.

2.8 Genetic alteration analysis

Using cBioPortal (http://www.cbioportal.org) for cancer genomics visualization [3234], we selected “Quick select; TCGA PanCancer Atlas Studies” on the home page and queried for “SMARCA1” genetic variation characteristics. Data including alteration frequency, mutation type and copy number alteration (CNA) for cancers in TCGA was extracted. Next, mutated site results of SMARCA1 exhibited in the pattern chart of the three-dimensional (3D) or protein structure through the “Mutations” module were obtained. Differences in OS, DFS, PFS and DSS with and without SMARCA1 gene variation were compared using the “Comparison” module.

2.9 DNA methylation analysis

We conducted DNA methylation analysis for the SMARCA1 gene for numerous probes (e.g., cg10076009, cg18751555, etc.) and different cancers using the MEXPRESS database (http://mexpress.be/) [35]. In MEXPRESS, DNA methylation, expression and clinical data was visualized, as well as Pearson correlation coefficient R-values and Benjamini-Hochberg-adjusted P-values were analyzed.

2.10 Phosphorylation feature analysis

We also explored the expression level of SMARCA1 of the phospho-protein (at the S116 site) using UALCAN. We retrieved the predicted phosphorylation features of the site via querying for “SMARCA1” in the PhosphoNET database (http://www.phosphonet.ca/), an open-access human phosphosite knowledgebase [36].

2.11 Tumor-related immune infiltration analysis

We selected cancer-associated fibroblasts and the immune cells of CD8+ T-cells in the “immune-Gene” module of the TIMER2 database to assess the potential correlation between SMARCA1 expression and tumor-related immune infiltrates across 33 cancers in TCGA. We visualized the data with heatmaps [24] and scatter plots via immune infiltration evaluation with the QUANTISEQ, TIMER, MCPCOUNTER, XCELL, CIBERSORT, CIBERSORT-ABS, TIDE and EPIC algorithms applied by purity-adjusted Spearman’s rank correlation test.

2.12 Gene-immune analysis

Sangerbox database (http://www.sangerbox.com), based on TCGA and GEO databases, is a practical bioinformation analysis tool [37]. We used it to analyze the potential relationship between SMARCA1 expression and microsatellite instability (MSI) or tumor mutational burden (TMB). The partial correlation value (cor) and P-value were calculated using Spearman’s rank test and circle plots.

2.13 Gene enrichment analysis

The STRING database (http://string-db.org/) [38], which supports the functional proteins association networks, was used for SMARCA1-related proteomic analysis. We set the main parameters as follows: protein name (“SMARCA1”), organism (“Homo sapiens”), the meaning of network edges (“evidence”), active interaction sources (“Experiments”), the minimum required interaction score [“low confidence (0.150)”] and max number of interactors to show [“1st shell: no more than 50 interactors”]. Finally, we acquired the SMARCA1-related protein networks diagram by enrichment analysis.

First, we retrieved data for the top 100 SMARCA1-related/similar genes via the “similar genes detection” module of GEPIA2 based on all tumor and normal tissues in TCGA. Then we performed a pairwise Pearson correlation analysis of SMARCA1 and the selected genes using the “correlation analysis” module of GEPIA2. Afterward, we applied the “Gene_corr” module of TIMER2 to perform the purity-adjusted Spearman’s rank correlation analysis and the selected genes in a heatmap with statistically significant data. This was followed by interaction analysis of SMARCA1-correlated and interacted genes on Jvenn (http://jvenn.toulouse.inra.fr/app/example.html), which is an interactive Venn diagram viewer [39]. We combined the two gene lists to conduct a Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis visualized with the “tidyr” and “ggplot2” packages in R. The data was obtained from the functional annotation chart using the DAVID Bioinformatics Resources 6.8 [40, 41] by uploading gene lists, followed by its analysis wizard. We also conducted Gene ontology (GO) enrichment analysis using R’s (version 4.0.3, www.r-project.org) [42] “clusterProfiler” package. All the data analysis output visualized cnetplots (node_label = T, colorEdge = T, circular = F), including biological process (BP), cellular component (CC) and molecular function (MF). Two-tailed P value less than 0.05 is regarded as statistically significant.

2.14 Diagnostic capability analysis of SMARCA1

Based on the TCGA and GTEx database data, we conducted a diagnostic capability analysis of SMARCA1 pan-cancer using receiver operating characteristic (ROC) curves. The area under curve (AUC) was calculated for evaluating the diagnostic value of SMARCA1.

2.15 Statistical analysis

All statistical analyses were performed with R (version 4.0.3). Normally distributed variables were analyzed using the t-test and one-way ANOVA test and non-normally distributed variables with nonparametric tests. Log-rank test and cox regression were used for survival analysis, Pearson’s correlation and spearman’s rank correlation test for correlation analysis. p-value < 0.05 was considered statistically significant.

3. Results

3.1 Gene expression analysis

This study aimed to assess the tumorigenic role of human SMARCA1 (NM_003069.5 for mRNA or NP_003060.2 for protein, S3A Fig). The conserved domains of SMARCA1 protein structure are similar among different species (e.g., X.tropicalis, D.rerio, H.sapiens, etc.), such as HAND (pfam09110), HELICc (cd00079) and SANT (cl17250) domains (S3B Fig). The phylogenetic tree (S4 Fig) demonstrated an evolutionary correlation of SMARCA1 in different species.

We obtained the RNA expression pattern of SMARCA1 in different single cell types and normal tissues based on the combined analysis of the GTEx, Human protein atlas (HPA) and Function annotation of the mammalian genome 5 (FANTOM5) datasets. SMARCA1 expression is highest in the adrenal gland, followed by the seminal vesicle, ovary and liver (Fig 1A) by tissue and in Leydig and peritubular cells by cell type (Fig 1B). Although SMARCA1 expression can be observed in nearly all the tissues detected (consensus normalized expression (NX) > 1), low RNA tissue specificity and enhanced RNA cell type specificity appears both in different single cell types and blood cells (Fig 1C). Furthermore, SMARCA1 expression in the cytoplasm and nucleus reaches moderate intensity in glioma, prostate and renal (Fig 1D).

thumbnail
Fig 1. SMARCA1 expression in different cells, tissues in the physiological and pathological status.

(A) Analysis of SMARCA1 gene expression in different tissues by the consensus datasets of HPA, GTEx and FANTOM5; (B) Analysis of SMARCA1 gene expression in different single cell types; (C) Analysis of SMARCA1 gene expression in different blood cells by the consensus datasets of HPA, Monaco and Schmiedel. (D) Analysis of SMARCA1 gene expression in different cancer tissues.

https://doi.org/10.1371/journal.pone.0274823.g001

We analyzed SMARCA1 expression levels in all tumors of TCGA via TIMER2. As shown in Fig 2A, the level of SMARCA1 expression in the cancer tissues of bladder urothelial carcinoma (BLCA), breast invasive carcinoma (BRCA), colon adenocarcinoma (COAD), kidney chromophobe (KICH), kidney renal clear cell carcinoma (KIRC), prostate adenocarcinoma (PRAD), rectum adenocarcinoma (READ), skin cutaneous carcinoma (SKCM), thyroid carcinoma (THCA), uterine corpus endometrial carcinoma (UCEC) (P < 0.001), kidney renal papillary cell carcinoma (KIRP), lung squamous cell carcinoma (LUSC) (P < 0.01), cervical squamous cell carcinoma and endocervical adenocarcinoma (CESC) and cholangiocarcinoma (CHOL) (P < 0.05) is lower than the adjacent normal tissues. On the contrary, the expression in the tumor tissues of lung adenocarcinoma (LUAD) and liver hepatocellular carcinoma (LIHC) is higher.

thumbnail
Fig 2. Comparison of SMARCA1 expression level in different tumor and adjacent normal tissues.

(A) Analysis of SMARCA1 expression level in different tumors or their subtypes and corresponding normal tissues via TIMER2. * P < 0.05, ** P < 0.01, *** P < 0.001. (B) By including the relevant normal tissues of the GTEx database as controls, the SMARCA1 Expression status in ACC, DLBC, GBM, ESCA, LAML, LGG, OV, PAAD, STAD, TGCT, THYM and UCS were analyzed via GEPIA2. ** P < 0.01. (C) Analysis of SMARCA1 expression in total protein between primary tumor and normal tissues of five cancers based on the CPTAC dataset. * P < 0.05, ** P < 0.01, *** P < 0.001.

https://doi.org/10.1371/journal.pone.0274823.g002

We proceeded to include data of the normal tissues from the GTEx database as controls for evaluating the difference in SMARCA1 expression between tumor and normal tissues of diffuse large B cell lymphoma (DLBC), glioblastoma multiforme (GBM), acute myeloid leukemia (LAML), brain lower grade glioma (LGG), ovarian serous cystadenocarcinoma (OV), pancreatic adenocarcinoma (PAAD), testicular germ cell tumors (TGCT), and thymoma (THYM) (P < 0.01). However, we did not observe a significant difference in other cancers, including adrenocortical carcinoma (ACC), esophageal carcinoma (ESCA), stomach adenocarcinoma (STAD) and uterine carcinosarcoma (UCS) (Fig 2B).

According to the analysis using the CPTAC database, we found that SMARCA1 total protein expression was higher in the primary tumor tissues of breast cancer, clear cell RCC, LUAD (P < 0.001), and ovarian cancer (P < 0.05) than in normal tissues except in the case of UCEC (Fig 2C). The results of pooling analysis in the Oncomine database further confirmed the findings for breast cancer, ovarian cancer, and colorectal cancer (all P < 0.01). However higher expression was observed in brain and CNS cancer, sarcoma and lymphoma (all P < 0.001) compared to the control (Fig 3).

thumbnail
Fig 3. Pooling analysis of the difference in SMARCA1 expression between tumor and normal tissues via the Oncomine database.

(A) Breast cancer; (B) Brain and CNS cancer; (C) Ovarian cancer; (D) Sarcoma; (E) Colorectal cancer; (F) Lymphoma.

https://doi.org/10.1371/journal.pone.0274823.g003

We also investigated the relationship between SMARCA1 expression and different pathological stages of various cancers via the “pathological stage plot” module of GEPIA2. Significant differences were observed in BLCA, COAD, KIRP, KIRC, PAAD, READ and THCA (all P < 0.05) (Fig 4A) but not in others (Fig 4B).

thumbnail
Fig 4. Expression of SMARCA1 gene in different pathological stages of diverse cancers in TCGA.

(A) Expression difference in BLCA, COAD, KIRC, KIRP, PAAD, READ and THCA; (B) Expression difference in ACC, BRCA, CESC, CHOL, DLBC, ESCA, HNSC, KICH, LIHC, LUAD, LUSC, OV, SKCM, STAD, TGCT, UCEC and UCS. Log2(TPM+1) was applied for log-scale, * P < 0.05, ** P < 0.01, *** P < 0.001.

https://doi.org/10.1371/journal.pone.0274823.g004

3.2 Survival analysis

We divided the cases into two groups according to high or low expression of SMARCA1 to assess the relationships between SMARCA1 expression and patient prognosis in different cancers, based on the TCGA and GEO databases. High expression of SMARCA1 was related to poor overall survival (OS) in patients with LGG (P = 0.036), LUAD (P = 0.009), and STAD (P = 0.029). Low expression was associated with poor OS in KIRC (P < 0.001) and PAAD (P = 0.024) (Fig 5A). Similarly, high expression of SMARCA1 was associated with poor disease-free survival (DFS) for LUAD (P = 0.037),lung squamous cell carcinoma (LUSC) (P = 0.013), mesothelioma (MESO) (P = 0.041), adrenocortical carcinoma (ACC) (P = 0.0073), COAD (P = 0.034) and LGG (P = 0.035), low expression in THCA (P = 0.0074) and KIRC (P = 0.021) (Fig 5B).

thumbnail
Fig 5. Analysis of SMARCA1 expression associated prognosis different cancers via GEPIA2.

(A) Overall survival (OS) analyses between SMARCA1 expression and all cancers included; (B) disease-free survival (DFS) analyses between SMARCA1 expression and all cancers included with demonstration of survival map and Kaplan-Meier curves.

https://doi.org/10.1371/journal.pone.0274823.g005

We used the Kaplan-Meier plotter to evaluate the correlation between SMARCA1 expression and five cancers. For breast cancer, low SMARCA1 expression was linked to poor OS (P = 0.04), relapse-free survival (RFS) (P = 0.011) and distant metastasis-free survival (DMFS) (P = 0.019) but not progress-free survival (PFS) (Fig 6A). The same was observed in OS (P < 0.001), first progression (FP) (P = 0.006) and post-progression survival (PPS) (P < 0.001) of lung cancer (Fig 6C). In contrast, high SMARCA1 expression was related to poor OS (P = 0.037), PFS (P < 0.001) and PPS (P = 0.047) in ovarian cancer (Fig 6B), as well as to OS (P < 0.001), FP (P < 0.001) in gastric cancer (Fig 6D). No significant difference based on SMARCA1 expression in relation to OS, PFS, RFS and disease-specific survival (DSS) was observed in liver cancer (all P > 0.05) (Fig 6E).

thumbnail
Fig 6. Correlation analysis between SMARCA1 expression and prognosis of different cancers via Kaplan-Meier plotter.

(A) OS, PPS, RFS, DMFS in breast cancer; (B) OS, PFS, PPS in ovarian cancer; (C) OS, FP, PPS in lung cancer; (D) OS, FP, PPS in gastric cancer; (E) OS, PFS, RFS, DSS in liver cancer. OS, overall survival; PPS, post progression survival; RFS, relapse free survival; DMFS, distant metastasis free survival; FP, first progression; PFS, progress free survival; DSS, disease-specific survival.

https://doi.org/10.1371/journal.pone.0274823.g006

We performed a meta-analysis (Fig 7) to validate the correlation observed in breast cancer (P = 0.001), lung cancer, ovarian cancer and gastric cancer (all P < 0.001) again and to identify any potential SMARCA1-prognosis correlation in liver cancer (P = 0.021). We performed a series of subgroup analyses through restricted sub-types or select cohorts and obtained distinct findings (Tables 15). We found SMARCA1-associated prognosis to correlate with tumor type and clinical risk factors.

thumbnail
Fig 7. Meta-analysis on the relationship between SMARCA1 expression and multiple factors of various tumor prognosis.

A series of subgroups survival analyses in breast, ovarian, lung, gastric and liver cancer were conducted with z-test, *P < 0.05, **P < 0.01, ***P < 0.001.

https://doi.org/10.1371/journal.pone.0274823.g007

thumbnail
Table 1. Subgroup analysis on the correlation of SMARCA1 expression and prognosis of breast cancer cases.

https://doi.org/10.1371/journal.pone.0274823.t001

thumbnail
Table 2. Subgroup analysis on the correlation of SMARCA1 expression and prognosis of ovarian cancer cases.

https://doi.org/10.1371/journal.pone.0274823.t002

thumbnail
Table 3. Subgroup analysis on the correlation of SMARCA1 expression and prognosis of lung cancer cases.

https://doi.org/10.1371/journal.pone.0274823.t003

thumbnail
Table 4. Subgroup analysis on the correlation of SMARCA1 expression and prognosis of gastric cancer cases.

https://doi.org/10.1371/journal.pone.0274823.t004

thumbnail
Table 5. Subgroup analysis on the correlation of SMARCA1 expression and prognosis of liver cancer cases.

https://doi.org/10.1371/journal.pone.0274823.t005

3.3 Genetic alteration analysis

Using the cBioPortal tool, we obtained the genetic variation status of SMARCA1 across different cancers in the TCGA database. The highest alteration frequency of SMARCA1 (> 10%) appeared in uterine cancers with “mutation” as the main component. “Deep deletion” showed an alteration frequency of ~4% in the uterine carcinosarcoma as the primary type (Fig 8A). Although “amplification” was predominant in renal clear cell carcinoma and “deep deletion” in mesothelioma, it is inconsequential due to having an alteration frequency < 2% (Fig 8A). Detailed information of mutation types, sites and frequencies of SMARCA1 variation is shown in Fig 8B. Missense alteration of SMARCA1 was the most common form and R993Q/*/P mutation in the SLIDE domain was detected in 4 cases of UCEC, 1 case of PRAD and 1 case of STAD (Fig 8B), which may result in frame-shift mutation of the SMARCA1 gene, translation from R (arginine) to Q (glutamine)/P (proline) at the 993 site of SMARCA1 protein and proceeding protein truncation. Although the complete structure of the SMARCA1 protein is unclear, the potential position of the R993 site can still be speculated in the 3D diagram (Fig 8C).

thumbnail
Fig 8. Genetic variation features of SMARCA1 in different cancers based on TCGA via cBioPortal.

(A) The alteration frequency and types among diverse cancers; (B) Potential sites of mutation; (C) mutation site with highest frequency (R993Q/*/P) in the 3D version of SMARCA1 and related carcinomas; (D) Prognosis analysis between SMARCA1 alteration and OS, DSS, DFS, and PFS of uterine corpus endometrial carcinoma (UCEC) by the cBioPortal. *P < 0.05.

https://doi.org/10.1371/journal.pone.0274823.g008

We assessed the link between SMARCA1 gene variation and clinical prognosis of different cancers. As presented in Fig 8D, UCEC patients with SMARCA1 mutation had better OS, DSS and PFS (all P < 0.05) but worse DFS (P = 0.445), compared to the control.

3.4 DNA methylation analysis

Using MEXPRESS, we looked into the potential relationship between SMARCA1 DNA methylation and oncogenesis in different neoplasms. Despite the insufficient methylation data, we could still observe some significant differences in several probes. With respect to LAML, we found a significant negative correlation at multiple probes (e.g., cg03460558, cg08617833, etc.) of the promoter region (all P < 0.001) (Fig 9A). Similarly, negative correlation was observed at probes (cg27353825, cg17420696, cg18959966) of non-promoter region in DLBC (P < 0.05) and in probes (cg04596655, cg27353825, cg18959966) of non-promoter region in LGG (P < 0.05) (Fig 9A). Interestingly, we not only found negative correlation in probes (cg08617833, cg24447042) of promoter region, but also positive correlation in probes (cg04596655, cg18751555) of non-promoter region appearing in TGCT (P < 0.05) (Fig 9A). Due to limited research on the field, we were unable to further verify the relationship between expression and DNA methylation of SMARCA1.

thumbnail
Fig 9. DNA methylation and phosphorylation analysis of SMARCA1 expression in different cancers of TCGA.

(A) The correlation analysis between SMARCA1 DNA methylation and multiple cancers (LAML, LGG, DLBC, TGCT) of diverse probes was displayed with beta value, Pearson correlation coefficients (R) and Benjamini-Hochberg-adjusted P-value. (B) The structural diagram of SMARCA1 protein with positive phosphoprotein sites and cancers related. (C) comparison of phosphoprotein level between tumor and normal tissues on UCEC and LUAD.

https://doi.org/10.1371/journal.pone.0274823.g009

3.5 Protein phosphorylation analysis

Based on the CPTAC database, we investigated the differences in SMARCA1 protein phosphorylation between primary tumors and normal tissues. Owing to limited data, only UCEC and LUAD associated with S116 site were analyzed. Findings of correlation between SMARCA1 phosphorylation and tumor are summarized in Fig 9B. S116, located in the DB domain of SMARCA1 displays higher phosphorylation levels in primary tumor tissues compared to normal tissues for UCEC and LUAD (all P < 0.001) (Fig 9C). Reference parameters for the CPTAC-identified phosphorylation site of SMARCA1 using PhosphoNet database are listed in Table 6, however, these are not backed with experimental data. These results indicate that further investigation of SMARCA1 phosphorylation is warranted.

thumbnail
Table 6. Analysis of CPTAC-identified phosphorylation sites of SMARCA1 via the PhosphoNET database.

https://doi.org/10.1371/journal.pone.0274823.t006

3.6 Tumor-related immune infiltration analysis

Tumor-related immune infiltration and its association with cancer pathogenesis, progression and metastasis gained increasing interest and focus [43, 44]. Cancer-associated fibroblasts were considered a significant component of the tumor-immunity microenvironment, exhibiting functions such as matrix remodeling, signal interaction and immune infiltration interfering [4548]. Therefore, we used QUANTISEQ, TIMER, MCPCOUNTER, XCELL, CIBERSORT, CIBERSORT-ABS, TIDE and EPIC algorithms to assess the correlation between SMARCA1 expression and the immune infiltration level of different cell types in various of TCGA cancers. We found a statistically significant positive relationship between SMARCA1 and the estimated value of cancer-associated fibroblasts infiltration for BRCA, CESC, COAD, ESCA, HNSC, HNSC-HPV-, PAAD, READ, STAD and TGCT (Fig 10A, P < 0.05) based on all the algorithms. In contrast, a negative relationship was observed between CD8+ T-cells infiltration and SMARCA1 expression for HNSC, HNSC-HPV-, LUAD and SARC (Fig 11A, P < 0.05) with most algorithms [24, 49, 50]. We also produced scatterplots of the above cancers with the most effective algorithm (Figs 10B, 11B).

thumbnail
Fig 10. Correlation analysis between SMARCA1 expression and cancer-associated fibroblasts.

(A) Heatmap of correlation between SMARCA1 and the tumor-related immune infiltration level of cancer-associated fibroblasts among all cancers in TCGA with diverse of algorithms; (B) correlation analysis of BRCA, CESC, COAD, ESCA, HNSC, HNSC-HPV, PAAD, READ, STAD and TGCT.

https://doi.org/10.1371/journal.pone.0274823.g010

thumbnail
Fig 11. Correlation analysis between SMARCA1 expression and CD8+ T-cells infiltration.

(A) Heatmap of correlation between SMARCA1 and the tumor-related immune infiltration level of CD8+ T-cells among all cancers in TCGA with diverse of algorithms; (B) correlation analysis of HNSC, HNSC-HPV, LUAD and SARC.

https://doi.org/10.1371/journal.pone.0274823.g011

Furthermore, we explored the relationship between SMARCA1 and tumor mutational burden (TMB) / microsatellite instability (MSI), which were used for the prediction of immunotherapy efficacy [51, 52]. All the cancers in TCGA were analyzed and results presented in Fig 12. SMARCA1 expression was positively correlated with TMB for KICH (P = 0.0054), LUAD (P = 0.022) and TCGT (P = 0.043) but negatively for BLCA (P = 0.0066) and STAD (P = 0.0069) (Fig 12A). We also observed a positive relationship between SMARCA1 and MSI for OV (P = 0.027), TGCT (P = 0.0021), CESC (P = 0.049) and KIRC (P = 0.0022), but negative for PRAD (P = 0.0014), STAD (P = 0.0025), LAML (P = 0.0024) and DLBC (P = 0,047) (Fig 12B). This aspect needs further research.

thumbnail
Fig 12. Correlation analysis between SMARCA1 and TMB/MSI.

(A) Gene-immune analysis of the potential correlation between SMARCA1 expression and TMB across all cancers in TCGA. The partial correlation values (cor) of +0.34 and -0.34 are marked. (B) Gene-immune analysis of the potential correlation between SMARCA1 expression and MSI across all cancers in TCGA. The partial correlation values (cor) of +0.29 and -0.29 are marked. *P-value < 0.05. TMB: tumor mutational burden; MSI: microsatellite instability.

https://doi.org/10.1371/journal.pone.0274823.g012

3.7 Enrichment analysis of SMARCA1-related genes

For an in-depth exploration of the SMARCA1 molecular mechanism in oncogenicity, we tried to sift out the targeted SMARCA1-interacted genes and SMARCA1-correlated genes for a set of signal pathway enrichment analyses. We acquired 50 SMARCA1-binding proteins certificated by experimental evidence via the STRING instrument and constructed an interaction network diagram of them (Fig 13A) [38]. We then used the “similar genes detection” module of GEPIA2 to obtain the top 100 SMARCA1 expression-correlated genes based on tumor types in TCGA. Generally, the top 5 selected genes exhibited a substantial positive reverence to SMARCA1 in most cancer types in a heatmap (Fig 13B) [24, 49, 50]. Scatterplots showed a strong positive correlation between SMARCA1 expression and dynein light chain 2 (DYNLL2) (R = 0.50), empfindlicher im dunkelroten licht1 (EID1) (R = 0.52), G protein-coupled receptor associated sorting protein 2 (GPRASP2) (R = 0.52), microtubule-associated protein 9 (MAP9) (R = 0.51), oculocerebrorenal syndrome of Lowe (OCRL) (R = 0.54) genes (all P < 0.001) (Fig 13C).

thumbnail
Fig 13. SMARCA1-related gene enrichment analysis.

(A) SMARCA1-related functional proteins association networks with experimental determination via STRING database; (B) The heatmap of correlation analysis between SMARCA1 and the selected targeting genes of top 100 SMARCA1-related genes in all TCGA cancers; (C) The detailed correlation analysis between SMARCA1 and corresponding genes, including DYNLL2, EID1, GPRASP2, MAP9, and OCRL; (D) cross-analysis of the SMARCA1 correlated and interacted genes; (E) KEGG pathway analysis of SMARCA1-related genes.

https://doi.org/10.1371/journal.pone.0274823.g013

However, we failed to screen out the common member genes from an intersection analysis of the two groups above (Fig 13D). We conducted KEGG and Go enrichment analyses by combining the two datasets. The KEGG plot suggested that “pyrimidine metabolism” and “purine metabolism” might be the main pathways involved in SMARCA1 oncogenicity (Fig 13E) [40, 41, 5355]. The Go enrichment analyses further predicted the intimate connection between most of these genes and the pathway or biological progress of chromatin remodeling, including ubiquitin-protein transferase activity, SWI/SNF superfamily-type complex, histone acetyltransferase complex, covalent chromatin modification and others (Fig 14).

thumbnail
Fig 14. Go analyses of SMARCA1-related genes in different cancers.

(A) The cnetplots for the molecular function (MF) data; (B) The cnetplots for the cellular component (CC) data; (C) The cnetplots for the biological process (BP) data.

https://doi.org/10.1371/journal.pone.0274823.g014

3.8 Diagnostic capability analysis

Through the comprehensive analysis, we discovered that SMARCA1 had good diagnostic performance on 29 cancers, especially on OV (AUC = 0.981), LAML (AUC = 0.935), DLBC (AUC = 0.970), CESC (AUC = 0.846), READ (AUC = 0.802), COAD (AUC = 0.807), LGG (AUC = 0.938), THYM (AUC = 0.992), PAAD (AUC = 0.898), GBM (AUC = 0.838) and TGCT (AUC = 0.910) (S5 Fig).

4. Discussion

As reported in prior studies, the main functions of SMARCA1 protein are focused on chromatin structure alteration and transcriptional activation across a wide range of species [611, 56, 57]. Our study demonstrated structure conservation of SMARCA1 translation products across several species through the “HomoloGene” database and phylogenetic tree analysis, indicating that similar biological mechanisms of SMARCA1 might operate in the regular physiological activity across each species. Apart from the potential correlation between SMARCA1 and different diseases, especially cancer [1519] were revealed by numerous studies. Whether SMARCA1 could play an essential role in the carcinogenesis of different cancers through established or undiscovered molecular mechanisms is yet to be determined. At the time of writing, to the best of our knowledge, no pan-cancer analysis of SMARCA1 has been reported in published literature. Hence, we thoroughly evaluated the SMARCA1 gene across all cancers based on several databases including TCGA, GEO, CPTAC and more. The molecular characteristic, survival prognosis, genetic mutation, DNA methylation, protein phosphorylation, immune infiltration, signal pathway and diagnostic value were analyzed.

The expression of SMARCA1 varied with each tumor, and so did diagnosis and prognosis, which probably indicates different tumorigenic mechanisms existed in different cancers. SMARCA1 was previously reported to play a vital role in the maturation of midbrain dopaminergic (mDA) neurons [58] and is related with multiple mechanisms in neurogenesis or schizophrenia [15]. Although it was considered to be correlated with neurodevelopmental disorders, t a lack of studies specially focused on the brain and central neural system (CNS) cancers. This study attempted to use several tools to assess the correlation between SMARCA1 expression and CNS malignant tumors. Despite our failure to find significant differences based on data in the TIMER2 database, we found that SMARCA1 expression was much higher in tumor tissues than in normal tissues in GBM and LGG based on the GEPIA2 data (combined TCGA and GTEx database) (P < 0.001). This inconsistency might be due to the insufficient normal tissues for control in TIMER2. Up on further analysis of prognosis, we found that higher expression of SMARCA1 was correlated with poor OS (P = 0.036) and DFS (P = 0.035) in LGG, but was not statistically significant in GBM (Fig 5). Similar results were found by OncoLnc (http://www.oncolnc.org), with Cox regression survival analysis on LGG (Cox coefficient = 0.373, P = 1.10e-04) and GBM (Cox coefficient = -0.075, P = 4.20e-01). Additionally, the correlation between SMARCA1 and brain and CNS cancer was verified again via the Oncomine database (P = 7.64e-06) (Fig 3B). Great diagnostic value of SMARCA1 was observed in both cancers (S5 Fig). We observed an unexpected association between reduced DNA methylation levels at three sites in the non-promoter region and high expression of SMARCA1 in LGG (Fig 9A). Consequently, the potential mechanisms of DNA methylation and related upstream or downstream pathways remain to be explored and other CNS tumors deserve to be included in any future investigations.

For lung carcinoma, we used TIMER2 to analyze the datasets of TCGA-LUAD (n = 515) and TCGA-LUSC (n = 501) projects and obtained completely opposite results. High SMARCA1 expression in tumor tissues of LUAD was related to poor OS (P = 0.009) and DFS (P = 0.037) (Fig 5). Nevertheless, low SMARCA1 expression in tumor tissues of LUSC conversely correlated with a better DFS (P = 0.013). Furthermore, we analyzed lung adenocarcinoma (n = 865) and squamous cell carcinoma (n = 675) cases from all cohorts, revealing that low SMARCA1 expression was correlated with poor OS, FP and PPS in LUAD, however high expression was associated with poor FP in LUSC (Table 3). The inconsistent in LUAD might due to the sample size and as a result, the opposite role of SMARCA1 in patient prognosis with different lung cancer histopathology needs further investigation with more extensive sample size studies.

SMARCA1 expression has been found to be correlated with liver cancer in a few studies. Therefore, we conducted a series of statistical analyses to examine this relationship. Based on TIMER2 analysis, SMARCA1 expression was high in tumor tissues of LIHC but low in cholangiocarcinoma (CHOL). However, there were no significant differences between SMARCA1 expression and prognosis in liver cancer based on GEPIA2, OncoLnc, and Kaplan-Meier plotter approaches. Although it could be affected by heterogeneity, model, effect size, sample size, bias, etc., the meta-analysis indicated a potentially high risk of poor prognosis in liver cancer with elevated SMARCA1 expression to (P = 0.021). After analyzing the subgroups of LIHC, we found that female patients with high SMARCA1 expression had poor OS, PFS, RFS and DSS (Table 5), indicating that gender might be a potential risk factor. Although big data cannot give sufficient evidence to support the tumor oncogenic role of SMARCA1 in the prognosis of LIHC as yet, further studies might influence the results.

Down-regulated SMARCA1 might contribute to DNA injury, growth restriction and tumor cell apoptosis in breast cancer [59] were previously reported, without providing prognosis analysis and other bioinformation. Our study discovered that a low expression of SMARCA1 was correlated with poor OS, RFS and DMFS in breast cancer based on large sample sizes. However, SMARCA1 protein expression was unexpectedly elevated in tumor tissues. The findings may have been affected by the small sample size of the control group. Extensive sample size studies are needed. Many potential mechanisms intervening the process of RNA to protein should also be considered. Although no significant independent risk factors in subgroups were identified (Table 1), low SMARCA1 expression was generally considered a potential risk indicator related to the poor prognosis of BRCA.

For gastrointestinal tumors, up-regulated SMARCA1 expression was considered to be correlated with colorectal and gastric cancer through different pathways [17, 18]. Interestingly, higher SMARCA1 expression was associated with better DFS based on the GEPIA2 database regardless of the low expression of SMARCA1 in tumor tissues of COAD verified by TIMER2 and Oncomine. Since there were more normal tissues in the GEPIA2 control sample, this may have introduced inconsistencies in the results. Unlike COAD, we observed a consistent relationship between elevated SMARCA1 expression and poor prognosis of STAD across the different databases which indicates that SMARCA1 might be an independent risk factor for STAD. Consequently, the cause of this inconsistency in COAD remains ambiguous.

A similar phenomenon was observed in ovarian cancer: transcriptional SMARCA1 expression was low in tumor tissues accompanied by a high protein expression, with the increased expression correlating with poor OS, PFS, and PPS of OV. These inconsistent conclusions might also be due to the different sample size, data resources and underlying interference mechanisms at the translation level. Chemotherapy regimens might play a critical role in the SMARCA1-related prognosis (Table 2). Although our study discovered that high expression of SMARCA1 is related to poor prognosis of OV, factors contributing to a difference in transcriptional and translational expression need further investigation.

Our analysis of genitourinary cancer achieved comparatively consistent conclusions. The poorer prognosis of KIRC, KIRP, KICH and UCEC were correlated with lower expression of SMARCA1, which was also down-regulated in tumor tissues, indicating that low SMARCA1 expression might be an independent risk factor for genitourinary. Due to insufficient TGCT data, we only evaluated SMARCA1 expression and DNA methylation. SMARCA1 expression levels were low in tumor tissues whereas results from the DNA methylation level analysis showed that two sites (cg08617833and cg24447042) in the promoter region were negatively correlated with SMARCA1 expression. However, another two sites (cg04596655 and cg18751555) in the non-promoter region were positively associated with SMARCA1 expression. This suggests that, some molecular biomarkers or pathways are involved in the progression of SMARCA1 expression and need further evaluation.

In our study, we first explored the potential association between SMARCA1 expression and TMB/MSI across all tumors in TCGA, which needs clinical evidence to verify and validate. We then combined the bioinformation on SMARCA1-associated components and correlated genes covering all cancers for a chain of enrichment analyses to identify the critical role of “pyrimidine/purine metabolism pathways”, “chromatin remodeling and modification” and “ubiquitin-protein transferase activity” in oncogenicity of carcinomas. Furthermore, applying multiple deconvolution algorithms helped us obtain a statistically significant positive correlation between SMARCA1 and cancer-associated fibroblasts in BRCA, CESC, COAD, ESCA, head and neck squamous cell carcinoma (HNSC), HNSC-HPV-, PAAD, READ, STAD and TGCT. Using the same methods, we obtained a statistically significant negative correlation between SMARCA1 and CD8+ T-cells immune-infiltration in the cancers of HNSC, HNSC-HPV-, LUAD and SARC. This indicated that SMARCA1 expression might play a critical role in certain cancers through unknown mechanisms.

We also estimated the correlation between SMARCA1 DNA methylation and protein phosphorylation concerning different neoplasms. Due to insufficient relevant studies on the subject matter, we only observed a significant correlation between DNA methylation in LAML, DLBC, LGG and TGCT and phosphorylation at the S116 site in LUAD and UCEC without subsequent analyses of other biological information. Although the information we revealed was limited, the study still indicated that DNA methylation was related to potential sites and types of carcinomas, providing a direction for further research to some extent.

There are several limitations in our study. First, the SMARCA1 gene was not at the frontier of research, this restricted the relevant studies on it. Little attention has been paid to some types of cancers, this has resulted in limited sample sizes for those cancers which has a negative impact on the findings. Secondly, although we used plenty of databases and tools to elaborate on the role of SMARCA1 on tumorigenesis in many cancers, some of them were still unwell defined, primarily attributing to data updates being out of sync or to each database performing a singular function. We did not conduct the experiments or analyze the clinical data to provide a more concrete basis for the conclusions owing to the massiveness of the data required to complete the experimentation. Such experimentation is not feasible for a single center. Furthermore, as a bioinformatic analysis, although the batch effect of samples and cross-platforms had been corrected by ComBat, the potential interference was common and difficult to completely eliminate. In addition, we did not conduct an in-depth analysis of tumor-related immune infiltration and PPI due to space constrains. Although the analysis was insufficient, it is still meaningful in guiding future research direction to some extent.

SMARCA gene and its subsets play a dominant role in chromatin remodeling and restoration, whose deficiency might induce several malignancies [60]. Following this viewpoint, we integrated the big data of SMARCA1 gene expression, associated diagnosis, prognosis, genetic mutation, DNA methylation, protein phosphorylation, tumor-related immune infiltration, TMB and MSI across a spectrum of cancers to conduct a pan-cancer analysis and determine the potential correlations among them. Although there are some contradictory results, our study contributes to an initial understanding of the function and role of SMARCA1 in oncogenicity. It points out the direction of further prospective research that can provide much need insight and resolve outstanding inconsistencies.

Supporting information

S1 Fig. Correlation between SMARCA1 and different disorders and cancers.

The reported pathogenic pathways mediated by SMARCA1 in different disorders and cancers are displayed in a graphic manner. The relevant references are included.

https://doi.org/10.1371/journal.pone.0274823.s001

(TIF)

S2 Fig. A flowchart for methods and materials.

https://doi.org/10.1371/journal.pone.0274823.s002

(PDF)

S3 Fig. Structural characteristics of SMARCA1 in different species.

(A) Genomic location of human SMARCA1; (B) Conserved domains of SMARCA1 protein among diverse species.

https://doi.org/10.1371/journal.pone.0274823.s003

(TIF)

S4 Fig. Phylogenetic tree of SMARCA1.

We obtained the phylogenetic tree of SMARCA1 in different species via COBALT of NCBI. COBALT, constraint-based multiple alignment tool.

https://doi.org/10.1371/journal.pone.0274823.s004

(TIF)

S5 Fig. Diagnostic value of SMARCA1 among pan-cancer via TCGA + GTEx.

SMARCA1 had different diagnostic abilities in different tumors. AUC, area under curve. * P < 0.05, ** P < 0.01, *** P < 0.001.

https://doi.org/10.1371/journal.pone.0274823.s005

(PDF)

References

  1. 1. Blum A, Wang P, Zenklusen JC. SnapShot: TCGA-Analyzed Tumors. Cell. 2018 Apr 5;173(2):530. pmid:29625059.
  2. 2. Tomczak K, Czerwińska P, Wiznerowicz M. The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge. Contemp Oncol (Pozn). 2015;19(1A):A68–77. pmid:25691825; PMCID: PMC4322527.
  3. 3. Zhang Z, Li H, Jiang S, et al. A survey and evaluation of Web-based tools/databases for variant analysis of TCGA data. Brief Bioinform. 2019 Jul 19;20(4):1524–1541. pmid:29617727; PMCID: PMC6781580.
  4. 4. GTEx project maps wide range of normal human genetic variation: A unique catalog and follow-up effort associate variation with gene expression across dozens of body tissues. Am J Med Genet A. 2018 Feb;176(2):263–264. pmid:29334591.
  5. 5. Rhodes DR, Kalyana-Sundaram S, Mahavisno V, et al. Oncomine 3.0: genes, pathways, and networks in a collection of 18,000 cancer gene expression profiles. Neoplasia. 2007 Feb;9(2):166–80. pmid:17356713; PMCID: PMC1813932.
  6. 6. Eberharter A, Becker PB. ATP-dependent nucleosome remodelling: factors and functions. J Cell Sci. 2004 Aug 1;117(Pt 17):3707–11. pmid:15286171.
  7. 7. Goodwin LR, Picketts DJ. The role of ISWI chromatin remodeling complexes in brain development and neurodevelopmental disorders. Mol Cell Neurosci. 2018 Mar;87:55–64. Epub 2017 Dec 15. pmid:29249292.
  8. 8. Giles KA, Gould CM, Du Q, et al. Integrated epigenomic analysis stratifies chromatin remodellers into distinct functional groups. Epigenetics Chromatin. 2019 Feb 12;12(1):12. pmid:30755246; PMCID: PMC6371444.
  9. 9. Neely KE, Workman JL. The complexity of chromatin remodeling and its links to cancer. Biochim Biophys Acta. 2002 Oct 2;1603(1):19–29. pmid:12242108.
  10. 10. Lopes F, Barbosa M, Ameur A, et al. Identification of novel genetic causes of Rett syndrome-like phenotypes. J Med Genet. 2016 Mar;53(3):190–9. Epub 2016 Jan 6. pmid:26740508.
  11. 11. Zheng B, Chen Q, Wang C, et al. Whole-genome sequencing revealed an interstitial deletion encompassing OCRL and SMARCA1 gene in a patient with Lowe syndrome. Mol Genet Genomic Med. 2019 Sep;7(9):e876. Epub 2019 Aug 3. pmid:31376231; PMCID: PMC6732312.
  12. 12. Hota SK, Bhardwaj SK, Deindl S, et al. Nucleosome mobilization by ISW2 requires the concerted action of the ATPase and SLIDE domains. Nat Struct Mol Biol. 2013 Feb;20(2):222–9. Epub 2013 Jan 20. pmid:23334290; PMCID: PMC3565048.
  13. 13. Mueller-Planitz F, Klinker H, Ludwigsen J, et al. The ATPase domain of ISWI is an autonomous nucleosome remodeling machine. Nat Struct Mol Biol. 2013 Jan;20(1):82–9. Epub 2012 Dec 2. pmid:23202585.
  14. 14. Euskirchen G, Auerbach RK, Snyder M. SWI/SNF chromatin-remodeling factors: multiscale analyses and diverse functions. J Biol Chem. 2012 Sep 7;287(37):30897–905. Epub 2012 Sep 5. pmid:22952240; PMCID: PMC3438922.
  15. 15. Homann OR, Misura K, Lamas E, et al. Whole-genome sequencing in multiplex families with psychoses reveals mutations in the SHANK2 and SMARCA1 genes segregating with illness. Mol Psychiatry. 2016 Dec;21(12):1690–1695. Epub 2016 Mar 22. pmid:27001614; PMCID: PMC5033653.
  16. 16. Patil PA, Lombardo K, Sturtevant A, et al. Loss of Expression of a Novel Chromatin Remodeler SMARCA1 in Soft Tissue Sarcoma. J Cytol Histol. 2018;9(6):524. Epub 2018 Nov 23. pmid:31093468; PMCID: PMC6513346.
  17. 17. Liu T, Han Z, Li H, et al. LncRNA DLEU1 contributes to colorectal cancer progression via activation of KPNA3. Mol Cancer. 2018 Aug 11;17(1):118. pmid:30098595; PMCID: PMC6087004.
  18. 18. Ding L, Zhao Y, Dang S, et al. Circular RNA circ-DONSON facilitates gastric cancer growth and invasion via NURF complex dependent activation of transcription factor SOX4. Mol Cancer. 2019 Mar 28;18(1):45. pmid:30922402; PMCID: PMC6437893.
  19. 19. Maurer A, Ortiz-Bruechle N, Guricova K, et al; German study group of bladder cancer. Comparative genomic profiling of glandular bladder tumours. Virchows Arch. 2020 Sep;477(3):445–454. Epub 2020 Mar 20. pmid:32198650; PMCID: PMC7443184.
  20. 20. Haeussler M, Zweig AS, Tyner C, et al. The UCSC Genome Browser database: 2019 update. Nucleic Acids Res. 2019 Jan 8;47(D1):D853–D858. pmid:30407534; PMCID: PMC6323953.
  21. 21. NCBI Resource Coordinators. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2016 Jan 4;44(D1):D7–19. Epub 2015 Nov 28. pmid:26615191; PMCID: PMC4702911.
  22. 22. Papadopoulos JS, Agarwala R. COBALT: constraint-based alignment tool for multiple protein sequences. Bioinformatics. 2007 May 1;23(9):1073–9. Epub 2007 Mar 1. pmid:17332019.
  23. 23. Thul PJ, Lindskog C. The human protein atlas: A spatial map of the human proteome. Protein Sci. 2018 Jan;27(1):233–244. Epub 2017 Oct 10. pmid:28940711; PMCID: PMC5734309.
  24. 24. Li T, Fu J, Zeng Z, et al. TIMER2.0 for analysis of tumor-infiltrating immune cells. Nucleic Acids Res. 2020 Jul 2;48(W1):W509–W514. pmid:32442275; PMCID: PMC7319575.
  25. 25. Tang Z, Kang B, Li C, et al. GEPIA2: an enhanced web server for large-scale expression profiling and interactive analysis. Nucleic Acids Res. 2019 Jul 2;47(W1):W556–W560. pmid:31114875; PMCID: PMC6602440.
  26. 26. Chandrashekar DS, Bashel B, Balasubramanya SAH, et al. UALCAN: A portal for facilitating tumor subgroup gene expression and survival analyses. Neoplasia. 2017 Aug;19(8):649–658. pmid:28732212
  27. 27. Chen F, Chandrashekar DS, Varambally S, et al. Pan-cancer molecular subtypes revealed by mass-spectrometry-based proteomic characterization of more than 500 human cancers. Nat Commun. 2019 Dec 12;10(1):5679. pmid:31831737; PMCID: PMC6908580.
  28. 28. Nagy Á, Lánczky A, Menyhárt O, et al. Validation of miRNA prognostic power in hepatocellular carcinoma using expression data of independent datasets. Sci Rep. 2018 Jun 15;8(1):9227. Erratum in: Sci Rep. 2018 Jul 26;8(1):11515. pmid:29907753
  29. 29. Clough E, Barrett T. The Gene Expression Omnibus Database. Methods Mol Biol. 2016;1418:93–110. pmid:27008011; PMCID: PMC4944384.
  30. 30. Zhang C, Bijlard J, Staiger C, et al. Systematically linking tranSMART, Galaxy and EGA for reusing human translational research data. F1000Res. 2017 Aug 16;6:ELIXIR-1488. pmid:29123641; PMCID: PMC5657030.
  31. 31. Shim S, Yoon BH, Shin IS, et al. Network meta-analysis: application and practice using Stata. Epidemiol Health. 2017 Oct 27;39:e2017047. pmid:29092392; PMCID: PMC5733388.
  32. 32. Unberath P, Knell C, Prokosch HU, et al. Developing New Analysis Functions for a Translational Research Platform: Extending the cBioPortal for Cancer Genomics. Stud Health Technol Inform. 2019;258:46–50. pmid:30942711.
  33. 33. Gao J, Aksoy BA, Dogrusoz U, et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci Signal. 2013 Apr 2;6(269):pl1. pmid:23550210; PMCID: PMC4160307.
  34. 34. Cerami E, Gao J, Dogrusoz U, et al. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer Discov. 2012 May;2(5):401–4. Erratum in: Cancer Discov. 2012 Oct;2(10):960. pmid:22588877; PMCID: PMC3956037.
  35. 35. Koch A, Jeschke J, Van Criekinge W, et al. MEXPRESS update 2019. Nucleic Acids Res. 2019 Jul 2;47(W1):W561–W565. pmid:31114869; PMCID: PMC6602516.
  36. 36. Qiao Z, Shiozawa K, Kondo T. Proteomic approach toward determining the molecular background of pazopanib resistance in synovial sarcoma. Oncotarget. 2017 Nov 28;8(65):109587–109595. pmid:29312631; PMCID: PMC5752544.
  37. 37. Bonneville R, Krook MA, Kautto EA, et al. Landscape of Microsatellite Instability Across 39 Cancer Types. JCO Precis Oncol. 2017;2017:PO.17.00073. Epub 2017 Oct 3. pmid:29850653; PMCID: PMC5972025.
  38. 38. Szklarczyk D, Morris JH, Cook H, et al. The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Res. 2017 Jan 4;45(D1):D362–D368. Epub 2016 Oct 18. pmid:27924014; PMCID: PMC5210637.
  39. 39. Bardou P, Mariette J, Escudié F, et al. jvenn: an interactive Venn diagram viewer. BMC Bioinformatics. 2014 Aug 29;15(1):293. pmid:25176396; PMCID: PMC4261873.
  40. 40. Huang DW, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID Bioinformatics Resources. Nature Protoc. 2009;4(1):44–57. pmid:19131956
  41. 41. Huang DW, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37(1):1–13. pmid:19033363
  42. 42. Chan BKC. Data Analysis Using R Programming. Adv Exp Med Biol. 2018;1082:47–122. pmid:30357717.
  43. 43. Kikuchi T, Mimura K, Ashizawa M, et al. Characterization of tumor-infiltrating immune cells in relation to microbiota in colorectal cancers. Cancer Immunol Immunother. 2020 Jan;69(1):23–32. Epub 2019 Nov 26. pmid:31768581.
  44. 44. Steven A, Seliger B. The Role of Immune Escape and Immune Cell Infiltration in Breast Cancer. Breast Care (Basel). 2018 Mar;13(1):16–21. Epub 2018 Feb 2. pmid:29950962; PMCID: PMC6016054.
  45. 45. Sahai E, Astsaturov I, Cukierman E, et al. A framework for advancing our understanding of cancer-associated fibroblasts. Nat Rev Cancer. 2020 Mar;20(3):174–186. Epub 2020 Jan 24. pmid:31980749; PMCID: PMC7046529.
  46. 46. Miyai Y, Esaki N, Takahashi M, et al. Cancer-associated fibroblasts that restrain cancer progression: Hypotheses and perspectives. Cancer Sci. 2020 Apr;111(4):1047–1057. Epub 2020 Mar 10. pmid:32060987; PMCID: PMC7156845.
  47. 47. Chen X, Song E. Turning foes to friends: targeting cancer-associated fibroblasts. Nat Rev Drug Discov. 2019 Feb;18(2):99–115. pmid:30470818.
  48. 48. Kwa MQ, Herum KM, Brakebusch C. Cancer-associated fibroblasts: how do they contribute to metastasis? Clin Exp Metastasis. 2019 Apr;36(2):71–86. Epub 2019 Mar 7. pmid:30847799.
  49. 49. Li Taiwen, Fan Jingyu, Wang Binbin, Traugh Nicole, Chen Qianming, Liu Jun S., et al. TIMER: A web server for comprehensive analysis of tumor-infiltrating immune cells. Cancer Research. 2017;77(21):e108–e110. pmid:29092952
  50. 50. Li Bo, Severson Eric, Pignon Jean-Christophe, Zhao Haoquan, Li Taiwen, Novak Jesse, et al. Comprehensive analyses of tumor immunity: implications for cancer immunotherapy. Genome Biology. 2016;17(1):174. pmid:27549193
  51. 51. Chan TA, Yarchoan M, Jaffee E, et al. Development of tumor mutation burden as an immunotherapy biomarker: utility for the oncology clinic. Ann Oncol. 2019 Jan 1;30(1):44–56. pmid:30395155; PMCID: PMC6336005.
  52. 52. Chang L, Chang M, Chang HM, et al. Microsatellite Instability: A Predictive Biomarker for Cancer Immunotherapy. Appl Immunohistochem Mol Morphol. 2018 Feb;26(2): e15–e21. pmid:28877075
  53. 53. Kanehisa M. and Goto S.; KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27–30 (2000). pmid:10592173
  54. 54. Kanehisa M; Toward understanding the origin and evolution of cellular organisms. Protein Sci. 28, 1947–1951 (2019). pmid:31441146
  55. 55. Kanehisa M., Furumichi M., Sato Y., Ishiguro-Watanabe M., and Tanabe M.; KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 49, D545–D551 (2021). pmid:33125081
  56. 56. Yuan J, Ghosal G, Chen J. The HARP-like domain-containing protein AH2/ZRANB3 binds to PCNA and participates in cellular response to replication stress. Mol Cell. 2012 Aug 10;47(3):410–21. Epub 2012 Jun 14. pmid:22705370; PMCID: PMC3601832.
  57. 57. Wu SF, Zhang H, Cairns BR. Genes for embryo development are packaged in blocks of multivalent chromatin in zebrafish sperm. Genome Res. 2011 Apr;21(4):578–89. Epub 2011 Mar 7. pmid:21383318; PMCID: PMC3065705.
  58. 58. Metzakopian E, Bouhali K, Alvarez-Saavedra M, et al. Genome-wide characterisation of Foxa1 binding sites reveals several mechanisms for regulating neuronal differentiation in midbrain dopamine cells. Development. 2015 Apr 1;142(7):1315–24. pmid:25804738; PMCID: PMC4378246.
  59. 59. Ye Y, Xiao Y, Wang W, et al. Inhibition of expression of the chromatin remodeling gene, SNF2L, selectively leads to DNA damage, growth inhibition, and cancer cell death. Mol Cancer Res. 2009 Dec;7(12):1984–99. Epub 2009 Dec 8. pmid:19996304.
  60. 60. Chetty R, Serra S. SMARCA family of genes. J Clin Pathol. 2020 May;73(5):257–260. pmid:32312722.