Figures
Abstract
Background
Colorectal cancer is one of the most deadliest malignancies worldwide. Due to the dearth of appropriate biomarkers, the diagnosis of this mortal disease is usually deferred, in its turn, culminating in the failure of prevention. By the same token, proper biomarkers are at play in determining the quality of prognosis. In other words, the survival rate is contingent upon the regulation of such biomarkers.
Materials and methods
The information regarding expression (GSE41258, and GSE31905), methylation (GSE101764), and miRNA (dbDEMC) were downloaded. MEXPRESS and GEPIA confirmed the validated differentially expressed/methylated genes using TCGA data. Taking advantage of the correlation plots and receiver-operating-characteristic (ROC) curves, expression and methylation profiles were compared. The interactions between validated differentially expressed genes and differentially expressed miRNA were recognized and visualized by miRTarBase and Cytoscape, respectively. Then, the protein-protein interaction (PPI) network and hub genes were established via STRING and Cytohubba plugin. Utilizing R packages (DOSE, Enrichplot, and clusterProfiler) and DAVID database, the Functional Enrichment analysis and the detection of KEGG pathways were performed. Ultimately, in order to recognize the prognostic value of found biomarkers, they were evaluated through drawing survival plots for CRC patients.
Results
In this research, we found an expression profile (with 13 novel genes), a methylation profile (with two novel genes), and a miRNA profile with diagnostic value. Concerning diagnosis, the expression profile was evaluated more powerful in comparison with the methylation profile. Furthermore, a prognosis-related expression profile was detected.
Citation: Mehrgou A, Teimourian S (2022) Update of gene expression/methylation and MiRNA profiling in colorectal cancer; application in diagnosis, prognosis, and targeted therapy. PLoS ONE 17(3): e0265527. https://doi.org/10.1371/journal.pone.0265527
Editor: Eduardo Andrés-León, Institute of Parasitology and Biomedicine, SPAIN
Received: June 22, 2021; Accepted: March 2, 2022; Published: March 25, 2022
Copyright: © 2022 Mehrgou, Teimourian. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The datasets supporting the conclusions of this article are available in the Gene Expression Omnibus repository, [https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE41258 (GSE41258), https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE101764 (GSE101764), https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE31905 (GSE31905)].
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
Nowadays, colorectal cancer is one of the most rampant malignancies in the world [1–7]. Although the rate of mortality and morbidity of CRC is considered high, these rates have been decreased among people age more than 50 and on the other hand, this reduction has been compensated in less-than-50-year old people [2, 3]. Among the risk factors of CRC, we can point out population aging, the dearth of physical activity, obesity, abstinence of fruits and vegetables, alcohol consumption, tobacco, and the like [3]. The existence of heterogeneity leads to rough circumstances deferring the achievement of proper outcomes in CRC patients [2, 8, 9]. The quality of the prognosis of CRC patients is ultimately contingent on the time of diagnosis. The earlier the disease is diagnosed the more chance will be provided for patients to experience 5-year-survival [5, 10]. Since CRC development is a multiple-step phenomenon including, activation of oncogenes and inactivation of tumor suppressor genes, determining the molecular markers can play a main role as an excellent indicator for this disease [2, 5, 11, 12].
Aberrant methylation of DNA is one of the effective and culpable epigenetic phenomena in developing malignancies, such as CRC [13, 14]. Methylation can be observed in promoter, non-promoter, coding regions, and non-coding regions throughout the DNA sequence whereby, creates varied clinical features in CRC patients [13, 15]. Embryonic development, cell proliferation, and gene expression are under the control of methylation. Furthermore, methylation has the ability to dysregulate various cellular processes through silencing tumor suppressor genes culminating in malignancies, eventually [13]. Methylation is involved in controlling gene expression in both healthy and diseased statuses [16–18]. In most cases, methylation occurs in the vicinity of the transcription start site (TSS). However, even in promoters with a low density of CpG islands, methylation can regulate the corresponding gene expression [17, 19–21]. It is noteworthy that the methylation in non-promoter regions can modulate 1) splicing, 2) alternative gene transcript expression through an alternative promoter, and 3) activation of enhancers [22].
In the present study, we utilize microarray information so as to analyze methylation and expression profiles in colorectal cancer. Paving the way to reach our goal, we take advantage of R packages, MEXPRESS, GEPIA, correlation plot, ROC curve, Cytohubba plugin, and DAVID. Then, the possible interactions, which the found genes have with miRNAs, are investigated.
2. Results
2.1. Recognized Differentially Expressed Genes (DEGs), validated Differentially Expressed Genes (vDEGs), and Differentially Methylated Genes (DMGs)
According to our analysis of GSE41258 (Fig 1), 181 DEGs, out of which 124 and 57 were down-regulated and up-regulated respectively, were detected (S1 Table). Furthermore, the analysis of GSE101764 (Fig 2) produced 9331 DMGs, out of which 3758 and 5573 were included in hyper-methylated and hypo-methylated DMGs, respectively (S2 Table). The methylation status of samples in GSE101764 is shown in Fig 2. Additionally, the comparison between M-values and Beta-values of those samples are illustrated in Fig 3. Pursuant to the analysis of GSE31905 (Fig 2), as a validation dataset, 1454 DEGs, out of which 980 down-regulated and 474 up-regulated genes were found (S3 Table). The top-ten validated differentially expressed genes in both under-expressed and over-expressed categories are included in the Table 1. Accordingly, 134 DEGs, out of which 91 under-expressed genes to the accompaniment of 43 over-expressed genes were validated (Table 2) (Fig 4).
A) Box plots of pre-processed samples data in GEO2R. The top Box plots relate to the discovery dataset. The middle Box plots pertain to the validation dataset. The bottom Box plots are for the methylation dataset. B) Post-processed (Normalized) samples data. The top Box plots relate to the discovery dataset. The middle Box plots pertain to the validation dataset. The bottom left plot is for the methylation dataset raw data. The bottom right plot is for the methylation dataset normalized data.
As it is demonstrated the most of methylation alterations have occurred in two extrem sides (hypo-methylation and hyper-methylation).
Beta and M values are usually used for visualization and statistical calculations, respectively.
Top) Over-expressed DEGs. Bottom) Under-expressed DEGs.
2.2. Identification of overlapping validated Differentially Expressed Genes-Differentially Methylated Genes (vDEGs-DMGs)
Among all vDEGs and DMGs, 22 hypo-methylated/over-expressed genes (ASCL2, CCL20, CEMIP, CHI3L1, CLDN1, COL11A1, COL1A1, CST1, DPEP1, FAP, INHBA, ITGBL1, KRT23, MMP1, MMP7, MSLN, SLC6A6, SLC7A5, SLCO1B3, TACSTD2, THBS2, and UBD) and 15 hyper-methylated/under-expressed genes (CA12, CHP2, CXCL12, DES, EPB41L3, FGFR2, GPM6A, KCNMA1, MUC4, MYH11, PYY, SCNN1B, TCF21, TNXB, and UGT2A3) were found to be overlapped (Fig 5). The heatmap plots for these vDEGs-DMGs are depicted based on their expression and methylation values in Figs 6 & 7.
Top) Over-expressed/hypo-methylated vDEGs-DMGs. Bottom) Under-expressed/hyper-methylated vDEGs-DMGs.
The top left corner) Expression values of over-expressed/hypo-methylated vDEGs-DMGs. The bottom left corner) Expression values of under-expressed/hyper-methylated vDEGs-DMGs. The top right corner) Methylation values of over-expressed/hypo-methylated vDEGs-DMGs. The bottom right corner) Methylation values of under-expressed/hyper-methylated vDEGs-DMGs.
The top left corner) Expression values of over-expressed/hypo-methylated vDEGs-DMGs in COAD. The bottom left corner) Expression values of under-expressed/hyper-methylated vDEGs-DMGs in COAD. The top right corner) Expression values of over-expressed/hypo-methylated vDEGs-DMGs in READ. The bottom right corner) Expression values of under-expressed/hyper-methylated vDEGs-DMGs in READ.
2.3. Verification of Differentially Methylated Genes (DMGs) based on MEXPRESS
The results produced by the MEXPRESS demonstrated nine and five significant hyper-methylated/under-expressed genes in COAD and READ cancer types, respectively (Fig 8). By the same token, 13 and 11 significant hypo-methylated/over-expressed genes were observed in COAD and READ cancer types, respectively (Fig 9). Also, we had 16 non-significant hyper-methylated/under-expressed genes and 20 non-significant hypo-methylated/over-expressed genes in both cancer types in total. Considering the r-value in the Figs 8 and 9, it indicates the quantity and type (positive or negative) of correlations between the methylation and expression status of various probes on each gene. The determining criterion that we considered to include the significant genes was the remarkable higher numbers of negative r-values in comparison with positive ones in each gene demonstrating more confident negative correlation between the methylation and expression status on a specific gene.
The first nine results pertain to COAD cancer type and the last five results correspond to READ cancer type.
The first thirteen results pertain to COAD cancer type and the last eleven results correspond to READ cancer type.
2.4. Scrutiny of negative correlation between expression and methylation status with correlation plot
Out of all correlation plots drawn for vDEGs-DMGs, solely two genes had statistically significant inverse/negative correlations between their expression and methylation status. CHP2, a hyper-methylated/under-expressed gene, and MSLN, a hypo-methylated/over-expressed gene, were our findings in this regard. The correlation plots of vDEGs-DMGs are illustrated in Figs 10 & 11.
2.5. Determination of specificity and sensitivity using receiver operating characteristic curves
Based on the expression values and the methylation values, the ROC curves assisted us in determining the specificity and the sensitivity of the inverse/negative correlations between the expression and methylation status in CHP2 and MSLN genes which had been previously confirmed (Fig 12). Taking the AUC and P.values into account, the ROC curves showed that CHP2 and MSLN genes can play role as high-quality markers in colorectal cancer. The similar ROC curves as for other vDEGs-DMGs are supplied in Fig 13. The AUC and P.values pertaining to other vDEGs-DMGs are in Table 3 in order to present them in a recognizable style.
The top curves relate to expression and the bottom curves pertain to methylation.
The top curves) Expression profiles. The bottom curves) methylation profiles. The left curves) Over-expressed/hypo-methylated vDEGs-DMGs. The right curves) Under-expressed/hyper-methylated vDEGs-DMGs.
2.6. Recognition of Differentially Expressed MiRNAs (DEMs) using the dbDEMC2
79 DEMs, out of which 38 up-regulated and 41 down-regulated miRNAs were detected (S4 Table). Moreover, the interactions depicting how the found vDEGs are possibly under the regulatory control of DEMs are shown in Fig 14.
Top) Over-expressed vDEGs- down-regulated DEMs. Bottom) Under-expressed vDEGs- up-regulated DEMs.
2.7. GEPIA-verification of validated Differentially Expressed Genes
The expression changes of 32 and 31 over-expressed vDEGs were verified in COAD and READ cancer types, respectively (Fig 15). Also, the GEPIA confirmed the expression change of 38 and 32 under-expressed vDEGs in COAD and READ cancer types, respectively (Fig 16).
2.8. Protein-protein interactions and determination of hub genes
As shown in Fig 17, the interactions connecting vDEGs-related proteins are produced based on the STRING database. To be more meticulous, the top ten hub genes having been filtered by the Cytohubba plugin are illustrated in Fig 18.
2.9. Functional enrichment analysis and KEGG pathways
The gene-ontology analysis of vDEGs produced three terms in the context of biological processes, molecular functions, and cellular components which are depicted in two styles of boxplot and cnet plot (Fig 19). Extracellular matrix organization, receptor ligand activity, and collagen−containing extracellular matrix are BP, MF, and CC with the most participated genes, respectively. Also, extracellular matrix organization, extracellular matrix structural constituent, and collagen−containing extracellular matrix are the most statistically significant BP, MF, and CC, respectively. Cytokine-cytokine receptor interaction, and pancreatic secretion are two KEGG pathways with the most participated genes (Fig 20).
The left collumn) Biological Processes. The middle collumn) Molecular Functions. The right collumn) Cellular Components.
2.10. Prognostic analysis of validated Differentially Expressed Genes (vDEGs) using Kaplan-Meier plots
This step of the analysis showed that eight over-expressed vDEGs to the accompaniment of 15 under-expressed vDEGs possess statistically significant prognostic value (Figs 21 & 22).
Each pale blue line corresponds to low expression in COAD or READ cancer types. Each pale red line corresponds to high expression in COAD or READ cancer types. The bold blue line represents some kind of average of low expression in two cancer types. The bold red line represents some sort of average of high expression in two cancer types.
Each pale blue line corresponds to low expression in COAD or READ cancer types. Each pale red line corresponds to high expression in COAD or READ cancer types. The bold blue line represents some kind of average of low expression in two cancer types. The bold red line represents some sort of average of high expression in two cancer types.
3. Discussion
CRC is the second cancer-related cause of death in the world [15, 23]. Since there is widespread molecular heterogeneity in CRC various clinical properties can be detected in patients. Accordingly, the routine tests such as histology cannot individually afford these complexities, and molecular markers, the information of which have been gathered in several databases and repositories, are at play [15, 24–29]. Even though the methylation process plays an important role in controlling gene expression in normal circumstances, the disruption of this phenomenon is usually recognized in approximately all malignancies [5]. Methylation modulates gene expression as for cell proliferation and differentiation. Ergo, methylation is engaged in controlling normal cell growth and cancerous cell development [5].
In the current study, we funded an attempt to establish a flowchart through using a bioinformatics approach to arrive at profiles that includes genes being distinct in their methylation and/or expression pattern.
Out of the detected hub genes, six (CCL20, CXCL1, CXCL3, CXCL8, CXCL11, and MMP1) were among over-expressed vDEGs which have been corroborated by the GEPIA in both COAD and READ cancer types. However, only CXCL12 was one of the hub genes whose presence in both under-expressed vDEGs and the GEPIA (COAD, and READ) has been verified. The three remaining hub genes (P2RY14, NPY1R, and PYY) were solely validated by the validation dataset. Two of the last-mentioned hub genes (P2RY14, and NPY1R), to the best of our knowledge, have not already been reported as being under-expressed in colorectal cancer. By the same token, three genes (CXCL3, CXCL8, and TESC) not only have been their diagnostic values confirmed by the validation dataset and the GEPIA (COAD, and READ) but also, their prognostic values are proved by the Kaplan-Meier plots. Accordingly, five hub genes (CCL20, CXCL1, CXCL11, CXCL12, and MMP1) are advantageous solely in diagnosis as being completely verified (validation dataset plus GEPIA). It is noteworthy that each gene whose expressional alteration has been confirmed in both COAD and READ cancer types by the GEPIA were considered 1) to have previously known or 2) to be most contingent for having participation in colorectal cancer. In the present study, we became prosperous to announce 13 vDEGs whose expressional participation in colorectal cancer has not been previously declared, including 1) over-expressed ones (as possible oncogenes in colorectal cancer): MFAP2, CELSR3, and 2) under-expressed ones (as possible tumor suppressor genes in colorectal cancer): P2RY14, NPY1R, ARL14, TNXB, IGLV1-44, UGT2B15, IGHA2, IL1R2, PCK1, SOSTDC1, and UGT2A3. Surprisingly, the specificity and sensitivity of reported novel genes, using the expression values of all 186 patients and 54 healthy samples in GSE41258, were sufficiently high in distinguishing healthy individuals from CRC patients (Fig 23). Regarding the detected novel genes, more details from other research works are presented in the following.
Left curve) Over-expressed novel vDEGs. Right curve) Under-expressed novel vDEGs.
MFAP2 has the potential to be considered an oncogene, triggering EMT and causing cancer progression, with diagnostic capability in gastric cancer. Bearing the prognostic value in mind, MFAP2 serves as a negative indicator for overall survival in gastric cancer. Also, this gene is observed increased in head and neck squamous carcinoma, particularly in lymph node metastasis [30, 31]. MFAP2 gene shares its oncogenic role not only in gastric cancer [32] but also in breast cancer. Restriction of MFAP2 can culminate in inhibition of proliferation, migration, and invasion in BC cells [33]. MFAP2 has the ability to determine the phenotype of cancer cells through altering ECM in cancer microenvironment. By the same token, this gene can lead to unpleasant outcomes among patients with liver cancer, pancreatic cancer, renal cancer, and cervical cancer [34].
The expression of ARL14 corresponds positively to prognosis in bladder cancer [35].
The down-regulation of TNXB is observed in cervical cancer and its up-regulation is known as correlated with the inhibition of EMT, migration, and invasion in this cancer [36]. TNXB is down-regulated in endometrial cancer, as well [37].
UGT2B15, as an oncogene, plays a main art in cancer progression, lymph node metastasis, and resistance to therapeutics in prostate cancer. More specifically, this gene can decrease the chance of responding to tamoxifen [38] and castration in breast and prostate cancers, respectively. Additionally, it is previously reported that UGT2B15 is enhanced in gastric cancer [39]. By the same token, the increased level of UGT2B15 expression exists in some subtypes of breast cancer accompanying a better survival rate [38].
In the light of prognosis, IGHA2 is tied in with basal-like and triple-negative breast cancer [40].
It is already proposed that IL1R2 is potentiated to act as an oncogene in that the high level of its expression, through inflammation and angiogenesis, culminated in CRC progression and development. On the other hand, the reduced level of this gene is reported in atherosclerotic lesions [41, 42].
While the over-expression of PCK1 was declared in colon cancer [43], lung cancer [44], melanoma [45], and lymphoma [43], metastatic breast cancer cells [43, 45, 46], and skin cancer [47], the down-regulation of this gene is present in kidney cancer and hepatocellular carcinoma (HCC) [43, 47, 48]. To be more elaborated, PCK1 promotes apoptosis and restrains both proliferation and tumor growth in HCC [43]. On the other hand, the expression of PCK1 is sometimes considered a factor leading to increased proliferation [44, 47, 49, 50].
SOSTDC1, through its down-regulation, provokes poor prognosis, tumor invasion, and low survival rate in multiple cancer types [51–54]. Furthermore, the raised expression level of SOSTDC1 can inhibit tumor growth in gastric cancer [55]. In thyroid cancer, breast cancer, and non-small cell lung carcinoma (NSCLC), SOSTDC1 being able to restrict proliferation is under-expressed [52, 53, 56–60]. In NSCLC, while the down-regulated SOSTDC1 can produce poor prognosis, its up-regulation is capable of inhibiting proliferation, migration, and EMT [53].
A low survival rate was observed in colon cancer patients bearing a high expression level of UGT2A3 [61].
In the ground of methylation, one hyper-methylated/under-expressed gene (CA12) and two hypo-methylated/over-expressed genes (COL1A1, and TACSTD2) were found which were confirmed both by expression (only by the validation dataset) and methylation (by MEXPRESS in two COAD, and READ cancer types). However, we had a chance of detecting three noticeably hyper-methylated/under-expressed genes (TCF21, MYH1, and DES) and seven outstandingly hypo-methylated/over-expressed genes (CCL20, CEMIP, CLDN1, KRT23, MMP7, SLC6A6, and SLC7A5) which not only were among v-DEGs-DMGs but also, pursuant to the GEPIA and the MEXPRESS they are of verified expressional and methylational value in both COAD and READ cancer types. It is worth mentioning that any gene whose methylation change has been demonstrated in both COAD and READ by the MEXPRESS was considered 1) to have previously known or 2) to be most probable for having a role in colorectal cancer. CHP2, as a hyper-methylated/under-expressed gene, had a statistically significant inverse/negative relationship between its expression and methylation. While the expressional alteration of CHP2 gene has been validated by the validation dataset its methylation status was confirmed only in COAD cancer type by the MEXPRESS. Taking all afore-mentioned issues and the appropriate ROC curve of CHP2 into account, we propose the hyper-methylation of this gene as a novel probable diagnostic marker in colorectal cancer. Furthermore, MSLN, as a hypo-methylated/over-expressed gene, illustrated a statistically significant inverse/negative relationship between its expression and methylation levels. Considering firstly, the GEPIA-verified expression change of MSLN gene in both COAD and READ and secondly, not being MEXPRESS-verified methylation status of this gene and for third, its plausible ROC curve we introduce the hypo-methylation of this gene as a contingent diagnostic marker in colorectal cancer for the first time. In the continuation of this research study, among vDEGs-DMGs, we found five (CHP2, CXCL12, EPB41L3, SCNN1B, and PYY) and one (MUC4) hyper-methylated/under-expressed genes which were corroborated in COAD and READ by the MEXPRESS, respectively. These are noteworthy that CXCL12 and EPB41L3 were verified by the GEPIA in COAD and READ cancer types while SCNN1B and PYY were documented only in COAD cancer type by the GEPIA. MUC4 and CHP2 genes have been validated solely by the validation dataset. By the same token, among vDEGs-DMGs, we discovered four (CST1, CHI3L1, UBD, and ASCL2) and two (INHBA, and ITGBL1) hypo-methylated/over-expressed genes which were recognized in COAD and READ by the MEXPRESS, respectively. These are worthwhile to point out that CST1, CHI3L1, UBD, and INHBA genes were verified by the GEPIA in both COAD and READ cancer types while ASCL2 was verified only in COAD cancer type by the GEPIA. ITGBL1 gene has been validated just by the validation dataset.
Pursuant to the correlation plots in this study, there are several genes with an inverse/inverse correlation between their expression and methylation values which were not statistically significant. We conjecture that the reasons behind this fact include 1) the limited number of patients, and 2) the methylation is not the main regulatory system and instead, the alternative mechanism(s) is/are regulating the expression of these genes. By the same token, changing in methylation does not necessarily end up measurable expression alteration. Even in several studies, the positive or direct correlation between gene expression and methylation in the gene body has been reported [22].
In the comparison between the power of expression and methylation profiles, the ROC curves elucidated that while both profiles are adequately sensitive and specific for diagnosis, the expression profile outruns the methylation profile to some extent.
With respect to the genes having interaction(s) with miRNAs, we realized four down-regulated vDEGs (TMEM100, MYH11, CHRDL1, and FAM107A) to the accompaniment of five up-regulated vDEGs (KLK10, CLDN1, SLC6A6, MMP11, and SLC7A5) being documented by the GEPIA in both COAD and READ cancer types. Ultimately, we ascertained one down-regulated vDEG (MYH11) and three up-regulated vDEGs (SLC7A5, SLC6A6, and CLDN1) which are under the dual regulatory mechanisms; miRNAs, and methylation. The expression and the methylation status of the last-mentioned genes have been corroborated by the GEPIA (COAD and READ) and the MEXPRESS (COAD and READ).
4. Conclusion
In conclusion, we averred four and one findings pertaining to the diagnosis and prognosis of colorectal cancer, respectively. Taking the diagnosis into account, an expression profile containing 43 over-expressed vDEGs and 91 under-expressed vDEGs, two of which and 11 of which were novel in CRC respectively, was detected. It is worth re-announcing that two of 11 novel under-expressed vDEGs were included in the hub genes. By the same token, a methylation profile including 22 hypo-methylated/over-expressed vDEGs-DMGs and 15 hyper-methylated/under-expressed vDEGs-DMGs, in per of which one novel differentially methylated gene was discerned, was introduced. According to the ROC curves, in the comparison between the diagnosis power of the expression profile and that of the methylation profile the more specificity and sensitivity for the expression profile were established as for CRC. The recognized miRNA expression profile demonstrated six over-expressed vDEGs interacting with six under-expressed DEMs and nine down-regulated vDEGs interacting with eight up-regulated DEMs. Taking the issue of prognosis into consideration, eight over-expressed vDEGs to the accompaniment of 15 under-expressed vDEGs were ascertained playing a crucial role in determining the survival time as for CRC patients. An outstanding point is that the declared profiles, particularly their novel genes due to their high enough powers in diagnosis, are capable of serving in targeted therapy. However, our results need to be more confirmed by further research studies, particularly clinically, due to two obstructive circumstances which should be tackled; the confined number of patients, and solely-bioinformatics-based analysis.
5. Materials and methods
5.1. Searching for suitable microarray data
Searching in the Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo) database, and using “Colorectal Neoplasm”, “Case/Control” or “Primary/Normal”, “Homo Sapiens”, and “Expression Profiling by Array” or “Methylation Profiling by Array” as “MeSH Terms”, “All Fields”, “Organism”, and “Study Type”, respectively, we looked for appropriate array datasets. It is noteworthy that in this step, “Tumor Stage”, “Metastasis”, and “any types of intervention” were not considered the determining criteria. Ultimately, we found three array datasets including, 1) GSE41258 [62, 63] (GPL96 [HG-U133A] Affymetrix Human Genome U133A Array]) as a testing or discovery expression array dataset with tissue samples from 54 healthy participants and 186 colorectal cancer patients, 2) GSE101764 [64] (GPL13534 [Illumina HumanMethylation450 BeadChip (HumanMethylation450_15017482)]) as a testing methylation array dataset with tissue samples having obtained from 149 healthy participants and 112 colorectal cancer patients, and 3) GSE31905 [65] (GPL6480 [Agilent-014850 Whole Human Genome Microarray 4x44K G4112F (Probe Name version)]) as a validating expression array dataset with specimens from seven healthy and 55 colorectal cancer patients.
5.2. Identification of Differentially Expressed Genes (DEGs) and Differentially Methylated Genes (DMGs)
Utilizing GEO2R, an online differentially analyzing tool, and “GEOquery” (Version 2.54.0) [66], and “Limma” (Version 3.42.0) [67] R (Version 3.6.3) packages, we exerted log2 transformation and Benjamini & Hochberg method on GSE41258 to find the differentially expressed genes in a case/control comparison manner. By the same token, |LogFC| ≥ 2 and FDR-adj.P.value < 0.05 were the thresholds to define statistical significant results. In view of the satisfactory distribution of the samples’ box plot’s medians, the discovery expression dataset (GSE41258) did not need to be more normalized. Also, we took advantage of “Limma”, “minfi (Version 1.32.0)” [68–74], missMethyl (Version 1.20.4)” [69, 75–77], “minfiData (Version 0.32.0)”, “DMRcate (Version 2.0.0)” [78], “IlluminaHumanMethylation450kanno.ilmn12.hg19 (Version 0.6.0)”, and “IlluminaHumanMethylation450kmanifest (Version 0.4.0)” R packages in order to detect the differentially methylated genes out of GSE101764. So as to perform DMGs, the raw data (IDAT files) of the methylation array dataset was downloaded. We visualized samples according to their quality and then the samples whose P.values were less than 0.05 were preserved (Fig 24). Through the PreprocessFunnorm function, the normalization was exerted. The probes whose qualities were with P.value < 0.01 were kept in the study. Additionally, since our samples were taken from both sexualities we proceeded to omit the probes located on the sex chromosomes. In the next steps, we excluded the probes contained SNPs in CpG islands and the cross-reactive probes. Finally, an unpaired-analysis was conducted in a case/control comparison manner, and the probes with FDR < 0.05 being statistically significant were maintained in the present research study. Then, we converted the approved probes into the genes on which they are located, and by defining |LogFC| ≥ 1 and adj.P.value < 0.05 as thresholds the statistically significant results were introduced. It is noteworthy, we used false discovery rate adjusted P.values through our differential analyses to avoid type I error in multiple testing.
The right side is the zoomed model of the left side. As it is shown all samples are in statistically significant range.
5.3. Validation of Differentially Expressed Genes
Consistent with analyzing strategy (|LogFC| ≥ 2 and FDR-adj.P.value < 0.05) in the testing or discovery expression array dataset, we validated the differentially expressed genes using GSE31905. It is noteworthy that through using the normalizeQuantiles function, the normalization process has been performed to preserve the comparability status. Through the Venny (Version 2.1.0) (https://bioinfogp.cnb.csic.es/tools/venny/) [79] and plotting Venn diagram, we intersected differentially expressed genes between testing and validating expression array datasets.
5.4. Identification of validated Differentially Expressed Genes (vDEGs)-Differentially Methylated Genes (DMGs)
Taking advantage of the Venny diagrams, we attempted to discover 1) over-expressed vDEGs-hypo-methylated DMGs and 2) under-expressed vDEGs-hyper-methylated DMGs. Furthermore, by plotting heatmap (ClustVis, https://biit.cs.ut.ee/clustvis/) [80] for the above-mentioned gene groups, we investigated the pattern through which the expressional levels of those genes are changed between normal and cancer groups. These are worth mentioning that firstly, the expression values used in heatmap plots were obtained from the testing expression array dataset (GSE41258), and secondly, the heatmap plots were drawn pursuant to correlation. By the same token, we drew another heatmap plot based on the TCGA data using the UALCAN database (http://ualcan.path.uab.edu/index.html) [81].
5.5. Verification of validated Differentially Expressed Genes (DEGs)-Differentially Methylated Genes (DMGs) by MEXPRESS and TCGA data
MEXPRESS (https://mexpress.be/) is an online tool for visualizing expression and methylation using TCGA data. Using the MEXPRESS tool, we tried to validate the vDEGs-DMGs based on TCGA data. The criteria we observed in the MEXPRESS were including 1) Omitting the samples whose methylation levels were characterized as null, 2) ordering the samples according to their expressional levels, 3) existence or non-existence of neoadjuvant therapy (as the sole clinical parameter), and 4) performing analysis on colon (COAD) and rectum (READ) cancer types data in TCGA database. So as to verify the DMGs through the MEXPRESS, we considered the existence of a statistically significant inverse/negative relationship between expression and methylation levels of most of the probes in each individual gene.
5.6. Expression-methylation correlation plots for validated Differentially Expressed Genes (vDEGs)-Differentially Methylated Genes (DMGs)
With the produced expression and methylation values obtained from GSE41258 and GSE101764 datasets, respectively, the correlation plots were drawn for normal and cancer participants, separately in order to perform the comparisons more precisely. Since the numbers of patients and normal samples in the above-mentioned datasets are not identical, the redundant patients and healthy samples were excluded randomly out of GSE41258 and GSE101764, respectively in order to match. The negative correlations with P.value < 0.05 were considered statistically significant.
5.7. Receiver operating characteristic curves for validated Differentially Expressed Genes (vDEGs)-Differentially Methylated Genes (DMGs)
Via the easyROC (Version 1.3.1) (http://www.biosoft.hacettepe.edu.tr/easyROC/), a web tool for ROC curve analysis, and using the expression and methylation values from GSE41258 and GSE101764 datasets, respectively we drew ROC curves for vDEGs-DMGs and for genes whose expression-methylation correlations were inverse/negative in the patients’ population. ROC curves were plotted as well for the efficacy of methylation and expression in this disease, separately. Accordingly, higher values in the differentially hyper-methylated and the differentially over-expressed genes were considered the indicators of higher risk. On the other hand, as for the differentially hypo-methylated and the differentially under-expressed genes, the lower values were assumed as the higher risk predisposing hallmarks. Analogous to the correlation plot, the number of patients and normal samples included in ROC curve analysis were set 112 and 54, respectively. The ultimate interpretation of the statistically significant results was based on the area under the ROC curve (AUC) and P.value < 0.05.
5.8. Identification of Differentially Expressed MicroRNAs (DEMs)
The dbDEMC 2.0 database (https://www.picb.ac.cn/dbDEMC/) [82], provides information regarding differentially expressed microRNAs obtained from the newly-released GEO and TCGA datasets in various cancer types. We utilized dbDEMC 2.0 in order to find DEMs in colorectal cancer patients in comparison with normal participants. Thereby, only datasets which included the comparisons between the studied groups were selected (GSE10289 [59 cases, and 7 controls], GSE30454 [54 cases, and 20 controls], GSE35602 [17 cases, and 4 controls], GSE35982 [8 cases, and 8 controls], GSE41012 [20 cases, and 15 controls], and GSE54088 [9 cases, and 10 controls] with 167 patients and 64 normal samples overall). Then, the DEMs with |LogFC| ≥ 2 were opted for remaining analyses.
5.9. Validated Differentially Expressed Genes (vDEGs)-Differentially Expressed MicroRNAs (DEMs) interactions
The miRTarBase database (http://mirtarbase.cuhk.edu.cn/php/index.php) [83], facilitating our way to detect interactions among genes and microRNAs, assisted us in finding the interactions connecting 1) the over-expressed vDEGs to under-expressed DEMs, and 2) the under-expressed vDEGs to over-expressed DEMs. Via the Cytoscape software (Version 3.7.2) those interactions became visualized.
5.10. More investigation of validated Differentially Expressed Genes (vDEGs) by GEPIA and TCGA data
The Gene Expression Profiling Interactive Analysis (GEPIA) database (http://gepia.cancer-pku.cn/) [84], paved the way for us to scrutinize the detected vDEGs in TCGA database. COAD and READ cancer types in TCGA database comprised our patient group and our normal population consisted of TCGA normal to the accompaniment of GTEx data. Following the LogScale transformation, |LogFC| ≥ 2 and P.value < 0.05 were determined as the statistically significant thresholds.
5.11. PPI networks construction and hub genes
The validated differentially expressed genes were investigated by the STRING database (Version 11.0) (https://string-db.org/) [85] to recognize the protein-protein interactions [4] among them. The median confidence of 0.4 was set and the nodes with no connections were ignored. Then, through using the Cytohubba plugin in the Cytoscape software and considering some pivotal benchmarks including, 1) co-expression, 2) homology, 3) experimentally determined interaction, 4) database annotated, and 5) combined score, the top ten hub genes were revealed and visualized.
5.12. Functional enrichment analysis of validated Differentially Expressed Genes (vDEGs)
The box plots and cnet plots for biological process (BP), molecular function (MF), and cellular component [86] were drawn regarding the vDEGs. The afore-mentioned Gene Ontology (GO) analyses were conducted and visualized by DOSE (Version 3.12.0) [87], enrichplot (Version 1.6.1), and clusterProfiler (Version 3.14.3) [88] packages in R software. Taking advantage of the Database for Annotation, Visualization and Integrated Discovery (DAVID) (Version 6.7) (https://david.ncifcrf.gov/), we discerned the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways with P.value < 0.05 the visualization of which was performed using Microsoft Excel 2016.
5.13. Survival analysis
The GEPIA database has also the ability to demonstrate survival plots for genes in various cancer types based on TCGA database. Accordingly, we combined COAD and READ patients groups to unravel the prognostic values of the detected vDEGs. The main characteristics of the survival plots are 1) using the Hazard Ratio (HR) with 95% Confidence Interval (CI), 2) considering the overall survival for patients, and 3) setting “Median” as the group cut off.
Supporting information
S1 Table. Over-expressed and under-expressed Differentially Expressed Genes obtained from the discovery dataset (GSE41258).
https://doi.org/10.1371/journal.pone.0265527.s001
(XLSX)
S2 Table. Hypo-methylated and hyper-methylated Differentially Methylated Genes obtained from (GSE101764).
https://doi.org/10.1371/journal.pone.0265527.s002
(XLSX)
S3 Table. Over-expressed and under-expressed Differentially Expressed Genes obtained from the validation dataset (GSE31905).
https://doi.org/10.1371/journal.pone.0265527.s003
(XLSX)
S4 Table. Down-regulated Differentially Expressed MiRNAs obtained from the dbDEMC 2.0 database.
https://doi.org/10.1371/journal.pone.0265527.s004
(XLSX)
References
- 1. Siegel RL, Miller KD, Fedewa SA, Ahnen DJ, Meester RGS, Barzi A, et al. Colorectal cancer statistics, 2017. CA: a cancer journal for clinicians. 2017;67(3):177–93. pmid:28248415
- 2. Yu C, Hong H, Zhang S, Zong Y, Ma J, Lu A, et al. Identification of key genes and pathways involved in microsatellite instability in colorectal cancer. Molecular medicine reports. 2019;19(3):2065–76. pmid:30664178
- 3. Castellano-Castillo D, Morcillo S, Clemente-Postigo M, Crujeiras AB, Fernandez-Garcia JC, Torres E, et al. Adipose tissue inflammation and VDR expression and methylation in colorectal cancer. Clinical epigenetics. 2018;10:60. pmid:29719581
- 4. Lippi G, Mattiuzzi C, Cervellin G. Meat consumption and cancer risk: a critical review of published meta-analyses. Critical reviews in oncology/hematology. 2016;97:1–14. pmid:26633248
- 5. Wu J, Wang G, He B, Chen X, An Y. Methylation of the UNC5C gene and its protein expression in colorectal cancer. Tumour biology: the journal of the International Society for Oncodevelopmental Biology and Medicine. 2017;39(4):1010428317697564. pmid:28378635
- 6. Ferlay J, Soerjomataram I, Dikshit R, Eser S, Mathers C, Rebelo M, et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. International journal of cancer. 2015;136(5):E359–86. pmid:25220842
- 7. Vinson KE, George DC, Fender AW, Bertrand FE, Sigounas G. The Notch pathway in colorectal cancer. International journal of cancer. 2016;138(8):1835–42. pmid:26264352
- 8. Markowitz SD, Bertagnolli MM. Molecular origins of cancer: Molecular basis of colorectal cancer. The New England journal of medicine. 2009;361(25):2449–60. pmid:20018966
- 9. Budinska E, Popovici V, Tejpar S, D’Ario G, Lapique N, Sikora KO, et al. Gene expression patterns unveil a new level of molecular heterogeneity in colorectal cancer. The Journal of pathology. 2013;231(1):63–76. pmid:23836465
- 10. Coppedè F, Lopomo A, Spisni R, Migliore L. Genetic and epigenetic biomarkers for diagnosis, prognosis and treatment of colorectal cancer. World journal of gastroenterology. 2014;20(4):943–56. pmid:24574767
- 11. De Roock W, De Vriendt V, Normanno N, Ciardiello F, Tejpar S. KRAS, BRAF, PIK3CA, and PTEN mutations: implications for targeted therapies in metastatic colorectal cancer. The Lancet Oncology. 2011;12(6):594–603. pmid:21163703
- 12. Le DT, Uram JN, Wang H, Bartlett BR, Kemberling H, Eyring AD, et al. PD-1 Blockade in Tumors with Mismatch-Repair Deficiency. The New England journal of medicine. 2015;372(26):2509–20. pmid:26028255
- 13. Sarabi MM, Naghibalhossaini F. The impact of polyunsaturated fatty acids on DNA methylation and expression of DNMTs in human colorectal cancer cells. Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie. 2018;101:94–9. pmid:29477476
- 14. Esteller M. Epigenetics in cancer. The New England journal of medicine. 2008;358(11):1148–59. pmid:18337604
- 15. Shi L, Li X, Wu Z, Li X, Nie J, Guo M, et al. DNA methylation-mediated repression of miR-181a/135a/302c expression promotes the microsatellite-unstable colorectal cancer development and 5-FU resistance via targeting PLAG1. Journal of genetics and genomics = Yi chuan xue bao. 2018;45(4):205–14. pmid:29735329
- 16. Morgan AE, Davies TJ, Mc Auley MT. The role of DNA methylation in ageing and cancer. The Proceedings of the Nutrition Society. 2018;77(4):412–22. pmid:29708096
- 17. van Vlodrop IJ, Niessen HE, Derks S, Baldewijns MM, van Criekinge W, Herman JG, et al. Analysis of promoter CpG island hypermethylation in cancer: location, location, location! Clinical cancer research: an official journal of the American Association for Cancer Research. 2011;17(13):4225–31. pmid:21558408
- 18. Portela A, Esteller M. Epigenetic modifications and human disease. Nature biotechnology. 2010;28(10):1057–68. pmid:20944598
- 19. Irizarry RA, Ladd-Acosta C, Wen B, Wu Z, Montano C, Onyango P, et al. The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores. Nature genetics. 2009;41(2):178–86. pmid:19151715
- 20. Maunakea AK, Nagarajan RP, Bilenky M, Ballinger TJ, D’Souza C, Fouse SD, et al. Conserved role of intragenic DNA methylation in regulating alternative promoters. Nature. 2010;466(7303):253–7. pmid:20613842
- 21. Domann FE, Rice JC, Hendrix MJ, Futscher BW. Epigenetic silencing of maspin gene expression in human breast cancers. International journal of cancer. 2000;85(6):805–10. pmid:10709100
- 22. Koch A, Joosten SC, Feng Z, de Ruijter TC, Draht MX, Melotte V, et al. Analysis of DNA methylation in cancer: location revisited. Nature reviews Clinical oncology. 2018;15(7):459–66. pmid:29666440
- 23. Thanki K, Nicholls ME, Gajjar A, Senagore AJ, Qiu S, Szabo C, et al. Consensus Molecular Subtypes of Colorectal Cancer and their Clinical Implications. International biological and biomedical journal. 2017;3(3):105–11. pmid:28825047
- 24. Guinney J, Dienstmann R, Wang X, de Reyniès A, Schlicker A, Soneson C, et al. The consensus molecular subtypes of colorectal cancer. Nature medicine. 2015;21(11):1350–6. pmid:26457759
- 25. Copija A, Waniczek D, Witkoś A, Walkiewicz K, Nowakowska-Zajdel E. Clinical Significance and Prognostic Relevance of Microsatellite Instability in Sporadic Colorectal Cancer Patients. International journal of molecular sciences. 2017;18(1). pmid:28067827
- 26. Liu W, Li L, Ye H, Tao H, He H. Role of COL6A3 in colorectal cancer. Oncology reports. 2018;39(6):2527–36. pmid:29620224
- 27. Roseweir AK, McMillan DC, Horgan PG, Edwards J. Colorectal cancer subtypes: Translation to routine clinical pathology. Cancer treatment reviews. 2017;57:1–7. pmid:28505475
- 28. Punt CJ, Koopman M, Vermeulen L. From tumour heterogeneity to advances in precision treatment of colorectal cancer. Nature reviews Clinical oncology. 2017;14(4):235–46. pmid:27922044
- 29. Roychowdhury S, Chinnaiyan AM. Translating cancer genomes and transcriptomes for precision oncology. CA: a cancer journal for clinicians. 2016;66(1):75–88. pmid:26528881
- 30. Sun T, Wang D, Ping Y, Sang Y, Dai Y, Wang Y, et al. Integrated profiling identifies SLC5A6 and MFAP2 as novel diagnostic and prognostic biomarkers in gastric cancer patients. International journal of oncology. 2020;56(2):460–9. pmid:31894266
- 31. Silveira NJ, Varuzza L, Machado-Lima A, Lauretto MS, Pinheiro DG, Rodrigues RV, et al. Searching for molecular markers in head and neck squamous cell carcinomas (HNSCC) by statistical and bioinformatic analysis of larynx-derived SAGE libraries. BMC medical genomics. 2008;1:56. pmid:19014460
- 32. Wang JK, Wang WJ, Cai HY, Du BB, Mai P, Zhang LJ, et al. MFAP2 promotes epithelial-mesenchymal transition in gastric cancer cells by activating TGF-β/SMAD2/3 signaling pathway. OncoTargets and therapy. 2018;11:4001–17. pmid:30034240
- 33. Gong X, Dong T, Niu M, Liang X, Sun S, Zhang Y, et al. lncRNA LCPAT1 Upregulation Promotes Breast Cancer Progression via Enhancing MFAP2 Transcription. Molecular therapy Nucleic acids. 2020;21:804–13. pmid:32791452
- 34. Yao LW, Wu LL, Zhang LH, Zhou W, Wu L, He K, et al. MFAP2 is overexpressed in gastric cancer and promotes motility via the MFAP2/integrin α5β1/FAK/ERK pathway. Oncogenesis. 2020;9(2):17. pmid:32054827
- 35. Wang L, Shi J, Huang Y, Liu S, Zhang J, Ding H, et al. A six-gene prognostic model predicts overall survival in bladder cancer patients. Cancer cell international. 2019;19:229. pmid:31516386
- 36. Yan SP, Chu DX, Qiu HF, Xie Y, Wang CF, Zhang JY, et al. LncRNA LINC01305 silencing inhibits cell epithelial-mesenchymal transition in cervical cancer by inhibiting TNXB-mediated PI3K/Akt signalling pathway. Journal of cellular and molecular medicine. 2019;23(4):2656–66. pmid:30697971
- 37. Men C, Chai H, Song X, Li Y, Du H, Ren Q. Identification of DNA methylation associated gene signatures in endometrial cancer via integrated analysis of DNA methylation and gene expression systematically. Journal of gynecologic oncology. 2017;28(6):e83. pmid:29027401
- 38. Hu DG, Selth LA, Tarulli GA, Meech R, Wijayakumara D, Chanawong A, et al. Androgen and Estrogen Receptors in Breast Cancer Coregulate Human UDP-Glucuronosyltransferases 2B15 and 2B17. Cancer research. 2016;76(19):5881–93. pmid:27496708
- 39. Chen X, Li D, Wang N, Yang M, Liao A, Wang S, et al. Bioinformatic analysis suggests that UGT2B15 activates the Hippo‑YAP signaling pathway leading to the pathogenesis of gastric cancer. Oncology reports. 2018;40(4):1855–62. pmid:30066917
- 40. Larsson C, Ehinger A, Winslow S, Leandersson K, Klintman M, Dahl L, et al. Prognostic implications of the expression levels of different immunoglobulin heavy chain-encoding RNAs in early breast cancer. NPJ breast cancer. 2020;6:28. pmid:32656317
- 41. Pou J, Martínez-González J, Rebollo A, Rodríguez C, Rodríguez-Calvo R, Martín-Fuentes P, et al. Type II interleukin-1 receptor expression is reduced in monocytes/macrophages and atherosclerotic lesions. Biochimica et biophysica acta. 2011;1811(9):556–63. pmid:21683158
- 42. Mar AC, Chu CH, Lee HJ, Chien CW, Cheng JJ, Yang SH, et al. Interleukin-1 Receptor Type 2 Acts with c-Fos to Enhance the Expression of Interleukin-6 and Vascular Endothelial Growth Factor A in Colon Cancer Cells and Induce Angiogenesis. The Journal of biological chemistry. 2015;290(36):22212–24. pmid:26209639
- 43. Grasmann G, Smolle E, Olschewski H, Leithner K. Gluconeogenesis in cancer cells—Repurposing of a starvation-induced metabolic pathway? Biochimica et biophysica acta Reviews on cancer. 2019;1872(1):24–36. pmid:31152822
- 44. Leithner K, Hrzenjak A, Trötzmüller M, Moustafa T, Köfeler HC, Wohlkoenig C, et al. PCK2 activation mediates an adaptive response to glucose depletion in lung cancer. Oncogene. 2015;34(8):1044–50. pmid:24632615
- 45. Li Y, Luo S, Ma R, Liu J, Xu P, Zhang H, et al. Upregulation of cytosolic phosphoenolpyruvate carboxykinase is a critical metabolic event in melanoma cells that repopulate tumors. Cancer research. 2015;75(7):1191–6. pmid:25712344
- 46. Chen EI, Hewel J, Krueger JS, Tiraby C, Weber MR, Kralli A, et al. Adaptation of energy metabolism in breast cancer brain metastases. Cancer research. 2007;67(4):1472–86. pmid:17308085
- 47. Liu MX, Jin L, Sun SJ, Liu P, Feng X, Cheng ZL, et al. Metabolic reprogramming by PCK1 promotes TCA cataplerosis, oxidative stress and apoptosis in liver cancer cells and suppresses hepatocellular carcinoma. Oncogene. 2018;37(12):1637–53. pmid:29335519
- 48. Mao Y, Keller ET, Garfield DH, Shen K, Wang J. Stromal cells in tumor microenvironment and breast cancer. Cancer metastasis reviews. 2013;32(1–2):303–15. pmid:23114846
- 49. Montal ED, Dewi R, Bhalla K, Ou L, Hwang BJ, Ropell AE, et al. PEPCK Coordinates the Regulation of Central Carbon Metabolism to Promote Cancer Cell Growth. Molecular cell. 2015;60(4):571–83. pmid:26481663
- 50. Vincent EE, Sergushichev A, Griss T, Gingras MC, Samborska B, Ntimbane T, et al. Mitochondrial Phosphoenolpyruvate Carboxykinase Regulates Metabolic Adaptation and Enables Glucose-Independent Tumor Growth. Molecular cell. 2015;60(2):195–207. pmid:26474064
- 51. Liu L, Wu S, Yang Y, Cai J, Zhu X, Wu J, et al. SOSTDC1 is down-regulated in non-small cell lung cancer and contributes to cancer cell proliferation. Cell & bioscience. 2016;6:24. pmid:27087917
- 52. Liang W, Guan H, He X, Ke W, Xu L, Liu L, et al. Down-regulation of SOSTDC1 promotes thyroid cancer cell proliferation via regulating cyclin A2 and cyclin E2. Oncotarget. 2015;6(31):31780–91. pmid:26378658
- 53. Chen G, Gong H, Wang T, Wang J, Han Z, Bai G, et al. SOSTDC1 inhibits bone metastasis in non-small cell lung cancer and may serve as a clinical therapeutic target. International journal of molecular medicine. 2018;42(6):3424–36. pmid:30320379
- 54. Zhou Q, Chen J, Feng J, Xu Y, Zheng W, Wang J. SOSTDC1 inhibits follicular thyroid cancer cell proliferation, migration, and EMT via suppressing PI3K/Akt and MAPK/Erk signaling pathways. Molecular and cellular biochemistry. 2017;435(1–2):87–95. pmid:28551845
- 55. Cui Y, Zhang F, Jia Y, Sun L, Chen M, Wu S, et al. The BMP antagonist, SOSTDC1, restrains gastric cancer progression via inactivation of c-Jun signaling. American journal of cancer research. 2019;9(11):2331–48. pmid:31815038
- 56. Zhou Q, Chen J, Feng J, Wang J. E4BP4 promotes thyroid cancer proliferation by modulating iron homeostasis through repression of hepcidin. Cell death & disease. 2018;9(10):987. pmid:30250199
- 57. Xiao B, Chen L, Ke Y, Hang J, Cao L, Zhang R, et al. Identification of methylation sites and signature genes with prognostic value for luminal breast cancer. BMC cancer. 2018;18(1):405. pmid:29642861
- 58. Clausen KA, Blish KR, Birse CE, Triplette MA, Kute TE, Russell GB, et al. SOSTDC1 differentially modulates Smad and beta-catenin activation and is down-regulated in breast cancer. Breast cancer research and treatment. 2011;129(3):737–46. pmid:21113658
- 59. Rawat A, Gopisetty G, Thangarajan R. E4BP4 is a repressor of epigenetically regulated SOSTDC1 expression in breast cancer cells. Cellular oncology (Dordrecht). 2014;37(6):409–19. pmid:25338303
- 60. Gopal G, Raja UM, Shirley S, Rajalekshmi KR, Rajkumar T. SOSTDC1 down-regulation of expression involves CpG methylation and is a potential prognostic marker in gastric cancer. Cancer genetics. 2013;206(5):174–82. pmid:23830730
- 61. Pang B, Xu X, Lu Y, Jin H, Yang R, Jiang C, et al. Prediction of new targets and mechanisms for quercetin in the treatment of pancreatic cancer, colon cancer, and rectal cancer. Food & function. 2019;10(9):5339–49.
- 62. Sheffer M, Bacolod MD, Zuk O, Giardina SF, Pincas H, Barany F, et al. Association of survival and disease progression with chromosomal instability: a genomic exploration of colorectal cancer. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(17):7131–6. pmid:19359472
- 63. Martin ML, Zeng Z, Adileh M, Jacobo A, Li C, Vakiani E, et al. Logarithmic expansion of LGR5(+) cells in human colorectal cancer. Cellular signalling. 2018;42:97–105. pmid:28958617
- 64. Barrow TM, Klett H, Toth R, Böhm J, Gigic B, Habermann N, et al. Smoking is associated with hypermethylation of the APC 1A promoter in colorectal cancer: the ColoCare Study. The Journal of pathology. 2017;243(3):366–75. pmid:28791728
- 65. Anders M, Fehlker M, Wang Q, Wissmann C, Pilarsky C, Kemmner W, et al. Microarray meta-analysis defines global angiogenesis-related gene expression signatures in human carcinomas. Molecular carcinogenesis. 2013;52(1):29–38. pmid:22012870
- 66. Davis S, Meltzer PS. GEOquery: a bridge between the Gene Expression Omnibus (GEO) and BioConductor. Bioinformatics (Oxford, England). 2007;23(14):1846–7.
- 67. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic acids research. 2015;43(7):e47. pmid:25605792
- 68. Aryee MJ, Jaffe AE, Corrada-Bravo H, Ladd-Acosta C, Feinberg AP, Hansen KD, et al. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics (Oxford, England). 2014;30(10):1363–9. pmid:24478339
- 69. Maksimovic J, Gordon L, Oshlack A. SWAN: Subset-quantile within array normalization for illumina infinium HumanMethylation450 BeadChips. Genome biology. 2012;13(6):R44. pmid:22703947
- 70. Fortin JP, Labbe A, Lemire M, Zanke BW, Hudson TJ, Fertig EJ, et al. Functional normalization of 450k methylation array data improves replication in large cancer studies. Genome biology. 2014;15(12):503. pmid:25599564
- 71. Triche TJ Jr., Weisenberger DJ, Van Den Berg D, Laird PW, Siegmund KD. Low-level processing of Illumina Infinium DNA Methylation BeadArrays. Nucleic acids research. 2013;41(7):e90. pmid:23476028
- 72. Fortin JP, Hansen KD. Reconstructing A/B compartments as revealed by Hi-C using long-range correlations in epigenetic data. Genome biology. 2015;16(1):180. pmid:26316348
- 73. Andrews SV, Ladd-Acosta C, Feinberg AP, Hansen KD, Fallin MD. "Gap hunting" to characterize clustered probe signals in Illumina methylation array data. Epigenetics & chromatin. 2016;9:56. pmid:27980682
- 74. Fortin JP, Triche TJ Jr., Hansen KD. Preprocessing, normalization and integration of the Illumina HumanMethylationEPIC array with minfi. Bioinformatics (Oxford, England). 2017;33(4):558–60. pmid:28035024
- 75. Phipson B, Oshlack A. DiffVar: a new method for detecting differential variability with application to methylation in cancer and aging. Genome biology. 2014;15(9):465. pmid:25245051
- 76. Maksimovic J, Gagnon-Bartsch JA, Speed TP, Oshlack A. Removing unwanted variation in a differential methylation analysis of Illumina HumanMethylation450 array data. Nucleic acids research. 2015;43(16):e106. pmid:25990733
- 77. Phipson B, Maksimovic J, Oshlack A. missMethyl: an R package for analyzing data from Illumina’s HumanMethylation450 platform. Bioinformatics (Oxford, England). 2016;32(2):286–8.
- 78. Peters TJ, Buckley MJ, Statham AL, Pidsley R, Samaras K, R VL, et al. De novo identification of differentially methylated regions in the human genome. Epigenetics & chromatin. 2015;8:6. pmid:25972926
- 79.
Oliveros JC. Venny. An interactive tool for comparing lists with Venn’s diagrams. https://bioinfogp.cnb.csic.es/tools/venny/index.html. (2007–2015).
- 80. Metsalu T, Vilo J. ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap. Nucleic acids research. 2015;43(W1):W566–70. pmid:25969447
- 81. Chandrashekar DS, Bashel B, Balasubramanya SAH, Creighton CJ, Ponce-Rodriguez I, Chakravarthi B, et al. UALCAN: A Portal for Facilitating Tumor Subgroup Gene Expression and Survival Analyses. Neoplasia (New York, NY). 2017;19(8):649–58. pmid:28732212
- 82. Yang Z, Wu L, Wang A, Tang W, Zhao Y, Zhao H, et al. dbDEMC 2.0: updated database of differentially expressed miRNAs in human cancers. Nucleic acids research. 2017;45(D1):D812–d8. pmid:27899556
- 83. Chou CH, Shrestha S, Yang CD, Chang NW, Lin YL, Liao KW, et al. miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions. Nucleic acids research. 2018;46(D1):D296–d302. pmid:29126174
- 84. Tang Z, Li C, Kang B, Gao G, Li C, Zhang Z. GEPIA: a web server for cancer and normal gene expression profiling and interactive analyses. Nucleic acids research. 2017;45(W1):W98–w102. pmid:28407145
- 85. Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic acids research. 2019;47(D1):D607–d13. pmid:30476243
- 86. Menigatti M, Di Gregorio C, Borghi F, Sala E, Scarselli A, Pedroni M, et al. Methylation pattern of different regions of the MLH1 promoter and silencing of gene expression in hereditary and sporadic colorectal cancer. Genes, chromosomes & cancer. 2001;31(4):357–61. pmid:11433526
- 87. Yu G, Wang LG, Yan GR, He QY. DOSE: an R/Bioconductor package for disease ontology semantic and enrichment analysis. Bioinformatics (Oxford, England). 2015;31(4):608–9.
- 88. Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics: a journal of integrative biology. 2012;16(5):284–7. pmid:22455463