Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Inferring the Perturbed microRNA Regulatory Networks in Cancer Using Hierarchical Gene Co-Expression Signatures

  • Jin Gu ,

    Affiliation MOE Key Laboratory of Bioinformatics; Bioinformatics Division/Center for Synthetic and Systems Biology, Tsinghua National Laboratory for Information Science and Technology, Department of Automation, Tsinghua University, Beijing, China

  • Zhenyu Xuan

    Affiliation Department of Molecular and Cell Biology, Center for Systems Biology, University of Texas at Dallas, Richardson, Texas, United States of America

Inferring the Perturbed microRNA Regulatory Networks in Cancer Using Hierarchical Gene Co-Expression Signatures

  • Jin Gu, 
  • Zhenyu Xuan


MicroRNAs (miRNAs), a class of endogenous small regulatory RNAs, play important roles in many biological and physiological processes. The perturbations of some miRNAs, which are usually called as onco-microRNAs (onco-miRs), are significantly associated with multiple stages of cancer. Although hundreds of miRNAs have been discovered, the perturbed miRNA regulatory networks and their functions are still poorly understood in cancer. Analyzing the expression patterns of miRNA target genes is a very useful strategy to infer the perturbed miRNA networks. However, due to the complexity of cancer transcriptome, current methods often encounter low sensitivity and report few onco-miR candidates. Here, we developed a new method, named miRHiC (enrichment analysis of miRNA targets in Hierarchical gene Co-expression signatures), to infer the perturbed miRNA regulatory networks by using the hierarchical co-expression signatures in large-scale cancer gene expression datasets. The method can infer onco-miR candidates and their target networks which are only linked to sub-clusters of the differentially expressed genes at fine scales of the co-expression hierarchy. On two real datasets of lung cancer and hepatocellular cancer, miRHiC uncovered several known onco-miRs and their target genes (such as miR-26, miR-29, miR-124, miR-125 and miR-200) and also identified many new candidates (such as miR-149, which is inferred in both types of cancers). Using hierarchical gene co-expression signatures, miRHiC can greatly increase the sensitivity for inferring the perturbed miRNA regulatory networks in cancer. All Perl scripts of miRHiC and the detailed documents are freely available on the web at


MicroRNAs (miRNAs) are a class of small (∼22 nt) regulatory RNAs, which play important roles in many essential biological and physiological processes, such as embryo development, cancer progression and immune response. About 1,400 miRNAs have been identified in human and more than 30% known protein-coding genes are potentially regulated by evolutionary conserved miRNAs [1], [2]. The perturbations of some miRNAs, usually called as onco-microRNAs (onco-miRs, including both oncogenic and tumor suppressive miRNAs in this study), have been reported to be significantly associated with multiple stages of cancer. But till now, only a few of the hundreds of miRNAs are linked to the complex dys-regulated cellular processes in cancer. There is a great need for inferring the perturbed miRNA regulatory networks and their functions in cancer [3].

To infer the perturbed miRNA regulatory network, one popular strategy is to analyze miRNA target gene set enrichments in differentially expressed gene signatures. This includes many developed methods, such as gene set analysis by hyper-geometric test (HG-test, or Fisher's exact test); GSEA (gene set enrichment analysis) [4], [5]; FAME (functional assignment of miRNAs via enrichment) [6]; and miRBridge [7], which assume that the target gene set enrichments reflect the perturbations of their upstream miRNA regulation strengths. But due to the complexity of cancer transcriptome, these methods usually show low sensitivity of inferring onco-miR candidates (here, the “sensitivity” mainly means the number of inferred onco-miR candidates under a given statistical significance level).

Cancer is a multi-stage and mixed process, usually involving many hierarchically organized sub-processes regulated at multiple scales [8]. The miRNA regulations also show the property of multi-scale [9]: a few miRNAs, which help determine cell types or cellular states, suppress hundreds of target gene expressions to maintain cell type or cellular state specific expression profiles, such as miR-124 in brain and miR-1, miR-133 in muscle [10], [11], [12]; however, many other miRNAs may only regulate some specific processes by targeting a small group of closely-related genes. The former kind of candidate onco-miRs can be easily identified by analyzing the enrichment of their target genes in the whole set of the differentially expressed genes, but the latter ones are frequently missed by existing methods due to insufficient target gene enrichments in the differentially expressed genes or in the co-expression signatures using pre-defined similarity cutoffs.

In this study, we proposed a new strategy to infer the onco-miRs and their perturbed regulatory networks. This strategy takes consideration of the multi-scale and hierarchically organized regulatory structures in the differentially expressed genes using gene co-expression information, and fine-tunes the scales in the gene co-expression hierarchy to analyze the miRNA target gene set enrichment. Our method, named as miRHiC (enrichment analysis of miRNA targets in Hierarchical gene Co-expression signatures), can infer the perturbed miRNA regulatory networks in cancer by analyzing the enrichments of miRNA target gene sets in the hierarchical gene co-expression signatures. These gene signatures were established by hierarchical gene co-expression clustering, one common way to separate the mixed signals in gene expression profiles at different correlation levels. In miRHiC, the miRNA target gene set is not required to be enriched in the whole set of the differentially expressed genes but within any signature at the fine scale of the gene co-expression hierarchy. Besides the higher sensitivity for inferring the onco-miR candidates, another advantage of considering the gene co-expression information is to reduce the noises of inferring the corresponding perturbed target genes: the “scattered” differentially expressed genes with little expression pattern similarity to other genes, which are more likely to be “false” miRNA targets due to expression noises [13], are excluded during the analysis. On two large-scale cancer gene expression datasets, miRHiC successfully identified several known onco-miRs and also inferred many new candidates.

Materials and Methods

MiRNA target genes

miRNAs and their target genes (the miRNAs from the same family are merged as a single item) were extracted from TargetScan database (v6.2) [1], [2]. A gene was regarded as a target of one miRNA, if the gene contains at least one conserved predicted miRNA binding site in its 3′-UTR. And the summarized context score (a negative score measuring miRNA-target regulation strength or confidence, provided by TargetScan) was recorded for each miRNA-target pair. Then, we discretized the context scores into K levels: all miRNA-target pairs were sorted according to their context scores in decreasing order (the pairs ranked on the top have the lowest regulation strength) and the discretized score for the miRNA-target pair with rank r was defined as: s = 1+b[rK/N]. It means the first 1/K miRNA-target pairs have lowest score 1, while the last 1/K pairs have highest score 1+b(K-1). According to ref.[6], K is set as 5 and b as 3 in this study.

The control miRNA target gene sets were generated by bipartite graph based random permutation of the miRNA-target pairs with the same discretized scores but keeping the sizes of all target gene sets. This kind of stringent permutation procedure can generate the control miRNA target gene sets which preserve the statistical properties much better than the randomization without restriction [6].

Cancer gene expression data

We test miRHiC on two large-scale cancer gene expression datasets downloaded from NCBI GEO database: 1) lung cancer (LUC) dataset, GSE19804 including 60 paired cancer and para-cancer samples; and 2) hepatocellular cancer (HCC) dataset, GSE22058 including 96 paired cancer and para-cancer samples. To avoid the noises in lowly expressed genes, we only kept the genes whose expression values rank at top 10,000 in at least 30% samples in each dataset. Then, the differentially expressed genes were identified with p-value <0.0001 using t-test (the p-values were multiple testing adjusted by BH correction). We identified 3,397 and 5,699 differentially expressed genes for LUC and HCC datasets, respectively.

miRHiC: enrichment analysis of miRNA targets in Hierarchical gene Co-expression signatures

miRHiC was proposed to infer the perturbed miRNA regulatory networks in cancer by incorporating the hierarchically organized co-expression information of the differentially expressed genes: firstly, the hierarchical gene co-expression signatures were established by clustering the differentially expressed genes based on pairwise gene co-expression correlations; then the miRNA target gene set enrichment was analyzed across the hierarchical co-expression signatures; and finally, a permutation test was used to estimate the statistical significance of the enrichment (Figure 1).

Figure 1. The flowchart of miRHiC.

In the first step, the differentially expressed genes were clustered as hierarchical gene co-expression signatures; then, the most significant enrichment of the miRNA target gene set was found across the hierarchical signatures; and finally, a permutation test was used to estimate the empirical p-value of the enrichment.

1) Get the hierarchical gene co-expression signatures.

Firstly, average linkage hierarchical clustering is implemented to cluster the differentially expressed genes based on their pairwise co-expression correlations. To reduce the noises caused by poorly correlated genes, the hierarchical clustering is stopped if the gene co-expression correlation is too low: we used the correlation with z-score 0.52 as the cutoff in this study (about p-value 0.3; z-score of any given correlation level is calculated using Fisher's transformation). This cutoff shows few influences on the results: for the LUC dataset, when the z-score cutoff changed from 0.3 to 0.9 by step 0.1, the hierarchical clustering was stopped at almost the same place. Then, we extracted the gene co-expression signatures (stable gene co-expression clusters) at different correlation scales by traversing the co-expression hierarchy from leaf to root (the correlation is decreasing and the size of signatures is increasing when traversing the hierarchy from leaf to root). The details of the signature extraction algorithm are given in user manual via miRHiC website.

2) Analyze the miRNA target gene set enrichments in the hierarchical gene co-expression signatures.

For the j-th gene co-expression signature in the hierarchy, we can find the overlapped genes between the signature (denoted as Sj) and the i-th miRNA target gene set (denoted as Ti), and then calculate the raw enrichment score by summing the discretized TargetScan scores (see the details of the score discretization in the above section) of the overlapped genes for i-th miRNA:

The p-value pij for this enrichment was estimated by examining the enrichment scores ESij(r) of 10,000 size-matched random control miRNA target gene sets:

After getting the enrichments in all hierarchical gene co-expression signatures (j = 1, 2, …), the P-score Pi for the i-th miRNA was calculated as the p-value of the most significant enrichment:

The P-score was used to measure the miRNA target gene enrichment across the whole gene co-expression hierarchy.

3) Calculate the statistical significance of the P-score based enrichments.

The P-score is the minimal of a set of p-values, so it is not uniformly distributed along 0∼1 (biased to 0). It cannot be directly used to measure the statistical significance of enrichment. Again, we used permutation test to estimate the statistical significance of the P-score: the P-scores Pi(r) of 10,000 size-matched control miRNA target gene sets were calculated according to above steps; and the empirical p-value pi for the P-score Pi was calculated as:

The empirical p-value pi was used to measure the statistical significance of miRNA target gene set enrichment across the whole hierarchical gene co-expression signatures. To correct the multiple test, fdrtool was used to calculate the q-values according to the empirical p-values [14].

Comparisons with other methods

miRHiC was compared with Gene Set Enrichment Analysis (GSEA) and gene set analysis by hyper-geometric test (HG-test). GSEA is a widely used method for inferring the perturbed gene sets by taking the continuous values and the rank information of differential gene expressions [5]. When comparing miRHiC with GSEA, the fold changes of gene expressions between cancer and normal samples were used in GSEA and the same miRNA target gene set permutation method was used to calculate the empirical p-values.

GSEA and HG-test use different computational models to measure gene set enrichments with miRHiC. For directly testing the advantage of using the hierarchical gene co-expression information, we used the differentially expressed genes as the only signature and ran miRHiC on it. For clear presentation, we named this approach as miRDeG (enrichment analysis of miRNA targets in Differentially Expressed Genes).

Except hierarchical clustering, k-means clustering is another commonly used algorithm to generate gene co-expression signatures. The algorithm can partition all differentially expressed genes into k non-overlapped clusters. Unlike hierarchical clustering, k-means is hard to exclude the poorly correlated genes by setting any threshold. In the comparison, we used k-means (k is set to 5 or 10) to get the gene co-expression signatures. Then we run the same procedure to analyze the miRNA target enrichments in the generated signatures with different k. We named this approach as miRKM (miRKM5 and miRKM10) in the below section.


Estimating the empirical p-values with no bias by miRHiC

To demonstrate that miRHiC did not have the problem of over-estimating the statistical significances, we generated 100 size-matched control target gene sets for each miRNA, and then calculated distributions of the empirical p-values for their enrichments in the hierarchical gene co-expression signatures using miRHiC. If miRHiC has no bias for estimating the empirical p-values, the p-values of these control miRNA target gene sets should be uniformly distributed between 0∼1. As expected, results showed that the empirical p-values are uniformly distributed (Figure 2). Another possible bias affecting empirical p-value is caused by different sizes of miRNA target gene sets: some miRNAs have more than 1,000 target genes while some only have less than 50 target genes. We calculated the Spearman's rank correlation between the sizes and the corresponding empirical p-values of the gene sets. The correlation is −0.015 (p-value of this correlation >0.05), which suggested that the empirical p-values are not affected by the sizes of gene sets. Based on these analyses, we can conclude that miRHiC has no bias for estimating the empirical p-values.

Figure 2. The histogram of the empirical p-values of the control miRNA target gene sets.

Inferring the perturbed miRNA regulatory networks in cancer

miRHiC can infer onco-miRs and their perturbed target regulatory networks by analyzing the miRNA target gene set enrichment in hierarchical gene co-expression signatures in cancer. On the two large-scale gene expression datasets of lung cancer (LUC) and hepatocellular cancer (HCC), miRHiC inferred 9 and 8 perturbed miRNAs or onco-miRs, respectively, with q-value <0.1. Under the same q-value cutoff, the three compared methods, GSEA, HG-test and miRDeG did not infer any candidate. Although miRKM inferred some candidates (for LUC dataset, miRKM5/10 inferred 3/4 candidates; and for HCC dataset, miRKM5/10 inferred 6/3 candidates), these numbers are still less than miRHiC and most of the miRKM inferences are covered by miRHiC. The details results are provided in Table S1. Among all 17 inferences from miRHiC, 9 are supported by direct functional evidences in literatures (LUC: miR-26, miR-29, miR-125, miR-130, miR-145 and miR-200; HCC: miR-21, miR-124 and miR-125). These results indicate that miRHiC can greatly improve the sensitivity of onco-miR inferences (Table 1). Considering the heterogeneity of cancer transcriptome, bootstrapping resampling was implemented to check the stability of the inferences. For LUC, 6 out of 9 candidates can be repeatedly inferred in more than 50% resampling experiments (miR-125, miR-149, miR-340 and miR-200 are stably inferred in more than 80% experiments). For HCC, 5 out of 8 candidates can be repeated inferred (miR-125 and miR-149 are stably inferred in more than 60% experiments).

By looking at the targeted signatures of the inferred onco-miRs, we found that they have different levels of gene co-expressions in the hierarchies (Figure 3). The functions associated with these signatures (the enriched GO terms of the signatures were annotated by DAVID web tool [15]) are significantly related to different hallmarks of cancer, including cell cycle, oxidation reduction, immune response, DNA repair, cell adhesion and vasculature development (Table 2). These results indicate that many miRNAs are linked to cancer through different sub regulatory programs. For example, miR-200 is known as an important regulator of angiogenesis (a child term of “vasculature development”). There are several experimental validated target genes for angiogenesis, including ZEB1 and KDR [16], [17], [18], existing in the inferred perturbed miR-200 regulatory networks in LUC dataset. MiR-200 may regulate the angiogenic switch in lung cancer through these target genes. In hepatocellular cancer, miR-21 was predicted to regulate “immune response” by targeting CD69, STAT3, CCL20 and SMAD7, in which STAT3 and SMAD7 are important signaling molecules for immune response.

Figure 3. The perturbed miRNA regulatory networks in the two types of cancers inferred by miRHiC.

A) is for lung cancer and B) for hepatocellular cancer. The circle nodes represent the gene co-expression signatures (ClusterID:Size). The diamond nodes represent the inferred onco-miRs. The numbers on the edges represent the sizes of the miRNA target genes overlapped with the corresponding gene co-expression signatures.

Table 2. The selected target genes and the related signature functions of the inferred onco-miR perturbed target networks.

Perturbed miR-149 sub-networks shared by the two types of cancers

The onco-miRs inferred in multiple cancers may play more important roles in cancer initiation and development. Two miRNAs, miR-125 and miR-149 were inferred by miRHiC in both types of cancers. For the inferred perturbed miR-125 regulatory networks, there are only three common targets (CDK16, TOMM40 and KIAA1522), which suggests that miR-125 may regulate different pathways in the two types of cancers. While for miR-149, its perturbed regulatory networks show significant target overlapping with a shared sub-network including 14 common targets. And the 14 targets are consistently over-expressed in cancer tissues (Figure 4).

Figure 4. The perturbed miR-149 sub-networks shared by LUC and HCC.

The average log-transformed fold changes of the shared target genes are also shown in the below table.

MiR-149 is a mammal conserved miRNA. A few studies show that miR-149 genetic polymorphisms are associated with the risk of cancer [19], [20]. Its expression is epigenetic silenced by DNA hyper-methylation in colorectal cancer [21]. But the miR-149 regulatory networks are still poorly understood in cancer. The inferred perturbed networks provide important insight of miR-149 regulations: most of the high confidence targets (with high TargetScan scores) in the shared sub-network are related to some essential biological processes, such as SRPK1 (serine/arginine-rich splicing factor kinase 1) and CCT3 (chaperonin containing TCP1, subunit 3). SRPK1 encodes a serine/arginine protein kinase specific for the SR (serine/arginine-rich domain) family of splicing factors. SRPK1 is upregulated in lung cancer and many other cancer types [22], [23]. CCT3 is a subunit of a molecular chaperone protein (chaperonin containing TCP1 complex) helping fold actin/tubulin and it can positively regulate cell cycle [24], [25]. CCT3 over-expression is also reported to be related to colorectal cancer [26] and liver cancer [27]. So, miR-149 may work as a cancer suppressor by targeting these oncogenes.


Analyzing miRNA target gene set enrichment in differentially expressed genes of large-scale gene expression profiles can greatly advance our understandings of the perturbed miRNA regulations. But due to the complexity of cancer transcriptome, it is challenging to infer the perturbed miRNA regulations by simply analyzing miRNA target gene set enrichment in the whole differentially expressed genes. In this study, we developed miRHiC to infer the perturbed miRNA regulatory networks in cancer by incorporating the hierarchical gene co-expression information into miRNA target gene set enrichment analysis. Results showed that miRHiC have much higher sensitivity for the inferences than the commonly used methods, such as HG-test, GSEA and miRDeG (FAME), of which all do not use the hierarchical gene co-expression information. Over 50% of the inferred onco-miRs have extensive literature supports and the gene co-expressions signatures targeted by these miRNAs are significantly related to multiple hallmarks of cancer. Recent studies also show that gene co-expressions can provide important information to identify the “real” target genes of miRNAs in the corresponding biological process [13], [28], which suggest that the target genes overlapped with the enriched co-expression signatures are more likely the real targets in cancer. Although miRHiC improved the sensitivity for inferring the onco-miRs and their perturbed target networks, a few known onco-miRs, such as miR-126 in lung cancer and miR-122 in hepatocellular carcinoma, were missed. These missed cases suggest that other computational models need to be developed for identifying the onco-miRs whose regulatory networks cannot be explained by the target gene enrichments in differential gene expression signatures.

The lengths of 3′-UTRs are strongly correlated with the number of targeted miRNAs and the context scores. The enriched signatures may significantly be biased to the ones with longer 3′-UTRs. When using hyper-geometry test to analyze the enrichments of miRNA target gene sets, we found that the signatures targeted by the inferred miRNAs have much longer average lengths of 3′-UTRs. However, like FAME [6], miRHiC used the bi-partite graph based permutation method, which can largely reduce this bias: the average lengths of 3′-UTRs of the genes in the signatures targeted by the inferred onco-miRs are 1314 nt and 1449 nt for the LUC and HCC datasets, respectively, not longer than these lengths of the differentially expressed genes (1424 nt and 1470 nt, respectively).

miRHiC provides a general strategy to analyze miRNA regulations using hierarchical signatures. Different hierarchical clustering methods can be used to get the hierarchical gene co-expression signatures. Besides the gene co-expression, the functional and regulatory interactions between genes (such as protein-protein interactions, transcriptional regulations and literature co-occurrences) can further be integrated to establish the hierarchical gene signatures. We will continuously test miRHiC strategy using different kinds of implementations.

To get better control sets of the miRNA target gene sets, miRHiC used the bi-partite graph based permutation. But this permutation method is time-consuming. Also, the computational burden is high for calculating the empirical p-values in a nested manner across the hierarchical gene co-expression signatures. We plan to develop faster algorithm to reduce the redundant calculations for estimating the p-values in future.

Supporting Information

Table S1.

The detailed results of miRHiC, GSEA, HG-test, miRDeG and miRKM.




We thank Dr Xiaotu Ma and Professor Yanda Li for extensive discussions. We thank Rui Fu and Chao He for software development and validation.

Author Contributions

Conceived and designed the experiments: JG ZX. Performed the experiments: JG. Analyzed the data: JG. Contributed reagents/materials/analysis tools: JG. Wrote the paper: JG ZX.


  1. 1. Friedman RC, Farh KK, Burge CB, Bartel DP (2009) Most mammalian mRNAs are conserved targets of microRNAs. Genome Res 19: 92–105.
  2. 2. Lewis BP, Burge CB, Bartel DP (2005) Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell 120: 15–20.
  3. 3. Ventura A, Jacks T (2009) MicroRNAs and cancer: short RNAs go a long way. Cell 136: 586–591.
  4. 4. Hung JH, Yang TH, Hu Z, Weng Z, DeLisi C (2012) Gene set enrichment analysis: performance evaluation and usage guidelines. Brief Bioinform 13: 281–291.
  5. 5. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, et al. (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102: 15545–15550.
  6. 6. Ulitsky I, Laurent LC, Shamir R (2010) Towards computational prediction of microRNA function and activity. Nucleic Acids Res 38: e160.
  7. 7. Tsang JS, Ebert MS, van Oudenaarden A (2010) Genome-wide dissection of microRNA functions and cotargeting networks using gene set signatures. Mol Cell 38: 140–153.
  8. 8. Hanahan D, Weinberg RA (2011) Hallmarks of cancer: the next generation. Cell 144: 646–674.
  9. 9. Makeyev EV, Maniatis T (2008) Multilevel regulation of gene expression by microRNAs. Science 319: 1789–1790.
  10. 10. Farh KK, Grimson A, Jan C, Lewis BP, Johnston WK, et al. (2005) The widespread impact of mammalian MicroRNAs on mRNA repression and evolution. Science 310: 1817–1821.
  11. 11. Lim LP, Lau NC, Garrett-Engele P, Grimson A, Schelter JM, et al. (2005) Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs. Nature 433: 769–773.
  12. 12. Stark A, Brennecke J, Bushati N, Russell RB, Cohen SM (2005) Animal MicroRNAs confer robustness to gene expression and have a significant impact on 3′UTR evolution. Cell 123: 1133–1146.
  13. 13. Gennarino VA, D'Angelo G, Dharmalingam G, Fernandez S, Russolillo G, et al. (2012) Identification of microRNA-regulated gene networks by expression analysis of target genes. Genome Res 22: 1163–1172.
  14. 14. Strimmer K (2008) fdrtool: a versatile R package for estimating local and tail area-based false discovery rates. Bioinformatics 24: 1461–1462.
  15. 15. Huang da W, Sherman BT, Lempicki RA (2009) Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 4: 44–57.
  16. 16. Roybal JD, Zang Y, Ahn YH, Yang Y, Gibbons DL, et al. (2011) miR-200 Inhibits lung adenocarcinoma cell invasion and metastasis by targeting Flt1/VEGFR1. Mol Cancer Res 9: 25–35.
  17. 17. Choi YC, Yoon S, Jeong Y, Yoon J, Baek K (2011) Regulation of vascular endothelial growth factor signaling by miR-200b. Mol Cells 32: 77–82.
  18. 18. Panda H, Pelakh L, Chuang TD, Luo X, Bukulmez O, et al. (2012) Endometrial miR-200c is altered during transformation into cancerous states and targets the expression of ZEBs, VEGFA, FLT1, IKKbeta, KLF9, and FBLN5. Reprod Sci 19: 786–796.
  19. 19. Ahn DH, Rah H, Choi YK, Jeon YJ, Min KT, et al.. (2012) Association of the miR-146aC>G, miR-149T>C, miR-196a2T>C, and miR-499A>G polymorphisms with gastric cancer risk and survival in the Korean population. Mol Carcinog.
  20. 20. Mw Z, Mj J, Yx Y, Sc Z, B L, et al. (2012) Associations of lifestyle-related factors, hsa-miR-149 and hsa-miR-605 gene polymorphisms with gastrointestinal cancer risk. Mol Carcinog 51 Suppl 1E21–31.
  21. 21. Wang F, Ma YL, Zhang P, Shen TY, Shi CZ, et al.. (2012) SP1 mediates the link between methylation of the tumour suppressor miR-149 and outcome in colorectal cancer. J Pathol.
  22. 22. Hayes GM, Carrigan PE, Miller LJ (2007) Serine-arginine protein kinase 1 overexpression is associated with tumorigenic imbalance in mitogen-activated protein kinase pathways in breast, colonic, and pancreatic carcinomas. Cancer Res 67: 2072–2080.
  23. 23. Gout S, Brambilla E, Boudria A, Drissi R, Lantuejoul S, et al. (2012) Abnormal expression of the pre-mRNA splicing regulators SRSF1, SRSF2, SRPK1 and SRPK2 in non small cell lung carcinoma. PLoS One 7: e46539.
  24. 24. Dekker C (2010) On the role of the chaperonin CCT in the just-in-time assembly process of APC/CCdc20. FEBS Lett 584: 477–481.
  25. 25. Grantham J, Brackley KI, Willison KR (2006) Substantial CCT activity is required for cell cycle progression and cytoskeletal organization in mammalian cells. Exp Cell Res 312: 2309–2324.
  26. 26. Lee SC, Chan J (2012) Proteomic identification of chaperonin-containing tail-less complex polypeptide-1 gamma subunit as a p53-responsive protein in colon cancer cells. Cancer Genomics Proteomics 9: 101–108.
  27. 27. Wong N, Chan A, Lee SW, Lam E, To KF, et al. (2003) Positional mapping for amplified DNA sequences on 1q21-q22 in hepatocellular carcinoma indicates candidate genes over-expression. J Hepatol 38: 298–306.
  28. 28. Plaisier CL, Pan M, Baliga NS (2012) A miRNA-regulatory network explains how dysregulated miRNAs perturb oncogenic processes across diverse cancers. Genome Res.
  29. 29. Wang G, Mao W, Zheng S, Ye J (2009) Epidermal growth factor receptor-regulated miR-125a-5p—a metastatic inhibitor of lung cancer. Febs J 276: 5571–5578.
  30. 30. Jiang L, Huang Q, Chang J, Wang E, Qiu X (2011) MicroRNA HSA-miR-125a-5p induces apoptosis by activating p53 in lung cancer cells. Exp Lung Res 37: 387–398.
  31. 31. Jiang L, Huang Q, Zhang S, Zhang Q, Chang J, et al. (2010) Hsa-miR-125a-3p and hsa-miR-125a-5p are downregulated in non-small cell lung cancer and have inverse effects on invasion and migration of lung cancer cells. BMC Cancer 10: 318.
  32. 32. Acunzo M, Visone R, Romano G, Veronese A, Lovat F, et al. (2012) miR-130a targets MET and induces TRAIL-sensitivity in NSCLC by downregulating miR-221 and 222. Oncogene 31: 634–642.
  33. 33. Liu B, Wu X, Wang C, Liu Y, Zhou Q, et al. (2012) MiR-26a enhances metastasis potential of lung cancer cells via AKT pathway by targeting PTEN. Biochim Biophys Acta 1822: 1692–1704.
  34. 34. Dang X, Ma A, Yang L, Hu H, Zhu B, et al. (2012) MicroRNA-26a regulates tumorigenic properties of EZH2 in human lung carcinoma cells. Cancer Genet 205: 113–123.
  35. 35. Rothschild SI, Tschan MP, Federzoni EA, Jaggi R, Fey MF, et al. (2012) MicroRNA-29b is involved in the Src-ID1 signaling pathway and is dysregulated in human lung adenocarcinoma. Oncogene 31: 4221–4232.
  36. 36. Muniyappa MK, Dowling P, Henry M, Meleady P, Doolan P, et al. (2009) MiRNA-29a regulates the expression of numerous proteins and reduces the invasiveness and proliferation of human carcinoma cell lines. Eur J Cancer 45: 3104–3118.
  37. 37. Fabbri M, Garzon R, Cimmino A, Liu Z, Zanesi N, et al. (2007) MicroRNA-29 family reverts aberrant methylation in lung cancer by targeting DNA methyltransferases 3A and 3B. Proc Natl Acad Sci U S A 104: 15805–15810.
  38. 38. Feng B, Wang R, Song HZ, Chen LB (2012) MicroRNA-200b reverses chemoresistance of docetaxel-resistant human lung adenocarcinoma cells by targeting E2F3. Cancer 118: 3365–3376.
  39. 39. Zhu W, Xu H, Zhu D, Zhi H, Wang T, et al. (2012) miR-200bc/429 cluster modulates multidrug resistance of human cancer cell lines by targeting BCL2 and XIAP. Cancer Chemother Pharmacol 69: 723–731.
  40. 40. Ceppi P, Mudduluru G, Kumarswamy R, Rapa I, Scagliotti GV, et al. (2010) Loss of miR-200c expression induces an aggressive, invasive, and chemoresistant phenotype in non-small cell lung cancer. Mol Cancer Res 8: 1207–1216.
  41. 41. Zhang S, Wu Y, Feng D, Zhang Z, Jiang F, et al. (2011) [miR-145 inhibits lung adenocarcinoma stem cells proliferation by targeting OCT4 gene]. Zhongguo Fei Ai Za Zhi 14: 317–322.
  42. 42. Yin R, Zhang S, Wu Y, Fan X, Jiang F, et al. (2011) microRNA-145 suppresses lung adenocarcinoma-initiating cell proliferation by targeting OCT4. Oncol Rep 25: 1747–1754.
  43. 43. Cho WC, Chow AS, Au JS (2011) MiR-145 inhibits cell proliferation of human lung adenocarcinoma by targeting EGFR and NUDT1. RNA Biol 8: 125–131.
  44. 44. Chen Z, Zeng H, Guo Y, Liu P, Pan H, et al. (2010) miRNA-145 inhibits non-small cell lung cancer cell proliferation by targeting c-Myc. J Exp Clin Cancer Res 29: 151.
  45. 45. Cho WC, Chow AS, Au JS (2009) Restoration of tumour suppressor hsa-miR-145 inhibits cancer cell growth in lung adenocarcinoma patients with epidermal growth factor receptor mutation. Eur J Cancer 45: 2197–2206.
  46. 46. Kim JK, Noh JH, Jung KH, Eun JW, Bae HJ, et al. (2013) Sirtuin7 oncogenic potential in human hepatocellular carcinoma and its regulation by the tumor suppressors MiR-125a-5p and MiR-125b. Hepatology 57: 1055–1067.
  47. 47. Gong J, Zhang JP, Li B, Zeng C, You K, et al.. (2012) MicroRNA-125b promotes apoptosis by regulating the expression of Mcl-1, Bcl-w and IL-6R. Oncogene.
  48. 48. Alpini G, Glaser SS, Zhang JP, Francis H, Han Y, et al. (2011) Regulation of placenta growth factor by microRNA-125b in hepatocellular cancer. J Hepatol 55: 1339–1345.
  49. 49. Liang L, Wong CM, Ying Q, Fan DN, Huang S, et al. (2010) MicroRNA-125b suppressesed human liver cancer cell proliferation and metastasis by directly targeting oncogene LIN28B2. Hepatology 52: 1731–1740.
  50. 50. Tomimaru Y, Eguchi H, Nagano H, Wada H, Tomokuni A, et al. (2010) MicroRNA-21 induces resistance to the anti-tumour effect of interferon-alpha/5-fluorouracil in hepatocellular carcinoma cells. Br J Cancer 103: 1617–1626.
  51. 51. Meng F, Henson R, Wehbe-Janek H, Ghoshal K, Jacob ST, et al. (2007) MicroRNA-21 regulates expression of the PTEN tumor suppressor gene in human hepatocellular cancer. Gastroenterology 133: 647–658.
  52. 52. Selaru FM, Olaru AV, Kan T, David S, Cheng Y, et al. (2009) MicroRNA-21 is overexpressed in human cholangiocarcinoma and regulates programmed cell death 4 and tissue inhibitor of metalloproteinase 3. Hepatology 49: 1595–1601.
  53. 53. Liu C, Yu J, Yu S, Lavker RM, Cai L, et al. (2010) MicroRNA-21 acts as an oncomir through multiple targets in human hepatocellular carcinoma. J Hepatol 53: 98–107.
  54. 54. Si ML, Zhu S, Wu H, Lu Z, Wu F, et al. (2007) miR-21-mediated tumor growth. Oncogene 26: 2799–2803.
  55. 55. Zhu Q, Wang Z, Hu Y, Li J, Li X, et al. (2012) miR-21 promotes migration and invasion by the miR-21-PDCD4-AP-1 feedback loop in human hepatocellular carcinoma. Oncol Rep 27: 1660–1668.
  56. 56. Zheng F, Liao YJ, Cai MY, Liu YH, Liu TH, et al. (2012) The putative tumour suppressor microRNA-124 modulates hepatocellular carcinoma cell aggressiveness by repressing ROCK2 and EZH2. Gut 61: 278–289.
  57. 57. Furuta M, Kozaki KI, Tanaka S, Arii S, Imoto I, et al. (2010) miR-124 and miR-203 are epigenetically silenced tumor-suppressive microRNAs in hepatocellular carcinoma. Carcinogenesis 31: 766–776.