Figure 1.
miRs and Target Genes in the TargetScan Dataset
(A) Distribution of the number of different miRs regulating each target gene in the TargetScan dataset. The thick red line represents the distribution in the original datasets, while each of the thin blue lines represents the distribution in one of the column-randomized matrices. The matrix contains only genes with at least one predicted site in their 3′ UTR. In each randomization, we shuffled the assignment of miRs to their targets, keeping constant the number of targets per miR.
(B) Distribution of number of targets per miR in the TargetScan dataset. In the thick red line we depicted the original distribution, while each blue thin line represents the distribution in one of the 100 row-randomized matrices, which preserve the distribution of number of miRs targeting each gene.
Table 1.
TargetScan Target Hubs GO Functional Enrichment
Figure 2.
Distribution of the density of miRs in the 3′ UTRs of target hubs (thick red line) and all the genes (thin blue line) in the TargetScan dataset (all genes included in this figures have at least one miR site predicted in their 3′ UTR). The log10 densities were binned into bins of 0.1, and relative frequencies were plotted. Same analysis for the PicTar dataset is in Figure S2.
Table 2.
Top 20 Most Significant Pairs of Coregulating miRNAs in the TargetScan Network
Figure 3.
miR Co-Occurrence Network in the TargetScan Dataset
(A) The TargetScan miR co-occurrence network, at FDR level of 0.05. A node represents a miR and an edge connects between pairs of miRs with significant rate of co-occurrence. The nodes in the figure are arranged from most highly connected on the top, to most lowly connected, on the bottom. For interactive viewing of the network, using Pajek (http://vlado.fmf.uni-lj.si/pub/networks/pajek/), see Datasets S1 and S2.
(B) Degree distribution in the TargetScan miR combinatorial regulation network (co-occurring miR pairs that passed FDR of 0.05).
Table 3.
Top 20 Most Significant Pairs of Coregulating miRNAs and TFs in the TargetScan and PicTar Networks
Figure 4.
Network Designs in the miR–TF Coregulation Network
The figure depicts the analyzed network motifs in the TargetScan and PicTar dataset, and with the use of TF binding sites in RefSeq genes promoters of 10 kb for both networks, and 5 kb for the PicTar network. The figure depicts, for each network motif, its architecture, the number of times it appears in each of the networks, the p-value and z-score for its over-representation in the network (as described in Materials and Methods), the total number of RefSeq genes that are regulated by this type of network design, and an example.
*For the first design, the coregulating miR–TF pair, we state the range of hypergeometric p-values of pairs that passed FDR and are considered significant, and in brackets the FDR p-value of these pairs using the randomization co-occurrence test.
**In addition, z-scores for significant pairs were calculated based on the co-occurrence edge-swapping randomization model (see Materials and Methods).
Figure 5.
Tissue Expression Correlations between miRs and TFs
miR tissue expression in brain, liver, thymus, testes, and placenta were taken from [34]. mRNA tissue expression was taken from [35].
(A) Background distribution of all possible miR–TF pairs for which expression profiles can be derived.
(B,C) Normalized histograms of correlation coefficients; the same distribution as in (A) was made, yet only for significantly co-occurring miR-TF pairs (red), and FFLs (green) in the PicTar (B) and TargetScan (C) networks. The figure shows the proportion of the various correlation coefficients divided by the background distribution depicted in (A).
Figure 6.
Analysis of miR Clusters in the Human Genome
(A) Distribution of distances between all neighboring pre-miR genes in the human genome.
(B) Distribution of tissue expression correlations between pairs of miRs: all possible pairs in the data (thin blue line) and pairs of miRs which reside in shared clusters (thick red line). In the inset are shown tissue expression correlations between pairs of miRs in the same genomic clusters versus distances between them.
(C) Distribution of number of conserved TFBS 30 kb upstream of the 5′ most nucleotide in each miR clusters. Conserved TFBSs were taken from UCSC hg17.