MicroRNAs (miRNAs) are short non-coding RNAs that regulate differentiation and development in many organisms and play an important role in cancer.
Using a public database of mapped retroviral insertion sites from various mouse models of cancer we demonstrate that MLV-derived retroviral inserts are enriched in close proximity to mouse miRNA loci. Clustered inserts from cancer-associated regions (Common Integration Sites, CIS) have a higher association with miRNAs than non-clustered inserts. Ten CIS-associated miRNA loci containing 22 miRNAs are located within 10 kb of known CIS insertions. Only one CIS-associated miRNA locus overlaps a RefSeq protein-coding gene and six loci are located more than 10 kb from any RefSeq gene. CIS-associated miRNAs on average are more conserved in vertebrates than miRNAs associated with non-CIS inserts and their human homologs are also located in regions perturbed in cancer. In addition we show that miRNA genes are enriched around promoter and/or terminator regions of RefSeq genes in both mouse and human.
Citation: Makunin IV, Pheasant M, Simons C, Mattick JS (2007) Orthologous MicroRNA Genes Are Located in Cancer-Associated Genomic Regions in Human and Mouse. PLoS ONE 2(11): e1133. doi:10.1371/journal.pone.0001133
Academic Editor: Christopher Arendt, Sanofi-Aventis, United States of America
Received: July 26, 2007; Accepted: October 15, 2007; Published: November 7, 2007
Copyright: © 2007 Makunin et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by the Australian Research Council. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
MicroRNAs (miRNAs) are short RNA molecules, ∼22 nucleotides long, capable of performing regulatory functions. In particular, miRNAs can suppress translation by non-perfect pairing to 3′ UTRs and/or cause degradation of mRNAs in the case of a perfect match between the miRNA and target mRNA . It seems that miRNAs do not have any catalytic activity but rather act as sequence-specific guides for associated protein complexes which are responsible for translation suppression or degradation of mRNA . The number of known miRNAs is growing rapidly, and hundreds of verified miRNAs are annotated in human, mouse, and other organisms (miRBase, http://microrna.sanger.ac.uk).
A range of observations point to a link between miRNAs and cancer, which is not surprising given their central role in many cellular and developmental processes (for reviews see –). A large number of human and mouse miRNAs have also been shown to be located in regions associated with cancer , . The expression of various miRNAs is altered in cancer and miRNA profiling can be used for precise cancer classification . Ectopic expression of the mir-17-19b cluster accelerates tumor formation in mice and has been accordingly classified as a potential oncogene .
Retroviruses, such as the murine leukemia virus (MLV), can cause tumor formation in mammals. Proviral insertions may activate proto-oncogenes or lead to inactivation of tumor suppressor genes in the vicinity of the insertion sites. Retroviral integration sites can be determined in animals with cancer using inverse PCR or similar techniques, and mapped to the genome sequence. Regions harboring multiple insertion sites in close proximity to each other are the most obvious candidates for cause of cancer development and are often named Common Integration Sites (CISs). In general, candidates for tumor-suppressor genes or proto-oncogenes are selected from protein-coding genes based on proximity to CISs.
Numerous genome-wide screens have been undertaken in mouse to identify genes involved in carcinogenesis, especially hematopoietic tumors –. Data obtained from various screens on different cancer models (including studies with genetically modified mice) has been compiled into the Retroviral Tagged Cancer Gene Database (RTCGD, http://rtcgd.ncifcrf.gov)  which also provides annotations for the UCSC genome browser . In the RTCGD, the CISs are defined as regions containing two inserts located within 20 kb, 3 inserts within 50 kb, and four or more inserts within 100 kb in the same cancer model (see FAQ section of RTCGD on http://rtcgd.ncifcrf.gov) (for details see ).
However, some CISs do not map near any known or annotated protein-coding sequence. It was shown that insertions of retroviruses in the vicinity of the mir-17-92 miRNA polycistron cause tumor formation and increase miRNA expression, indicating that retroviral mutagenesis can be a potent tool for discovery of oncogenic miRNAs , . Considering the emerging regulatory role of microRNAs in cell differentiation and cancer we analyzed the association between publicly available retroviral integration sites and known miRNA loci in the mouse genome. We found that miRNA loci are significantly enriched in the vicinity of CISs which suggests that some of these miRNA loci may also be considered as candidate proto-oncogenes or tumor-suppressor genes.
Murine miRNA loci associate with retroviral common integration sites
We analyzed co-localization between mouse miRNAs and retroviral integration sites determined from mice that developed cancer. For this analysis we used the 363 miRNAs from the miRNA registry (http://microrna.sanger.ac.uk) that have been mapped to 381 locations within the well-assembled fraction of the mouse genome (four miRNAs mapped to more than one location). The locations correspond to the genomic positions of miRNA precursor sequences. We used the RTCGD database containing 2373 retroviral integration sites within Common Integration Sites (CIS inserts) and 3119 retroviral integration sites mapped outside of CISs (non-CIS inserts). We excluded from our analysis integration sites of Sleeping Beauty transposons because non-CIS insertions of Sleeping Beauty are non-randomly distributed among the chromosomes: chromosomes 1, 4, 6 and 15 harbor more than half of all Sleeping Beauty non-CIS integration sites.
Using a local mirror of the UCSC genome browser we found that CIS inserts are located within 5 kb of 17 murine miRNAs and within 10 kb of 22 miRNAs. Examples of co-localization between miRNAs and CIS inserts are shown in Fig. 1 and a full list of CIS-associated miRNAs is given in Table 1. Further increasing the distance (past 10 kb) did not result in a significant increase of miRNA numbers associated with CIS inserts (Fig. 2) indicating that the association between miRNAs and CIS inserts is maximal at short distances.
Each panel represents 20 kb of genomic DNA. Blue triangles indicate retroviral integration sites, red ticks represent miRNAs, black boxes and lines show the position of spliced transcripts. (A) chr11:87,562,001–87,582,000. (B) chrX:48,977,001–48,997,000. (C) chr14:113,914,001–113,934,000. Genome assembly mm8.
Blue diamonds represent the number of miRNAs located at the indicated distance or less from CIS inserts, and red diamonds correspond to miRNAs located at the indicated distance or less from non-CIS inserts. The trendline for miRNAs associated with non-CIS inserts is shown as red dotted line.
We used a bootstrap simulation to estimate the statistical significance of the co-localization between retroviral integration sites and miRNAs (see Materials and Methods). Because some miRNAs are clustered in the genome and hence distributed non-uniformly, we grouped miRNAs into loci by adding 5, 10, 20 or 30 kb to each side of the miRNA location and combining the overlapping regions. This grouping is necessary to maintain clustered and tandemly repeated miRNAs as single units (loci) during the bootstrap procedure. Regions of the same sizes were randomly placed on the mouse genome and cases of overlap with retroviral integration sites were counted. The number of miRNA loci that are located 10 kb or less from CIS inserts is approximately 5.5 times higher than that observed for randomly placed loci, and the probability of obtaining such a number by chance is estimated as 1.7×10−5. The enrichment declines with the length of the miRNA loci, and the probability of obtaining a similar overlap between miRNA loci and CIS inserts by chance is higher for longer distances (Table 2). The bootstrap data indicate that the strongest association between miRNA loci and CIS inserts is at short distances, up to 10 kb. In agreement with this, the number of CIS retroviral inserts near to individual miRNAs is also highly enriched at short distances, and the enrichment declines with distance.
Non-CIS inserts are enriched in the vicinity of miRNA loci
We analyzed the co-localization between miRNAs and retroviral inserts mapped outside of CISs. These non-CIS inserts are non-clustered retroviral integration sites obtained in cancer screens. Their role (if any) in tumorigenesis is unclear and so these inserts are generally omitted from analysis. Low saturation in some cancer screens suggests that some non-CIS inserts might be located in regions involved in tumorigenesis, but on the other hand it is possible to speculate that some are just by-products of the cancer screen.
The number of miRNA located at a given distance from non-CIS inserts increases proportionally to the distance between the miRNA and non-CIS insert (R2 = 0.9396), whereas the number of miRNAs associated with CIS inserts does not display such a strong linear dependence (Fig. 2) (R2 = 0.7831). The association of miRNA with CIS inserts is better described by a logarithmic trendline with R2 = 0.945 (see Materials and Methods). The bootstrap simulation showed a significant enrichment for miRNA loci associated with non-CIS inserts, especially for distances less than 5 kb (Table 3). The number of non-CIS retroviral integration sites in the vicinity of miRNAs has a slightly higher enrichment than the enrichment for miRNA loci. Close examination of miRNA loci associated with non-CIS inserts revealed several loci with two independent non-CIS inserts located very close to each other. For example, among 13 miRNA loci (16 miRNAs) with non-CIS inserts within 10 kb, ten loci have a single insert, and three loci have two inserts. These closely located inserts were isolated from different cancer models, which is why these inserts were classified as non-CIS. We analyzed the distribution of distances between non-CIS inserts in the genome. There is significant increase in the number of non-CIS inserts located within 3 kb of each other, whereas the number of non-CIS insertions at longer distances is more or less uniform when measured in 1 kb bins. A total of 332 out of 3119 inserts outside of CISs are located within 3 kb of each other.
We removed all non-CIS inserts located within 3 kb of each other and repeated the bootstrap analysis. Nevertheless, even this dataset shows approximately two-fold enrichment for miRNA loci associated with non-CIS inserts separated by 3 kb or more (Table 4). The enrichment is similar for all distances analyzed but the association at longer distances is statistically more significant.
Based on this bootstrap analysis we conclude that miRNA loci show the strongest association with CIS inserts at short distances (less than 10 kb). At distances less than 10 kb the enrichment of miRNA loci overlapping with CIS inserts is two times higher than the enrichment of miRNA loci overlapping with non-CIS inserts. At longer distances, such as 30 kb, miRNA loci show a similar association both with CIS and non-CIS inserts.
MicroRNA loci are enriched around starts and ends of protein-coding genes
It is known that integration of murine leukemia viruses and MLV-derived vectors preferentially occurs around promoter regions . Indeed, out of 3119 non-CIS retroviral sites from RTCGD database, 1190 (38%) are located within 5 kb of annotated transcription start sites of RefSeq genes (occupying ∼6.8% of the genome). This represents more than 5-fold enrichment, higher than the enrichment of non-CIS insertions around miRNA loci (Tables 3 and 4). Out of 2373 CIS inserts, 989 (42%) are located within 5 kb from annotated transcription start sites of RefSeq genes.
We analyzed the distribution of miRNA loci in the mouse genome with respect to RefSeq genes. Out of 381 miRNA locations, 155 (41%) overlap RefSeq Genes which occupy 32% of the mouse genome. Among these, 22 (6%) miRNAs overlap exons (2% of the genome), and 133 (35%) are located in introns (30% of the genome). Somewhat surprisingly, we found that miRNAs are enriched close to the start or the end of genes: 69 (18%) and 72 (19%) miRNAs are located within 5 kb of RefSeq gene transcription start or end sites, respectively (∼2.6 and ∼2.8 fold enrichment). In total, 105 (51%) miRNA locations are located within 5 kb either from the start, end, or both (>3-fold enrichment). Moreover, miRNAs show slightly higher enrichment in regions where gene start sites are separated from gene end sites by less than 10 kb (data not shown).
MicroRNAs associated with non-CIS inserts tend to be close to promoters of RefSeq genes while CIS-associated miRNAs tend to be distant from promoters. For example, 13 miRNA loci (16 miRNAs) have non-CIS inserts mapped within 10 kb. Out of these, 9 loci (69%) are located within 10 kb from RefSeq gene starts, while out of 10 CIS-associated miRNA loci only 4 are less than 10 kb from RefSeq gene starts. Six CIS-associated miRNA loci containing 17 miRNAs are located more than 10 kb away from promoters of RefSeq genes, indicating that the observed association between miRNAs and CISs is not explained by co-localization of miRNAs near genes. A similar tendency is observed for miRNA 5k loci (miRNAs within 5 kb of each other): out of 10 miRNA 5k loci with non-CIS RIS within 5 kb (Table 3), 7 overlap RefSeq gene starts. Out of 7 miRNA 5k loci with CIS RIS within 5k (Table 2), 3 overlap RefSeq gene starts.
Interestingly, human miRNAs are also enriched around transcription start or end sites of RefSeq genes. There are 543 annotated miRNAs in the human genome mapped to 474 unique miRNA precursor locations. Out of these 474 miRNA locations 108 (23%) are located within 5 kb of RefSeq gene transcription start or/and end sites (2.2-fold enrichment). We merged these 474 locations into 311 miRNA 5k loci by adding 5 kb on each side of the precursor and then created the base-pair-wise union (OR) of locations. Out of 311 miRNA 5k loci 92 (30%) overlap either transcription start or end sites of RefSeq genes, or both. These 92 loci contain 110 (23%) miRNA precursors. It seems that miRNA loci in the vicinity of transcription start or end sites contain less miRNA precursors than miRNA loci located farther from genes (1.2 and 1.7 miRNA precursors per loci, respectively).
CIS-associated miRNAs are conserved and their human orthologs are located in cancer-associated regions
CIS-associated miRNAs have some common features. Most (21 out of 22) CIS-associated miRNAs are located outside of RefSeq protein-coding genes. The exception, mir-135a-1, is located in the 3′ UTR of the gene 6230410P16Rik. In contrast, 5 out of 16 miRNAs having non-CIS inserts within 10 kb are located within RefSeq genes. On average, CIS-associated miRNA loci contain more miRNAs than non-CIS associated miRNA loci (2.2 and 1.2 miRNA per locus, respectively).
CIS-associated miRNAs tend to be more conserved than miRNAs associated with non-CIS inserts, both in alignments of multiple vertebrate species and in human and mouse pairwise comparisons. First, we used pre-calculated phastCons  conservation scores based on 17 vertebrates for whole miRNA precursor sequences. The average conservation score for 22 CIS-associated miRNA is 0.941 versus 0.739 for 16 non-CIS-associated miRNA (see Materials and Methods for details). Second, we compared the sequence identity between mouse miRNA precursors and their orthologous human sequences. CIS-associated mouse miRNA precursors have 96% identity with human sequences whereas non-CIS-associated miRNAs display 91% identity, slightly lower than the average identity level for all mouse miRNA precursors (92%). In addition, CIS-associated miRNA precursors have significantly less indels between mouse and human sequences.
Considering the high conservation of CIS-associated miRNAs we looked for the involvement of human homologs in cancer development. Human homologs of eight CIS-associated miRNA loci are located in fragile regions and regions involved in cancer  (Table 1). It has been shown that miRNAs are generally down-regulated in cancer . We therefore compared the available data on tissue-specific expression of human miRNAs  and the type of cancer associated with the CISs (blood or brain cancer) located in the vicinity of homologous murine miRNAs. Eight CIS-associated miRNA loci are co-localized with insertions determined from various types of blood cancer (Table 1). Human expression data in 24 different human organs and cell types was available for orthologs of members of 7 these loci . There is a high correspondence between the miRNA expression pattern and the type of cancer induced by retroviral insertion near these miRNAs. Five loci have human miRNA orthologs whose expression is highest in tissues associated with blood development such as bone marrow or thymus. Another miRNA, miR-22, is enriched in thymus although it is most abundant in skeletal muscles .
Here we demonstrate that murine miRNAs are associated with CISs. The enrichment of miRNAs loci in proximity to CIS inserts is higher than the enrichment of miRNAs around non-clustered retroviral insertions located outside of CISs. All but one CIS-associated miRNA are located outside RefSeq genes and some of them may be classified as proto-oncogenes or tumor-suppressor genes. Indeed, the well-characterized oncogenic miRNA cluster mir-17-92 ,  is associated with CIS inserts (Table 1). Ectopic expression of the mir-17–92 cluster accelerates tumor development in a mouse B-cell lymphoma model . Another example is the mir-106a miRNA cistron which shows numerous retroviral integrations in a lymphocyte tumor screen: tumors containing inserts close to the mir-106a miRNA cluster exhibit up to a 20-fold higher expression of these miRNAs . There is also some evidence of the involvement of mir-142 in cancer development .
Formally, we cannot exclude the possibility that insertions affect (distant) regulatory elements situated in cis to protein-coding genes, especially when the CISs occur relatively close to genes, e.g., in HOX clusters (Fig. 1c). However, an equally if not more plausible explanation is that retroviral insertions are causing changes in the expression of miRNA genes, especially in those cases where the insertions occur in close proximity to miRNAs, rather than affecting more distant protein-coding genes. For example, all five CIS insertions in region XqA5 are less than 5 kb from the three miRNAs listed but more than 120 kb from the assigned RIS gene Gpc3, and four out of seven CIS insertions in region 11qC are closer to mir-142 than to the nearest assigned gene Supt4h2. It is also possible that that (some) miRNA genes have chromatin structure open to retroviral integration similar to promoter regions of protein coding genes. The CIS-associated miRNAs represent prospective candidates for further experimental studies in the context of cancer association.
It appears that miRNAs are enriched around the transcription start and/or end sites of protein-coding genes both in human and mouse. Considering the strong bias of MLV integrations around promoter regions  it is possible to speculate that the observed enrichment of non-CIS retroviral insertions near miRNAs may be at least partially due to the preferential location of miRNAs around promoter regions. Interestingly, miRNAs located close to transcription start or end sites tend to be non-clustered indicating a different type of organization of these miRNA genes.
Materials and Methods
We used miRBase release 9.0 (October 2006) which contains 373 mouse miRNAs, 363 of which were mapped to 381 locations within the Mouse Feb 2006 (mm8) genome assembly (Build 36 “essentially complete” assembly by NCBI).
The July 2006 version of the Retroviral Tagged Cancer Gene Database (RTCGD, http://rtcgd.ncifcrf.gov)  contains 2373 retroviral integration sites classified as belonging to Common Integration Sites (CISs), and 3119 retroviral integration sites classified as located outside of CISs.
All analysis was done on a local mirror of the UCSC Genome Browser .
Bootstrap analysis was done similarly to . Briefly, miRNAs were merged into miRNA loci by adding 5, 10, 20 or 30 kb on each side with subsequent base-pair-wise union (OR) on the UCSC Genome Browser. The resulting loci were randomly placed on the mouse genome with mapped CIS inserts or inserts outside of CISs and any cases of overlap were counted. Genome assembly gaps were removed from the analysis. Overlaps between randomly placed loci were prohibited. For each bootstrap test we performed 107 iterations.
The conservation of mouse miRNAs was estimated using phastCons conservation scores  for whole miRNA precursors. The phastCons conservation scores were based on mouse-centric alignments of 17 species and were obtained from the UCSC genome browser . The average phastCons score for bases within each known miRNA was calculated using the UCSC hgWiggle utility and the -doStats flag. Mouse-human base-pair identity scores were calculated using pairwise genome alignments obtained from the UCSC genome browser and the UCSC utilities axtAndBed and axtCalcMatrix.
The enrichment of miRNAs around transcription start or end sites was calculated as follows: all RefSeq gene transcription start and end sites were extracted from the genome browser. Annotations were created by adding 5 or 10 kb to every transcription start or end site. The enrichment was calculated as the ratio between the fraction of miRNAs overlapping these annotations and the fraction of the genome occupied by the corresponding annotations.
The linear and logarithmic trendlines and R2 values were calculated using Microsoft Excel 2003. The linear trendline for miRNAs associated with non-CIS inserts was described by y = 1.2286x+6.3333 with R2 = 0.9396. The linear trendline for miRNAs associated with CIS inserts was described by y = 0.3371x+17.6 and R2 = 0.7831. The logarithmic trendline for miRNAs associated with CIS inserts was described by y = 5.2282Ln(x)+9.3527 with R2 = 0.945.
We would like to thank the Retroviral Tagged Cancer Gene Database (RTCGD) for their assistance in providing access to the mapped retroviral insertion positions. We would like to thank anonymous reviewers for constructive comments and suggestions.
Conceived and designed the experiments: JM IM. Performed the experiments: MP IM CS. Analyzed the data: MP IM CS. Contributed reagents/materials/analysis tools: MP. Wrote the paper: JM MP IM. Other: Laboratory Head: JM. Provided support and facilities: JM.
- 1. Bartel DP (2004) MicroRNAs: genomics, biogenesis, mechanism, and function. Cell 116: 281–297.
- 2. Mattick JS, Makunin IV (2005) Small regulatory RNAs in mammals. Hum Mol Genet 14: R121–132.
- 3. Zhang B, Pan X, Cobb GP, Anderson TA (2007) MicroRNAs as oncogenes and tumor suppressors. Dev Biol 302: 1–12.
- 4. Xi Y, Edwards JR, Ju J (2007) Investigation of miRNA biology by bioinformatic tools and impact of miRNAs in colorectal cancer - regulatory relationship of c-Myc and p53 with miRNAs. Cancer Informatics 3: 245–253.
- 5. Gregory RI, Shiekhattar R (2005) MicroRNA biogenesis and cancer. Cancer Res 65: 3509–3512.
- 6. Sevignani C, Calin GA, Nnadi SC, Shimizu M, Davuluri RV, et al. (2007) MicroRNA genes are frequently located near mouse cancer susceptibility loci. Proc Natl Acad Sci USA 104: 8017–8022.
- 7. Calin GA, Sevignani C, Dumitru CD, Hyslop T, Noch E, et al. (2004) Human microRNA genes are frequently located at fragile sites and genomic regions involved in cancers. Proc Natl Acad Sci USA 101: 2999–3004.
- 8. Lu J, Getz G, Miska EA, Alvarez-Saavedra E, Lamb J, et al. (2005) MicroRNA expression profiles classify human cancers. Nature 435: 834–838.
- 9. He L, Thomson JM, Hemann MT, Hernando-Monge E, Mu D, et al. (2005) A microRNA polycistron as a potential human oncogene. Nature 435: 828–833.
- 10. Bijl J, Sauvageau M, Thompson A, Sauvageau G (2005) High incidence of proviral integrations in the Hoxa locus in a new model of E2a-PBX1-induced B-cell leukemia. Genes Dev 19: 224–233.
- 11. Hwang HC, Martins CP, Bronkhorst Y, Randel E, Berns A, et al. (2002) Identification of oncogenes collaborating with p27Kip1 loss by insertional mutagenesis and high-throughput insertion site analysis. Proc Natl Acad Sci USA 99: 11293–11298.
- 12. Johansson FK, Brodd J, Eklof C, Ferletta M, Hesselager G, et al. (2004) Identification of candidate cancer-causing genes in mouse brain tumors by retroviral tagging. Proc Natl Acad Sci USA 101: 11334–11337.
- 13. Lund AH, Turner G, Trubetskoy A, Verhoeven E, Wientjens E, et al. (2002) Genome-wide retroviral insertional tagging of genes involved in cancer in Cdkn2a-deficient mice. Nat Genet 32: 160–165.
- 14. Kim R, Trubetskoy A, Suzuki T, Jenkins NA, Copeland NG, et al. (2003) Genome-based identification of cancer genes by proviral tagging in mouse retrovirus-induced T-cell lymphomas. J Virol 77: 2056–2062.
- 15. Mikkers H, Allen J, Knipscheer P, Romeijn L, Hart A, et al. (2002) High-throughput retroviral tagging to identify components of specific signaling pathways in cancer. Nat Genet 32: 153–159.
- 16. Suzuki T, Minehata K, Akagi K, Jenkins NA, Copeland NG (2006) Tumor suppressor gene identification using retroviral insertional mutagenesis in Blm-deficient mice. EMBO J 25: 3422–3431.
- 17. Suzuki T, Shen H, Akagi K, Morse HC, Malley JD, et al. (2002) New genes involved in cancer identified by retroviral tagging. Nat Genet 32: 166–174.
- 18. Yamashita N, Osato M, Huang L, Yanagida M, Kogan SC, et al. (2005) Haploinsufficiency of Runx1/AML1 promotes myeloid features and leukaemogenesis in BXH2 mice. Br J Haematol 131: 495–507.
- 19. Akagi K, Suzuki T, Stephens RM, Jenkins NA, Copeland NG (2004) RTCGD: retroviral tagged cancer gene database. Nucleic Acids Res 32: D523–527.
- 20. Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, et al. (2002) The human genome browser at UCSC. Genome Res 12: 996–1006.
- 21. Wang CL, Wang BB, Bartha G, Li L, Channa N, et al. (2006) Activation of an oncogenic microRNA cistron by provirus integration. Proc Natl Acad Sci USA 103: 18680–18684.
- 22. Slape C, Hartung H, Lin YW, Bies J, Wolff L, et al. (2007) Retroviral insertional mutagenesis identifies genes that collaborate with NUP98-HOXD13 during leukemic transformation. Cancer Res 67: 5148–5155.
- 23. Bushman F, Lewinski M, Ciuffi A, Barr S, Leipzig J, et al. (2005) Genome-wide analysis of retroviral DNA integration. Nat Rev Microbiol 3: 848–858.
- 24. Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, et al. (2005) Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res 15: 1034–1050.
- 25. Baskerville S, Bartel DP (2005) Microarray profiling of microRNAs reveals frequent coexpression with neighboring miRNAs and host genes. RNA 11: 241–247.
- 26. O'Donnell KA, Wentzel EA, Zeller KI, Dang CV, Mendell JT (2005) c-Myc-regulated microRNAs modulate E2F1 expression. Nature 435: 839–843.
- 27. Lum AM, Wang BB, Li L, Channa N, Bartha G, et al. (2007) Retroviral activation of the mir-106a microRNA cistron in T lymphoma. Retrovirology 4: 5.
- 28. Calin GA, Croce CM (2006) MicroRNAs and chromosomal abnormalities in cancer cells. Oncogene 25: 6202–6210.
- 29. Simons C, Pheasant M, Makunin IV, Mattick JS (2006) Transposon-free regions in mammalian genomes. Genome Res 16: 164–172.