Xenotropic murine leukemia virus (MLV)-related virus (XMRV) is a new human retrovirus associated with prostate cancer and chronic fatigue syndrome. The causal relationship of XMRV infection to human disease and the mechanism of pathogenicity have not been established. During retrovirus replication, integration of the cDNA copy of the viral RNA genome into the host cell chromosome is an essential step and involves coordinated joining of the two ends of the linear viral DNA into staggered sites on target DNA. Correct integration produces proviruses that are flanked by a short direct repeat, which varies from 4 to 6 bp among the retroviruses but is invariant for each particular retrovirus. Uncoordinated joining of the two viral DNA ends into target DNA can cause insertions, deletions, or other genomic alterations at the integration site. To determine the fidelity of XMRV integration, cells infected with XMRV were clonally expanded and DNA sequences at the viral-host DNA junctions were determined and analyzed. We found that a majority of the provirus ends were correctly processed and flanked by a 4-bp direct repeat of host DNA. A weak consensus sequence was also detected at the XMRV integration sites. We conclude that integration of XMRV DNA involves a coordinated joining of two viral DNA ends that are spaced 4 bp apart on the target DNA and proceeds with high fidelity.
Citation: Kim S, Rusmevichientong A, Dong B, Remenyi R, Silverman RH, et al. (2010) Fidelity of Target Site Duplication and Sequence Preference during Integration of Xenotropic Murine Leukemia Virus-Related Virus. PLoS ONE 5(4): e10255. doi:10.1371/journal.pone.0010255
Editor: Reuben S. Harris, University of Minnesota, United States of America
Received: February 6, 2010; Accepted: March 28, 2010; Published: April 20, 2010
Copyright: © 2010 Kim et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by a National Institutes of Health (NIH) Grant CA68859 and The Margaret E. Early Medical Research Trust Grant to S.A.C., and by grant number W81XWH-07-1-338 from the U.S. Department of Defense Prostate Cancer Research Program, NIH Grant CA103943, the Charlotte Geyer Foundation, and the Mal and Lea Bank Chair to R.H.S. S.K. is partly supported by a Dissertation Year Fellowship Award from the UCLA Graduate Division. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Xenotropic murine leukemia virus (MLV)-related virus (XMRV) is a new human retrovirus having a 8.65 kbp genome and shares up to 95% overall nucleotide sequence identity with other known MLVs . XMRV was first reported to be associated with prostate cancer from patients homozygous for a defective variant of RNase L (R462Q), a regulated endoribonuclease for single-stranded RNA that functions in the antiviral action of interferon (IFN) , . The Arg to Gln substitution at amino acid position 462 (R462Q) of RNase L is a common missense variant (35% allelic frequency), resulting in a 3-fold decrease in catalytic activity compared with the wild-type enzyme , . Consistent with the observation that the virus is associated with patients having the homozygous mutant RNASEL genotype, XMRV replication in vitro is sensitive to IFN-β inhibition . The link between XMRV and prostate cancer suggests that inherited defects of RNase L may enhance susceptibility to XMRV, leading to tumorigenesis. However, detection of XMRV has recently been reported in prostate samples independent of the RNASEL genotype . XMRV has also been detected in the blood of patients with chronic fatigue syndrome . The causal relationships of XMRV infection to prostate cancer and chronic fatigue syndrome, as well as the mechanism for virus pathogenicity, have yet to be established. Additionally, several studies have failed to detect XMRV in different European cohorts of patients with either prostate cancer  or with chronic fatigue syndrome , , , suggesting that either population differences or environmental factors may modulate the incidence of XMRV infections.
Integration of the cDNA copy of the viral RNA genome is essential for retroviruses to establish a productive infection (for reviews, see reference ). However, because of its nonspecific nature, retroviral DNA integration is inherently a mutagenic event. Many retroviruses, especially members of the gammaretrovirus genus, can induce tumors as a consequence of integrating their viral genome into the host cell chromosome and activating proto-oncogenes via promoter or enhancer insertion, a mechanism referred to as proviral insertional mutagenesis . XMRV is a member of the gammaretrovirus family, and does not encode host-derived oncogenes . Genome-wide analyses of XMRV integration sites in a human prostate cell line, DU145, and prostate cancer tissues showed that XMRV integration favors gene-dense regions and genomic features frequently associated with structurally open, transcriptional regulatory regions of a chromosome, such as transcription start sites, CpG islands, and DNase hypersensitive sites . The XMRV integration sites in prostate cancer tissues are further associated with cancer breakpoints, common fragile sites, and microRNA genes. However, no common integration site or integration hotspot has been detected within or near known proto-oncogenes and tumor suppressor genes in both acutely infected cells and cancer tissues . Due to the relatively few integration sites (a total of 14) analyzed thus far in prostate cancer tissues, the role of XMRV infection in causing prostate cancer by insertional mutagenesis is still unclear.
Integration of retroviral DNA is catalyzed by the viral enzyme integrase (IN) and involves sequential steps of DNA breaking and joining reactions (; and see Fig. 1A). During integrative recombination, the two ends of the linear viral DNA genome are joined in a concerted fashion to staggered sites on the opposite strands of the target DNA. Gap repair of the integration intermediate results in the formation of a provirus that is flanked by short direct repeats of target DNA, a hallmark of retroviral DNA integration , . The length of the direct repeats, which varies from 4 to 6 bp among the retroviruses but is invariant for each particular retrovirus, presumably corresponds to the spacing of the staggered target DNA sites that are attacked by IN during integration. Analyses of various proviruses together with the associated flanking DNA sequences have revealed high integration fidelity. For instance, 15 of 15 human immunodeficiency virus type 1 (HIV-1) integration sites , , , 8 of 8 MLV integration sites , , , and 7 of 7 spleen necrosis virus integration sites ,  have the correct length of the target site duplication. However, certain mutations of the viral genome or reaction conditions can lead to uncoordinated integration of the two viral ends and result in deletions, insertions, or other rearrangements of the host DNA , , , , . Therefore, in addition to insertional mutagenesis, uncoordinated integration of the two viral ends during integrative recombination may constitute another mechanism that can cause genomic alterations and initiate deleterious events in the infected cell. In this study, we have cloned and determined host DNA sequences flanking XMRV proviruses. We found that integration of XMRV DNA proceeds with high fidelity, and consistently produces a 4-bp direct repeat at the virus-target DNA junctions. Analysis of the 4-bp direct repeats reveals a weak consensus integration sequence.
(A) DNA breaking and joining steps during integration. Viral and target DNA strands are represented by thick black and parallel lines, respectively, and the viral long terminal repeats (LTRs) are depicted as grey boxes. Nucleotides at the top and bottom strands are denoted by uppercase and lowercase letters, respectively. During 3′-end processing, IN removes two nucleotides from the 3′ end of each strand of linear viral DNA so that the viral 3′ ends terminate with a conserved CA dinucleotide. Closed arrowheads denote the positions of strand transfer, a concerted cleavage-ligation reaction during which IN makes a staggered break in the target DNA. Host DNA repair enzymes fill in the resulting single-stranded gaps, denoted by D1 to D4 in the upper strand and d1 to d4 in the lower strand of target DNA, and remove the two unpaired nucleotides at the 5′ ends of the viral DNA (open arrowheads), thereby generating the short direct repeats flanking the provirus. (B) A potential pathway for generating a base transversion in the short direct repeat during XMRV integration. A coordinated integration of the two viral ends occurred at the 4-bp staggered positions as depicted by the closed arrowheads. During repair of the single-stranded gap adjacent to the upstream LTR, an adenine nucleotide was introduced at the D4 position either by misincorporation or aberrant processing of the unpaired AA-dinucleotide at the viral 5′ end. Subsequent repair of the mismatch resulted in the observed transversion (denoted by bold types).
Fidelity and length of target site duplication during XMRV integration
The IN-catalyzed integration of retroviral DNA involves sequential DNA breaking and joining steps (Fig. 1A). To determine the length of the target-site duplication during XMRV integration, we sequenced the stretches of host cell DNA flanking the long terminal repeat (LTR) at each end of a given provirus, and then searched for these flanking sequences within the human genome. To facilitate the analysis, a human prostate cancer cell line DU145 was infected by XMRV and then clonally expanded. Ten infected cell clones were analyzed, and a total of 15 integration site sequences flanking both ends of the XMRV provirus were determined and mapped (Table 1). Three cell clones (C-6, -7, and -8) contained multiple XMRV proviruses, which may have resulted from multiple integration events within the same cell clone or from mixed clonal populations.
Of the 15 XMRV integration sites analyzed, 13 had a 4-bp target site duplication, one site had a 5-bp duplication (clone C-10), and one had a 273-bp duplication (clone C-3). Examination of the viral DNA sequence of the provirus with the 273-bp target site duplication revealed that the left LTR contained a 5-bp deletion at the U3 end that includes a CA dinucleotide that is highly conserved in retroviruses . Deletion or mutation of the CA-dinucleotide in the viral donor DNA substrates significantly reduces the efficiency of coordinated integration of two donor molecules into a target DNA , , , , . The U3 end deletion in the left LTR might cause an uncoordinated integration of the two XMRV DNA ends, resulting in staggered breaks that were 273-bp apart. For the 13 proviral integration sites with a 4-bp duplication, 12 had duplication sequences that matched correctly with human genomic DNA sequences. The remaining integration site (from clone C-8) contained a T to A transversion at the position 4 within the direct repeat flanking the left LTR (5′-TAAA), while the direct repeat flanking the right LTR (5′-TAAT) matched correctly with the human genomic DNA (5′-TAAT). Since mismatches in the genome would most likely be repaired by host enzymes before integration, we speculate that the transversion was produced by base misincorporation during gap filling or aberrant processing of the unpaired nucleotides at the viral 5′ end, followed by mismatch repair that fixed the mutation (Fig. 1B).
In addition to the length of the direct repeats, analysis of the 15 integration site sequences showed that all viral sequences, with the exception of the left LTR end of the proviral clone C-3, were terminated with the conserved CA dinucleotide at the 3′ end (data not shown), indicating that the viral DNA ends were correctly cleaved by IN . Based on our analysis that 87% (13 of 15) of the proviruses had a correct 4-bp direct repeat at the integration site, we conclude that the majority of XMRV integration reactions involve a concerted joining of two viral DNA ends that are spaced 4 bp apart on the target DNA.
Base composition surrounding XMRV integration sites
Genome-wide analyses of virus-target DNA junctions reveal a weak consensus integration sequence that is nonetheless unique for each retrovirus examined , , , , . This consensus integration sequence is generally palindromic. For instance, the consensus integration sequence for HIV-1 and MLV are 5′-GTWAC and 5′-VTAB, respectively (using standard International Union of Biochemistry base codes: B = C, G, or T; V = A, C, or G; W = A or T) , , , . To determine the base composition surrounding the XMRV integration site, the target DNA sequences flanking the proviruses were aligned relative to the integration site (between position −1 and D1; Fig. 2), and the nucleotide frequency of the 4-bp direct repeat (positions D1 to D4; Fig. 2) and the positions 10 bp upstream (positions −1 to −10) and 10 bp downstream (positions +1 to +10) of the direct repeat were calculated. In addition to the 13 integration site sequences from the cell clones, the analysis included a dataset containing 472 XMRV integration sites from acutely infected DU145 cells and 14 integration sites from human prostate cancer tissues .
Base compositions of the 4-bp target site duplication (positions D1 to D4; demarcated by the thick vertical lines) and 10 bp upstream (positions −1 to −10) and downstream (positions +1 to +10) of the direct repeat were calculated. The datasets include the 13 integration sites with correct 4-bp direct repeat (Table 1), 472 integration sites from acutely infected DU145 cells (GenBank accession numbers EU981292 to EU981799) and 14 integration sites from human prostate cancer tissues (GenBank accession numbers EU981800 to EU981813) . Integration occurs between positions −1 and D1 on the top strand, and between positions D4 and +1 on the bottom strand (blue arrows). Any base in a position that is significantly overrepresented than the random dataset (P<0.0001) is highlighted in green, while any base in a position that is significantly underrepresented than the random dataset (P<0.0001) is highlighted in red.
Comparison of the nucleotide frequency at each position to the value of a random dataset generated in silico led to identification of a 5′-CTVB consensus sequence (P<0.0001). Among all the retroviruses analyzed, the consensus integration site sequence of XMRV is most similar to that of MLV , , . Both XMRV and MLV generate a 4-bp target site duplication with thymine favored at the D2 position and adenine disfavored at the D4 position. In addition, thymine was disfavored at the D1 position for both XMRV and MLV. At position D3 of the XMRV integration site sequence, although the only statistical significance at P<0.0001 was the underrepresentation of thymine, adenine was significantly favored at P<0.005. In addition to the 4-bp direct repeat, many positions upstream and downstream of the direct repeat had nucleotide frequencies that were significantly overrepresented (e.g. cytosine and guanine at positions +3 and +9, respectively) or underrepresented (e.g. guanine at position −2) when compared to the random in silico control. Furthermore, some of the positions with significantly different representation showed symmetry, such as adenine being favored at position +2 and the corresponding thymine being favored at position −2. Other positions exhibiting a distinct nucleotide preference, however, did not show this symmetry; for example, cytosine was favored at position +3, but guanine was not favored at position −3.
XMRV is a newly discovered gammaretrovirus that has been associated with prostate cancer and chronic fatigue syndrome in humans . An important question is whether XMRV has a causal role in initiation or progression of either of these two diseases. In this study, we investigated if integration of XMRV DNA into the host cell chromosome can cause genetic alterations that may subsequently lead to human disease. During integration, the two ends of the linear viral DNA are joined to staggered sites on the opposite strands of the target DNA . Subsequent strand separation and gap repair lead to the presence of short direct repeats flanking the proviral DNA , . Therefore, the length of the direct repeats presumably corresponds to the spacing of the two viral ends on target DNA during integrative recombination catalyzed by IN. Analyses of various proviruses have revealed that the length of target site duplication, though varying from 4 to 6 bp among the different retroviruses examined, is invariant for each particular retrovirus , , . The high fidelity of the direct repeat length supports the notion that IN multimers form a stable complex with viral and target DNA and catalyze coordinated processing and integration of the two viral DNA ends , , , , . In addition, reaction conditions in vitro and in vivo that promote uncoordinated integration of the two ends often produce deletions and duplications of various lengths in the target DNA , , , , , , , . Since the majority of the integrated XMRV contain viral sequences that terminate with the conserved CA dinucleotide and are flanked by a 4-bp direct repeat of target DNA sequence, we conclude that the two viral DNA ends are correctly processed and joined in a coordinated manner to target DNA by IN during XMRV integration.
Although retroviruses can access most of the host genome for integration, selection of particular target sites is not random, and the frequency of use of specific sites varies considerably, with some sites being preferred up to several hundred times greater than random , , . The mechanism that determines target site specificity is not well understood, and is likely affected by multiple factors , . Both in vitro and in vivo studies have implicated IN as one important determinant in specifying a chromosomal or DNA site for integration. INs of different retroviruses exhibit significant differences in the distribution and preference of integration into an identical target substrate in vitro , , , and in vivo, a chimeric HIV that encodes IN from MLV integrates preferentially into chromosomal features favored by MLV (i.e. transcription start sites and CpG islands) instead of transcription units as favored by HIV-1 . Although primary DNA sequence is likely not a dominant factor in determining target site specificity, genome-wide analyses of virus-target DNA junctions reveal the presence of weak consensus integration sequences, which are generally palindromic and unique for each retrovirus , , , , , , , , . A weak palindromic consensus sequence is also detected among the XMRV integration sites. We hypothesize that integration of retroviral DNA into a host DNA site depends on the specific interaction between IN and target DNA sequences, resulting in each retrovirus having its own unique, though weak, consensus sequence. The consensus sequence for each retrovirus may be a result of favorable interactions between the DNA bases and certain amino acid residues of IN, or may reflect the amenability of the sequence in adopting particular DNA structures favorable for IN binding. For instance, a common mechanism for stimulating HIV-1 integration is DNA bending, which creates a widened major groove at the outer curved face that is favorable for integration , , , , .
The site and fidelity of integration have significant implications for the fate of both the virus and the host cell. Although the present study shows that XMRV integration proceeds with high fidelity, further analysis of additional XMRV integration sites in human tissues would be necessary to clarify whether insertional mutagenesis plays a pathogenic role during XMRV infection. Many viruses from the gammaretrovirus genus of the Retroviridae family, such as MLV, feline leukemia virus, and koala retrovirus, are responsible for leukemogenesis and other diseases in their respective host species . Therefore, the recent evidence of authentic infections of humans by XMRV and the association of XMRV infection with prostate cancer and chronic fatigue syndrome , ,  are alarming and warrant further investigations to determine the causal relationship and pathogenic mechanisms.
Materials and Methods
Host DNA sequences flanking the XMRV provirus
To determine the length and base composition of the target sequence duplication produced by XMRV integration, ten single-clonal (isogenic) populations of XMRV-infected cells were prepared. Plasmid VP62/pcDNA3.1(−) containing the molecular clone of XMRV  was transfected with Lipofectamine 2000 (Invitrogen) into DU145 cells. The transfected cells were cultured with complete RPMI 1640 media for 3 weeks, trypsinized, diluted, and plated in 96-well plates so that the calculated number of cells per well on average would be 0.15, 0.45, 1.5, 4.5 and 15. The media from wells with a single colony were assayed for reverse transcriptase (RT) activities after 17 to 24 days. Based on high RT activities, ten clones were chosen for integration site analysis. For each clonal population, the cellular DNA sequence at the right LTR-host DNA junction was determined using the linker ligation-mediated PCR assay as described below. Based on the sequence information of the right LTR-host DNA junction, the left LTR-host DNA junction was amplified by nested PCR using forward primers that anneal to positions upstream of the left LTR-host DNA junction and reverse primers that anneal to sequences downstream and within the left LTR. XMRV613R (5′-GATCGCCGGCCGGCTTA), which is complementary to nt positions 597 to 613 of XMRV, and XMRV165R (5′-CCTGACTACAGATATCCTGTTT), which is complementary to nt positions 143 to 165, were used as reverse primers for the first and second PCRs, respectively. The PCR product was electrophoresed on a 1.5% agarose gel, and the expected size of DNA band was excised from the gel and extracted using a gel extraction kit (Qiagen). Extracted DNA was cloned into a pCR-Blunt vector using a Zero Blunt PCR Cloning Kit (Invitrogen).
Linker ligation-mediated PCR assay for cloning XMRV integration sites
The genomic DNA from XMRV-infected cells was isolated with a QIAamp DNA Mini Kit (Qiagen) following the manufacturer's instruction. The assay for determining XMRV integration sites in DU145 cells was performed as described previously . Briefly, genomic DNA from XMRV-infected DU145 cells was digested with Pst I, which cuts once in the XMRV genome at nucleotide (nt) position 7,534 and produces on average 4-kbp DNA fragments. After digestion, DNA was denatured and annealed with a biotinylated primer, bXMRV7938 (5′-biotin-ATCCTACTCTTCGGACCCTGT), which is complementary to nt positions 7,938 to 7,958 within the env gene (about 160 bp upstream of the right LTR). The annealed primer was extended using the PicoMaxx High Fidelity PCR system (Stratagene) to produce biotinylated double-stranded DNA containing the viral-human DNA junction region. The biotinylated DNA product was then isolated by binding to streptavidin-agarose Dynabeads (Dynal), and digested with TaqαI (5′-T↓CGA), a 4-bp cutter that does not cleave the viral DNA portion of the biotinylated DNA. Digestion of the human genomic DNA with TaqαI produces on average 1.9-kbp DNA fragments . After digestion, the integration site-containing DNA was ligated with TaqLinker, which was prepared by annealing BHLinkA (5′-CGGATCCCGCATCATATCTCCAGGTGTGACAGTTT) with TaqLinkS (5′-CACCTGGAGATATGATGCGGGATC). The TaqLinker contains a 2-nt 5′-overhang (in bold type) complementary to the TaqαI -digested biotinylated DNA. The linker-ligated DNA product was amplified by a two-step PCR process. The first PCR was carried out using primers XMRV8415F (5′-AACCAATCAGCTCGCTTCTC) and Linker1 (5′-TAACTGTCACACCTGGAGATA) in a final volume of 300 µl with 0.5 µM of each primer, 0.2 mM of dNTPs, and 12 U Pfu DNA polymerase (Stratagene) under the following condition: 2 min of preincubation at 94°C, followed by 29 cycles at 94°C for 30 s, 58°C for 30 s, and 72°C for 4 min. The PCR product was purified using a PCR Purification Kit (Qiagen), and was used as the template for the second PCR with two nested primers, XMRV8535F (5′-CGGGTACCCGTGTTCCCAATA) and Linker2 (5′-TAGATATGATGCGGGATCCG), which anneal downstream of XMRV8415F and Linker1 binding sites, respectively. The condition for the second PCR was identical to the first PCR except being conducted with only 18 cycles. The second PCR product was electrophoresed on a 1.5% agarose gel and DNA bands between 200 bp to 2 kbp were extracted and cloned into a pCR-Blunt vector using a Zero Blunt PCR Cloning Kit (Invitrogen).
Integration site sequence determination and data analysis
The sequence of the cloned DNA was determined by dideoxy sequencing, and sequencing ambiguities were resolved by repeated sequencing on both strands. The authenticity of the integration site sequence were verified by the following criteria: (i) the sequence contained both XMRV LTR and linker sequence, (ii) a match to the human genome begining after the end of the LTR (5′-…CA-3′) and ending with the linker sequence, and (iii) the host DNA region (containing 20 or more nucleotides) from the putative integration site sequence showed 96% or greater identity to the human genomic sequence. The authenticated integration site sequences were then mapped to the human genome hg18 [University of California, Santa Cruz (UCSC) March 2006 freeze; National Center for Biotechnology Information (NCBI) Build 36.1] using BLASTN program (http://www.ensembl.org) or BLAT (UCSC; http://genome.ucsc.edu).
To determine nucleotide preference at integration sites, the target DNA sequences flanking the viral-host DNA junctions were aligned relative to the point of viral DNA integration. The XMRV integration site datasets used to determine nucleotide preference include the 13 correct integration sites listed in Table 1 (GenBank accession numbers GU816075, GU816076, GU816079 to GU816100, GU816103, GU816104), 472 integration sites from acutely infected DU145 cells (GenBank accession numbers EU981292 to EU981799) , and 14 integration sites from human prostate cancer tissues (GenBank accession numbers EU981800 to EU981813) . The nucleotide frequency at each position was calculated and compared to values obtained from a set of 10,000 random positions generated in silico by choosing a random number between 1 and 3,093,120,360, which represents the total length of the 22 autosomal chromosomes plus the X-sex chromosome of the human genome. The nucleotide frequencies of the random dataset are 29.8%, 20.4%, 20.5%, and 29.3% for A, C, G, and T, respectively. Statistical difference of nucleotide frequency between XMRV integration site sequences and the random dataset was analyzed at each position using a chi-square test at P<0.0001.
Nucleotide sequences accession numbers
The GenBank accession numbers for integration site sequences from the ten XMRV-infected cell clones listed in Table 1 are GU816075 to GU816104.
Conceived and designed the experiments: RHS SAC. Performed the experiments: SK AR BD RR. Analyzed the data: SK AR SAC. Contributed reagents/materials/analysis tools: BD RHS. Wrote the paper: SK RHS SAC.
- 1. Urisman A, Molinaro RJ, Fischer N, Plummer SJ, Casey G, et al. (2006) Identification of a novel Gammaretrovirus in prostate tumors of patients homozygous for R462Q RNASEL variant. PLoS Pathog 2: e25.
- 2. Silverman RH (2003) Implications for RNase L in prostate cancer biology. Biochemistry 42: 1805–1812.
- 3. Casey G, Neville PJ, Plummer SJ, Xiang Y, Krumroy LM, et al. (2002) RNASEL Arg462Gln variant is implicated in up to 13% of prostate cancer cases. Nat Genet 32: 581–583.
- 4. Xiang Y, Wang Z, Murakami J, Plummer S, Klein EA, et al. (2003) Effects of RNase L mutations associated with prostate cancer on apoptosis induced by 2′,5′-oligoadenylates. Cancer Res 63: 6795–6801.
- 5. Dong B, Kim S, Hong S, Das Gupta J, Malathi K, et al. (2007) An infectious retrovirus susceptible to an IFN antiviral pathway from human prostate tumors. Proc Natl Acad Sci USA 104: 1655–1660.
- 6. Schlaberg R, Choe DJ, Brown KR, Thaker HM, Singh IR (2009) XMRV is present in malignant prostatic epithelium and is associated with prostate cancer, especially high-grade tumors. Proc Natl Acad Sci USA 106: 16351–16356.
- 7. Lombardi VC, Ruscetti FW, Das Gupta J, Pfost MA, Hagen KS, et al. (2009) Detection of an infectious retrovirus, XMRV, in blood cells of patients with chronic fatigue syndrome. Science 326: 585–589.
- 8. Hohn O, Krause H, Barbarotto P, Niederstadt L, Beimforde N, et al. (2009) Lack of evidence for xenotropic murine leukemia virus-related virus(XMRV) in German prostate cancer patients. Retrovirology 6: 92.
- 9. Erlwein O, Kaye S, McClure MO, Weber J, Wills G, et al. (2010) Failure to detect the novel retrovirus XMRV in chronic fatigue syndrome. PLoS ONE 5: e8519.
- 10. Groom H, Boucherit V, Makinson K, Randal E, Baptista S, et al. (2010) Absence of xenotropic murine leukaemia virus-related virus in UK patients with chronic fatigue syndrome. Retrovirology 7: 10.
- 11. van Kuppeveld FJM, de Jong AS, Lanke KH, Verhaegh GW, Melchers WJG, et al. (2010) Prevalence of xenotropic murine leukaemia virus-related virus in patients with chronic fatigue syndrome in the Netherlands: retrospective analysis of samples from an established cohort. BMJ 340: c1018.
- 12. Brown PO (1997) Integration. In: Coffin JM, Hughes SH, Varmus HE, editors. Retroviruses. Cold Spring Harbor: Cold Spring Harbor Laboratory Press. pp. 161–203.
- 13. Mikkers H, Berns A (2003) Retroviral insertional mutagenesis: tagging cancer pathways. Adv Cancer Res 88: 53–99.
- 14. Kim S, Kim N, Dong B, Boren D, Lee SA, et al. (2008) Integration site preference of xenotropic murine leukemia virus-related virus, a new human retrovirus associated with prostate cancer. J Virol 82: 9964–9977.
- 15. Yoder KE, Bushman FD (2000) Repair of gaps in retroviral DNA integration intermediates. J Virol 74: 11191–11200.
- 16. Li L, Olvera JM, Yoder KE, Mitchell RS, Butler SL, et al. (2001) Role of the non-homologous DNA end joining pathway in the early steps of retroviral infection. Embo J 20: 3272–3281.
- 17. Muesing MA, Smith DH, Cabradilla CD, Benson CV, Lasky LA, et al. (1985) Nucleic acid structure and expression of the human AIDS/lymphadenopathy retrovirus. Nature 313: 450–458.
- 18. Vincent KA, York HD, Quiroga M, Brown PO (1990) Host sequences flanking the HIV provirus. Nucleic Acids Res 18: 6045–6047.
- 19. Vink C, Groenink M, Elgersma Y, Fouchier RA, Tersmette M, et al. (1990) Analysis of the junctions between human immunodeficiency virus type 1 proviral DNA and human DNA. J Virol 64: 5626–5627.
- 20. Shoemaker C, Goff S, Gilboa E, Paskind M, Mitra SW, et al. (1980) Structure of a cloned circular Moloney murine leukemia virus DNA molecule containing an inverted segment: implications for retrovirus integration. Proc Natl Acad Sci USA 77: 3932–3936.
- 21. Shoemaker C, Hoffman J, Goff SP, Baltimore D (1981) Intramolecular integration within Moloney murine leukemia virus. J Virol 40: 164–172.
- 22. Horowitz JM, Holland GD, King SR, Risser R (1987) Germ line integration of a murine leukemia provirus into a retroviruslike sequence. J Virol 61: 701–707.
- 23. Shimotohno K, Temin HM (1980) No apparent nucleotide sequence specificity in cellular DNA juxtaposed to retrovirus proviruses. Proc Natl Acad Sci USA 77: 7357–7361.
- 24. Shimotohno K, Mizutani S, Temin HM (1980) Sequence of retrovirus provirus resembles that of bacterial transposable elements. Nature 285: 550–554.
- 25. Moreau K, Torne-Celer C, Faure C, Verdier G, Ronfort C (2000) In vivo retroviral integration: fidelity to size of the host DNA duplication might be reduced when integration occurs near sequences homologous to LTR ends. Virology 278: 133–136.
- 26. Taganov K, Daniel R, Katz RA, Favorova O, Skalka AM (2001) Characterization of retrovirus-host DNA junctions in cells deficient in nonhomologous-end joining. J Virol 75: 9549–9552.
- 27. Oh J, Chang KW, Alvord WG, Hughes SH (2006) Alternate polypurine tracts affect Rous sarcoma virus integration in vivo. J Virol 80: 10281–10284.
- 28. Oh J, Chang KW, Hughes SH (2006) Mutations in the U5 sequences adjacent to the primer binding site do not affect tRNA cleavage by Rous sarcoma virus RNase H but do cause aberrant integrations in vivo. J Virol 80: 451–459.
- 29. Vatakis DN, Kim S, Kim N, Chow SA, Zack JA (2009) Human immunodeficiency virus integration efficiency and site selection in quiescent CD4+ T cells. J Virol 83: 6222–6533.
- 30. Aiyar A, Hindmarsh P, Skalka AM, Leis J (1996) Concerted integration of linear retroviral DNA by the avian sarcoma virus integrase in vitro: dependence on both long terminal repeat termini. J Virol 70: 3571–3580.
- 31. Goodarzi G, Pursley M, Felock P, Witmer M, Hazuda D, et al. (1999) Efficiency and fidelity of full-site integration reactions using recombinant simian immunodeficiency virus integrase. J Virol 73: 8104–8111.
- 32. Hindmarsh P, Ridky T, Reeves R, Andrake M, Skalka AM, et al. (1999) HMG protein family members stimulate human immunodeficiency virus type 1 and avian sarcoma virus concerted DNA integration in vitro. J Virol 73: 2994–3003.
- 33. Li M, Craigie R (2005) Processing of viral DNA ends channels the HIV-1 integration reaction to concerted integration. J Biol Chem 280: 29334–29339.
- 34. Holman AG, Coffin JM (2005) Symmetrical base preferences surrounding HIV-1, avian sarcoma/leukosis virus, and murine leukemia virus integration sites. Proc Natl Acad Sci USA 102: 6103–6107.
- 35. Lewinski MK, Bisgrove D, Shinn P, Chen H, Hoffmann C, et al. (2005) Genome-wide analysis of chromosomal features repressing human immunodeficiency virus transcription. J Virol 79: 6610–6619.
- 36. Berry C, Hannenhalli S, Leipzig J, Bushman FD (2006) Selection of target sites for mobile DNA integration in the human genome. PLoS Comput Biol 2: e157.
- 37. Kim S, Kim Y, Liang T, Sinsheimer JS, Chow SA (2006) A high-throughput method for cloning and sequencing HIV-1 integration sites. J Virol 80: 11313–11321.
- 38. Derse D, Crise B, Li Y, Princler G, Lum N, et al. (2007) Human T-cell leukemia virus type 1 integration target sites in the human genome: comparison with those of other retroviruses. J Virol 81: 6731–6741.
- 39. Lewinski MK, Yamashita M, Emerman M, Ciuffi A, Marshall H, et al. (2006) Retroviral DNA integration: viral and cellular determinants of target-site selection. PLoS Pathog 2: 611–622.
- 40. Murphy JE, Goff SP (1992) A mutation at one end of Moloney murine leukemia virus DNA blocks cleavage of both ends by the viral integrase in vivo. J Virol 66: 5092–5095.
- 41. Moreau K, Faure C, Violot S, Gouet P, Verdier G, et al. (2004) Mutational analyses of the core domain of avian leukemia and sarcoma viruses integrase: critical residues for concerted integration and multimerization. Virology 318: 566–581.
- 42. Li M, Mizuuchi M, Burke TR Jr, Craigie R (2006) Retroviral DNA integration: reaction pathway and critical intermediates. Embo J 25: 1295–1304.
- 43. Sinha S, Pursley MH, Grandgenett DP (2002) Efficient concerted integration by recombinant human immunodeficiency virus type 1 integrase without cellular or viral cofactors. J Virol 76: 3105–3113.
- 44. Craigie R (1992) Hotspots and warm spots: integration specificity of retroelements. Trends Genet 8: 187–190.
- 45. Withers-Ward ES, Kitamura Y, Barnes JP, Coffin JM (1994) Distribution of targets for avian retrovirus DNA integration in vivo. Genes Dev 8: 1473–1487.
- 46. Schroder AR, Shinn P, Chen H, Berry C, Ecker JR, et al. (2002) HIV-1 integration in the human genome favors active genes and local hotspots. Cell 110: 521–529.
- 47. Holmes-Son ML, Appa RS, Chow SA (2001) Molecular genetics and target site specificity of retroviral integration. Adv Genet 43: 33–69.
- 48. Bushman F, Lewinski M, Ciuffi A, Barr S, Leipzig J, et al. (2005) Genome-wide analysis of retroviral DNA integration. Nat Rev Microbiol 3: 848–858.
- 49. Pryciak PM, Varmus HE (1992) Nucleosomes, DNA-binding proteins, and DNA sequence modulate retroviral integration target site selection. Cell 69: 769–780.
- 50. Katzman M, Sudol M (1995) Mapping domains of retroviral integrase responsible for viral DNA specificity and target site selection by analysis of chimeras between human immunodeficiency virus type 1 and visna virus integrases. J Virol 69: 5687–5696.
- 51. Shibagaki Y, Chow SA (1997) Central core domain of retroviral integrase is responsible for target site selection. J Biol Chem 272: 8361–8369.
- 52. Hacker CV, Vink CA, Wardell TW, Lee S, Treasure P, et al. (2006) The integration profile of EIAV-based vectors. Mol Ther 14: 536–545.
- 53. Kang Y, Moressi CJ, Scheetz TE, Xie L, Tran DT, et al. (2006) Integration site choice of a feline immunodeficiency virus vector. J Virol 80: 8820–8823.
- 54. Moalic Y, Blanchard Y, Felix H, Jestin A (2006) Porcine endogenous retrovirus integration sites in the human genome: features in common with those of murine leukemia virus. J Virol 80: 10980–10988.
- 55. Nowrouzi A, Dittrich M, Klanke C, Heinkelein M, Rammling M, et al. (2006) Genome-wide mapping of foamy virus vector integrations into a human cell line. J Gen Virol 87: 1339–1347.
- 56. Muller HP, Varmus HE (1994) DNA bending creates favored sites for retroviral integration: an explanation for preferred insertion sites in nucleosomes. EMBO J 13: 4704–4714.
- 57. Taganov KD, Cuesta I, Daniel R, Cirillo LA, Katz RA, et al. (2004) Integrase-specific enhancement and suppression of retroviral DNA integration by compacted chromatin structure in vitro. J Virol 78: 5848–5855.
- 58. Bor Y-C, Bushman FD, Orgel L (1995) In vitro integration of human immunodeficiency virus type 1 cDNA into targets containing protein-induced bends. Proc Natl Acad Sci USA 92: 10334–10338.
- 59. Wang GP, Ciuffi A, Leipzig J, Berry CC, Bushman FD (2007) HIV integration site selection: Analysis by massively parallel pyrosequencing reveals association with epigenetic modifications. Genome Res 17: 1186–1194.
- 60. Rosenberg N, Jolicoeur P (1997) Retroviral pathogenesis. In: Coffin JM, Hughes SH, Varmus HE, editors. Retroviruses. Cold Spring Harbor: Cold Spring Harbor Laboratory Press. pp. 475–586.
- 61. Gabriel R, Eckenberg R, Paruzynski A, Bartholomae CC, Nowrouzi A, et al. (2009) Comprehensive genomic access to vector integration in clinical gene therapy. Nat Med 15: 1431–1436.