Gene amplification of specific loci has been described in all kingdoms of life. In the protozoan parasite Leishmania, the product of amplification is usually part of extrachromosomal circular or linear amplicons that are formed at the level of direct or inverted repeated sequences. A bioinformatics screen revealed that repeated sequences are widely distributed in the Leishmania genome and the repeats are chromosome-specific, conserved among species, and generally present in low copy number. Using sensitive PCR assays, we provide evidence that the Leishmania genome is continuously being rearranged at the level of these repeated sequences, which serve as a functional platform for constitutive and stochastic amplification (and deletion) of genomic segments in the population. This process is adaptive as the copy number of advantageous extrachromosomal circular or linear elements increases upon selective pressure and is reversible when selection is removed. We also provide mechanistic insights on the formation of circular and linear amplicons through RAD51 recombinase-dependent and -independent mechanisms, respectively. The whole genome of Leishmania is thus stochastically rearranged at the level of repeated sequences, and the selection of parasite subpopulations with changes in the copy number of specific loci is used as a strategy to respond to a changing environment.
Variations in the copy number of DNA segments account for a substantial amount of genome diversity of most organisms. DNA amplification, a contributor to copy number variation, can occur in response to various stresses or after altered growth conditions, leading to extensive and often reversible genetic variation. DNA amplification in the parasite Leishmania occurs outside the normal chromosomes and arises by DNA rearrangements involving homologous repeated sequences. We show here that such repeated sequences are widespread in the Leishmania genome and that most of the Leishmania genome is subject to stochastic gene rearrangements mediated by these low-copy repeat sequences. Thus, although cells in the population have a common core genome, many individual cells will differ from the rest of the population by carrying one or more distinct extrachromosomal amplicon. Upon selection with either drugs or culture conditions, a subpopulation can emerge where the amplicon copy number per cell increases, and this clone of cells can then expand to dominate the population. We propose that Leishmania uses adaptive gene amplification at a genome-wide scale as one strategy to adapt to a changing environment.
Citation: Ubeda J-M, Raymond F, Mukherjee A, Plourde M, Gingras H, Roy G, et al. (2014) Genome-Wide Stochastic Adaptive DNA Amplification at Direct and Inverted DNA Repeats in the Parasite Leishmania. PLoS Biol 12(5): e1001868. doi:10.1371/journal.pbio.1001868
Academic Editor: Philip J. Hastings, Baylor College of Medicine, United States of America
Received: December 17, 2013; Accepted: April 11, 2014; Published: May 20, 2014
Copyright: © 2014 Ubeda et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was funded in part by the CIHR group grant to MO, BP, and JC and by CIHR operating grants 13233 and 15501 to MO. JMU and AM were Training Fellows of the Strategic Training Program in Microbial Resistance, a partnership of the CIHR Institute of Infection and Immunity and the Fonds de Recherche en Santé du Québec. FR was the recipient of a CIHR studentship. JC holds the Canada Research Chair in Medical Genomics. MO is a Burroughs Wellcome Fund Scholar in Molecular Parasitology and holds the Canada Research Chair in Antimicrobial Resistance. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Abbreviations: CNV, copy number variation; DRs, direct repeated sequences; FBS, fetal bovine serum; HR, homologous recombination; IRs, inverted repeats; MTX, methotrexate; RAG, repeat alignment group; SIDERs, short interspersed degenerate retroposons
Copy number variations (CNVs) account for a substantial amount of genomic variability in mammalian genomes (reviewed in ). DNA amplification, a contributor of CNVs, has been reported in response to various stresses or after altered growth conditions, and can lead to extensive and often reversible genetic variations (reviewed in ,). Several models have been proposed to explain DNA amplification mechanisms –. Extrachromosomal circular DNAs can be the products of gene amplification in mammalian cells and in the protozoan parasite Leishmania (reviewed in ,). In Leishmania, DNA circles are generated by homologous recombination (HR) between direct repeated sequences (DRs) (Figure 1A) ,. DNA amplification can also lead to palindrome formation. Increasing evidence suggests that palindromes are initiated at the level of inverted repeats (IRs). Indeed IRs are known to increase chromosome instability during replication, leading to hairpin formation and representing a substantial source of DNA breakage and rearrangement (Figure 1B). IRs have been shown to initiate inverted duplications in yeast cells –, in protozoa ,,, and in mammalian cells .
(A) HR between DRs can lead to circular amplification (i) or to tandem duplication of the locus by nonequal crossing at sister chromatids (ii). The amplification can be nonconservative, leading to the deletion of the region amplified, or conservative (e.g., within replication forks), without genomic deletion. C1 and C2 are primers used to detect novel junctions formed by recombination between DRs. Arrows Δ1 and Δ2 are primers used to demonstrate a locus deletion between the DRs. (B) Role of IRs in the formation of linear amplicons. One model (i) suggests that IRs cause hairpin formation during replication and trigger the replication fork to stall and dissociate ,. The annealing of the repeats is used to prime DNA synthesis, leading to locus duplication up to the telomeric end . Other mechanisms described in yeast and/or mammalian cells can also explain duplication events: (b) a single-strand break (SSB) during replication close to IRs or (ii) a double-strand break (DSB) initiating a linear duplication. The black bars represent the DNA segments that are amplified. L1 and L2 are primers used to detect novel junctions formed by rearrangements at IRs. The arrows in boxes indicate repeated sequences.
Leishmania is an early diverging eukaryote whose genes are expressed constitutively as part of long polycistronic units where the RNAs are matured by coupled transsplicing and polyadenylation (reviewed in ) and by epigenetic marks –. Gene regulation occurs mostly at the posttranscriptional and (post)translational levels  with no control at the level of transcription initiation, in part due to the lack of several general transcription factors . Leishmania display, however, additional strategies to modulate the expression of specific genes when selective pressure is applied. For example, Leishmania cells selected for resistance to cytotoxic compounds often amplify or delete a number of specific loci coding for either drug targets or drug transporters (reviewed in ). These amplified DNAs are generally extrachromosomal and found either as circular elements – or as linear minichromosomes –. These elements do not appear to have a specific origin of replication and are usually lost when drug pressure is removed ,,, although some extrachromosomal elements are maintained in absence of drug pressure ,,,.
The dihydrofolate reductase-thymidylate synthase gene DHFR-TS is amplified as part of extrachromosomal circles in Leishmania cells resistant to the antifolate methotrexate (MTX) ,,. In the few cases studied, these circles were formed by HR between DRs ranging in size, depending on the species, from 575 to 837 bp (Figure 1A) . Similarly, the gene coding for the ABC protein MRPA can be amplified in Leishmania cells resistant to antimonials (SbIII), the first-line antileishmanial drug, by HR between DRs ranging in size from 198 to 1,389 bp (Figure 1A) ,,. In contrast, minichromosomes containing large inverted duplications ,,, are generated by the annealing of IRs  followed by duplications extending to telomeric ends (Figure 1B).
Repeated sequences used for DNA amplification are generally noncoding and are interestingly highly conserved between different Leishmania species . These intergenic sequences may have been maintained to facilitate the amplification of key genomic loci essential to respond to changing growth conditions. Alternatively, such low-copy repeated sequences may be more abundant than previously thought and could be used by Leishmania as a platform to amplify several segments of its genome. We present here both bioinformatics and functional analyses revealing that homologous repeated sequences are widespread in the Leishmania genome and that most of the Leishmania genome is stochastically subjected to gene rearrangements at the level of these low-copy repeated sequences. Subpopulations of parasites with amplified DNA segments can be selected and we propose that Leishmania uses adaptive gene amplification at a genome-wide scale as one strategy to adapt to its changing environment.
Genome-Wide Distribution of Repeated Sequences in Leishmania
The noncoding DRs used for DHFR-TS amplification are highly conserved (86% identical) between Leishmania major and Leishmania infantum . This interspecies conservation led us to hypothesize that either specific loci are subjected to considerable CNVs or alternatively that the Leishmania genome has more repeated sequences than initially anticipated. Indeed, the published analyses of the genome sequences of several Leishmania species did not reveal the presence of extensive repeated sequences ,. The genome sequence of L. major Friedlin was screened for noncoding repeated sequences. To this end, intergenic regions were aligned to their respective entire chromosome sequences using blastn. Repeated sequences were filtered for lengths between 0.2 and 2.5 kbp with a minimum identity of 85%, as a high level of homology is required for recombination in Leishmania . There is no evidence for non-HR in Leishmania and HR is likely to require up to 95% of homologous sequences . According to these criteria, we identified 1,926 repeats in the L. major genome (Figure 2A) for which we assigned a unique identifier (Tables S1 and S2). These repeats represent 5% of the Leishmania genome and are scattered throughout all the chromosomes and are either in a direct or in an inverted orientation, as illustrated for chromosomes 6 and 23 in Figure 2B.
(A) The number of pairs of repeat sequences (RSs) is indicated for each of the 36 L. major chromosomes. The repeats are part of distinct RAGs. The repeats belonging to the SIDER retroposon family are the most abundant and are represented by yellow (SIDER1) or blue bars (SIDER2). Other repeated sequences are represented by orange bars. (B) Repeats on chromosomes 6 and 23 of L. major. Arrows indicate the orientation and the approximate location of the repeats. Repeats of the same RAG for each chromosome are displayed with the same color.
We clustered the repeated sequences into Repeat Alignment Groups (RAGs), each RAG being composed of all the members of a same repeat family (with 85% minimum identity). Using these criteria, we assembled 507 RAGs in the L. major genome by BLAST. Most RAGs, 490 out of 507, have fewer than 10 members, with 242 RAGs consisting of only two unique repeated sequences (Tables S1 and S2). RAGs are usually confined to one specific chromosome, with two exceptions, RAG17 with its 33 repeats distributed on 13 chromosomes and RAG418 with its 29 repeats present on two chromosomes (Tables S1 and S2).
Within several of the 507 RAGs, we detected the presence of sequences part of short interspersed degenerate retroposons (SIDERs), which are distributed in the Leishmania genome , and proven to regulate gene expression at both the posttranscriptional (SIDER2) or translational (SIDER1) level ,. SIDERs are degenerate and thus do not fall into a single RAG but are included in several of the RAGs (Table S1). We found 359 SIDER1 repeats and 948 SIDER2 repeats, accounting for 67.9% of the 1,926 repeated sequences of L. major. Because several RAGs have more than one pair of repeats per chromosome (e.g., see Figure 2B and Table S1), we estimated that the 1,926 repeats in the 507 RAGs can lead to 4,601 potential amplicons in L. major (Table S2). The distances between DRs (Figure S1A) or IRs (Figure S1B) are on average between 1 and 100 kb. IRs are found in general closer to telomeres, whereas DRs appear more evenly distributed along the chromosomes (Figure S1C and S1D; see also Figures 2B and 3A).
(A) Chromosome 23 with its three large polycistronic units is shown. Each vertical bar represents a gene. Intergenic regions of chromosome 23 were scanned for repeated sequences, and 15 RAGs (194–208) theoretically leading to 95 amplicons were detected (Table S1). RAG 203 (and its 29 putative amplicons) is not represented because it corresponds to complex duplicated regions difficult to investigate by PCR. We tested for the presence of 48 amplicons by PCR. PCR fragments of the expected size provided evidence for 25 amplicons formed at the level of DRs (upper map) and 15 amplicons formed at the level of IRs (lower map). PCR products of the right size detected (filled circle and filled square) or not detected (open circle and open square) in L. major LV39 (circles) and Friedlin (squares), respectively. (B) Selected examples of the detection of PCR products compatible with DNA rearrangements mediated at the level of either DRs or IRs in L. major strains LV39 or Friedlin. (C) Detection of PCR products diagnostic of deletion of three loci at the level of DRs using primers equivalent to Δ1 and Δ2 of Figure 1A.
Similar bioinformatics analyses were performed on the L. infantum and L. braziliensis genomes. Blastn screens revealed a total of 1,886 repeats in L. infantum and 2,058 in L. braziliensis that were assembled, respectively, into 513 and 619 RAGs, and could potentially lead to 3,165 and 3,183 amplicons (Tables S3 and S4). L. major and L. infantum share over 60% of their RAGs with a strong conservation in synteny. L. braziliensis, a representative of the Viannia subgenus, is distant from the two other species, and with the search criteria at 85% identity, only 10% and 16% of RAGs were conserved within L. major and L. infantum, respectively.
Stochastic Genome-Wide Gene Amplification in Leishmania
We have shown previously that PCR allows the detection of novel junctions formed after genomic rearrangement triggering circular (C1 and C2 primers of Figure 1A) or linear (primers L2 and L1 in Figure 1B) amplifications in drug-resistant Leishmania cells ,. We also observed the amplification of a GSH1 locus in a wild-type background by using a PCR assay with higher sensitivity . We thus tested whether other rearrangements could be detected in wild-type cells using similar sensitive PCR assays. Chromosome 23 contains a minimum of 15 RAGs (RAG194–RAG208), with a total of 54 repeats that could theoretically lead to the formation of 95 amplicons. We tested the rearrangement of 48 of these amplicons and detected a PCR product of the expected size for 40 of them. Twenty-five amplicons were produced by HR between DRs and 15 after annealing of IRs (Figure 3A). For example, RAG195 includes four repeats in direct and inverted orientations (in orange in Figure 2B). Out of the six potential amplicons, five were detected in at least one of the two strains, the amplicon 195d was generated by HR between DRs, whereas the amplicons 195a,b,c,e are consistent with a rearrangement at IRs (Figure 3A,B). Selected PCR products were sequenced, and these rearrangements were confirmed.
We were able to consistently detect 60% to 80% of the predicted amplicons in any given population of late-log phase cultures for two strains of L. major, LV39 and Friedlin (see for examples Figure 3A and 3B). A similar frequency of rearrangements was observed in L. infantum JPCM5 and in our L. infantum lab strain 263. Although most work was conducted with promastigotes, the amplicons could also be detected in intracellular Leishmania inside macrophages (Figure S2). These genomic rearrangements in unselected strains are both global and stochastic, as the sets of amplicons are not always identical (Figure 3A,B). Through dilution experiments and using the L. major MTX60.4 line with amplification of the DHFR-TS locus, we estimated the frequency of loci rearrangements to be approximately 10−6 to 10−7/cell (see Materials and Methods and Figure S3).
Gene amplification in Leishmania can be either conservative, where the genomic region amplified remains intact, or nonconservative, where the chromosomal locus amplified is deleted (Figure 1A). Conservative amplification is observed more often in highly drug-resistant mutants, most likely because deletion of large genomic regions is associated with a cost, but nonconservative amplification has been observed previously with arsenite-  and MTX-resistant mutants . We thus tested whether locus deletion was detectable in populations using appropriate primers (the equivalent of Δ1 and Δ2 in Figure 1A). We tested for the deletion of the chromosomal segments between the DRs of the RAGs 196, 208b, and 204b, and obtained PCR products of the appropriate size and sequence consistent with a locus deletion (Figure 3C).
Adaptive Gene Amplification in Leishmania
We hypothesized that cells carrying advantageous preexisting amplifications could be positively selected to eventually represent a larger proportion of the population. Additionally, amplicons should be lost when selection is removed, confirming that adaptive gene amplification is reversible. We therefore exposed L. major LV39 and L. infantum 263 to MTX pressure, a drug known to select either for DHFR-TS or PTR1 amplification in Leishmania ,,. Using a semiquantitative PCR approach with appropriate primers (Figure 4A), we detected the DHFR-TS circular amplicon in the untreated wild type L. major LV39 population (Figure 4A, lane 1) at an estimated frequency of 10−6 to 10−7/cells (Figure S3). After only two passages (~18 generations) with 0.2 µM MTX (the EC50 value), we estimated that approximately 1% of the population carried the DHFR-TS amplicon (Figure 4A, lane 2), and after six passages with 0.2 µM MTX, this proportion increased to 10% of the population (Figure 4C). Inversely, when the drug pressure was removed, the estimated proportion of the cells containing the DHFR-TS amplicon decreased rapidly (Figure 4A,C). In the unstressed wild-type L. infantum 263 population, the DHFR-TS circular amplicon could not be detected; hence, no positive selection could be observed after MTX pressure. Instead, we detected the rearrangement resulting in a PTR1 linear amplicon in the unstressed L. infantum 263 population (Figure 4B) and monitored its positive selection under 0.2 µM MTX pressure. The proportion of the PTR1 linear amplicon-containing cells increased to reach 1% of the population after four passages with the drug and again decreased when the selection was removed, albeit less rapidly than for DHFR-TS circular amplicons (Figure 4B,C).
(A) Schematic representation of the DHFR-TS locus and the circular amplicon generated by HR between DRs (black arrows) on chromosome 6 in L. major. Genomic DNA was extracted from WT and MTX stressed promastigotes. Amplicon detection in the unstressed WT population (right panel, lane 1), after two passages with MTX (at EC50, 0.2 µM) (right panel, lane 2), and after removing MTX pressure (lanes 3–7). Ca and Cb are the primers used for detecting the rearrangements. (B) Schematic representation of the PTR1 and MRPA locus and the linear amplicon with inverted duplications generated after annealing of IRs (black arrows) on chromosome 23 in L. infantum (left panel). Detection of the PTR1 amplicons by PCR using primers La and Lb in the unstressed WT 263 population (right panel, lane 1), after four passages with MTX (at EC50, 0.2 µM) (lanes 2–4), and after removing MTX pressure (lanes 5–8). Lane 9 is a no DNA template control. (C) Kinetics of the selection and loss of the DHFR circular amplicon containing cells in LV39 (filled circle) and PTR1 linear amplicon containing cells in L. infantum (filled square) under MTX pressure (continuous line) and after drug removal (dashed lines). Average of three biological independent experiments is shown. (D) Adaptive MRPA gene amplification upon SbIII selection. Semiquantitative PCR was performed with primers La and Lb. Genomic DNA was extracted from L. infantum WT (lane 2) cells stressed with SbIII at 160 µM (4×EC50) (lanes 3–5) and from cultures after drug removal (lanes 6–8). Lane 1 is a no DNA template control. Densitometric ratios of PCR band intensities are indicated at the bottom. One representative experiment out of four is shown. Amplification of the chromosomal GAPDH gene was used as a reference to normalize the amount of template DNA loaded.
We also analyzed the adaptive gene amplification in L. infantum 263 population selected with SbIII, for which amplification of the MRPA gene has been previously reported . The MRPA gene is located near PTR1 and can be amplified as part of a linear amplicon mediated by rearrangements at the level of the same IRs as for PTR1 (Figure 4B). We indeed observed a gradual increase in the proportion of the MRPA linear amplicon-containing cells in the population following five passages with 160 µM of SbIII (Figure 4D, lanes 3–5). When the SbIII stress was removed (Figure 4D, lanes 6–8), we observed a decrease of the PCR product indicative of the loss of the MRPA linear amplicon in the population.
The increased intensities observed by PCR upon selection possibly reflect a combination of both the successive divisions of the amplicon-containing cells but also of an increased copy number of these amplicons in a given number of cells. To test this, we selected L. major for MTX resistance both by keeping cells at 0.5×EC50 for up to eight passages but also by incremental increase of the drug (Figure S4A). Amplification of the DHFR-TS locus was tested by both Southern blot analyses (Figure 5A) and by PCR (Figure 5B). An EcoRV-BglII digest was hybridized to a probe that can discriminate the 10 kb chromosomal locus and the 4 kb rearranged circle's band (Figure S4B). In cells grown without MTX (P0) and cells grown in the presence of 0.5×EC50 MTX for up to four passages (P4), we observed the 10 kb chromosomal band but not the 4 kb band diagnostic of the circle (Figure S4B, Figure 5A). However, at P5 (faintly) and at passages 6, 7, and 8, we see a gradual increase hybridization intensity of the 4 kb band (Figure 5A). A paralleled and expected increase was also observed when the blot was hybridized with a DHFR-TS probe, whereas the signal with the control GAPDH probe remained constant (Figure 5A). Selection with higher drug concentration enriched more rapidly for more circles [see P4 at 1×EC50 (P41), Figure 5A], although at one point a plateau is reached in terms of amplified circles, as an average of 20 copies of DHFR-TS was observed in comparison to control probes at either 2× or 8×EC50 (Figure S4C). The next step was to test whether these amplicons are evenly distributed in each cell within the population. At P6, the 4 kb rearranged band is of similar intensity than the chromosomal band (Figure 5A), whereas at P8 and P41 the hybridization ratio between the rearranged bands and the chromosomal bands (or between DHFR-TS and GAPDH) are 4 and 8, respectively (Figure 5A). This suggests that in P6, P8, and P41, there is an average of one, four, and eight circles per cell, although this could also be unevenly distributed. To test this, we complemented the Southern blots and the semiquantitative PCR with a colony-based PCR assay. We plated cultures of P0, P3, P6, P8, and P41, and 10 colonies for each population culture were immediately processed and tested directly by PCR for the rearranged fragment diagnostic of the DHFR-TS locus amplification. Using this qualitative colony-based PCR approach, we could not detect PCR products in any of the clones derived from P0 and P3. A PCR fragment diagnostic of the DHFR-TS amplicon was observed in 1 out of 10 clones derived from P6 (Figure 5C), suggesting that not all cells within the population had the amplicon, but several cells had many copies of the amplicon to provide the hybridization signals observed in Figure 5A. Similarly, when P8 with its average of four circles per cell (Figure 5A) was cloned, only four out of 10 colonies had the amplicon; thus, several cells must have more than four copies to lead to the hybridization signals shown in Figure 5A. The average number of circles per cell in P41 was determined to be eight (Figure 5A), and all 10 clones derived from P41 had the amplicon, suggesting that the copy number of the amplicon must be in average eight to explain the hybridization results of Figure 5A. However, there could be variation in the copy number in individual cells of the population (as our assay in Figure 5C cannot discriminate for this). When the selection is pursued, the copy number of the amplified locus further increases to an average of 20 (Figure S4C). The results of Figure 5 support the concept that few cells within the population have initially the rearranged structure (P0 to P5 in Figure 5A and 5B). Selective pressure will lead to an increase in the proportion of the rearranged structure within the population (Figure 5). The experiment of Figure 5C is showing that an increased number of cells within the population is emerging with the amplicon, and experiments of Figure 5A and Figure S4C are showing that upon selection there is also an increase in the copy number of amplicons within the cells.
L. major was either cultured in the presence of 0.5× MTX EC50 for eight continuous passages (P1–P8) or cultured for one passage at 0.5×EC50 and then shifted for four passages at 1× MTX EC50 (P41) (see Figure S4A for the selection scheme). (A) DNAs extracted from the population of parasites grown at 0.5×EC50 MTX for up to eight passages (P1–P8) and for four passages at 1×EC50 MTX (P41) were analyzed by Southern blot for the presence of DHFR-TS circles. The blot was hybridized to a probe corresponding to the first 1,000 bp of LinJ.06.0830 that discriminate the chromosomal loci at 10 kb from its amplified region at 4 kb (see Figure S4B). The blot was also hybridized with a probe covering the coding sequence of DHFR-TS (DHFR) and to a GAPDH probe to monitor the DNA loaded in each lane. Fold increases were normalized with the GAPDH signal. (B) DNAs extracted from the population of parasites at every 0.5×EC50 passages (P1–P8) were further analyzed by semiquantitative PCR for the presence of the new junction derived from HR between the direct repeats flanking the DHFR-TS locus (see Figure 4A). The number of PCR cycles for the DHFR-TS amplicon and the GAPDH control is indicated. For passages P6–P8, the PCR signal was saturated at cycle 35 and is thus not shown. (C) Parasites from five selected passages (boxed in A and B) were cloned on plates. For each condition, total DNA was extracted from 10 randomly selected colonies and tested for the presence of DHFR-TS amplification by PCR. The number of PCR cycles for the DHFR-TS amplicon and the GAPDH control is indicated on the left of each panel.
Gene amplification unrelated to drug selection has also been reported in Leishmania . Interestingly, pulse-field gels revealed the presence of an amplified linear DNA in our L. infantum 263 strain grown in vitro for several years (Figure 6A, lane 1). We discovered that the selection of this linear amplicon is linked to the supplementation of FBS to the medium. Indeed, the minichromosome is positively selected when FBS is added to the medium (Figure 6A, lanes 1 and 3), whereas removing the FBS leads to its gradual loss (Figure 6A, lanes 2 and 4). Selective loss or gain of these amplicons was most clearly observed after 15 passages. Next generation sequencing (NGS) of the purified minichromosome using Illumina paired-end reads revealed that it is derived from chromosome 6 (Figure 6D), which was confirmed by Southern blot hybridization (Figure 6B). The amplicon extended from a region between LinJ06.1150 and LinJ06.1160 to one telomeric end of the chromosome (Figure 6D). Interestingly, a sequence that was repeated several times on chromosome 6 either in a direct or inverted fashion was found at one breakpoint between LinJ06.1150 and LinJ06.1160 (Figure 6D). Sequence coverage was consistent with a rearrangement between LinJ06.1150 and LinJ06.1160 and between LinJ06.1240 and LinJ06.1250, although no obvious repeated sequences were apparent in the latter region (Figure 6D). A PCR assay (Figure 6C, lane 1) confirmed that this rearrangement occurred at the position suggested by NGS analysis (see the primers used in Figure 6D). The PCR fragment was only observed in cells with the linear amplicon (Figure 6C). The absence of IRs at the rearrangement that would have explained the formation of this linear amplicon may be due to secondary rearrangements due to the long culture history of the strain. This is consistent with the primers failing to detect amplification in cells without the amplicon (Figure 6C, lanes 2 and 4), as there are no continuous rearrangements that can take place. There must nonetheless be a low number of cells remaining with this amplicon, and this may explain the slow re-emergence (15 passages) of cells with amplicons upon growth with FBS. In future studies, it would be of interest to link the gene within the amplicon and the substrate within FBS driving the selection.
(A) The small chromosomes of L. infantum were separated by pulse-field gel electrophoresis. Total DNA was isolated from cells cultured with FBS (lane 1) and from cells cultured in serum-free medium for 15 passages (lane 2), FBS was then added for 15 passages (lane 3), and removed again for 15 passages (lane 4). Lane M, marker of S. cerevisiae chromosomes. (B) Southern blot hybridized to a probe derived from LinJ06.1210. (C) PCR assay mapping the rearrangement leading to the linear amplicon shown in (D). The primers used are indicated by small arrows located between LinJ06.1150 and LinJ06.1160 and LinJ06.1240 and LinJ06.1250. A line indicates nonadjacent lanes that have been brought together for producing the figure. (D) Sequence coverage depth as determined by NGS of the linear amplicon. Below are the genomic organization of the subtelomeric region of chromosome 6 in L. infantum and the structure of the linear amplicon with its inverted duplication. Large arrows indicate the presence of repeated sequences where one of the rearrangements leading to extrachromosomal linear amplification occurred, whereas thin arrows indicate the location of PCR primers.
Formation of Extrachromosomal Circles But Not of Linear Amplicons Is Facilitated by the RAD51 Recombinase
We hypothesized that RAD51, the main recombinase in eukaryotes, would have an important role in the formation of circular amplicons, where a crossover is necessary between two homologous DRs (Figure 1A). The gene LinJ.28.0580 encodes the closest homolog of RAD51 in Leishmania, sharing 70% homology with the yeast RAD51 protein. To test its role in HR and extrachromosomal circular amplification, we generated a L. infantum RAD51 null mutant. A PvuII digest of L. infantum genomic DNA hybridized to a RAD51 5′flank probe should lead to a 1.2 kb fragment in WT (Figure 7A, lane 1). Integration of NEO and HYG expression cassettes should lead to 0.9 and 3.2 kb fragments, respectively, and the integrations were verified by Southern blots (Figure 7A). Moreover, no copy of RAD51 was detected in the null mutant by PCR (Figure S5A,B). As expected, the RAD51 null mutant was sensitive to the double-strand break inducing agent methyl methanesulfonate (Figure S5C), and the phenotype could be reversed by transfecting the null mutant with an episomal vector harboring the RAD51 gene (Figure S5C). We investigated whether the Leishmania RAD51 null mutant exhibited a decreased efficiency of HR by double crossover using a BLA gene replacement cassette tested previously to inactivate the GSH1 gene . Transfection of this cassette showed that the RAD51 null mutant had a decreased ability to yield BLA-resistant colonies in comparison to wild-type cells or to the add-back revertant (Figure S5D).
(A) Diagram of the RAD51 locus in L. infantum WT with the NEO and HYG disruption cassettes (left panel). Southern blot analysis (right panel) of WT (lane 1); RAD51/RAD51::NEO (lane 2) and RAD51::NEO/RAD51::HYG (lane 3) genomic DNAs digested with PvuII (P) and hybridized with the 5′ UTR RAD51 probe. (B and C) Circular amplicon selection in RAD51−/− parasites. L. infantum WT pSPαZEOα (1 and 4), RAD51−/− pSPαZEOα (2 and 5), and RAD51−/− pSPαZEOαRAD51 (3 and 6) were either cultured in the absence of drug (−) or in the presence of 0.2 µM MTX for five passages. Total DNA was extracted from cells, and semiquantitative PCRs were performed to detect DHFR-TS (B) or PTR1 (C) circular amplicons. Amplification of the chromosomal GAPDH gene was used as reference to normalize the amount of template DNA. The data shown are averages of three independent experiments (*p≤0.05; **p≤0.005; ***p≤0.0005, two-tailed Student's t test).
The stochastic and adaptive gene amplification abilities of the RAD51 null mutant were analyzed. The DHFR-TS circular amplicon was detected in L. infantum wild-type cells, but its level was significantly reduced in the unselected RAD51 null mutant (Figure 7B, lanes 1 and 2), whereas wild-type level was observed in the add-back revertant (Figure 7B, lane 3). Following MTX pressure, an increased level of DHFR-TS circular amplicons was observed in the wild-type cells but not in the RAD51−/− mutant cells (Figure 7B, lanes 4 and 5). Similar observations of reduced circle formation in the RAD51−/− mutant were obtained for a PTR1 circular amplicon (Figure 7C, lanes 2 and 5). The phenotype was reverted in the RAD51 add-back rescued cells (Figure 7B,C, lanes 3 and 6).
Further evidence for the role of RAD51 in circular amplification was obtained by selecting wild-type and RAD51 null mutant cells for arsenite resistance, a metal related to SbIII and a potent selector of circular MRPA gene amplification ,. Seven to nine independent clones of wild-type L. infantum, RAD51 null mutant, and RAD51 add-back revertant lines were obtained after in vitro step-by-step selection for high arsenite resistance. We analyzed the clones for MRPA gene amplification with a MRPA probe hybridized to pulse-field gels. Among the nine resistant clones derived from the wild-type background, eight had MRPA circles (Figure 8B, lanes 3–10) and one exhibited a MRPA containing linear amplicon (Figure 8B, lane 2) as deduced by their characteristic hybridization patterns. The extent of gene amplification was estimated in selected mutants by standard Southern blots (Figure S6). Seven resistant clones were obtained with the RAD51 null mutant, of which only two clones harbored MRPA circular amplicons (Figure 8C, lanes 2 and 7). Four other clones presented MRPA linear amplifications (Figure 8C, lanes 3–6) and the remaining clone did not exhibit any extrachromosomal MRPA amplification, although hybridization at the chromosomal locus was more intense (Figure 8C, lane 8). A statistical analysis demonstrated that significantly fewer circles were observed (p<0.05) in the RAD51 null mutant compared to wild-type cells. This is RAD51-specific, as MRPA coding circles were observed in the RAD51 add-back rescue cells selected for arsenite resistance (Figure 8D).
(A) The MRPA locus and the repeats allowing its amplifications are shown; DRs are A–A′, A–A″, B–B′, and C–C′ and IRs are D–D′. Primers 1–9 were used to determine the rearrangements that occurred in L. infantum WT cells (B), in RAD51−/− (C), and in RAD51 add-back revertant parasites (D) all selected for arsenite resistance (up to 250 µM = 5×EC50). The DNA of resistant clones was separated by pulse-field gel electrophoresis and hybridized to a MRPA probe. The rearrangement point for each amplicon was determined by PCR (Figure S7) and is indicated below for each lane. Lane 1, unselected population; lanes 2–10, independent arsenite-resistant clones.
Two different pairs of DRs can promote circular amplification of the MRPA gene (B–B′ and C–C′ in Figure 8A). Moreover, the ABCC1, ABCC2, and MRPA genes encode three members of the ABCC family of transporter proteins. They share a nucleotide binding site with high homology and these conserved sequences, although coding can be the substrate for HR and can thus support circular amplification (A, A′, and A″ in Figure 8A). PCR analyses of the arsenite-resistant clones containing MRPA circular amplicons revealed different MRPA circles depending on the pair of DRs used for rearrangement, and MRPA circular amplicons were formed and selected at the level of all of these repeats, although only one type of circle is usually found in one clone (Figures 8B and S7). The MRPA linear amplicons were all generated after annealing of the IRs D–D′ despite their size differences (Figure 8B, lane 2; Figure 8C, lanes 3–6; and Figure S7). This difference could be explained by secondary internal rearrangements and/or differences in telomeric sequences. In the RAD51 add-back revertant clones resistant to arsenite, we observed a higher level of amplicons as deduced from the hybridization signals and a higher diversity of MRPA circles, as four clones exhibited at least two circular amplicons with different rearrangement points (Figure 8D, lanes 3–6 and Figure S7). Intriguingly, one RAD51 null mutant clone (Figure 8C, lane 7 and Figure S7B) had evidence for at least three amplicons with different rearrangements. The RAD51 null mutant resistant to arsenite with a stronger hybridization at the chromosomal locus had a positive PCR reaction between repeats A–A′ (Figure 8C, lane 8), suggesting a tandem intrachromosomal duplication (see Figure 1A).
Bioinformatics analyses indicated that repeated sequences are spread throughout the Leishmania genome (Figure 2, Tables S1–S4). Our screen was focused on noncoding sequences of at least 200 bp. We used this cutoff as HR in Leishmania was found to be more efficient with regions of homology of at least 200 bp . However, shorter repeats or repeated coding sequences (see Figure 8) could also be used for genomic rearrangements. RAGs are usually in low copy, but many contain the extinct SIDER retroposons. These SIDER retroposons are too degenerated to be included into one RAG, but they appear to have a dual role: one structural, allowing DNA rearrangements, and a functional role, as SIDER-1 and -2 were found to modulate gene expression at the level of translation and mRNA stability, respectively ,.
At the genome scale, these repeated sequences provide a platform for gene rearrangements. This was illustrated for chromosome 23, where we obtained experimental evidence that the entire length of the chromosome is subjected to rearrangements in unselected wild-type cells (Figure 3). These genome-wide rearrangements led to either circular or linear extrachromosomal elements, depending on whether rearrangements occurred at the level of DRs or IRs, respectively. Because the extrachromosomal nature of the amplified DNA cannot be tested in wild-type cells, we subjected cells to growth conditions that allowed the detection of amplification by nonsensitive techniques (such as in Figures 5A, 6, or 8). These were invariably extrachromosomal (reviewed in ). One exception is shown in Figure 8C (lane 7), suggesting a tandem intrachromosomal duplication event. Thus, we cannot exclude that some of the PCR products observed in unselected lines could be due to intrachromosomal tandem duplication (Figure 1A), but we favor the model of extrachromosomal amplification because experimental evidence for the latter abound in Leishmania. Our data suggest that extrachromosomal elements are initially present in a limited number of cells. Upon division and selection, the copy number of these elements increases, possibly by unequal segregation during cell division, as suggested for double-minute chromosomes (reviewed ). Cells with higher copy of the extrachromosomal elements have a selective advantage and can expand to become a majority of the population during further selection (Figure 5C).
DNA replication in Leishmania is not well understood but because plasmids with no Leishmania DNA sequences can replicate , it does suggest permissiveness. The fact that amplified DNAs are usually lost when selection is removed (Figures 4, 6, and 7) implies that they do not encode the sequences necessary to be maintained stably. The apparent permissiveness of Leishmania in DNA replication may explain why amplification can be observed so frequently. The phenomenon of stochastic gene amplification may also happen in other organisms, but this may not have been observed because these extrachromosomal DNAs are in very few cells and cannot increase further because of lack of replication. Trypanosoma brucei, a parasite related to Leishmania, has a much stricter replication system where episomal vector maintenance is the exception , and no gene amplification has been described in this parasite, except for changes in ploidy . We searched for the presence of repeated sequences in T. brucei and found 773 DRs or IRs that could potentially lead to 1,848 amplifications. We tested the rearrangements at the level of selected repeats and obtained PCR products (Figure S8) whose sequencing confirmed the predicted rearrangements. Thus, T. brucei has repeated sequences where gene rearrangements can occur but further amplification is not observed, possibly because these extrachromosomal DNAs cannot replicate. Additional organisms need to be tested for the presence of repeated sequences and for DNA rearrangements by sensitive PCR. Although this may not be as adaptive and flexible as in Leishmania, it may provide transient selective advantage to a population. It is salient to point out that long repeated sequences were found in large plasmids of Rhizobium and rearrangements mediated by HR at the level of the repeats were evidenced by a PCR-based method similar to the one presented here .
Leishmania is known to have extensive plasticity in its genome. Previous studies revealed that Leishmania chromosomes can be aneuploid, being either monosomic, disomic, or polysomic ,,,. Evidence for mosaic aneuploidy was also provided in single cells by FISH analysis . In a population, the average copies of a chromosome may be two, but individual cells may have one, two, or three copies of one specific chromosome . Here, we provide an additional layer of complexity and show an extreme example of genome plasticity. Cells in the population have a common core genome, but single cells differ from the rest of the population by carrying one or several circular and/or linear extrachromosomal amplicons. Individual cells differ in their amplicon complement, and upon selection with either drugs (Figures 4, 7, and 8) or culture conditions (Figure 6), a subpopulation would emerge where the amplicon copy number per cell would increase and then expand to become a majority of the population (Figure 5). Extrachromosomal elements have been found in field isolates ,,, suggesting that this is not only found in cells cultured in vitro.
The phenomenon of DNA amplification is reversible, as removal of the selective pressure leads to the population reverting to equilibrium (Figures 4 and 6). Whole genome rearrangements is thus a highly dynamic process that continuously takes place in Leishmania. We propose that this plasticity is used by Leishmania as one strategy to respond to its changing environment. Indeed, within the whole population, a subpopulation having a specific amplicon may be sufficient to provide the necessary advantage to allow the population to thrive. Subpopulations with amplified DNAs thrive during selective conditions and can help (possibly by secreting bioactive molecules or detoxifying the milieu) remaining cells of the population that lack the specific locus amplified. Our assay of locus deletion revealed that this type of rearrangement seems to be as frequent as those leading to extrachromosomal elements (Figure 3). The cost of deleting several genes between repeats may be high, but it is possible that within a population, a subpopulation with genomic deletion (e.g., a drug transporter) may help the population to grow in a selective environment, and upon removal of the selection, these specific cells would be lost from the population.
Leishmania is an early divergent eukaryote that does not regulate gene expression at the level of transcription initiation. Undoubtedly, there are several layers of regulation including posttranscriptional and translational mechanisms . Along with chromosome aneuploidy ,,, stochastic gene rearrangement is a strategy that Leishmania evolved to respond to its environment. At the population level, we have seen that most regions bordered by repeats are rearranged and subpopulations of these cells can be selected. The mechanisms leading to circular and linear amplicons are likely to differ (Figure 1). Indeed, RAD51 is important for the formation of circular amplicons but not linear ones (Figure 8). RAD51 is not essential, however, as two out of seven RAD51 mutants selected for arsenite resistance had circular amplicons (Figure 8C, lanes 2 and 7), suggesting that other recombinases are involved. As the Leishmania genome displays high plasticity, the targeting of enzymes involved in gene rearrangements may prevent the parasite from adapting to drugs and hinder the emergence of the most prevalent resistance mechanisms. This is in line with the concept of targeting components of the mutagenesis pathways leading to adaptive mutations in response to stressful environments (reviewed in ). The use of such inhibitors (also known as anti-evolvability drugs) would represent novel therapeutic strategies for preventing the evolution of antimicrobial resistance –.
Materials and Methods
Bioinformatic Analyses and Primer Design
Genome sequences and annotations were obtained from GeneDB in XML or Artemis format. Analyses were performed on the genome sequence of L. major Friedlin version 5.2, L. infantum JPCM5 version 3, and L. braziliensis clone M2904 version 2. Intergenic sequences were extracted and blasted against their respective chromosome. Blast hits were filtered for identities higher than 85% and lengths between 200 and 2,500 nucleotides. Redundant hits were removed, and repeated sequences were given unique identifiers. For each chromosome, repeated sequences were clustered by sequence homology. Repeated regions were screened for SIDERs as described . Primers were designed for all putative recombination events using Primer3 and designed within 150 nucleotides from the repeated sequence with the orientations shown in Figure 1A and 1B. Optimal primer length was 23 nucleotides, and optimal Tm was 58°C. Input files for Primer 3 were created using in-house perl scripts. A list of the primers used can be found in Table S5.
Pulse-Field Gel Electrophoresis and Southern Blot Hybridization
Intact chromosomes were prepared from late log phase cultures of Leishmania promastigotes and separated by pulse-field gel electrophoresis using a Bio-Rad CHEF-DR III apparatus at 5 V/cm and 120° separation angle as described . Gels were transferred and hybridized with [α-32P]dCTP-labeled DNA probes according to standard protocols.
DNA Preparation for Sensitive PCR Assays
Late log phase promastigotes (15–20 ml) were pelleted at 3,000 rpm for 5 min, and pellets were washed with HEPES-NaCl, resuspended in suspension buffer (100 mM EDTA, 100 mM NaCl, 10 mM TRIS pH 8.0), and lysed in 1% SDS with 50 µg/ml proteinase K for 2 h at 37°C. The DNA was extracted with 1 volume of phenol, precipitated with 2 volumes of 99% ethanol, washed with 70% ethanol twice, and dissolved in 1 ml TE. RNAse (20 µg/ml) was added and incubated for 30 min at 37°C, followed by addition of 50 µg/ml proteinase K and 0.1% SDS at 37°C for 30 min. DNA was extracted with 1 volume of phenol, precipitated, and washed as above, then dissolved in MilliQ water. DNA was quantified using a Nanodrop spectrophotometer.
Semiquantitative Polymerase Chain Reaction and Estimate of the Rearrangement Frequency
The PCR products, which needed to be longer than the size of the repeats used for recombination, required optimizations. PCR reaction mixture consisted of 100 ng of phenol-purified genomic DNA isolated as described above, 4 µM of forward and reverse primers, 2 mM dNTPs, 1.25 U of FastStart Taq DNA polymerase (Roche), 1×PCR buffer + MgCl2, and 3.33 mg/ml BSA. The total reaction mixture was made up to 25 µl by addition of the genomic DNA. Each PCR reaction was standardized as follows: an initial denaturation at 94°C for 4 min, denaturation at 95°C for 15 s, annealing for 30 s, elongation at 72°C for 1 min, and a final extension at 72°C for 5 min. For each PCR reaction, the annealing temperature was optimized as well as the number of cycles to prevent saturation of the amplification. The housekeeping gene GAPDH was used as a control to normalize the amount of DNA loaded in each reaction. Saturation of band intensities of the amplified PCR products was determined using the AlphaImager 2000 software. We used L. major MTX60.4 mutant, which contains the DHFR amplicon , to make a standard curve and determine the frequencies of rearrangement. Prior to DNA extraction, the MTX60.4 cells were mixed with L. major wild-type cells from 101- to 1010-fold, resulting in the dilution of the amplicon in DNA extracts. The semiquantitative PCR described above revealed a relative decrease in amplification frequency for the dilution 10−1 to 10−6, consistent with the dilution of the amplicons (Figure S3). Higher dilutions of the targets (10−7 to 10−10) did not further decrease the signal intensities after PCR quantitation, which suggests that the 10−6 dilution represents the basal rearrangement frequency (Figure S3). Considering that MTX60.4 cells have an average of 10 amplicons per cells , we estimated the rearrangement frequency at 10−6 to 10−7 amplicon per cell. We carried out similar dilution experiments with L. infantum MTX20.5, which contains both the DHFR-TS circular amplicon and the PTR1 linear amplicon , and found similar rearrangement rates of 10−6 to 10−7. To quantitate the amplicons in the stressed samples, PCRs were done with the same dilutions and optimized cycle numbers as for the standard curve. The same amount of the PCR amplification reactions was loaded on an agarose gel, and the signals were quantified and normalized.
Parasites from passages P3, P6, and P8 at 0.5×MTX EC50 and from passage P4 at 1×MTX EC50 (P41) were plated on SDM agar for 10 d. Single colonies were picked from the plates and independently lysed in 50 µl of InstaGen Matrix solution (BioRad) according to the manufacturer's recommendations. PCR reaction mixture and conditions were the same as described above but with 1 µl of lysed colony-derived parasites instead of 100 ng phenol-purified genomic DNA. The optimized number of cycles for the detection of the novel junction originating from the DHFR-TS amplicon was set to 30 cycles. For the unselected parasites (P0 passage), no amplification was observed even when the number of cycles was 40 cycles. The housekeeping gene GAPDH was used as a control.
Generation of a L. infantum RAD51 Null Mutant
The first allele of LinJ.28.0580 (RAD51) of L. infantum was deleted by HR using the noncoding flanking regions of the gene fused to the marker neomycin phosphotransferase (NEO). The deletion of the second allele could not be obtained by HR using the same flanking regions. An insertional inactivation strategy was performed where the hygromycin phosphotransferase gene (HYG) was inserted within the ORF. The primers used for inactivation of both alleles are listed in Table S5. Transfectants were selected in the presence of 40 µg/ml of G418 (Geneticin) and 300 µg/ml of hygromycin B, and integrations were confirmed by Southern blots and PCR.
Recombination assay of 5′GSH1-BLA-3′ GSH1 was measured as described . Briefly, 5 µg of the blasticidin (BLA) inactivation construct was transfected in 4×107 WT or RAD51 null mutant cells. After 24 h, cells were preselected with 40 µg/ml blasticidin, and after another 24 h, 5–7×106 cells were plated in the presence of 50 µg/ml blasticidin (Invitrogen) in triplicates and the number of colonies per plate was counted after 10–15 d.
Library was prepared using the Nextera library preparation kit (Illumina, Carlsbad, CA) following the manufacturer's protocol. Library was quantified with Picogreen, and 20 µl of 2.5 nM diluted DNA was used for library preparation. Library was denatured and diluted to 20 pM following the protocol recommended for sequencing of Nextera libraries on the MiSeq sequencer (Illumina). Sequencing was performed on the MiSeq system using paired-end reads of 150 nucleotides. Genomes were assembled using Ray 2.0.0 . Reads were aligned to L. infantum JPCM5 version 4 using the bwa aligner , and sequencing coverage was assessed for each position of chromosome 6. The sequencing data for the FCPO amplicon are available at the EMBL European Nucleotide Archive (http://www.ebi.ac.uk/ena) under the study accession number ERP002431, sample accession ERS227354.
Distance between repeated sequences within the same chromosome for (A) DRs and (B) IRs. The abscissa is a log10 scale of kbp, and the y-axis corresponds to the number of couples of repeats in the range defined on the abscissa. Most of the couples of DRs and IRs are 1 to 100 kbp distant. Distribution of IRs (C) and DRs (D) along the chromosomes. IRs are enriched in regions near the telomeres, whereas DRs are more uniformly dispersed. The abscissa corresponds to the distance between the repeats and the closest telomere end, and the y axis to the number of couples in the range defined on the abscissa.
Amplicon detection in intracellular Leishmania. J774 murine macrophage cells were infected with L. infantum (MHOM/MA/67/ITMAP-263) promastigotes at a parasite/macrophage ratio of 10:1 for 3 h. Noninternalized parasites were removed by several washes. After 4 d, the cells were collected, washed with HEPES-NaCl, and homogenized in resuspension buffer (100 mM EDTA, 100 mM NaCl, 10 mM TRIS pH 8.0). The cells were lysed and DNA extracted as described in Materials and Methods. PCR reaction mixture for detection of the PTR1 circular and linear amplicons consisted of 500 ng of the prepared genomic DNA, and sensitive PCR was carried out as described under Materials and Methods. The housekeeping chromosomal gene glyceraldehyde-3-phosphate dehydrogenase (GAPDH) was used as an internal control. All amplicons were sequenced to confirm their identity. Lane 1, intracellular amastigotes; lane 2, no template control.
Determination of frequency of gene rearrangements in Leishmania. L. major MTX60.4 cells were diluted 101- to 1010-fold with L. major LV39 wild-type cells, and PCR to detect DHFR amplicons was conducted. PCR products were loaded on a 1% agarose gel (A) and quantified. The chromosomal locus GAPDH was used for normalization. The semiquantitative PCRs were realized with an increasing number of cycles until a clear PCR product was detected for the wild-type strain. The dilutions 10−8 to 10−10 gave the same amplification rate than the wild-type strain, indicating that those samples contained the same number of amplicons. Because the MTX60.4 cells contain at least 10 amplicons per cell (Ubeda et al., 2008) , these data indicate that the rate of the DHFR genomic rearrangements is higher than 10−7. The PCR quantitation data at cycle 25 were used to draw a standard curve (B), which was then used to determine, after 25 PCR cycles, the quantity of amplicons in the MTX stressed population.
Selection for MTX resistance and DHFR-TS amplification in L. major. (A) Selection scheme of L. major for eight continuous passages at 0.5× MTX EC50 or by 2-fold increments from 0.5× MTX EC50 up to 8× MTX EC50. DNAs extracted at selected passages were further analyzed by PCR and Southern blots for the presence of the DHFR-TS amplicon in Figure 5 (*) and Figure S4C (+). P1–P8 refers to the number of passages performed at each MTX concentration. (B) Map of the DHFR-TS locus and its amplified region. The 10 kb EcoRV–BglII chromosomal fragment and the 4 kb BglII–BglII rearranged fragment diagnostic of amplification are indicated. Black arrows represent the DRs involved in HR. The probe used for differentiating the chromosomal and amplified loci by Southern blots corresponds to the first 1,000 bp of the gene LinJ.06.0830 and is indicated by the line under the map. (C) Southern blots of DNAs isolated from L. major selected step-by-step for MTX resistance by 2-fold MTX increments until they grew at 8× EC50 (these DNAs were extracted from passages with a ‘+’ sign in Figure S4A). The blot was hybridized to the probe indicated in Figure S4B, allowing the discrimination of the rearranged and chromosomal bands. The blot was rehybridized with a probe covering the coding sequence of DHFR-TS (DHFR) and to a GAPDH probe to monitor the DNA loaded in each lane. Fold increases were normalized with the GAPDH signal.
L. infantum RAD51 null mutant and associated phenotypes. (A) Diagram of the RAD51 locus in L. infantum WT, and with the NEO deletion and HYG disruption cassettes, with localization of primers a–f and size of the PCR products. (B) PCR fragments with primers a–f confirming inactivation cassettes insertion in the RAD51 locus. Molecular weights are indicated on the left, and the various alleles are pinpointed on the right. Lane 1, L. infantum WT; lane 2, RAD51/RAD51::NEO; lane 3, RAD51::NEO/RAD51::HYG. White lines indicate nonadjacent lanes that have been brought together for producing the figures. (C) Effect of methylmethane sulphonate (MMS) on cell growth. L. infantum WT PSPαZEOα (filled triangle), RAD51−/− PSPαZEOα (filled square), and RAD51−/− PSPαZEOαRAD51 (filled circle) were passaged in various concentrations of MMS for 3 d, after which the growth was monitored at 600 nm. Average of three independent experiments is shown. (D) Transformation efficiency of RAD51−/− mutants. L. infantum WT PSPαZEOα, RAD51−/−PSPαZEOα, and RAD51−/−PSPαZEOαRAD51 were transfected with a linear DNA fragment containing a selectable marker (BLA) flanked by the 5′ and 3′ GSH1 flanking sequences. Transformation efficiency was calculated by plating an equal number of cells in triplicate on plates containing the selection drug and counting the number of colonies per plate. The graph represents triplicate from two independent experiments.
MRPA gene amplification in selected arsenite-resistant L. infantum mutants. The DNAs of L. infantum WT cells and of the arsenite-resistant mutants (Figure 8B, lanes 7, 8, and 10) were isolated and digested with NcoI and hybridized with MRPA and GAPDH probes. Dilution experiments with the DNA of As250.8 have indicated that the copy number of the MRPA amplicon is higher than 20 copies in this cell line.
Identification of the MRPA locus rearrangements in the SbIII-resistant clones by PCR. (A) The map of the MRPA locus is shown with the repeats A–D and the primers 1–9. (B) L. infantum WT cells (left panel), RAD51 null mutant (middle panel), and RAD51 revertant cells (right panel) were selected for arsenite resistance, and all the arsenite-resistant clones selected independently displayed MRPA amplifications (Figure 8). By PCR, we determined the rearrangements that occurred using specific primer pairs for each couple of repeats. Fewer efforts were made for having sensitive PCR as we were dealing with highly amplified DNAs.
Evidence for gene rearrangements at the level of DRs and IRs in Trypanosoma brucei. A bioinformatics analysis revealed at least 773 intergenic repeated sequences in T. brucei. Four examples are shown here. Evidence for rearrangements at the level of these repeats was obtained by PCR using appropriate primers. PCR products of the right size were obtained for each reaction and sequenced. P, PCR product; M, molecular weight marker. White lines indicate nonadjacent lanes that have been brought together for producing the figure.
L. major pairs of repeated sequences with putative gene amplification products.
Properties of L. major RAGs.
L. infantum pairs of repeated sequences with putative gene amplification products.
Leishmania braziliensis pairs of repeated sequences with putative gene amplification products.
PCR primers used in this study.
We thank Dr. Andrée Maheux for fruitful discussions for optimizing the PCR assays, Dr. Rubens do Monte for helping with statistical analyses, and members of the M.O. lab for critical reading of the manuscript.
The author(s) have made the following declarations about their contributions: Conceived and designed the experiments: JMU FR AM PL JC MO. Performed the experiments: JMU FR AM MP HG GR AL. Analyzed the data: JMU FR AM PL BP JC MO. Contributed reagents/materials/analysis tools: FR JC. Wrote the paper: JMU PL BP JC MO.
- 1. Freeman JL, Perry GH, Feuk L, Redon R, McCarroll SA, et al. (2006) Copy number variation: new insights in genome diversity. Genome Res 16: 949–961.
- 2. Andersson DI, Hughes D (2009) Gene amplification and adaptive evolution in bacteria. Annu Rev Genet 43: 167–195.
- 3. Hastings PJ, Lupski JR, Rosenberg SM, Ira G (2009) Mechanisms of change in gene copy number. Nat Rev Genet 10: 551–564.
- 4. Tanaka H, Yao MC (2009) Palindromic gene amplification—an evolutionarily conserved role for DNA inverted repeats in the genome. Nat Rev Cancer 9: 216–224.
- 5. Wahl GM (1989) The importance of circular DNA in mammalian gene amplification. Cancer Res 49: 1333–1340.
- 6. Beverley SM (1991) Gene amplification in Leishmania. Annu Rev Microbiol 45: 417–444.
- 7. Ouellette M, Hettema E, Wust D, Fase-Fowler F, Borst P (1991) Direct and inverted DNA repeats associated with P-glycoprotein gene amplification in drug resistant Leishmania. Embo J 10: 1009–1016.
- 8. Grondin K, Roy G, Ouellette M (1996) Formation of extrachromosomal circular amplicons with direct or inverted duplications in drug-resistant Leishmania tarentolae. Mol Cell Biol 16: 3587–3595.
- 9. Butler DK, Yasuda LE, Yao MC (1996) Induction of large DNA palindrome formation in yeast: implications for gene amplification and genome stability in eukaryotes. Cell 87: 1115–1122.
- 10. Narayanan V, Mieczkowski PA, Kim HM, Petes TD, Lobachev KS (2006) The pattern of gene amplification is determined by the chromosomal location of hairpin-capped breaks. Cell 125: 1283–1296.
- 11. Mizuno K, Lambert S, Baldacci G, Murray JM, Carr AM (2009) Nearby inverted repeats fuse to generate acentric and dicentric palindromic chromosomes by a replication template exchange mechanism. Genes Dev 23: 2876–2886.
- 12. Tanaka H, Cao Y, Bergstrom DA, Kooperberg C, Tapscott SJ, et al. (2007) Intrastrand annealing leads to the formation of a large DNA palindrome and determines the boundaries of genomic amplification in human cancer. Mol Cell Biol 27: 1993–2002.
- 13. Haile S, Papadopoulou B (2007) Developmental regulation of gene expression in trypanosomatid parasitic protozoa. Curr Opin Microbiol 10: 569–577.
- 14. Anderson BA, Wong IL, Baugh L, Ramasamy G, Myler PJ, et al. (2013) Kinetoplastid-specific histone variant functions are conserved in Leishmania major. Mol Biochem Parasitol 191: 53–57.
- 15. Thomas S, Green A, Sturm NR, Campbell DA, Myler PJ (2009) Histone acetylations mark origins of polycistronic transcription in Leishmania major. BMC Genomics 10: 152.
- 16. van Luenen HG, Farris C, Jan S, Genest PA, Tripathi P, et al. (2012) Glucosylated hydroxymethyluracil, DNA base J, prevents transcriptional readthrough in Leishmania. Cell 150: 909–921.
- 17. Ivens AC, Peacock CS, Worthey EA, Murphy L, Aggarwal G, et al. (2005) The genome of the kinetoplastid parasite, Leishmania major. Science 309: 436–442.
- 18. Beverley SM, Coderre JA, Santi DV, Schimke RT (1984) Unstable DNA amplifications in methotrexate-resistant Leishmania consist of extrachromosomal circles which relocalize during stabilization. Cell 38: 431–439.
- 19. Garvey EP, Santi DV (1986) Stable amplified DNA in drug-resistant Leishmania exists as extrachromosomal circles. Science 233: 535–540.
- 20. Petrillo-Peixoto ML, Beverley SM (1988) Amplified DNAs in laboratory stocks of Leishmania tarentolae: extrachromosomal circles structurally and functionally similar to the inverted-H-region amplification of methotrexate-resistant Leishmania major. Mol Cell Biol 8: 5188–5199.
- 21. White TC, Fase-Fowler F, van Luenen H, Calafat J, Borst P (1988) The H circles of Leishmania tarentolae are a unique amplifiable system of oligomeric DNAs associated with drug resistance. J Biol Chem 263: 16977–16983.
- 22. Beverley SM, Coburn CM (1990) Recurrent de novo appearance of small linear DNAs in Leishmania major and relationship to extra-chromosomal DNAs in other species. Mol Biochem Parasitol 42: 133–141.
- 23. Tripp CA, Myler PJ, Stuart K (1991) A DNA sequence (LD1) which occurs in several genomic organizations in Leishmania. Mol Biochem Parasitol 47: 151–156.
- 24. Papadopoulou B, Roy G, Ouellette M (1993) Frequent amplification of a short chain dehydrogenase gene as part of circular and linear amplicons in methotrexate resistant Leishmania. Nucleic Acids Res 21: 4305–4312.
- 25. Navarro M, Liu J, Muthui D, Ortiz G, Segovia M, et al. (1994) Inverted repeat structure and homologous sequences in the LD1 amplicons of Leishmania spp. Mol Biochem Parasitol 68: 69–80.
- 26. Olmo A, Arrebola R, Bernier V, Gonzalez-Pacanowska D, Ruiz-Perez LM (1995) Co-existence of circular and multiple linear amplicons in methotrexate- resistant Leishmania. Nucleic Acids Res 23: 2856–2864.
- 27. Grondin K, Kundig C, Roy G, Ouellette M (1998) Linear amplicons as precursors of amplified circles in methotrexate- resistant Leishmania tarentolae. Nucleic Acids Res 26: 3372–3378.
- 28. Ubeda JM, Legare D, Raymond F, Ouameur AA, Boisvert S, et al. (2008) Modulation of gene expression in drug resistant Leishmania is associated with gene amplification, gene deletion and chromosome aneuploidy. Genome Biol 9: R115.
- 29. Leprohon P, Legare D, Raymond F, Madore E, Hardiman G, et al. (2009) Gene expression modulation is associated with gene amplification, supernumerary chromosomes and chromosome loss in antimony-resistant Leishmania infantum. Nucleic Acids Res 37: 1387–1399.
- 30. Dubessay P, Ravel C, Bastien P, Lignon MF, Ullman B, et al. (2001) Effect of large targeted deletions on the mitotic stability of an extra chromosome mediating drug resistance in Leishmania. Nucleic Acids Res 29: 3231–3240.
- 31. Kündig C, Leblanc E, Papadopoulou B, Ouellette M (1999) Role of the locus and of the resistance gene on gene amplification frequency in methotrexate resistant Leishmania tarentolae. Nucleic Acids Res 27: 3653–3659.
- 32. Grondin K, Papadopoulou B, Ouellette M (1993) Homologous recombination between direct repeat sequences yields P-glycoprotein containing amplicons in arsenite resistant Leishmania. Nucleic Acids Res 21: 1895–1901.
- 33. Peacock CS, Seeger K, Harris D, Murphy L, Ruiz JC, et al. (2007) Comparative genomic analysis of three Leishmania species that cause diverse human disease. Nat Genet 39: 839–847.
- 34. Papadopoulou B, Dumas C (1997) Parameters controlling the rate of gene targeting frequency in the protozoan parasite Leishmania. Nucleic Acids Res 25: 4278–4286.
- 35. Coelho AC, Leprohon P, Ouellette M (2012) Generation of Leishmania hybrids by whole genomic DNA transformation. PLoS Negl Trop Dis 6: e1817.
- 36. Bringaud F, Muller M, Cerqueira GC, Smith M, Rochette A, et al. (2007) Members of a large retroposon family are determinants of post-transcriptional gene expression in Leishmania. PLoS Pathog 3: 1291–1307.
- 37. Smith M, Bringaud F, Papadopoulou B (2009) Organization and evolution of two SIDER retroposon subfamilies and their impact on the Leishmania genome. BMC Genomics 10: 240.
- 38. Boucher N, Wu Y, Dumas C, Dube M, Sereno D, et al. (2002) A common mechanism of stage-regulated gene expression in Leishmania mediated by a conserved 3′-untranslated region element. J Biol Chem 277: 19511–19520.
- 39. Mukherjee A, Langston LD, Ouellette M (2011) Intrachromosomal tandem duplication and repeat expansion during attempts to inactivate the subtelomeric essential gene GSH1 in Leishmania. Nucleic Acids Res 39: 7499–7511.
- 40. Haimeur A, Brochu C, Genest P, Papadopoulou B, Ouellette M (2000) Amplification of the ABC transporter gene PGPA and increased trypanothione levels in potassium antimonyl tartrate (SbIII) resistant Leishmania tarentolae. Mol Biochem Parasitol 108: 131–135.
- 41. Rovai L, Tripp C, Stuart K, Simpson L (1992) Recurrent polymorphisms in small chromosomes of Leishmania tarentolae after nutrient stress or subcloning. Mol Biochem Parasitol 50: 115–125.
- 42. Detke S, Katakura K, Chang KP (1989) DNA amplification in arsenite-resistant Leishmania. Exp Cell Res 180: 161–170.
- 43. Hahn PJ (1993) Molecular biology of double-minute chromosomes. Bioessays 15: 477–484.
- 44. Papadopoulou B, Roy G, Ouellette M (1994) Autonomous replication of bacterial DNA plasmid oligomers in Leishmania. Mol Biochem Parasitol 65: 39–49.
- 45. Patnaik PK, Bellofatto V, Hartree D, Cross GA (1994) An episome of Trypanosoma brucei can exist as an extrachromosomal element in a broad range of trypanosomatids but shows different requirements for stable replication. Mol Biochem Parasitol 66: 153–156.
- 46. Wilson K, Berens RL, Sifri CD, Ullman B (1994) Amplification of the inosinate dehydrogenase gene in Trypanosoma brucei gambiense due to an increase in chromosome copy number. J Biol Chem 269: 28979–28987.
- 47. Flores M, Mavingui P, Perret X, Broughton WJ, Romero D, et al. (2000) Prediction, identification, and artificial selection of DNA rearrangements in Rhizobium: toward a natural genomic design. Proc Natl Acad Sci U S A 97: 9138–9143.
- 48. Rogers MB, Hilley JD, Dickens NJ, Wilkes J, Bates PA, et al. (2011) Chromosome and gene copy number variation allow major structural change between species and strains of Leishmania. Genome Res 21: 2129–2142.
- 49. Downing T, Imamura H, Decuypere S, Clark TG, Coombs GH, et al. (2011) Whole genome sequencing of multiple Leishmania donovani clinical isolates provides insights into population structure and mechanisms of drug resistance. Genome Res 21: 2143–2156.
- 50. Sterkers Y, Lachaud L, Crobu L, Bastien P, Pages M (2011) FISH analysis reveals aneuploidy and continual generation of chromosomal mosaicism in Leishmania major. Cell Microbiol 13: 274–283.
- 51. Mukherjee A, Padmanabhan PK, Singh S, Roy G, Girard I, et al. (2007) Role of ABC transporter MRPA, gamma-glutamylcysteine synthetase and ornithine decarboxylase in natural antimony-resistant isolates of Leishmania donovani. J Antimicrob Chemother 59: 204–211.
- 52. Galhardo RS, Hastings PJ, Rosenberg SM (2007) Mutation as a stress response and the regulation of evolvability. Crit Rev Biochem Mol Biol 42: 399–435.
- 53. Cirz RT, Romesberg FE (2007) Controlling mutation: intervening in evolution as a therapeutic strategy. Crit Rev Biochem Mol Biol 42: 341–354.
- 54. Rosenberg SM, Shee C, Frisch RL, Hastings PJ (2012) Stress-induced mutation via DNA breaks in Escherichia coli: a molecular mechanism with implications for evolution and medicine. Bioessays 34: 885–892.
- 55. Genois MM, Mukherjee A, Ubeda JM, Buisson R, Paquet E, et al. (2012) Interactions between BRCA2 and RAD51 for promoting homologous recombination in Leishmania infantum. Nucleic Acids Res 2012: 13.
- 56. Boisvert S, Raymond F, Godzaridis E, Laviolette F, Corbeil J (2012) Ray meta: scalable de novo metagenomic assembly and profiling. Genome Biol 13: R122.
- 57. Li H, Durbin R (2009) Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics 25: 1754–1760.