The Leishmania genus comprises up to 35 species, some with status still under discussion. The multilocus sequence typing (MLST)—extensively used for bacteria—has been proposed for pathogenic trypanosomatids. For Leishmania, however, a detailed analysis and revision on the taxonomy is still required. We have partially sequenced four housekeeping genes—glucose-6-phosphate dehydrogenase (G6PD), 6-phosphogluconate dehydrogenase (6PGD), mannose phosphate isomerase (MPI) and isocitrate dehydrogenase (ICD)—from 96 Leishmania (Viannia) strains and assessed their discriminatory typing capacity. The fragments had different degrees of diversity, and are thus suitable to be used in combination for intra- and inter-specific inferences. Species-specific single nucleotide polymorphisms were detected, but not for all species; ambiguous sites indicating heterozygosis were observed, as well as the putative homozygous donor. A large number of haplotypes were detected for each marker; for 6PGD a possible ancestral allele for L. (Viannia) was found. Maximum parsimony-based haplotype networks were built. Strains of different species, as identified by multilocus enzyme electrophoresis (MLEE), formed separated clusters in each network, with exceptions. NeighborNet of concatenated sequences confirmed species-specific clusters, suggesting recombination occurring in L. braziliensis and L. guyanensis. Phylogenetic analysis indicates L. lainsoni and L. naiffi as the most divergent species and does not support L. shawi as a distinct species, placing it in the L. guyanensis cluster. BURST analysis resulted in six clonal complexes (CC), corresponding to distinct species. The L. braziliensis strains evaluated correspond to one widely geographically distributed CC and another restricted to one endemic area. This study demonstrates the value of systematic multilocus sequence analysis (MLSA) for determining intra- and inter-species relationships and presents an approach to validate the species status of some entities. Furthermore, it contributes to the phylogeny of L. (Viannia) and might be helpful for epidemiological and population genetics analysis based on haplotype/diplotype determinations and inferences.
Leishmania is a protozoan genus comprising many species, some associated with a human neglected disease called leishmaniasis. This parasite is found worldwide and is transmitted by sand flies, having numerous domestic and sylvatic animals as reservoirs. Leishmania is genetically and ecologically diverse and it has been argued that this has an impact on the epidemiology of the disease. Many typing methods have been proposed for the study of this diversity, although a generally agreed methodology is still required. Also, there is still a lack of consensus on the validity of some species. Multilocus sequence typing (MLST) is a method for studying the population structure and diversity of pathogens, but before an MLST scheme can be proposed it is essential to undertake a detailed analysis and selection of the sequences that are to be included in the system. Here, we sequenced four gene fragments of 96 L. (Viannia) strains, representing most species from this subgenus. Our results showed a good agreement between the current species assignment and the multilocus sequence analysis. Evidence of genetic recombination was found and the phylogenetic relationships were determined. Overall the results point to the feasibility of an MLST scheme for Leishmania and indicate that the four gene fragments analyzed could form part of this typing system. This will certainly be a valuable approach for taxonomy, population genetics, and epidemiological studies of this pathogen.
Citation: Boité MC, Mauricio IL, Miles MA, Cupolillo E (2012) New Insights on Taxonomy, Phylogeny and Population Genetics of Leishmania (Viannia) Parasites Based on Multilocus Sequence Analysis. PLoS Negl Trop Dis 6(11): e1888. doi:10.1371/journal.pntd.0001888
Editor: Christian Tschudi, Yale School of Public Health, United States of America
Received: June 15, 2012; Accepted: September 17, 2012; Published: November 1, 2012
Copyright: © 2012 Boité et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: This work was supported by Fundação Oswaldo Cruz (FIOCRUZ) - http://portal.fiocruz.br, Fundação de Amparo a Pesquisa do Estado do Rio de Janeiro (FAPERJ) - http://www.faperj.br, Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) - http://www.cnpq.br. This research was also supported by a grant from the European Union (LeishEpinetSA, INCO-CT2005-015407). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Leishmania are the causative agents of leishmaniasis, which can present in different forms, from simple cutaneous to the deadly visceral disease, and are found in most tropical and sub-tropical regions. In spite of their morphological homogeneity, more than 20 species have been described for the Leishmania genus. Phenotypic diversity is observed not only between species, but also even in virulence levels among clones  which, upon interaction with the host's immunological response, contributes to determining the observed clinical pleiotropy and affects the efficiency of therapy applied . Leishmania genetic diversity may also compromise vaccine development, although key antigen genes are highly conserved. Understanding leishmaniasis and the development of measures to counter its spread depend on the ability to identify Leishmania species and characterize genetic variants.
Multilocus enzyme electrophoresis (MLEE) is still considered by many to be the gold standard for Leishmania identification, but several DNA based methods have proven useful to study Leishmania genetic diversity , . DNA sequencing and PCR-RFLP of hsp70 genes have been shown to be promising for the identification of Leishmania parasites , , although too conserved for intra-specific diversity studies. Highly polymorphic markers, such as microsatellites, perform poorly at taxonomic levels higher than species, whilst most other genotyping methods rely on multicopy genes that are more difficult to analyze.
A standardized, sensitive, practical and reproducible typing method, such as multilocus sequence typing (MLST), must form the basis for a robust classification of Leishmania species, which is achievable in most laboratories even in the advent of fast and cheaper genome sequencing.
MLST, as proposed in 1998 for bacterial pathogens , provides a portable, reproducible, and quantitative typing system. It has since been applied to diploid organisms –, including Leishmania, with ten markers described for the L. donovani complex – and four targets proposed for L. (Viannia) spp. . MLST databases for Leishmania have not yet been implemented and the published number of strains and species typed is relatively low. Alongside MLST, multilocus sequence analysis (MLSA) has been shown to be a good tool for strain characterization and epidemiological surveillance, as well as population structure and evolutionary studies , , , . The implementation of an MLST system demands careful evaluation of markers beforehand to study their diversity and phylogenetic consistency , .
We report here the evaluation of four candidate coding regions, located in different chromosomes, to be included in an MLSA system of the subgenus L. (Viannia). These regions are part of the genes for the metabolic enzymes glucose-6-phosphate dehydrogenase (G6PDH, EC 22.214.171.124), phosphogluconate dehydrogenase (6PGD, EC 126.96.36.199), mannose phosphate isomerase (MPI, EC 188.8.131.52) and isocitrate dehydrogenase (ICD, EC 184.108.40.206). Three of these genes had been studied previously, but mainly for Peruvian isolates of the L. braziliensis complex. Herein a large number of strains and most species of the subgenus L. (Viannia) were included. The sequences were used to conduct haplotype analysis, including the construction of haplotype networks. Diploid multilocus analyses were performed and a concatenated tree was defined, as well as clonal complexes (CC). Using an MLSA approach, we report the genetic diversity of the four gene fragments evaluated in the L. (Viannia) group and demonstrate their value for taxonomic, phylogenetic and population genetic studies of Leishmania parasites.
Research in this study was subject to ethical review by the European Commission and approved as part of contract negotiation for Project LeishEpiNetSA (contract 01547): the work conformed to all relevant European regulations. The research was also reviewed and approved by the ethics committee of the London School of Hygiene and Tropical Medicine (approval 5092). The Leishmania strains analyzed were principally strains derived from international cryobanks such as Coleção de Leishmania do Instituto Oswaldo Cruz (CLIOC) and the cryobank of the London School of Hygiene and Tropical Medicine (LSHTM). In all cases Leishmania were isolated from patients as part of normal diagnosis and treatment with no unnecessary invasive procedures and with written and/or verbal consent recorded at the time of clinical examination. Data on isolates were coded and anonymised. The Leishmania strains are deposited either at the LSHTM (n = 9) or at CLIOC (n = 87) as open data. CLIOC is a Depository Authority by the Ministry of the Environment [Fiel Depositária pelo Ministério do Meio Ambiente, MMA] (D.O.U. 05.04.2005). All samples were used for research purposes only and the data were analyzed anonymously in the scope of resolution 21 (August 31, 2006 – CGEN/MMA), for which authorization is not required.
Samples and GenBank available sequences
Ninety six strains were obtained from frozen stocks from CLIOC (n = 87), and from the London School of Hygiene and Tropical Medicine (LSHTM) (n = 9) (Table S1). Strains were chosen to be representative of the zymodeme and geographical diversity of species of the subgenus Leishmania (Viannia) in South America, and in particular Brazil: L. (V) braziliensis, L. (V) guyanensis, L. (V) naiffi, L. (V) lainsoni, L. (V) shawi, L. (V) utingensis and L. (V) lindenbergi. Most strains had been previously characterized by MLEE . DNA sequences were retrieved from GenBank of L. (Viannia) species: 14 of G6PD; 24 of 6PGD; 20 of MPI and 1 of ICD (Table S2). Only those presenting full coverage with the alignment of sequences obtained in the present study were included in the analysis: for G6PD: 3 L. braziliensis, 2 L. guyanensis, 1 L. peruviana, 1 L. panamensis, 1 L. lainsoni; for 6PGD: 10 L. peruviana, 07 L. braziliensis, 1 L. panamensis; for MPI: 11 L. peruviana, 9 L. braziliensis; for ICD: 1 L. braziliensis.
DNA extraction and PCR
Promastigotes were grown at 25°C in Schneider's medium supplemented with 20% (v/v) heat-inactivated fetal bovine serum to a density of 1×109 cells/mL (late log phase), as estimated by counting in a Neubauer chamber. DNA extraction was performed using the Wizard DNA purification Kit (Promega, Madison, USA) following the manufacturer's instructions.
The chosen loci are distributed on different chromosomes according to L. (Viannia) braziliensis genome sequencing data: 6PGD, on chromosome 34; G6PD, on chromosome 20; MPI, on chromosome 32; ICD, on chromosome 33.
Primers (Table 1) were designed in conserved regions of gene sequences from the published L. major and the L. braziliensis genomes in Genbank (www.ncbi.nlm.nih.gov/Genbank) to amplify sections of the coding regions of the genes that would be amenable to full sequencing using the PCR primers and that include putative species-specific polymorphisms as well as singleton SNPs. Amplification reactions had, for 50 µl total volume, 0,1 mM of each primer, reaction buffer (100 mM Tris–HCl, pH 8.8; 500 mM KCl, 1% Triton X-100; 15 mM MgCl2), 0.25 mM deoxyribonucleotide triphosphate (dNTPs), 0.025 U FideliTaq/GoTaq polymerase and approximately 50 ng DNA. Amplification conditions were: 94°C for 2 minutes, followed by 35 cycles at 94°C for 30 seconds, 55°C (for ICD, 6PGD and G6PD) or 58°C (for MPI) for 30 seconds and 68°C for 1 minute, with a final extension at 68°C for 5 min.
PCR products were purified with the Wizard SV Clean-up System (Promega). The final DNA concentration was estimated by comparison with a DNA Ladder Marker (Promega) in 2% agarose gel. Sequencing was performed with the same primers used for amplification, using the ABI PRISM BigDyeTerminator v3.1 Cycle Sequencing Kit, and products analyzed in an automated DNA sequencer (ABI-3730). Consensus sequences were generated and edited in Phred/Phrap/Consed Version: 0.020425.c  from two forward and two reverse strands. Sequences with Phred values below ten over their extent were discarded and only sequence segments with values above twenty were used for contig construction. Ambiguous (heterozygous) sites were coded using the standard IUPAC codes for combinations of two or more bases. Contigs from all samples were manually assembled and aligned in MEGA4 . The homologous sequences available in GeneBank for other L. (Viannia) isolates were obtained using the Basic Local Alignment Search Tool (BLAST) algorithm hosted by NCBI, National Institute of Health, USA (http://www.ncbi.nlm.nih.gov) and included in the alignments.
Haplotype reconstruction was done through DNAsp5  using the PHASE algorithm, which automatically assigns a haplotype number (H) for each unique haploid sequence. Haplotype data was then used in DNAsp to calculate the synonymous and non-synonymous substitution rates and haplotype diversity (HD). Haplotype diversity (HD) was calculated as: HD = N(1−Σxi2)/(N−1) where xi is the haplotype frequency of each haplotype in the sample and N is the sample size. The average haplotype diversity for species was compared through ANOVA.
The discriminatory capacity of each marker employed was calculated through Simpson's diversity index (D), as follows: D = Σ n(n−1)/N(N−1), where n = the total number of strains of a particular haplotype and N = total number of strains analyzed. Phylogenetic congruence between markers was assessed by comparison of maximum parsimony (MP) and median-joining (MJ) networks generated by the Network free software , using individual haplotype data.
Diploid multilocus analysis
For each sample a sequence type number (ST) was defined with each marker, which in the homozygous strains was identical to the haplotype number (H), and for the heterozygous strains was a combination of the two possible alleles. A diploid sequence type (DST) was defined for the final combination of STs of the four markers. The sequences from the four genes were concatenated using BioEdit v7.0.9 and unique concatenated sequences were identified with the DST previously assigned. To evaluate the phylogenetic information provided by the four markers and to investigate the presence of recombination signatures, a NeighborNet network was built in SplitsTree 4.0  based on genetic distances calculated according to the Kimura-2 parameter model of nucleotide substitutions from concatenated data.
ST data combinations (with the exception of heterozygous strains) were analyzed in e-BURST v.3 (http://eburst.mlst.net/v3/enter_data/single/) to define clonal complexes (CC), which are sets of related strains containing pairs of strains that share at least (L-1) identical alleles at the L loci with at least one other member of the CC.
Results and Discussion
We report here the most comprehensive multilocus sequence analysis study to date of the subgenus Leishmania (Viannia), based on fragments of four different metabolic enzyme coding genes from a wide range of species, zymodemes and geographical origins.
With the aim of developing a working MLST scheme, we selected polymorphic coding regions that could be sequenced with forward and reverse primers. The full coding regions of three of the studied genes (6PGD, G6PD and MPI) had already been sequenced and characterized for some L. (Viannia) strains, mainly of Peruvian L. braziliensis and L. peruviana. However, we also partially analyzed the coding region for ICD and studied a large number of strains (96), to include most known species or complexes of L. (Viannia).
Genetic diversity of the four housekeeping genes used in the MLSA of Leishmania (Viannia): Polymorphic, ambiguous and multi-allelic sites
The sequences obtained for each gene varied between 589 (MPI) and 914 bp (ICD) (Table 2) and covered between 38% (G6PD) and 71% (ICD) of the corresponding gene. Sequences were deposited in GenBank (www.ncbi.nlm.nih.gov/Genbank) with accession numbers JN996517–JN996708 and JQ181608–JQ181801 and are also available at the CLIOC website (htpp://clioc.fiocruz.br).
Gene fragments had 50–78 polymorphic sites (respectively, 7.3–8.7%) for the strains studied here, of which 46 (G6PD) to 76 (ICD) were parsimony informative (PI) sites (Table 2). 6PGD had the highest percentage of PI sites (8.4%), with the lowest found in G6PD (6.7%). The studied G6PD region had a much higher number and percentage of singletons, 4 (0.6%), in relation to the respective total template length than the other three gene regions (0.1–0.2%; Table 2).
The four sequenced fragments, representing different loci, had different degrees of diversity between species groups as seen by the haplotype diversity (Figure 1) and are thus suitable to be used in combination for intra- and inter-specific inferences. Although theoretically including more loci improves discriminatory capacity, it has been described that increasing the number of genes above four did not increase discrimination of MLST . Furthermore, the index of diversity  for the targets included in the present study implies that the typing results can be interpreted with confidence.
ICD was the only polymorphic marker for L. shawi strains. * G6PD and MPI were the least polymorphic markers (P<0.001) within species for L. lainsoni and L. guyanensis respectively.
Overall, after including published gene sequences for the same regions as studied here (except for ICD, which had no available sequences) gene fragments provided 51 (MPI) and 64 (6PGD) PI sites (Table 3).
Most polymorphic sites were bi-allelic, although sites with three variants occurred in three genes, G6PD, 6PGD and in ICD (Table S3). Recently, the genome plasticity of some Leishmania species was analyzed . The evaluation of the L. braziliensis genome showed that 30 of 35 chromosomes are clearly trisomic; three are tetrasomic (chromosomes 4, 5 and 29) and one hexasomic (chromosome 31). Moreover, the same study showed that multicopy genes are found preferentially on non-supranumerary chromosomes. Such peculiarities are of great importance in all molecular analyses proposed for the Leishmania genus. In the present work, all markers used are located on different chromosomes, which are described as non-supranumerary . However, the description of a trisomic genome for L. braziliensis indicates that multiallelic possibilities might be quite frequent when polymorphic sites are analyzed in DNA sequences of Leishmania (Viannia) species. Moreover, gene copy numbers might also change between species and generations of the same strain.
Twenty-six isolates had at least one locus with double peaks in the chromatograms (Table 2 and Table S4). The number of sites showing double peaks was similar in three gene regions, but much smaller for MPI, which also had a much smaller number of strains with double peaks in their sequences compared to the other targets (Table 2). Different strains presenting double peaks at the same site were observed in all markers, except MPI (Table S4).
Although most strains were homozygous, almost a third had at least one locus with double peaks, except L. shawi. MPI was the most homozygous gene, even though it was not the most conserved. All the other markers presented more than one double peak per strain and more than one strain with double peaks. It is possible that this is a sampling effect of the sequenced region in MPI, or that gene conversion in this gene is higher .
The presence of ambiguous sites and possible parental alleles among the samples studied strongly suggests real heterozygosis and also the occurrence of some level of recombination. Ambiguous sites were observed in twenty positions over the four gene fragments analyzed (Table S4), and putative homozygous parental were observed in eight of them. Many strains had two or more heterozygous sites. Heterozygosis can be caused by mutation in one allele but it can also be caused by genetic exchange between strains with different alleles. Mutation is more likely for single heterozygous sites but recombination is a more parsimonious explanation for two or more sites . The majority of heterozygous alleles among our sample presented just one ambiguous site. Exceptions were encountered in one L. braziliensis strain showing HP 8/17 in 6PGD and 6/11 in ICD, with two and three ambiguous sites respectively. This observation reinforces the fact that real heterozygosis might be occurring, because these two loci are on different chromosomes and in both sequence alignments at least two ambiguous sites were observed in that sample. Moreover, after determination of the minimum number of recombination events by using DNAsp, recombination between sites was detected for the four markers, with highest frequency for 6PGD and least for G6PD (data not shown).
The assumption that ambiguities in the chromatograms are the result of heterozygosis can only be reliably postulated if the target is a single copy gene. In our case, the four selected genes are single copy genes, as estimated by in silico analysis of the four available Leishmania species reference genomes (data not shown). Heterozygous samples would need confirmation through biological cloning of isolates, to exclude polyclonal populations. The strains we have used were not cloned, except for the reference strains. As far as we know, none of the studies aiming to construct an MLST system for Leishmania have used biologically and/or genetically cloned the samples. Following an overview of sample profiles, without cloning, if ambiguous sites are detected, the strains containing them should be selected for deeper analysis and cloning.
Putative species-specific SNPs, as determined through the data presented here and available in GenBank, were detected for L. guyanensis in 6PGD (A105); for L. naiffi in MPI (15T, 82A, 98A, 270A, 306C, 498C, 543T) and G6PD (320C, 341C, 432A, 467A); for L. lainsoni in MPI (33G, 105G, 294T) and G6PD (116A, 239T, 247G, 326A, 464C, 482C, 572A, 577T) and for L. shawi in MPI (135G). No species-specific SNPs were detected for L. braziliensis. ICD was the only marker that did not present species-specific SNPs.
In a previous study, species-specific SNPs for all species, including L. braziliensis were shown . This incongruence between the two studies might be a consequence of differences in the gene fragments analyzed as well as in the strains studied. SNP markers should thus be used for species identification with great care, even when more strains are studied in future. Either full sequences should be obtained, or a large panel of SNP markers should be used for reliable identification and characterization.
Identification of assorted Leishmania (Viannia) haplotypes
Here we found a large number of unique haplotypes, which are likely to be rare or recent in the population. In contrast, some haplotypes were shared across species, such as haplotype H1 of 6PGD, which was detected in strains from different species: almost all L. braziliensis (n = 40), nine L. peruviana as well as two previously published L. braziliensis sequences, one L. lainsoni (IOCL 2500 from Acre) and one L. naiffi (IOCL 855 from Amazonas) strain. This could be due to recombination or convergent evolution, but it could also represent the most likely ancestral haplotype, as depicted from the haplotype network constructed for this gene (Figure 2). Remarkably, those species present different electrophoretic mobility for the 6PGDH enzyme system. Such incongruence might be due to the fact that the entire coding region was not sequenced here, so sequence sections coding for differences in MLEE might not be present in the current analysis, or it might be due to post-transcriptional and post-translational modifications .
A) G6PD; B) 6PGD; C) MPI; D) ICD. The 96 strains selected for the study (Table S1) plus sequences of Leishmania (Viannia) strains retrieved from GenBank (Table S2) were included. Haplotype frequency is represented by the size of each node and the numbers of polymorphisms are indicated in the branches by dashes: one = 02 to 05 polymorphism; two = 06 to 10; three = more than 10 polymorphisms. Each species, considering the MLEE characterization, were assigned by different colors as coded by the legend.
Upon haplotype assignment it was found that among the 96 studied strains there were 25 (MPI) to 43 (ICD) different haplotypes, of which 16 (MPI) to 31 (ICD) were represented by only one strain. Nevertheless, ICD had the highest percentage of exclusive haplotypes per strain (72.1%) and G6PD had the lowest (63.6%; Table 4).
Strains identified as L. braziliensis represented more than 50% of the strains studied. The greatest number of haplotypes was observed within the L. braziliensis group for all markers except G6PD for which the greatest number of haplotypes was observed within the L. guyanensis group (Table 3). Around 30% of the haplotypes in each gene were detected in L. braziliensis (Table 3). However, almost no singletons and no species-specific SNPs were observed among the L. braziliensis strains, but some ambiguous sites were seen (Table S4). L. utingensis and L. lindenbergi had unique haplotypes for all markers, with the exception of 6PDG for L. lindenbergi, as mentioned above.
Regarding G6PD, 12 haplotypes were shared by at least two strains. The most common haplotype, H6, was found in 45 strains, characterized as L. braziliensis or L. peruviana. Four additional haplotypes were found in previously published sequences, one in L. panamensis (H36), two L. braziliensis (H34, from Peru, and H35, from Brazil) and one L. lainsoni (H37, from Peru). One published L. guyanensis sequence had haplotype H4, the most common L. guyanensis/L. shawi haplotype in this analysis (Table S2). Except for L. shawi and L. peruviana, which shared haplotypes with L. guyanensis and L. braziliensis respectively, the network depicted from this target presented the best clustering accordingly to species (Figure 2A).
Concerning 6PGD, haplotype H15 was only found in one isolate of L. braziliensis from Acre, northern Brazil, but it was identical to four previously published L. braziliensis strains from Peru. Similarities among strains from these two adjacent areas were reported previously based on MLMT , corroborating the suggestion that a geographic cluster and probably a hierarchical population structure of L. braziliensis exist in this area. L. guyanensis had 10 haplotypes, of which H9 was the most frequent, and L. shawi had one distinct haplotype (H3). Only two 6PGD haplotypes found in previously published sequences (one L. braziliensis from Peru and one L. panamensis) were not detected among the strains studied here, indicating a good coverage of haplotype diversity by our study.
The two most frequent alleles observed, corresponding to 6PGD H1 and H8 (Figure 2B), were also observed combined in one heterozygous L. braziliensis strain (IOCL 2833). These haplotypes were also observed in one (H1; IOCL 2494) and two (H8 IOCL 918 and 2538) other heterozygous strains, combined with other haplotypes (Table S4). Some 6PGD haplotypes were unique to heterozygous strains, as H4/H5 and H30/H31 (Table S1).
MPI L. braziliensis haplotype H8 was also found in one L. guyanensis strain. Haplotype H7 was the most common in L. braziliensis (41 strains) and was also found in eight previously published sequences of L. peruviana and eight of L. braziliensis (Table S2). MPI was previously reported as a good target to discriminate between L. braziliensis and L. peruviana , . Our conflicting results could be related to differences in the fragment regions analyzed, or might reflect a bias in the sample analyzed previously. Two published sequences of L. peruviana strains presenting an ambiguous site had the most common H7 and a new haplotype 26 after allele reconstruction. One available L. peruviana and one L. braziliensis sequence from Peru presented additional haplotypes H27 and H28, respectively (Figure 2C; Table S2).
The greatest number of haplotypes was found in ICD, for almost all species, although there were similar levels for G6PD and 6PGD in L. naiffi. ICD H6 was by far the most common L. braziliensis haplotype, including one previously published sequence (Table S2). ICD H9, the most common L. guyanensis haplotype, was also found in one L. shawi (IOCL 3200) (Figure 2D; Table S1).
Overall, and including published gene sequences for the same regions as studied here, there are 28 to 43 different described alleles in L. (Viannia), of which 11 to 16 are in the L. braziliensis complex.
Haplotype diversity and haplotype network selection of marker combinations for resolution of inter- and intra-species relationships
Overall, haplotype diversity (HD) was higher for ICD (0.85), similar among the other gene regions (0.76–0.78) and as compared in ANOVA not significantly different (P>0.01). However, upon analysis by species, MPI was the least polymorphic marker for L. guyanensis (P<0.001) and G6PD was the least polymorphic locus for L. lainsoni (P<0.001). Among the three L. shawi isolates, ICD was the only polymorphic marker (Figure 1).
The Simpson index of diversity (D), which provides an indication of the discriminatory capacity of each marker, was similar for MPI, G6PD and 6PGD, (0.77, 0.77 and 0.78 respectively), but higher for ICD (D = 0.89). Analyses of the discriminatory capacity indicate a high level of strain discrimination, of almost 90%, using these four loci. However, higher values were observed when applied to L. braziliensis and L. guyanensis (96% and 100%, respectively). Other species were represented by few strains and were not analyzed. This suggests that, even though the number of markers is smaller than that usually used in MLST (7), these four markers are sufficient for studies in L. (Viannia). However, more detailed population genetics studies may require more markers. Regarding the discriminatory power of each marker, the Simpson index showed that ICD had the highest diversity. Indeed, this is a very polymorphic locus in MLEE, which is able to detect intra-species variation .
MPI was here found to be a good marker to distinguish between the species, although HP8 was shared among L. braziliensis and one L. guyanensis strain. This enzyme is used as a marker to differentiate L. peruviana from L. braziliensis, which was supported by DNA sequencing in a previous study  by the detection of a specific SNP, as well as others useful to differentiate closely related L. (Viannia) species. We could not confirm this, given that our sequences were shorter and did not include that SNP locus.
Maximum parsimony-based haplotype networks built for each gene (Figure 2) showed that species, as identified by MLEE, clearly formed separated clusters in each gene network, with a few exceptions confined to some strains that for some markers were not grouped in their species haplogroups. Reticulate patterns were observed in some clusters for all loci studied. Even with markers for which L. shawi presented different haplotypes from L. guyanensis, these two species always clustered close together.
Haplogroups were, in general, consistent with the species (color-coded nodes), although exceptions occurred for all markers: G6PD (H4 and H6; Figure 2A); 6PGD (H1; Figure 2B); MPI (H7, H8 and H19; Figure 2C) and in the ICD network (H9, H22, H23, H24; Figure 2D). A single haplotype MPI (H19) comprising an L. braziliensis strain (IOCL 2541, L. braziliensis from Pernambuco) was at the base of the L. lainsoni cluster, but for all the other markers this strain clustered within the L. braziliensis species group. One L. lainsoni strain (IOCL 2500 from Acre) was part of the L. braziliensis cluster for ICD (H24) and 6PGD (H1). Conversely, the L. lainsoni cluster included two haplotypes exclusively from L. braziliensis strains (haplotypes H22 and H23, IOCL 2498 and IOCL 2499 respectively, from Acre), even though both clusters were located in opposite sides of the network.
The most frequent haplotypes (node size) were often the founding haplotype for a given haplogroup, as clearly observed for L. braziliensis (MPI H7, ICD H6, G6PD H6, 6PGD H1) and L. guyanensis (MPI H9, ICD H9, G6PD H4, 6PGD H9). Eighteen out of 55 L. braziliensis strains were frequently observed in the most common haplotypes, but for L. guyanensis the composition of the most common haplotypes was different between the markers. The most frequent haplotypes were also usually those in which inter-specific sharing of sequences was observed (Figure 2).
L. guyanensis formed a diverse cluster, whereas L. shawi strains presented a profile coherent with a subpopulation of the L. guyanensis group for all markers, commonly sharing the most common L. guyanensis haplotype or differing from it in at most two polymorphisms (Figure 2C).
Although we used a relevant number of strains, the data analysis by Network software generated median vectors. The presence of median vectors in the networks might indicate that: i) intermediate haplotypes were present in lost populations; ii) haplotypes from populations that not included in this analysis; iii) the ancestors of these strains suffered rapid adaptive evolution with expansion of these extant strains. Therefore, even without sequencing strains representing all genetic diversity, statistical tools may predict the variability.
Neighbour Joining (NJ) trees were built for each marker to evaluate the phylogenetic relationship between the haplotypes. Almost no incongruence was observed between the markers. Four monophyletic groups were clearly observed for each marker, representing basically L. lainsoni and L. naiffi, the most divergent groups, and L. braziliensis and L. guyanensis (in this case, including L. shawi), which were very closely related for any marker (data not shown), corroborating the genetic distance MLEE-based tree . L. lindenbergi (except in 6PGD) and L. utingensis were each in a separate and independent branch, but grouping closer to L. naiffi and L. guyanensis respectively.
Available sequences of L. peruviana for G6PD, 6PGD and MPI, and for L. panamensis for G6PD and MPI, were included in the respective network constructions. L. peruviana sequences either presented the most frequent haplotype for L. braziliesnis strains (in G6PD H6, 6PGD H1 MPI H7; Figure 2) or differed from it in one polymorphic site (in the MPI H26 and H27; Figure 2C; Table S2). A previous study using random amplification of polymorphic DNA (RAPD) and MLEE  reported that L. peruviana and L. braziliensis corresponded to two closely related, but distinct monophyletic lines, which was not corroborated by hsp70 gene sequence analysis . L. panamensis sequences presented new haplotypes (G6PD H36 and 6PGD H36) within the L. guyanensis and L. shawi haplogroup. Previously  MLEE and RAPD analysis questioned the distinction between L. panamensis and L. guyanensis, since data did not indicate distinct monophyletic lines. In individual NJ trees for the gene fragments studied here (data not shown), the L. panamensis sequences clustered within the L. guyanensis/L. shawi group and L. peruviana clustered within the L. braziliensis group. It was not possible to include L. panamensis and L. peruviana in the final MLSA conclusions since there were no available sequences for all genes studied. Furthermore, more strains from both species should be sequenced for the four gene targets to infer properly on the monophyletic origin of them.
Concatenated NeighborNet confirms species-specific clusters and suggests relatively frequently recombination occurring in L. braziliensis and L. guyanensis
Among the 96 L. (Viannia) strains, 75 final diploid sequence types (DSTs) were assigned. Although we detected a high number of DSTs, many DSTs are unique, while others are more prevalent, widely distributed and presenting temporal stability, which might reflect limited genetic recombination involving these DSTs . The only species with strains sharing the same DST was L. braziliensis. DST12, for example, was found in 18 strains of L. braziliensis from different Brazilian endemic regions related to the Atlantic rain forest, except one, and present zymodeme diversity although most are classified in zymodeme 27 (Table S1). DST12 is not only the most frequent but also shows temporal stability, as the strains included in this DST had been isolated between 1987 and 2001. Strains typed as DST12 were isolated from patients presenting distinct clinical manifestations. This raises the intriguing proposition that the apparent dominance of DST12 in endemic locations associated with urban areas of the Atlantic rain forest region may be a consequence of higher fitness of this DST to the modified environment.
Three other L. braziliensis DSTs (16, 27, and 35) comprised two or three strains. All the other L. braziliensis were assigned to unique DSTs (n = 30). DST16 comprised two heterozygous L. braziliensis strains from Peru, whilst DST27 comprised two strains (zymodemes 27 and 74) from different localities (Pernambuco and Bahia, respectively), and DST35 three strains from the same zymodeme (IOC/Z27) and geographic origin (Bahia, Northern Brazil).
This demonstrates that the MLSA approach allows both detection of different genotypes and the level of separation between strains through the number of polymorphic sites.
A NeighborNet network was obtained with the concatenated sequences represented by the DSTs (Figure 3). The clusters were in agreement with MLEE for species groups, with the exception of L. shawi, which clusters together with all L. guyanensis strains. The reticulate aspects of the L. guyanensis group suggest recombination events occurring among the strains, including L. shawi (Figure 3). This same aspect was observed for the L. braziliensis cluster, but two strains were more divergent (IOC/L2498 and IOC/L2499, DSTs 32 and 57). Both were isolated from Acre state, a region bordering Peru. Recently we have demonstrated that these two strains clustered together with L. peruviana and Peruvian L. braziliensis by microsatellite typing . However, upon removal of ICD sequences from this analysis these two strains grouped very closed to the other L. braziliensis strains (data not shown).
A bushy network structure is observed indicating homologous recombination. # = DSTs presenting ambiguous sites (possible heterozygous) in at least one locus; DSTs gathered in clonal complex are represented by different colored circles depending on the species: yellow - L. braziliensis; black – L. guyanensis; light blue – L. shawi; blue – L. lainsoni; red – L. naiffi. Singletons are represented by DST number only.
It is clear that L. braziliensis, L. guyanensis, L. naiffi and L. lainsoni all represent distinct species, forming monophyletic groups in the NeighborNet. L. lindenbergi and L. utingensis were placed close to the monophyletic groups corresponding to L. naiffi and L. guyanensis respectively, corroborating MLEE and MLMT data (unpublished). More isolates from both species should be studied to infer their taxonomic status. However, it is important to mention that these two species shared no alleles with all the other species/strains, except for L. lindenbergi, which shared alleles in 6PGD with L. braziliensis (Figure 2B). The estimate of the average genetic distance between L. lindenbergi or L. utingensis and the other L. (Viannia) species is comparable to those observed between each of the species analyzed. The highest values were observed between L. lainsoni and any other species.
Corroborating the results observed for the haplotype network, L. naiffi and L. lainsoni were the most divergent and the large splits observed suggesting no influence of genetic exchange. DST58, represented by a single L. lainsoni strain was positioned in a split between L. lainsoni and L. naiffi. The extensive reticulation suggests that recombination has occurred relatively frequently in L. braziliensis and L. guyanensis, as proposed by other authors . As expected, L. braziliensis and L. naiffi presented higher within group average genetic distance values than L. guyanensis, even including L. shawi strains in the L. guyanensis group, corroborating previous studies on the genetic diversity of both L. braziliensis and L. naiffi , , , . However, as far as we know, this is the first report of a high level of genetic diversity within L. lainsoni. This was the species giving the highest within group average genetic distance value, although it consists of only two zymodemes. Reports on the occurrence of L. lainsoni and L. naiffi in distinct endemic areas have constantly increased, indicating that both species are adapted to several environments where different sand fly species participate in parasite transmission. Notably, L. lainsoni was the most divergent species, followed by L. naiffi, which might contribute to the widespread dispersion of these two species –.
BURST analysis contributes to the understanding of relatedness among DSTs
Considering that a large number of unique DSTs were detected, the BURST algorithm was employed to evaluate the relationships between 52 homozygous DSTs, which corresponded to 70 strains. Here we applied eBURST to analyze the allele profiles as an efficient method to recognize natural discontinuities that would merit taxonomic status as species or subspecies. The BURST algorithm first identifies mutually exclusive groups of related genotypes in the population (typically an MLST database), and attempts to identify the founding genotype (sequence type, ST) of each group. The algorithm then predicts the descent from the predicted founding genotype to the other genotypes in the group, displaying the output as a radial diagram, centered on the predicted founding genotype. The procedure was developed for use with the data produced by MLST (STs and their allelic profiles).
The clonal complex model has proved to be valuable in analyzing many MLST datasets, making the results more amenable for epidemiological analysis. A clonal complex (CC) comprises genetically related, but not identical, organisms. In this study six CCs were observed (Figure 4); each one represented by strains classified as the same Leishmania species. Strains typed as L. braziliensis were grouped in two clonal complexes. The CCs observed were: CC1, L. lainsoni (comprising 2 of 6 strains; 2/6); CC2, L. naiffi (3/5); CC3, L. shawi (3/3); CC4, L. braziliensis (30/37); CC5, L. braziliensis (2/37); CC6, L. guyanensis (13/17). CC6 included 13 L. guyanensis DSTs from 17 analyzed, indicating DST37 as the predict founder genotype. CC4 included 12 out of 19 L. braziliensis DSTs analyzed and the predicted founder genotype was DST12, the most common DST observed for this species.
Six clonal complexes were found: CC1: L. lainsoni; CC2 L. naiffi; CC3 L. shawi; CC4 L. braziliensis; CC5 L. braziliensis; CC6 L. guyanensis. Predicted founders were identified for CC4 (DST12) and CC6 (DST37; IOC/L565). Only homozygous DSTs were used (Table S1).
Most of the isolates were not grouped in any clonal complex, showing a high prevalence of singletons (n = 15; 4 L. lainsoni, 2 L. naiffi, 5 L. braziliensis, 4 L. guyanensis), differing at two or more loci from all other DSTs (Table S1), typical of populations with a high rate of recombination relative to mutation and for which eBURST does not reliably indicate ancestry . Although eBURST does not contribute to phylogenetic analysis in Leishmania, this is a helpful clustering tool to analyze MLST results from this organism, which minimizes the differences observed between each Leishmania strain. For example, although all L. guyanensis strains were assigned to a distinct DST, almost all of them clustered in the same clonal group (CC5), corroborating the already demonstrated homogeneity of the L. guyanensis population in Brazil  and contrasting with the results with microsatellites showing diversity within the L. guyanensis complex . We cannot infer the dispersion capacity of this group, but temporal stability is certainly a characteristic of this clonal complex.
Another scenario was observed for L. braziliensis. Out of 55 L. braziliensis, we obtained groups of 18, 3, 2 and 2 strains that were assigned to the same DSTs. However, two clonal complexes were observed for this species, one represented by 57.5% of the strains analyzed (CC4), the other one by only two strains (CC5) and 27.5% representing singletons. This result would fit a hypothesis of L. braziliensis having better fitness than L. guyanensis, leading to an improved dispersion. However, more L. guyanensis isolates from other regions not located in Brazil and its ‘sister’ species, L. panamensis, would benefit this discussion.
Although the main purpose of this study was not to investigate the epidemiological findings from MLSA of L. (Viannia), it is important to mention some aspects. The capacity of dispersion and adaptation of L. braziliensis clonal complex CC4 is evident as well as its temporal stability. Although formed by only two L. braziliensis strains, CC5 represents an interesting clonal complex. The two strains were isolated from the same locality, a municipality in Pernambuco state, in northeastern Brazil, where a unique zymodeme is found. Furthermore, it is interesting that a high level of genetic diversity was already demonstrated for the L. braziliensis population in Pernambuco, depicted by MLEE analysis. It seems that in this region two transmission cycles co-exist; involving or not involving the sylvatic environment , . This could influence or favor the maintenance of two clonal complexes in the area.
The putative founder for the clonal cluster CC4 (L. braziliensis) and CC6 (L. guyanensis) were determined. However, it is not possible to infer if they represent the ancestor of each group. In fact, for L. braziliensis CC4 the putative founder is the most prevalent DST, but for L. guyanensis this is not true as for this species each strain was assigned as one DST.
To evaluate the robustness of each CC and the relationship between them, a NJ tree was built using the concatenated sequences of DSTs included by eBURST in clonal complexes, excluding the singletons (Figure 5). Clusters with high bootstrap supports were observed, corresponding to each CC. DST24 and DST30, from CC5, were the only two DSTs inside the L. braziliensis cluster forming a group supported by the bootstrap value. Inside the L. guyanensis group a high supported cluster was observed composed of all CC3 DSTs, corresponding to L. shawi. Low bootstrap values were observed among almost all DSTs from CC4 and CC6, corroborating the fact that they belong to the same complex. Evidence was again found for the divergence of L. lainsoni and L. naiffi clonal complexes.
The phylogenetic tree was constructed using number of differences and the NJ method. Only DSTs comprised in any clonal complexes (Figure 4 and Table S1) were included. Bootstrap values (percentages of 500 replicates) above 50% are indicated at the nodes.
The results obtained here strongly supported the established taxonomy of L. (Viannia), considering the species that have been found in circulation in Brazil. Specifically, our data support monophyly of all but one Brazilian L. (Viannia) species analyzed here and highlight the close relationships between L. braziliensis and L. guyanensis and the recombination events occurring in both species. Some aspects merit special mention, however. The taxonomic validity of L. shawi has been questioned and indeed the markers studied here suggested that L. shawi isolates were closely related to or were part of the L. guyanensis group. The same was observed in the NeighborNet network with concatenated sequences. However, the eBURST analysis indicated that these isolates form a distinct clonal complex from L. guyanensis, although closely related as observed in the NeighborNet network. Recently, hsp70 gene sequence analysis has indicated that L. shawi is a subgroup of L. guyanensis . In addition, L. shawi and L. guyanensis had the lowest average genetic distance of the L. (Viannia) studied here. Our results, thus, indicate that L. guyanensis has different clonal populations, such as those observed for L. braziliensis, with L. shawi being one of them.
It is noteworthy that the MLSA clades derived here are in good agreement with MLEE clusters reported previously. Taken together, our data point to the combined four gene scheme used here as a reasonable approach that provides extensive differentiation and offers evolutionarily accurate clustering. The analysis performed herein should be extended to species other than those studied here and should be used as a starting point to develop an MLST scheme for Leishmania spp. genetic typing. While providing complete genome sequences is not possible as a routine approach, MLST generates evidence for similarities and differences between Leishmania species and/or strains, offering a number of advantages over most typing methods and providing results helpful for taxonomic, population genetics, evolutionary and, in consequence, epidemiological studies .
Such data are likely to revolutionize the systematics of Leishmania, consolidate our view of what constitutes a Leishmania ‘species’, provide evidence concerning the epidemiology of these pathogens and might even be applicable directly to clinical samples.
Definition of Leishmania species and knowledge of the genetic structure of all Leishmania species will provide a useful framework for exploring the evolutionary dynamics and phylogenetic distribution of relevant strain properties. Recognizing the urgent need for a standardized globally acceptable species definition and typing method for Leishmania, we are now sequencing other genes and including more species and strains in our analysis aiming to propose that species within the Leishmania genus could be defined as a group of strains that share a determined level of similarity in the concatenated nucleotide sequences of the genes selected. To achieve this, establishment of a consensus MLST gene set that provides optimum differentiation for Leishmania species and/or strains is required.
Taxonomic and collection data (clinical, geographical, biochemical and molecular Information) for L. (Viannia) strains used for MLSA.
Sequences retrieved from GenBank and those included in the analysis performed in the present study, indicating the assigned sequence type (ST) for each marker.
Multi-alleles sites observed for each fragment-gene alignment and the distribution of each allele in the studied strains.
Strains presenting ambiguous sites (IUPAC symbols) for the targets with the respective site position in length alignment and the most common nucleotide.
We thank the Plataforma de Sequenciamento do Instituto Oswaldo Cruz (PDTIS) for performing all the sequencing. We also would like to thank Gabriel Eduardo Melim Ferreira for his contribution on the figures and for his critical review on the text and for Global Science Editing, UK.
Conceived and designed the experiments: EC MCB MAM ILM. Performed the experiments: MCB. Analyzed the data: MCB EC. Contributed reagents/materials/analysis tools: EC ILM MAM. Wrote the paper: EC MCB MAM ILM.
- 1. Garin YJ, Sulahian A, Pratlong F, Meneceur P, Gangneux JP, et al. (2001) Virulence of Leishmania infantum is expressed as a clonal and dominant phenotype in experimental infections. Infect Immun 69: 7365–7373. doi: 10.1128/iai.69.12.7365-7373.2001
- 2. Arevalo J, Ramirez L, Adaui V, Zimic M, Tulliano G, et al. (2007) Influence of Leishmania (Viannia) species on the response to antimonial treatment in patients with American tegumentary leishmaniasis. J Infect Dis 195: 1846–1851. doi: 10.1086/518041
- 3. Banuls AL, Hide M, Prugnolle F (2007) Leishmania and the leishmaniases: a parasite genetic update and advances in taxonomy, epidemiology and pathogenicity in humans. Adv Parasitol 64: 1–109. doi: 10.1016/s0065-308x(06)64001-3
- 4. Schonian G, Mauricio I, Cupolillo E (2010) Is it time to revise the nomenclature of Leishmania? Trends Parasitol doi: 10.1016/j.pt.2010.06.013
- 5. da Silva LA, de Sousa CdS, da Graça GC, Porrozzi R, Cupolillo E (2010) Sequence analysis and PCR-RFLP profiling of the hsp70 gene as a valuable tool for identifying Leishmania species associated with human leishmaniasis in Brazil. Infect Genet Evol 10: 77–83. doi: 10.1016/j.meegid.2009.11.001
- 6. Montalvo AM, Fraga J, Monzote L, Montano I, De Doncker S, et al. (2010) Heat-shock protein 70 PCR-RFLP: a universal simple tool for Leishmania species discrimination in the New and Old World. Parasitology 137: 1159–1168. doi: 10.1017/s0031182010000089
- 7. Maiden MC, Bygraves JA, Feil E, Morelli G, Russell JE, et al. (1998) Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. Proc Natl Acad Sci USA 95: 3140–3145. doi: 10.1073/pnas.95.6.3140
- 8. Debourgogne A, Gueidan C, Hennequin C, Contet-Audonneau N, de Hoog S, et al. (2010) Development of a new MLST scheme for differentiation of Fusarium solani Species Complex (FSSC) isolates. J Microbiol Methods 82: 319–323. doi: 10.1016/j.mimet.2010.07.008
- 9. Odds FC (2010) Molecular phylogenetics and epidemiology of Candida albicans. Future Microbiol 5: 67–79. doi: 10.2217/fmb.09.113
- 10. Odds FC, Bougnoux ME, Shaw DJ, Bain JM, Davidson AD, et al. (2007) Molecular phylogenetics of Candida albicans. Eukaryot Cell 6: 1041–1052. doi: 10.1128/ec.00041-07
- 11. Mauricio IL, Gaunt MW, Stothard JR, Miles MA (2001) Genetic typing and phylogeny of the Leishmania donovani complex by restriction analysis of PCR amplified gp63 intergenic regions. Parasitology 122: 393–403. doi: 10.1017/s0031182001007466
- 12. Mauricio IL, Yeo M, Baghaei M, Doto D, Pratlong F, et al. (2006) Towards multilocus sequence typing of the Leishmania donovani complex: resolving genotypes and haplotypes for five polymorphic metabolic enzymes (ASAT, GPI, NH1, NH2, PGD). Int J Parasitol 36: 757–769. doi: 10.1016/j.ijpara.2006.03.006
- 13. Zemanova E, Jirku M, Mauricio IL, Horak A, Miles MA, et al. (2007) The Leishmania donovani complex: genotypes of five metabolic enzymes (ICD, ME, MPI, G6PDH, and FH), new targets for multilocus sequence typing. Int J Parasitol 37: 149–160. doi: 10.1016/j.ijpara.2006.08.008
- 14. Tsukayama P, Lucas C, Bacon DJ (2009) Typing of four genetic loci discriminates among closely related species of New World Leishmania. Int J Parasitol 39: 355–362. doi: 10.1016/j.ijpara.2008.08.004
- 15. Jacobsen MD, Boekhout T, Odds FC (2008) Multilocus sequence typing confirms synonymy but highlights differences between Candida albicans and Candida stellatoidea. FEMS Yeast Res 8: 764–770. doi: 10.1111/j.1567-1364.2008.00392.x
- 16. Jacobsen MD, Rattray AM, Gow NA, Odds FC, Shaw DJ (2008) Mitochondrial haplotypes and recombination in Candida albicans. Med Mycol 46: 647–654. doi: 10.1080/13693780801986631
- 17. Lauthier JJ, Tomasini N, Barnabe C, Rumi MM, D'Amato AM, et al. (2012) Candidate targets for Multilocus Sequence Typing of Trypanosoma cruzi: validation using parasite stocks from the Chaco Region and a set of reference strains. Infect Genet Evol 12: 350–358. doi: 10.1016/j.meegid.2011.12.008
- 18. Yeo M, Mauricio IL, Messenger LA, Lewis MD, Llewellyn MS, et al. (2011) Multilocus sequence typing (MLST) for lineage assignment and high resolution diversity studies in Trypanosoma cruzi. PLoS Negl Trop Dis 5: e1049. doi: 10.1371/journal.pntd.0001049
- 19. Cupolillo E, Grimaldi G, Momen H (1994) A General Classification of New-World Leishmania Using Numerical Zymotaxonomy. Am J Trop Med Hyg 50: 296–311.
- 20. Gordon D (2002) Viewing and Editing Assembled Sequences Using Consed. Current Protocols in Bioinformatics: John Wiley & Sons, Inc.
- 21. Tamura K, Dudley J, Nei M, Kumar S (2007) MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) Software Version 4.0. Molecular Biology and Evolution 24: 1596–1599. doi: 10.1093/molbev/msm092
- 22. Librado P, Rozas J (2009) DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25: 1451–1452. doi: 10.1093/bioinformatics/btp187
- 23. Bandelt HJ, Dress AW (1992) Split decomposition: a new and useful approach to phylogenetic analysis of distance data. Mol Phylogenet Evol 1: 242–252. doi: 10.1016/1055-7903(92)90021-8
- 24. Huson DH, Bryant D (2006) Application of phylogenetic networks in evolutionary studies. Mol Biol Evol 23: 254–267. doi: 10.1093/molbev/msj030
- 25. Hunter PR, Gaston MA (1988) Numerical index of the discriminatory ability of typing systems: an application of Simpson's index of diversity. J Clin Microbiol 26: 2465–2466.
- 26. Rogers MB, Hilley JD, Dickens NJ, Wilkes J, Bates PA, et al. (2011) Chromosome and gene copy number variation allow major structural change between species and strains of Leishmania. Genome Res 21: 2129–2142. doi: 10.1101/gr.122945.111
- 27. Mauricio IL, Gaunt MW, Stothard JR, Miles MA (2007) Glycoprotein 63 (gp63) genes show gene conversion and reveal the evolution of Old World Leishmania. Int J Parasitol 37: 565–576. doi: 10.1016/j.ijpara.2006.11.020
- 28. Cuervo P, Domont GB, De Jesus JB (2010) Proteomics of trypanosomatids of human medical importance. J Proteomics 73: 845–867. doi: 10.1016/j.jprot.2009.12.012
- 29. Oddone R, Schweynoch C, Schonian G, de Sousa CS, Cupolillo E, et al. (2009) Development of a multilocus microsatellite typing approach for discriminating strains of Leishmania (Viannia) species. J Clin Microbiol 47: 2818–2825. doi: 10.1128/jcm.00645-09
- 30. Zhang WW, Miranda-Verastegui C, Arevalo J, Ndao M, Ward B, et al. (2006) Development of a genetic assay to distinguish between Leishmania (Viannia) species on the basis of isoenzyme differences. Clin Infect Dis 42: 801–809. doi: 10.1086/500326
- 31. Banuls AL, Dujardin JC, Guerrini F, De Doncker S, Jacquet D, et al. (2000) Is Leishmania (Viannia) peruviana a distinct species? A MLEE/RAPD evolutionary genetics answer. J Eukaryot Microbiol 47: 197–207. doi: 10.1111/j.1550-7408.2000.tb00039.x
- 32. Fraga J, Montalvo AM, De Doncker S, Dujardin JC, Van der Auwera G (2010) Phylogeny of Leishmania species based on the heat-shock protein 70 gene. Infect Genet Evol 10: 238–245. doi: 10.1016/j.meegid.2009.11.007
- 33. Banuls AL, Jonquieres R, Guerrini F, Le Pont F, Barrera C, et al. (1999) Genetic analysis of leishmania parasites in Ecuador: are Leishmania (Viannia) panamensis and Leishmania (V.) guyanensis distinct taxa? Am J Trop Med Hyg 61: 838–845.
- 34. Rougeron V, De Meeus T, Hide M, Waleckx E, Dereure J, et al. (2010) A battery of 12 microsatellite markers for genetic analysis of the Leishmania (Viannia) guyanensis complex. Parasitology 137: 1879–1884. doi: 10.1017/s0031182010000776
- 35. Brito ME, Andrade MS, Mendonca MG, Silva CJ, Almeida EL, et al. (2009) Species diversity of Leishmania (Viannia) parasites circulating in an endemic area for cutaneous leishmaniasis located in the Atlantic rainforest region of northeastern Brazil. Trop Med Int Health 14: 1278–1286. doi: 10.1111/j.1365-3156.2009.02361.x
- 36. Rougeron V, De Meeus T, Hide M, Waleckx E, Bermudez H, et al. (2009) Extreme inbreeding in Leishmania braziliensis. Proc Natl Acad Sci USA 106: 10224–10229. doi: 10.1073/pnas.0904420106
- 37. Cupolillo E, Brahim LR, Toaldo CB, de Oliveira-Neto MP, de Brito ME, et al. (2003) Genetic polymorphism and molecular epidemiology of Leishmania (Viannia) braziliensis from different hosts and geographic areas in Brazil. J Clin Microbiol 41: 3126–3132. doi: 10.1128/jcm.41.7.3126-3132.2003
- 38. Cupolillo E, Aguiar Alves F, Brahim LR, Naiff MF, Pereira LO, et al. (2001) Recent advances in the taxonomy of the New World leishmanial parasites. Med Microbiol Immunol 190: 57–60.
- 39. Kato H, Gomez EA, Yamamoto Y, Calvopina M, Guevara AG, et al. (2008) Natural infection of Lutzomyia tortura with Leishmania (Viannia) naiffi in an Amazonian area of Ecuador. Am J Trop Med Hyg 79: 438–440.
- 40. van der Meide WF, Jensema AJ, Akrum RA, Sabajo LO, Lai AFRF, et al. (2008) Epidemiology of cutaneous leishmaniasis in Suriname: a study performed in 2006. Am J Trop Med Hyg 79: 192–197.
- 41. Simon S, Veron V, Carme B (2010) Leishmania spp. identification by polymerase chain reaction-restriction fragment length polymorphism analysis and its applications in French Guiana. Diagn Microbiol Infect Dis 66: 175–180. doi: 10.1016/j.diagmicrobio.2009.08.013
- 42. Turner KM, Feil EJ (2007) The secret life of the multilocus sequence type. Int J Antimicrob Agents 29: 129–135. doi: 10.1016/j.ijantimicag.2006.11.002
- 43. Brandao-Filho SP, Brito ME, Carvalho FG, Ishikawa EA, Cupolillo E, et al. (2003) Wild and synanthropic hosts of Leishmania (Viannia) braziliensis in the endemic cutaneous leishmaniasis locality of Amaraji, Pernambuco State, Brazil. Trans R Soc Trop Med Hyg 97: 291–296. doi: 10.1016/s0035-9203(03)90146-5