The genotypic structure of parasite populations is an important determinant of ecological and evolutionary dynamics of host-parasite interactions with consequences for pest management and disease control. Genotypic structure is especially interesting where multiple hosts co-exist and share parasites. We here analyze the natural genotypic distribution of Crithidia bombi, a trypanosomatid parasite of bumblebees (Bombus spp.), in two ecologically different habitats over a time period of three years. Using an algorithm to reconstruct genotypes in cases of multiple infections, and combining these with directly identified genotypes from single infections, we find a striking diversity of infection for both data sets, with almost all multi-locus genotypes being unique, and are inferring that around half of the total infections are resulting from multiple strains. Our analyses further suggest a mixture of clonality and sexuality in natural populations of this parasite species. Finally, we ask whether parasite genotypes are associated with host species (the phylogenetic hypothesis) or whether ecological factors (niche overlap in flower choice) shape the distribution of parasite genotypes (the ecological hypothesis). Redundancy analysis demonstrates that in the region with relatively high parasite prevalence, both host species identity and niche overlap are equally important factors shaping the distribution of parasite strains, whereas in the region with lower parasite prevalence, niche overlap more strongly contributes to the distribution observed. Overall, our study underlines the importance of ecological factors in shaping the natural dynamics of host-parasite systems.
Citation: Salathé RM, Schmid-Hempel P (2011) The Genotypic Structure of a Multi-Host Bumblebee Parasite Suggests a Role for Ecological Niche Overlap. PLoS ONE 6(8): e22054. doi:10.1371/journal.pone.0022054
Editor: Dirk Steinke, Biodiversity Insitute of Ontario - University of Guelph, Canada
Received: February 16, 2011; Accepted: June 16, 2011; Published: August 10, 2011
Copyright: © 2011 Salathé, Schmid-Hempel. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Funding: The study was supported by Swiss NSF (grant nr. 3100A0-116057) and Australian ARC (grant nr. DP0209447). It was also supported by the Genetic Diversity Centre of ETH Zurich (GDC) and CCES. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
J.B.S. Haldane suggested in 1949  that parasites might be responsible for the maintenance of genotypic variation in natural populations of hosts. In fact, the population genetic structure of micro-parasites is an important element for the ecological dynamics of a host-parasite system and of importance for practical problems, too. Examples include the conditions under which an epidemic can be controlled – the efficacy of interventions undertaken in agriculture animal husbandry and human public health , or the general observation that host-parasite interactions are based on genotypic variation in both parasite infectivity and host susceptibility . There is also a widespread consensus that genotype-genotype interactions have a major influence on host-parasite co-evolution ,,.
So far, a number of ecological factors have been identified that affect parasite population genetic structure. For example, specific selection by the immune system can structure both parasite populations  and host heterogeneity , and host vaccination can potentially select for certain parasite types rather than others . Also co-infections of hosts by parasites affect the parasite genetic structure in the host  and, similarly, structure depends on whether the parasites reproduce clonally or sexually, or show at least episodic genetic exchange among “strains”. This issue has been the focus of many debates over the past decades –. The range of these considerations is certainly relevant for the trypanosomatids, a representative of which is studied here. The group contains the agents of some major human diseases (e.g. sleeping sickness, Chagas disease, leishmaniasis) as well as important agricultural diseases, such as bovine Nagana fever and several plant diseases (caused by Phytomonas). Furthermore, in this group, genetic exchange has been demonstrated for Trypanosoma brucei ,, T. cruzi , and Leishmania major . Recently, we have been able to find experimental evidence for genetic exchange in C. bombi .
We here study the genotypic distribution of a contagious protozoan micro-parasite, the trypanosome Crithidia bombi  in field populations of its host, bumblebees Bombus spp., and in two distinct regions of Switzerland. Previous studies have repeatedly demonstrated the tight genotypic interactions between C. bombi and its model host B. terrestris ,. For instance, experimental transmission between colonies is affected by the genotype of the parasite as well as the genetic identities of both the targeted and the donor colony, and quantitative genetic studies have made it abundantly clear that host genotype is a major determinant of susceptibility to C. bombi . As far as we know to date, C. bombi is restricted to hosts of the genus Bombus. Within Bombus spp., C. bombi can be transmitted to other con-specifics inside the colony but also to any other potential con-generic host via visits of the same flowers . Every locally present host species can potentially be infected, although the infection prevalence varies among species at a given site , .
Based on these data, we seek to shed light on the following questions: What is the basic genetic structure of the parasite populations, i.e. are populations locally structured and distinct from each other? How common are multiple infections in the field, and how genetically diverse are infections in general? Can we infer clonality or sexuality from analyzing the genotypic structure of these field infections? And finally, do parasite types associate with host species (the phylogenetic hypothesis); that is, is there segregation of parasite genotypes across different host species in the field, or are the parasite genotypes randomly distributed over all hosts that can harbor infections? Alternatively, can ecological factors like resource partitioning of host species, i.e. the dietary overlap at any given floral resource, affect the distribution of the parasite genotypes (the ecological hypothesis)? If the latter were true, we would expect that the different hosts would share parasites not only due to their genetic background, but also due to visiting the same flowers, i.e. the overlap of their ecological niches, and therefore the overlap in their infections - and vice versa if host species were the crucial factor. Note that flower preferences by foraging bees can be affected by a number of factors (e.g. tongue length, –. Yet here, we observe the realized flower preferences regardless of how they come about in order to assess possible pathways of transmission as a factor explaining the structure of the parasite populations. We collected data over a period of three years and in two regions representing typical bumblebee habitats of Central Europe. In particular, one region represents a habitat of relatively high infection prevalence of C. bombi, and the other represents a habitat with low parasite prevalence.
Bumblebees were collected in two regions in Switzerland over a period of three years (2003, 2004, and 2005): (1) region “Basler Jura”, a hilly area (elevation 441–654 m) in Northwestern Switzerland, with the sites Röschenz (47°25′18″N, 7°27′22″E, 537 m), Soyhières (47°23′57″N, 7°22′13″E, 441 m), and Movelier (47°24′06″N, 7°19′15″E, 654 m); (2) region “Lower Engadin”, i.e. the Lower Engadin valley of the Southeastern Swiss Alps at elevations around the timberline (1'800–1'900 m), with the sites Buffalora (46°38′52″N, 10°16′00″E, 1'926 m), La Munt (46°38′20″N, 10°19′42″E, 2'205 m), Lavin (46°46′28″N, 10°6′36″E, 1'736 m), and Stabelchod (46°38′44″N, 10°14′26″E, 1'945 m), and one stretch just above the timberline, site La Schera (46°38′23″N to 46°38′42″N, 10°12′41″E to 10°14′48″E, at elevations of 2'300–2'584 m).
To ensure sampling an adequate range of infections throughout the season, we sampled several times during the season when possible, which for the Basler Jura could be further apart in total due to a longer season of flowering, i.e. late May, mid July and mid August. On the mountain meadows we sampled mid July and mid August, as the flower season at those altitudes is shorter by at least one month. For the sampling, arbitrary transects were defined in a site and all bees visiting a flower within a certain distance (2 m) to the left and right were collected. This approach was chosen to reduce sampling bias by randomly including all local landscape components instead of focusing on specific ones, e.g. a hedge or a hiking trail, to ensure the collection of C. bombi strain to be an adequate representation of the circulating parasites ,. After collection, all bees were freeze-killed and later screened for infections in the laboratory. To investigate the overlap of ecological niches for the different host species, the flower the collected bee had just been visiting was identified and registered; plant species was identified following the key of Hess et al. .
C. bombi DNA extraction and microsatellite typing
We first dissected the bumblebees to check for parasites more generally under the stereo and light microscopes. Then, the bee guts (the site of infection of C. bombi) were extracted and individually placed into 1.5 ml Eppendorf tubes, together with 100 µl of Ringer solution (Merck); an equal volume of 10% Chelex (Bio-Rad) was added. The samples were heated to 95°C for 15 min, cooled on ice, vortexed and centrifuged for two minutes at 15'000 g. PCR was performed for five C. bombi specific microsatellite loci (labeled 4.G9, 4, 16, 2.F10, 1.B6) according to Schmid-Hempel and Reber Funk , with a standard PCR protocol: denaturation at 95°C for 5 min; 37 cycles of 1 min at 95°C, 30 sec at respective annealing temperature, 30 sec extension at 72°C; final extension at 72°C for 10 min. All products were visualized on Spreadex gels EL400 (Elchrom Scientific) and for increased accuracy loaded onto the 3130 ABI Prism Fragment Analyzer later on. Allele lengths were calibrated to the same standards across platforms.
Note that a new, co-infecting species of Crithidia, named C. expoeki, was recently described and is known to occur in the same regions . However, the microsatellite primers used here do not amplify this new species. Therefore, we can safely say that our analysis only refers to populations of C. bombi.
Algorithm to find C. bombi strains
An allele combination at a single locus defines a single-locus genotype. The combination of alleles over all typed loci is the multi-locus genotype, which is here heuristically considered a “strain”. A host can be infected with a number of genetically different strains at the same time. In this case it is not straightforward to separate these infections, i.e. to associate alleles with strains, and so to assess the number of concurrent infections. The reconstruction of constituent genotypes from multiple infections is a non-trivial problem, which has been considered not only in the field of biology, but also, for example, in applied forensic medicine ,. However, in situations like those analyzed here, where multiple host species and multiple infections of different strains of a parasitic species are involved, one cannot a priori rely on assumptions of Hardy-Weinberg frequencies or the knowledge of the genotypes of some suspects and reference persons, as it is done in forensic applications. Hence, for the time being, the problem of how to correctly reconstruct the constituent genotypes from field infections is not solved satisfactorily. To nevertheless use available data, and given that almost half of the hosts are multiply infected, we have used a conservative approach to reconstruct genotypes and calculate allele frequencies, and will report the results from single infections and all infections separately where appropriate. The algorithm used here is conservative as it gives priority to genotypes that are known from single infections, and since it assumes that infections with a maximum of two alleles at any one locus represent single infections. Schmid-Hempel and Reber Funk  employed this algorithm for the analysis of infections in spring queens (i.e. young queens emerging from hibernation and starting their colony); it is based on the diploid genotype of C. bombi and can be described as follows:
First, a minimum number of independent infections (nmin) for all single loci in a given bee is calculated, as derived from the maximum number of alleles at any one single locus. Then, the maximum possible minimum number of strains infecting this bee is the largest value of nmin estimated from all loci. In a next step, the most probable circulating strains in the population are derived. These strains are identified from the singly infected bees in the sample (having no more than 2 alleles at any one locus). The respective multi-locus genotype is reconstructed from the unique combinations of alleles at all loci. These candidate single strains are then ranked according to their observed frequencies in the population, with the most abundant strain being the most likely candidate to be considered part of an observed multiple infection, and so forth for the less abundant strains, until the multiple infection is fully resolved. With this procedure, multiple infections can be partitioned into conservative combinations of strains that are most likely to make up the mixed infection in a given bee. This requires calculating all possible combinations of multi-locus types, and ranking them according to the ranking of their constituent single-locus types (for details, see ). We here used this algorithm to retrieve the strains from multiple infections.
It is not yet straightforward to partition a mixed infection into all its constituent, independent infection genotypes with certainty unless the infection is cloned and all clones are typed. Previous studies have pointed out the extreme difficulties of investigating complex mixtures of DNA probes ,, and we emphasize, that the algorithm used here  provides only an approximate tool for the estimation of the constituent genotypes of mixed infections, and is purposely kept conservative to avoid overestimation. Hence, the true number of strains making up mixed infections is likely to be higher and similarly, the true prevalence of multiple infections is also likely higher, as is the number of unique genotypes, for the same reasons.
Population genetic analysis and analysis of genetic distance
We first analyzed our data for the set of singly infected bees alone. In a second step, all infections were included, i.e. including the reconstructed strains retrieved from multiple infections as identified by the above algorithm. The overall population genetic structure was calculated using the software Genepop . Pairwise FST–values and their significance levels were calculated using Arlequin 3.1 . Each directly identified or reconstructed multi-locus genotype was entered as one independent data point; populations with sample sizes smaller than 5 were not included in each of these analyses, and those with sample sizes lower than 10 are marked in italics. Locus 1.B6 was not included in the final analysis at all due to ambiguities in genotyping on the ABI Prism Fragment Analyzer (scattered ranges instead of clear peaks). We also tested for linkage disequilibrium according to the method of Black and Kraftsur  implemented in the software GENETIX .
In a further step, C. bombi populations for each site where bees had been sampled for consecutive years were pooled together into one data set for that given site. The resulting three parasite populations in the region Basler Jura (Röschenz, Movelier, Soyhières) and four parasite populations in the Lower Engadin (La Munt, Buffalora, Lavin, Stabelchod), respectively, were analyzed with respect to their phylo-geography using allele frequencies. Relationships were based on Cavalli-Sforza's and Edwards' chord distance DC . According to Takezaki and Nei , this distance is superior to other measures because it explicitly considers the stepwise nature of microsatellite mutation. Allele frequencies were bootstrapped 1'000 times, using the subprogram SEQBOOT within the program package PHYLIP 3.1. ; the consensus tree was assembled in NEIGBOR and CONSENSUS, and finally visualized in TREEVIEW .
At last, G∶N-ratios (number of different multi-locus genotypes, G, over sample size, N) were calculated as a measure of genetic diversity and clonality of C. bombi according to Ivey and Richards . Calculations were done for all successfully typed multi-locus (4 loci) genotypes. Note that diversity at these four loci was high enough for the resolution to be appropriate for our analyses.
Canonical redundancy analysis (RDA)
A redundancy analysis (RDA) was performed in the statistics package R version 2.10.1 (from the R Foundation for Statistical Computing) to look for correspondence between C. bombi alleles, i.e. parasite genotypes, and overlap in flower visits among different host species, and for the effect of host species itself. RDA  is a constrained principal component analysis and is related to canonical correspondence analysis (CCA; ). It can be used to compute a multifactorial and multivariate analysis of variance  and is the direct extension of multiple regression to the modeling of multivariate response data . RDA examines how one set of variables (Y, for example, genetic variables such as the genotypes of the infection strains) may be explained by another set of variables (X, e.g. environmental variables such as overlap in flower visits and host species). In other words, the outcome of the analysis expresses how much of the variance in one set of variables can be explained by another set of variables. The input for the analysis is a set of transformed quantitative variables (Y) and any other variable (X), in our case a set of binary matrices for each set of variables, and the output are principal components of the residuals of X.
Results can be presented in a bi-plot, where the two dimensions represent the Y-variables, while vectors represent the X-variables. The longer a vector in the plot, the more variance is explained by the respective variable, X. Accordingly, variables that do not contribute much to explain the variance of the dependent data set are located in the center of the plot. The matrices we used are defined by (i) single alleles typed from the C.bombi infections, (ii) the independent single-locus genotypes (multi-locus genotypes could not be used, since each so defined genotype was almost unique in the sample), (iii) the bumblebee host species, and (iv) the flowers that these hosts had visited. Direct redundancy analysis was performed for each region as a whole, and for the single sites Movelier, Roeschenz and Soyhieres from the Basler Jura with the data set of 2004; these data sets produced enough residual components for complete analysis, which was not the case for all other data sets due to limited sample sizes for infected individuals per year and site. Since we have more than two sets of variables, a partial redundancy analysis had to be performed in addition to the direct redundancy analysis in order to control for the effect of a third set (Z) and to isolate the effect of data set X alone (for example to account for only the effect of flower visits while filtering out any effect of host identity). Finally, we used the ANOVA implemented for the RDA-analysis in R to test the output of the analysis for significance.
A total of n = 2'267 bees were sampled and scrutinized for all regions and years. A synopsis of data is given in Table 1 for each sampling site and year with the respective number of specimens sampled, the number of bumblebee species identified, the percentage of species infected at the respective site, relative total as well as multiple infection, the number of typed alleles for the four loci considered in the analysis, and the number of rare alleles found in the respective subpopulations (defined by their frequency ≤0.05.). The observations were very variable and distinct for each site, but in general infections in the lower, hilly region of the Basler Jura were clearly more common and genetically more diverse than in the alpine region of the Lower Engadin.
Among all bees, a total of 310 individuals were infected by C. bombi in the Basler Jura (prevalence 30.60%, n = 1'013 bees samples in total), and a total of 82 infected bees were found in the samples from the Lower Engadin (prevalence 6.54%; n = 1'254 bees) for the entire period from 2003 to 2005. This data set includes all Bombus species found in the sampled communities. Typing the four chosen loci for C. bombi resulted in a total of 39 distinct alleles, 149 single-locus genotypes among all infections, and 213 distinct strains (i.e. distinct multi-locus genotypes). Of the total of 392 infections at least 173 (i.e. a fraction of 44%) were infections by multiple strains of C. bombi as revealed by the allelic pattern of the infection (i.e. the presence of more than two alleles for at least one locus). Note that the error rate in our typing of alleles due to PCR errors is small enough to be neglected - common Taq DNA polymerases are generally expected to have a base pair (bp) substitution error rate in the order of 8×10−6 point mutations per bp per PCR cycle , which cannot account for the diversity of alleles found here.
Because of possible uncertainty involved in identifying the genotypes in multiple infections, we here present our analyses separately for single and for the pooled infections (single and reconstructed multiple infections). Our algorithm to reconstruct genotypes from multiple infections is conservative and will thus likely underestimate measures of Fst. Due to the limitations set by any marker-based system, only a finite number of different genotypes can be detected and additional genotypes will go unnoticed. We assessed the effect of such hidden diversity of genotypes by re-running the analyses including the first occurrence of the unique genotypes only. The results were virtually the same as when all cases were included (i.e. including the duplicate genotypes); this reflects the fact that almost every new infection contained a new genotype anyway (c.f. Fig. 1).
Each dot is a different population. The typed infections on the x-axis (sample size N) represent the number of infections that were found and genotyped in the respective population.
(a) Population genetics of single infections.
Population structure. For all analyses we calculated F-values for the overall population of infections. Over all data sites and years, the F-values indicated a slightly structured overall population with a likely global excess of heterozygosity across sites and years (FST = 0.0285, p<0.001; FIS = −0.0301, p = 0.064). The calculated number of migrants for the single infections are 16.77 (after correction for sample size; see , ), reflecting limited overlap between sites and regions. Considering the pairwise differentiation among the C. bombi populations at different sites for each year, we found widely varying FST -values, including significant separation of populations from one another, with pairwise FST- values for all loci ranging from -0.146 to 0.1374 (see Table S1 in the Supporting Information). Together these results indicate that a substantial part of variation lies within the populations but that there is also variation between years of sampling at the same site. Testing for Hardy-Weinberg equilibrium (using the Markov chain method), of the 12 populations that were successfully analyzed, two significantly deviated from Hardy-Weinberg equilibrium (Table 2). Furthermore, there was no significant excess of heterozygotes over all populations (p = 0.146).
Seasonal effects. To assess for seasonal effects, we divided our datasets into seasonal subpopulations whenever possible, and performed the same Hardy-Weinberg tests as mentioned above. Due to restricted sample sizes, not all data could be used for this part, because we did not include instances with sample sizes of N<5. There was no substantial difference to the pooled data set (including all seasons) except that global heterozygote excess became slightly less supported (p = 0.175). Of the 15 successfully analyzed populations, only two (same as above) significantly deviated from Hardy-Weinberg equilibrium (at p<0.05). Seasonal subpopulations deviated from each other at various levels of significance.
Linkage. Among the total of 18 populations and 6 pairwise locus combinations, and using the complete data set for single infections only, we found 10 significant values for linkage (at p<0.05, Fisher's method; 6.5% of cases). Across all populations, linkage was significant for the locus pairs 4.G9 and 4 (p<0.001), and for loci 4 and 16 (p<0.01).
(b) Population genetics of all infections.
Here, we make use of the reconstructed genotypes from multiple infections. As mentioned in the Methods section, the algorithm we used is conservative, i.e. it underestimates the number of different genotypes in the population. As a result, the measures of population structure (FST) are likely to be underestimated, too. This needs to be appreciated in addition to an already inherent underestimation of differentiation by F-statistics found for data sets with high genotypic diversity .
Population structure. Over all data sites and years, and when including all infections, the F-values indicated a slightly structured population with global excess of heterozygosity across sites and years (FST = 0.0266, p<0.01; FIS = −0.2050, p<0.01). The calculated number of migrants was 28.10, after correction for sample size as above. Considering the pairwise differentiation among populations at different sites for each year, we found widely varying FST -values, ranging from −0.746 to 0.3639 (see Table S2 in the Supporting Information). When considering all infections, 9 out of the 14 analyzed populations deviated from Hardy-Weinberg equilibrium (p<0.05) (Table 2), and there was strong heterozygote deficiency (p<0.001).
Seasonal patterns. We observed clear seasonal changes when considering all infections. For example, whereas the early subpopulation at Buffalora in 2004 significantly deviated from Hardy-Weinberg equilibrium (p<0.01), the late subpopulation was in perfect equilibrium (p close to one). For the subpopulations of La Munt in 2004 the situation was the opposite with a population starting out close to equilibrium in the early season (p = 0.98) and veering away from it in the later season (p<0.01). At Roeschenz in 2004 the subpopulation started out far from equilibrium (p = 0.01) and remained there throughout the season (p<0.001). In 2005 a different pattern could be observed at the very same site, with the subpopulations starting out far from equilibrium (p = 0.04), but slowly approaching it towards mid- (p = 0.10) and late season (p = 0.27). In the same year but at a different site (Soyhieres 2005), the pattern was again opposite, with the subpopulation starting out close to Hardy-Weinberg equilibrium early in the season (p = 0.91) and strongly deviating later in the year (p<0.01). Clearly, the genotypic infection patterns of these C. bombi field populations reveal a good deal of dynamic turnover varying among sites and from year to year. The study of multiple infections is thus especially helpful when elucidating seasonal patterns.
Linkage. For all infections, 81 possible tests produced 13 significant linkages (p<0.05, Fisher's method; 16.0% of cases). Across all populations, linkage was significant for the locus pairs 4.G9 and 4 (p<0.01), and for loci 4 and 16 (p<0.001), and 4.G9 and 2.F10 (p<0.001). Given that a total of 65 statistical tests are possible in this matrix, only the highly significant values might have a real basis, however.
(c) Genotypic correlations.
Here, we include data from all infections because the bias towards lower diversity of genotypes in the set of re-constructed multiple infections does not change the conclusions, given that there is considerable diversity even among the single infections. Using all infections, we found that the number of genotypes relative to the number of samples (G∶N-ratios) varied from 0.853 to 1.0 for the 4-loci-genotypes (Figure 1). Most of the data points are above the 0.5-level, which according to Ivey and Richards (2001) is indicative of a bias towards sexual reproduction in the parasite population. The correlation between G∶N ratios and the host species diversity at a given site, Hs, showed a negative but not significant trend (Spearman's rho = −0.451, p = 0.164). Figure 1 also shows that G∶N only slightly decreases as sample size increases, demonstrating that infections by C. bombi are extremely diverse in natural populations. The same holds true if only single infection were considered.
Not surprisingly, there is a positive correlation between the percentage of infected individuals in a population with the number of C. bombi genotypes found in these respective populations for each locus (4.G9: R2 = 0.555, p<0.001, 4: R2 = 0.662, p<0.001, 2.F10: R2 = 0.635, p<0.001, 16: R2 = 0.235, p = 0.035) as well as for all loci pooled together (R2 = 0.344, p<0.001). Hence, the higher the infection load (prevalence of infections), the more genotypic diversity of the parasite population can be found (especially in Figure 2a). Different loci show this increase in different ways, with saturation quickly reached for the least polymorphic Locus 16. When correlating the genotypes found with the proportion of the multiple infections among all infection, there is a positive correlation between the number of genotypes and multiple infection for the more polymorphic loci, 4.G9 (R2 = 0.388, p<0.001), 4 (R2 = 0.384, p = 0.006), and 2.F10 (R2 = 0.352, p = 0.010), but there is no significant increase in genotypic diversity with increasing multiple infection for locus 16 (R2 = 0.115, p = 0.169) (Figure 2b). The regression over all loci together was positive (R2 = 0.203, p<0.001). The proportion of the total of rare alleles being found (with a frequency <0.05) also increases with increasing infection load in a respective population (R2 = 0.827, p<0.001) (Figure 2c).
(d) Genetic distance.
For the purpose of measuring genetic distances, we defined seven distinct geographic populations of C. bombi by pooling data sets from consecutive years for the same sites. This set was analyzed with respect to mutual genetic relationships between populations, i.e. their phylogeography. The resulting (un-rooted) tree corresponds to the geographic separation, with neighboring populations also being genetically close on the tree (Figure 3). The most supported branch in the tree (bootstrap value = 531/1'000) separates the two geographically distinct regions, the alpine (Lower Engadin) and the lower, hilly (Basler Jura) region. However, this support is still weak by the accepted standards. Among sites of the alpine region, C. bombi populations are genetically less divergent from each other than they are among sites of the Jura region. This may reflect the higher genetic diversification of C. bombi observed in the Jura region compared to the Alps.
1'000 bootstrap runs of an unrooted neighbor-joining tree were performed using Cavalli-Sforza's and Edwards' genetic distance. Small numbers indicate bootstrap values. The red line on the tree represents the geographic separation of the two regions Basler Jura and Lower Engadin. (Basler Jura: Röschenz, Movelier, Soyhières; Lower Engadin: La Munt, Buffalora, Stabelchod, Lavin.)
Niche overlap versus phylogeny
We tested whether host species identity (the phylogenetic hypothesis) or the overlap of flower visits (the ecological hypothesis), or both, are factors explaining the genotypic composition of the infecting parasite populations. We genetically characterized the parasite population by either the alleles at the various loci in separation, or as genotypes (combination of single infections and reconstructed multiple infections). Figure 4 depicts a subset of RDA plots with significant ANOVA outcome of the direct and partial redundancy analysis. The results varied among the regions and sites and sometimes significance levels depended on whether the allele matrix (i.e. independent alleles at loci) or the genotype matrix (i.e. single-locus genotypes, and reconstructed all genotypes) was used for the analysis (Table 3). For example, host species identity was the stronger predictor for both the parasite alleles and genotypes than the flower visits in the Basler Jura region. In the alpine Lower Engadin valley, host species identity was explaining most of the variation for the parasite alleles but not for the genotypes. On the other hand, when species identity is ignored, flower visit is a good predictor for parasite alleles and genotypes in the Lower Engadin and for the genotypes, but not for the alleles, in the Basler Jura. When separating sites and years, the patterns were not consistent within the respective region, which shows how dynamic the infections can change at any one specific location. With the exceptions of the Alpine sites Lavin and La Munt, where flower visit was a successful predictor, the factor species identity was a good predictor elsewhere. Note that as mentioned above, only the data sets with sufficient residual components could be used for complete analysis.
The two axes illustrate the two principal components of the residuals of the multiple linear regression of X unto Y. The blue arrows designate the vectors for the constraining variable.
Our data yield the first quantitative estimate of the proportion of multiple infections of Crithidia bombi among all sampled infected bumblebee hosts. We found that from the total of 392 infected hosts whose infections were genotyped over the three years of sampling, 44% are representing multiple infections. This number is likely an underestimated considering that we had to reconstruct a portion of the genotypes, and potentially missed some actual diversity. Interestingly, the proportion of multiple infections presented here is similar to what Schmid-Hempel and Reber Funk  had reported for colonies of B. terrestris founded in the lab by spring queens caught in the field early in the season. A remarkable degree of diversity of different parasite genotypes had already been reported in that earlier study and, at least where more than two loci could be genotyped, no multi-locus genotype (“strain”) did overlap between colonies, indicating a very strong association of the distinguishable parasite genotypes with the different host colonies. Our data now demonstrate that in a field situation with several co-existing host species and the possibility of transmission among them, the diversity of parasite genotypes and the fraction of multiple infections is similar to the samples analyzed in the laboratory. This is remarkable because whereas the infection window for new infections had closed for the lab colonies at the moment of their isolation, the bees that forage in the field remain exposed to new infections continuously. Note that the fraction of multiple infections is higher in the denser parasite population in the Basler Jura than at the higher elevations, demonstrating the same effect.
The FST-values reported here support the idea of segregation of the parasite genotypes across host individuals in the populations. Overall, each sampled C. bombi population at a given site in a given year appears to have its own dynamics, however. A recurrent observation is that the higher the prevalence of infection in a bumblebee community, the higher the number of alleles for the infecting population that is found (Figure 2). These high-infection populations also consist of more multiple infections, have a higher abundance of rare alleles, and deviate more from Hardy-Weinberg equilibrium. The variation in p-values resulting from the exact Hardy-Weinberg probability test (Table 2) also demonstrates the different dynamics in the different subpopulations, even more so when measuring at different time points, i.e. different stages in a season reflecting the buildup of the epidemic over the season.
The situation in C. bombi is perhaps not unusual, as varying population structures have also been reported, for example, for Plasmodium falciparum where epidemiological effects are suspected to affect the distribution of genotypes – and where the degree of clonality vs. sexual reproduction may vary considerably among vectors and populations . Similarly, parasite population structure can vary with infection cycles (e.g. in Trypanosoma cruzi ,) and subtypes of a parasite can be associated with different phenologies and effects on the host (e.g. in Leishmania infantum , or Trypanosoma brucei ). Furthermore, strong selection by the variable host types can shape the population genetic structure of parasites in important ways  (e.g. in L. donovani ). Indeed, given the strong host-parasite interactions known from the Bombus-Crithidia system, a good (and as yet unknown) proportion of the parasite's population genetic structure should be determined by strong selection by the host, leading to a kind of the ‘iceberg effect’  that makes only the varying small parts of the entire genotypic variation contained in single hosts visible to analyses.
As far as we understand the system, a bottleneck is imposed on the parasite population every autumn by the death of the worker and male host bees at the end of the season and the fact that only few colonies manage to produce young queens that carry the infection through hibernation . Furthermore, the young queens vary considerably in their chance of successful hibernation. Therefore, we can expect that the genetic compositions of the spring parasite populations also vary considerably among species, colonies, and years. Also we find negative correlations between species abundance and the prevalence of C. bombi within and also between years of sampling and report more detailed ecological results elsewhere (Salathé & Schmid-Hempel, in prep). The building up of the infections over the season in turn might depend on a number of parameters, such as weather, host density, host species composition, environmental disturbance, floral abundance, and may specifically be affected by flower aggregation sizes determining foraging behavior of the hosts . In the year 2004, for example, an exceptionally high prevalence of C. bombi infection was reported for all Jura sites (Movelier 54%, Röschenz 55%, Soyhieres 55%) and was followed by a “crash” in infection in the following season (Movelier 28%, Röschenz 11%, Soyhieres 12%), and a trend towards less deviation from Hardy-Weinberg, possibly indicating increased rates of events of genetic exchange among co-infecting parasite strains.
Unfortunately, it is almost impossible to track entire multi-locus genotypes for C. bombi across host species and years because the same multi-locus genotype is hardly ever found twice. Yet, it is feasible to analyze the individual alleles. For the three polymorphic microsatellite loci 4.G9, 4 and 2.F10, about half of the alleles found in this study are rare. Of the remaining alleles, one is usually the most common, although no systematic changes over the years can be confirmed, perhaps due to limited sample size. The increased occurrence of rare alleles with increasing infection prevalence (Figure 2) may indicate that more new combinations of alleles are generated by genetic exchange with increasing instances of co-infection. Linkage between alleles for each pair of loci varies from population to population, from year to year, and varies even within the locus pairs when only single infections are considered compared to all infections. This pattern may be a result of a mixture of clonal and sexual reproduction, similar to occasionally observed mixture of outcrossed and selfed offspring in hermaphrodites . In fact, the diversity of the multi-locus genotypes found in the field (see Figure 1) could arise in a clonal population only with unusually high mutation rates, which we do not expect here. Furthermore, the G∶N values for C. bombi are definitely biased towards what is expected from sexual reproduction (Figure 1). We now know, in fact, that C. bombi strains regularly exchange genetic material, mostly following patterns of Mendelian segregation of parental alleles . That study showed that in some cases alleles are lost or gained, leading to an entirely new genotype different from either parent.
The phylogenetic tree based on the typed microsatellites reflects the effect of geography on the distribution of C. bombi genotypes. Although the sites within each of the two regions were farther away from one another than a typical bumblebee would fly to forage, they roughly cluster together to form a geographic region. Considering that the bumblebee communities substantially differ in species composition between the two regions (personal observation), it is interesting to note that the distances in the tree are not correlating directly with the differences in host community structure. In La Munt, for example, the host species composition was similar to those in Stabelchod and Buffalora. Nevertheless, the distances in the tree between the sites La Munt and Buffalora are larger than between the two larger regions (see Figure 3). The proximity of Stabelchod and Buffalora on the parasite tree must therefore have a different cause than geography alone, and might be attributable to differences in the composition of the plant communities at these sites. Interestingly, the C. bombi populations of the alpine region are genetically less divergent from each other than they are within the sites of the Jura region. Whether this is due to a higher genetic diversification of C. bombi observed in the Jura region compared to the Alps, or whether it is a sample size effect would need to be investigated.
The importance of ecological factors for the population structure of the parasite is supported by the redundancy analysis (RDA) where, both, the species identity of the host as well as the overlap of foraging niches between hosts (shared flower visits) have an important effect on the distribution of C. bombi genotypes. The conclusions are somewhat different depending on whether the allele matrix or the genotype matrix is used for the analysis but the general message remains the same. Considering the Lower Engadin region, only the flower visits were a significant constraining factor to genotype distribution. Host identity does not seem to play such an important role in this region, despite the strong link of bumblebee species and flowers itself, and the fact that infected bees potentially carry the infection back to the colony. In all, it appears therefore that the population structure of infections might be driven by dietary overlap (the ecological hypothesis, which sets transmission patterns) that in turn is affected by species identity.
Given comparable cases, niche overlap between hosts has frequently been reported to enhance between-species transmission ,. C. bombi is a good representative for this scenario as it is readily transmitted horizontally via shared use of flowers . In our study the ecological hypothesis is more strongly supported, presumably because transmission becomes more limiting. Studies including ecological parameters are therefore of great value for a better understanding of the dynamics of infections in the field, especially in a highly diversified system such as Bombus spp. and its parasite C. bombi.
Pairwise FST-values for single infections only for all sites and over all loci considered in the population genetics test (Arlequin 3.1, Excoffier et al. 2005). Significant p - values are marked as bold, for p≤0.05. Populations with sample sizes lower than 5 were not included in the test, values for data sets with n<10 are marked in italic. Population key: 1) Buffalora 2003, 2) Buffalora 2004, 3) La Munt 2004, 4) Lavin 2004, 5) Stabelchod 2004, 6) Movelier 2004, 7) Movelier 2005, 8) Roeschenz 2003, 9) Roeschenz 2004, 10) Roeschenz 2005, 11) Soyhieres 2004, 12) Soyhieres 2005.
Pairwise FST-values for all infections for all sites and over all loci considered in the population genetics test (Arlequin 3.1, Excoffier et al. 2005). Significant p - values are marked as bold, for p≤0.05. Populations with sample sizes lower than 5 were not included in the test, values for data sets with n<10 are marked in italic. Population key: 1) Buffalora 2003, 2) Buffalora 2004, 3) La Munt 2003, 4) La Munt 2004, 5) Lavin 2004, 6) Stabelchod 2004, 7) Movelier 2004, 8) Movelier 2005, 9) Roeschenz 2003, 10) Roeschenz 2004, 11) Roeschenz 2005, 12) Soyhieres 2003, 13) Soyhieres 2004, 14) Soyhieres 2005.
We thank Asta Audzijonyte and Regula Schmid-Hempel for comments on the manuscript. Further, we thank the Swiss National Park and the community of Röschenz, BL for sampling permission in their protected wildlife areas.
Conceived and designed the experiments: RS. Performed the experiments: RS. Analyzed the data: RS. Contributed reagents/materials/analysis tools: PS-H. Wrote the paper: RS. Designed the software used in the analysis: PS-H. Commented on and corrected the manuscript: PS-H.
- 1. Haldane JBS (1949) Disease and evolution. La Ricerca Scientifica Suppl A 19: 68–76.
- 2. Halkett F, Simon J-C, Balloux F (2005) Tackling the population genetics of clonal and partially clonal organisms. Trends Ecol Evol 20: 194–201.
- 3. Boyle JP, Rakjasekar B, Saeij JPJ, Ajioka JW, Berriman M, et al. (2006) Just one cross appears capable of dramatically altering the population biology of a eukaryotic pathogen like Toxoplasma gondii. Proc Nat Acad Sci USA 103: 10514–10519.
- 4. Heitman J (2006) Sexual reproduction and the evolution of microbial pathogens. Curr Biol 16: R711–R725.
- 5. Read AF, Taylor LH (2001) The ecology of genetically diverse infections. Science 292: 1099–1102.
- 6. Tibayrenc M (2005) Bridging the gap between molecular epidemiologists and evolutionists. Trends Microbiol 13: 575–580.
- 7. Wakelin D, Apanius V (1997) Immune defence: genetic control. In: Clayton DH, Moore J, editors. Host-parasite evolution: principles and avian models. Oxford University Press. pp. 30–58.
- 8. Carius HJ, Little TJ, Ebert D (2001) Genetic variation in a host-parasite association: potential for coevolution and frequency-dependent selection. Evolution 44: 1136–1145.
- 9. Schmid-Hempel P, Ebert D (2003) On the evolutionary ecology of specific immune defence. Trends Ecol Evol 18: 27–32.
- 10. Lythgoe KA (2002) Effects of acquired immunity and mating strategy on the genetic structure of parasite populations. Am Nat 159: 519–529.
- 11. De Roode JC, Culleton R, Cheesman SJ, Carter R, Read AF (2004) Host heterogeneity is a determinant of competitive exclusion or coexistence in genetically diverse malaria infections. Proc Roy Soc B 271: 1073–1080.
- 12. Gandon S, Mackinnon MJ, Nee S, Read AF (2001) Imperfect vaccines and the evolution of pathogen virulence. Nature 414: 751–756.
- 13. Seppälä O, Karvonen A, Valtonen ET, Jokela J (2009) Interactions among co-infecting parasite species: a mechanism maintaining genetic variation in parasites? Proc R Soc Lond B 276: 691–697.
- 14. Tibayrenc M (1999) Toward an integrated genetic epidemiology of parasitic protozoa and other pathogens. Annu Rev Genet 33: 449–477.
- 15. Tibayrenc M, Ayala FJ (1999) Evolutionary genetics of Trypanosoma and Leishmania. Microbes Infect 1: 465–472.
- 16. Tibayrenc M, Ayala FJ (2002) The clonal theory of parasitic protozoa: 12 years on. Trends Parasitol 18: 405–410.
- 17. Jenni L, Marti S, Schweitzer J, Betschart B, LePage RWF, et al. (1986) Hybrid formation between African trypanosomes during cyclical transmission. Nature (322): 173–175.
- 18. MacLeod A, Tweedie A, McLellan S, Taylor S, Cooper A, et al. (2005) Allelic segregation and independent assortment in T. brucei crosses: Proof that the genetic system is Mendelian and involves meiosis. Mol Biochem Parasitol 143: 12–19.
- 19. Gaunt MW, Yeo M, Frame IA, Stothard JR, Carrasco HJ, et al. (2003) Mechanism of genetic exchange in American trypanosomes. Nature (421): 936–939.
- 20. Akopyants NS, Kimblin N, Secundino N, Patrick R, Peters N, et al. (2009) Demonstration of genetic exchange during cyclical development of Leishmania in the sand fly vector. Science 324: 265–268.
- 21. Schmid-Hempel R, Salathé R, Tognazzo M, Schmid-Hempel P (2011) Genetic exchange and emergence of novel strains in directly transmitted, monoxenic trypanosomatids. Infect Genet Evol 11: 564–571.
- 22. Lipa JJ, Triggiani O (1980) Crithidia bombi sp. n. a flagellated parasite of a bumblebee Bombus terrestris L. (Hymenoptera, Apidae). Acta Protozool 27: 287–290.
- 23. Schmid-Hempel P, Schmid-Hempel R (1993) Transmission of a pathogen in Bombus terrestris, with a note on division of labour in social insects. Behav Ecol Sociobiol 33: 319–327.
- 24. Schmid-Hempel P, Puhr K, Krüger N, Reber C, Schmid-Hempel R (1999) Dynamic and genetic consequences of variation in horizontal transmission for a micro-parasite infection. Evolution 53: 426–434.
- 25. Wilfert L, Gadau J, Baer B, Schmid-Hempel P (2006) Natural variation in the genetic architecture of a host-parasite interaction in the bumblebee, Bombus terrestris. Mol Ecol 16: 1327–1339.
- 26. Durrer S, Schmid-Hempel P (1994) Shared use of flowers leads to horizontal pathogen transmission. Proc Roy Soc B 258: 299–302.
- 27. Shykoff JA, Schmid-Hempel P (1991) Incidence and effects of four parasites in populations of bumble bees in Switzerland. Apidologie 22: 117–125.
- 28. Korner P, Schmid-Hempel P (2005) Correlates of parasite load in bumblebees in an Alpine habitat. Entomological Science 8: 151–160.
- 29. Brian AD (1957) Differences in the flowers visited by four species of bumble-bees and their causes. J Anim Ecol 26: 71–98.
- 30. Inouye DW (1980) The effect of proboscis and corolla tube lengths on patterns and rates of flower visitation by bumblebees. Oecologia 45: 197–201.
- 31. Goulson D, Darvill B (2004) Niche overlap and diet breadth in bumblebees; are rare species more specialized in their choice of flowers? Apidologie 35: 55–63.
- 32. Halkett F, Simon JF, Balloux F (2005) Tackling the population genetics of clonal and partially clonal organisms. Trends Ecol Evol 20: 194–201.
- 33. Hess AE, Landolt E, Hirzel R (1991) Bestimmungsschlüssel zur Flora der Schweiz. Birkhäuser Verlag, Basel.
- 34. Schmid-Hempel P, Reber Funk C (2004) The distribution of genotypes of the trypanosome parasite, Crithidia bombi, in populations of its host, Bombus terrestris. Parasitology 129: 147–158.
- 35. Schmid-Hempel R, Tognazzo M (2010) Molecular divergence defines two distinct lineages of Crithidia bombi (Trypanosomatidae), parasites of bumblebees. J Eukaryot Microbiol 57: 337–345.
- 36. Ladd C, Lee HC, Yang N, Bieber FR (2001) Interpretation of Complex Forensic DNA Mixtures. Croat Med J 42: 244–246.
- 37. Mortera J, Dawid AP, Lauritzen SL (2003) Probabilistic expert systems for DNA mixture profiling. Theor Pop Biol 63: 191–205.
- 38. Raymond M, Rousset F (1995) GENEPOP (version 1.2): population genetics software for exact tests and ecumenicism. J Hered 86: 248–249.
- 39. Excoffier L, Laval G, Schneider S (2005) Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol Bioinform Online 1: 47–50.
- 40. Black WC IV, Kraftsur ES (1985) A FORTRAN program for the calculation and analysis of two- locus linkage disequilibrium coefficients. Theor Appl Genet 70: 491–496.
- 41. Belkhir K, Borsa P, Goudet J, Chikhi L, Bonhomme F (1998) GENETIX, logiciel sous Windows pour la génétique des populations. CNRS UPR 9060, Université de Montpellier II, France.
- 42. Cavalli-Sforza LL, Edwards AWF (1967) Phylogenetic analysis: models and estimation procedures. Am J Hum Genet 19: 233–257.
- 43. Takezaki N, Nei M (1996) Genetic distances and reconstruction of phylogenetic trees from microsatellite DNA. Genetics 144: 389–399.
- 44. Felsenstein J (1993) Phylogenetic inference package. PHYLIP 3.1. University of Washington, Seattle. Available at http://evolution.genetics.washington.edu/phylip.html.
- 45. Ivey CT, Richards JH (2001) Genetic diversity of everglades sawgrass, Cladium jamaicense (Cyperaceae). Int J Plant Sci 162: 1327–1335.
- 46. Rao CR (1964) The use and interpretation of principal component analysis in applied research. Sankhyā: Indian J Stat, Series A 26: 329–359.
- 47. terBraak CJF (1986) Canonical correspondence analysis: a new eigenvector technique for multivariate direct gradient analysis. Ecology 67: 1167–1179.
- 48. Legendre P, Anderson DJ (1999) Distance-based redundancy analysis: testing multispecies responses in multifactorial ecological experiments. Ecol Monogr 69: 1–24.
- 49. Legendre P, Legendre L (1998) Numerical Ecology. 2nd Edition. Elsevier Science B.V. Amsterdam.
- 50. Cline J, Braman JC, Hogrefe HH (1996) PCR fidelity of Pfu DNA polymerase and other thermostable DNA polymerases. Nucleic Acids Res 24: 3546–3551.
- 51. Slatkin M (1985) Rare alleles as indicators of gene flow. Evolution 39: 53–65.
- 52. Barton NH, Slatkin M (1986) A quasi-equilibrium theory of the distribution of rare alleles in a subdivided population. Heredity 56: 409–415.
- 53. Meirmans PG, Hedrick PW (2010) Assessing population structure: FST and related measures. Mol Ecol Res 11: 5–18.
- 54. Babiker HA, Lines J, Hill WG, Walliker D (1996) Population structure of Plasmodium falciparum in African villages with different malaria endemicities. Ann Trop Med Parasitol 90: 410.
- 55. Anderson TJC, Haubold B, Williams JT, Estrada FJG, Richardson L, et al. (2000) Microsatellite markers reveal a spectrum of population structures in the malaria parasite Plasmodium falciparum. Mol Biol Evol 17: 1467–1482.
- 56. Peyerl-Hoffmann G, Jelinek T, Kilian A, Kabagambe G, Metzger WG, et al. (2001) Genetic diversity of Plasmodium falciparum and its relationship to parasite density in an area with different malaria endemicities in West Uganda. Trop Med Int Health 6: 607–613.
- 57. Annan Z, Durand P, Ayala FJ, Arnathau C, Awono-Ambene P, et al. (2007) Population genetic structure of Plasmodium falciparum in the two main African vectors, Anopheles gambiae and Anopheles funestus. Proc Natl Acad of Sci USA 104: 7987–7992.
- 58. Barnabe C, Brisse S, Tibayrenc M (2000) Population structure and genetic typing of Trypanosoma cruzi, the agent of Chagas disease: A Multilocus Enzyme Electrophoresis approach. Parasitology 120: 513–526.
- 59. Cuervo P, Cupolillo E, Segura I, Saravia N, Fernandes O (2002) Genetic diversity of Colombian sylvatic Trypanosoma cruzi isolates revealed by the ribosomal DNA. Mem I Oswaldo Cruz 97: 877–880.
- 60. Chargui N, Amro A, Haouas N, Schönian B, Babba H, et al. (2008) Population structure of Tunisian Leishmania infantum and evidence for the existence of hybrids and gene flow between genetically different populations. International Journal for Parasitology 39: 801–811.
- 61. Hide G, Welburn SC, Tait A, Maudlin I (1994) Epidemiological relationships of Trypanosoma brucei stocks from South East Uganda: Evidence for different population structures in human infective and non-human infective isolates. Parasitology 109: 95–111.
- 62. Hamilton R, Boots M, Paterson S (2005) The effect of host heterogeneity and parasite intragenomic interactions on parasite population structure. Proc R Soc B 272: 1647–1653.
- 63. Guerbouj S, Victoir K, Guizani I, Seridi N, Nuwayri SN, et al. (2001) Gp63 gene polymorphism and population structure of Leishmania donovani complex: Influence of the host selection pressure? Parasitology 122: 25–35.
- 64. Schmid-Hempel P (2001) On the evolutionary ecology of host-parasite interactions–addressing the questions with bumblebees and their parasites. Naturwissenschaften 88: 147–158.
- 65. Goulson D (2000) Why o do pollinators visit proportionally fewer flowers in large patches? Oikos 91: 485–492.
- 66. Cutter AD (2006) Nucleotide Polymorphism and Linkage Disequilibrium in Wild Populations of the Partial Selfer Caenorhabditis elegans. Genetics 172: 171–184.
- 67. Begon M, Hazel SM, Baxby D, Bown K, Cavanagh R, et al. (1999) Transmission dynamics of a zoonotic pathogen within and between wildlife host species. Proc Roy Soc B 266: 1939–1945.
- 68. Thiele EA, Sorensen RE, Gazzinelli A, Minchella DJ (2008) Genetic diversity and population structuring of Schistosoma mansoni in a Brazilian village. Int J Parasitol 38: 389–399.