Skip to main content
Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Phylogeographic Structure of the White-Footed Mouse and the Deer Mouse, Two Lyme Disease Reservoir Hosts in Québec

  • Jessica Fiset,

    Affiliation Département de sciences biologiques, Université de Montréal, Montréal, Québec, Canada

  • Nathalie Tessier,

    Affiliations Département de sciences biologiques, Université de Montréal, Montréal, Québec, Canada, Ministère des Forêts, de la Faune et des Parcs, Longueuil, Québec, Canada

  • Virginie Millien ,

    ‡ These authors are joint senior authors on this work.

    Affiliation Redpath Museum, McGill University, Montréal, Québec, Canada

  • Francois-Joseph Lapointe

    ‡ These authors are joint senior authors on this work.

    Affiliation Département de sciences biologiques, Université de Montréal, Montréal, Québec, Canada


Modification of a species range is one of many consequences of climate change and is driving the emergence of Lyme disease in eastern Canada. The primary reservoir host of the bacteria responsible for Lyme disease, Borrelia burgdorferi, is the white-footed mouse (Peromyscus leucopus), whose range is rapidly shifting north into southern Québec. The deer mouse, P. maniculatus, is occurring over most Québec province and is a less competent host for B. burgdorferi. Here, we compared the phylogeographic structure of both Peromyscus species in Québec. Using a combination of multiple mitochondrial DNA markers and phylogeographic methods, we detected an ongoing and rapid expansion of P. leucopus, while P. maniculatus appears more stable. Haplotype and populations networks indicated that populations of P. maniculatus exhibit more genetic structure than P. leucopus across the study area. Furthermore, significant and consistent genetic divergences between populations of the two species on both sides of the St. Lawrence River suggest that distinct lineages of P. leucopus and P. maniculatus with different ancestral origins colonized Southern Québec following the Last Glacial Maximum. The phylogeographic structure of pathogens is expected to mirror the structure observed in their reservoir hosts. As different strains of Borrelia burgdorferi may be associated with different levels of pathogenicity and immune responses of their hosts, our results are helpful at better understanding the pattern of spread of Lyme disease in a zone of emergence, and associated risk for human populations.


Climate change is occurring at a rate unprecedented in Earth history [1], and in the Northern Hemisphere, we are witnessing increasing global average temperature, shorter winters, and more abundant precipitations [2]. Species may alter their geographical range to track changes in environmental conditions. Most notably, climate change is thought to be the driver of northern range expansion or range shifts in a number of species, which in turn is affecting the composition of local terrestrial and aquatic communities [36]. A number of North American terrestrial species reach the northern limit of their geographic distribution around the Great Lakes and the St. Lawrence River [7]. Southern Québec thus represents an important transition zone and an ideal location for studying the pattern of range shift in response to global change [8,9].

Molecular markers have proven useful to reconstruct the pattern of range shift in response to global warming since the Last Glacial Maximum, but also to the most recent historical global change [10]. Phylogeographic studies allow to trace past geographical patterns of genetic differentiation among populations, and have proven particularly useful to identify historical vicariance events that have shaped current species distributions [1011]. Here, we used such an approach and compared two closely related species of small mammal found across most of the Western United States and up into Southern Québec in Canada.

The deer mouse (Peromyscus maniculatus) and the white-footed mouse (P. leucopus) are closely related rodent species of the family Cricetidae [1213] that have diverged approximately 500,000 years ago [14]. The two species are similar in their ecology and morphology, and occur in sympatry across most of the range of the white-footed mouse. Peromyscus leucopus reaches its northern distribution limit in Southern Québec [8, 15]. The range of P. leucopus has expanded northwards over the last few decades in the Michigan upper and lower Peninsula [9], as well as in Southern Québec [16]. The rate of poleward expansion was estimated at 10 to 15 km/year and in both regions, such a rapid rate of expansion was attributed to global warming [9,16]. Peromyscus maniculatus is a widespread species, ranging from Mexico to Northern Canada. It is a dominant component of small mammal communities and has been established in southern Québec for a much longer time than P. leucopus [17].

Previous studies have shown similar patterns in the phylogeography of P. maniculatus and P. leucopus in the United States. Namely, the pattern of genetic differentiation of the mtDNA control region (D-Loop) revealed three clades of P. leucopus that originated from distinct glacial refugia in North America. One of these clades corresponds to the east coast of the United States, while another clade is associated with the upper midwest [18, 19]. Six clades were identified for P. maniculatus across North America using Cytochrome B sequences [20]. There again, one clade was restricted to the east coast of the United States, while a second one corresponded to the upper Midwest plus Manitoba, Ontario and Québec in Canada.

Here, we propose to apply phylogeographic methods not only to assess past vicariance events, but rather to detect a genetic signature of a recent range expansion in P. leucopus. To do so, we used a combination of multiple mtDNA markers to resolve the phylogeography of P. leucopus and P. maniculatus in Southern Québec.

The use of mitochondrial DNA (mtDNA) has long been advocated in phylogeography for its rapid rate of evolution [10]. However, the use of several markers is strongly recommended, as a specific locus may be subjected to different evolutionary constraints, due to selection on that locus or simply by stochasticity [21]. The mtDNA markers most frequently used in phylogeographic studies of small mammals are the Cytochrome B (Cyt B), Cytochrome Oxidase III (COIII) and the control region (D-Loop) [1920, 2224]. Here, we combined these three highly polymorphic mtDNA markers with two others (ATP synthase 8 and 16S ribosomal RNA) in order to achieve better phylogeographic resolution. We applied a range of methods to evaluate whether expanding populations of white-footed mouse display a different phylogeographic structure compared to that of the deer mouse, a well-established species in Québec. More specifically, for both species we tested whether the phylogeographic patterns for different mtDNA markers were congruent within Québec and if major geographic barriers were limiting the dispersal of P. leucopus, as it has been previously shown [25, 26]. We also compared our results to previously published work to identify the North American clades associated with the populations from Québec and identify potential glacial refugia in both species.

The white-footed mouse is known to be the main reservoir host for the tick vector of Borrelia burgdorferri, a spirochete bacterium responsible for Lyme disease in North America [27, 28]. Lyme disease is an emerging zoonotic disease in North American and the risk for humans is rapidly increasing and expanding northwards into Southern Québec [29]. The white-footed mouse is an excellent reservoir for Lyme disease. Unlike other mammals such as dog, deer and human whose immune system fights the pathogen, P. leucopus remains an asymptomatic carrier throughout its life once infected [28]. The white-footed mouse is also able to efficiently transmit (90%) all strains of Borrelia to ticks [28, 30]. Yet, the deer mouse (P. maniculatus) is also competent to maintain and transmit Borrelia in areas where the white-footed mouse is not occurring [31].

Here, we conducted a comparative analysis of P. leucopus, a rapidly expanding and highly competent host, and P. maniculatus, a more established and less competent host. Our results will allow to better anticipate the geographic patterns of range expansion in the white-footed mouse into Southern Québec and adapt to the associated rapid spread of Lyme disease in this region located at the most northern limit of the distribution of P. leucopus.

Materials and Methods

Field sampling

The sampling took place at a total of 28 sites (Fig 1). A majority of these sites were located within the current known distribution of Lyme disease occurrence in Southern Québec, [27, 29, 32, 33]. The other sites were located in areas were Borrelia has not been detected but within the geographical range of P. leucopus in Québec (Fig 1). A total of 96 individuals of white-footed mice and 53 individuals of deer mice were captured in forest patches during the summer of 2011 using Sherman live traps (Table 1), of which respectively 87 and 43 specimens were successfully sequenced for all five mtDNA markers. All specimens were brought back to the laboratory for tissue sampling and genetic analyses.

Fig 1. Location of the 26 study sites sampled during the summer 2011 in southern Québec.

An additional site (Saguenay) is located further north-east and is not shown on this map.

Table 1. Geographical coordinates of the sampling sites, and sample size for P. leucopus (P. l.) and P. maniculatus (P. m.).

Ethics Statement

All procedures were approved by the Québec Government (SEG permit #2011-05-15-014-00-S-F), the McGill University Animal Care Committee (AUP#5420) and the Comité de déontologie de l’expérimentation sur les animaux de l’Université de Montréal (#11–054).

Species identification and sequencing

For each specimen, we extracted DNA from tail tissues using the standard Quick-Lysis protocol [34], and all specimens were identified to the species using species-specific primers in multiplex polymerase chain reaction (PCR) [25, 35]. We then amplified five mtDNA regions using the primers described in Table 2. The amplifications were carried out in a 50 μl volume comprising 5 μl of 10x reaction buffer, 4 μl of dNTP solution at 2.5 mM, 20pM of each primers, 1.5 u of Taq DNA polymerase and 50 to 300 ng of DNA. The DNA amplification protocol was 94°C for 1 min, 35 x (94°C for 45 sec, 54°C for 1 min 30 sec, 72°C for 45 sec) and 72°C for 7 min. The PCR products were then sequenced at the McGill University Genome Québec Innovation Centre. We visualised the chromatograms with the program 4Peaks 1.7 [36] and aligned the sequences with the program ClustalX 2.1 [37]. We removed ambiguous sections at the beginning and the end of each sequence, as well as the gaps and repetitions susceptible to be polymerase reaction errors. Sequence alignments for all markers combined are available as supplementary material for P. leucopus (S1 Table) and P. maniculatus (S2 Table).

Table 2. Primers used for amplifying the five mtDNA regions used in this study.

Data analysis

We first estimated numbers of haplotypes, private haplotypes, parsimony-informative sites, and segregating sites, as well as nucleotide diversity and haplotype diversity with DnaSP version 5.10.1 [38] and the R package ‘PopGenome’ version 2.1.0 [39]. For phylogeographic analysis, the optimal substitution model was then determined with the program ModelTest [40]. We reconstructed phylogeographic trees for each species and all five mtDNA markers separately with the R package ‘ape’ version 3.1–4 [41] using the neighbor-joining (NJ) algorithm [42] applied to pairwise genetic distances among individuals (function nj) and corrected with the K2P model [43] for unequal base frequencies [44]. In addition, PhyML [45] was also applied to the five mtDNA markers, based on the same substitution model [43], and using the NJ tree as a starting tree. For better visualization, the trees were artificially rooted on the longest branch (midpoint rooting [46]) with the R package ‘phangorn’ version 1.1–1 [47], and a majority-rule consensus was computed (function consensus) to identify the supported clades. The robustness of separate phylogeographical trees was assessed with bootstrap values [48] computed from 100 replicates, and tree resolution was measured by counting the ratio of non-zero length branches over the total number of branches. We performed an incongruence length difference (ILD) test [49] with PAUP* 4.0 [50], and test of congruence among distance matrix (CADM) [51, 52] with the R package ‘ape’ to compare the markers with one another and determine whether they led to congruent phylogeographic patterns (function We then ran a posteriori comparisons using Mantel tests [53] to test for the significance of differences between pairs of markers (function

We built haplotype networks with the program Haplotype Viewer [54]. Whereas a phylogeographic tree links individuals, a haplotype network links genetic sequences, thus merging all unique haplotypes at a single node. For both species, we performed a clanistic analysis to evaluate the clans and slices purity [55], where clans are groups obtained by cutting a single branch (bipartition), and slices are groups obtained by cutting two branches (tripartition) in an unrooted network. We calculated the clan and slice purity by counting the number of haplotypes from each sampling site in a group divided by the size of this group (a value of one indicating that a given clan—or slice—contained only haplotypes from a single locality). We also calculated the clan and slice diameter as the maximum distance (number of mutations) between two haplotypes from the same locality in a group; this value was standardized by the total diameter of the network for comparison purposes. We used a Student t-test on all clanistic variables to test for significant differences in the haplotype networks of the two Peromyscus species.

To further detect differences in the geographical pattern of genetic variability between the two species, we computed population networks among localities (nodes in the graph) by considering only shared haplotypes [56]. Namely, two populations (sites) are connected by an edge in the network only when they have at least one haplotype in common. To account for unequal sampling sizes, we estimated the connection probability with a resampling approach in which a single haplotype per site was randomly selected at each iteration prior to constructing the network. This procedure was repeated 1000 times to derive pairwise connection probabilities among sites, for each species and each marker. We used two indices to compare these population networks: the graph density (the number of connections in the graph relative to the number of possible connections), and the node degree distributions (where the degree of a node is the number of edges connected to that node). These two indices were compared between the two Peromyscus species with Student t-tests and non-parametric Mann Whitney tests, respectively.

We determined possible demographic changes in population size or selective regimes for both species independently as well as for both shores of the St. Lawrence River with three different neutrality test and computed mismatch distributions [57] to detect population expansion. To test for deviations from neutrality, we calculated Tajima’s D [58], Fu’s FS [59], and Ramos-Onsins and Rozas’ R2 [60] statistics separately for the five markers, as well as for all markers combined. We assessed statistical significance of each test with 1000 replicates using either Arlequin version[61], DnaSP version 5.10.1 [38], or the R package ‘Pegas’ version 0.7 [62]. The mismatch distributions were computed using the function MMD with the R package ‘Pegas’ [62]. We also computed Fst-values among all pairs of populations and between shores for each species with all markers combined, and assessed statistical significance with 1000 permutations in Arlequin version [61].

Finally, we estimated divergence times for the split separating the north and south shores of the St. Lawrence River for the two species using a Bayesian analysis and a relaxed-clock dating implemented in BEAST version 1.8.2 [63]. To do so, the 87 D-Loop sequences of Peromyscus leucopus included in the present study were combined with all sequences listed in Shipp-Pennock et al. [19], and the 43 Cyt B sequences of Peromyscus maniculatus were combined with those listed in Dragoo et al. [20] to determine phylogeographic clades corresponding to past vicariance events. In each case, the combined dataset was trimmed to compare sequences of equal lengths. To calibrate the molecular clock for each mtDNA marker, the corresponding sequences of Peromyscus leucopus (respectively Peromyscus maniculatus) were used as outgroup, and the split between both species was estimated at 3.5 Mya [64]. The analyses were run for 10,000,000 generations, sampling model parameters every 1000 generations and consensus trees were produced with TreeAnnotator v. 1.8.2 after elimination of 20% of the trees as burn-in. The resulting trees for both species were visualized in FigTree 1.4.2.


All markers exhibited similar levels of polymorphism and nucleotide diversity, except for the 16S region, which was the least variable (Table 3). All markers displayed different numbers of haplotypes between the two species, except for the ATP8, the only marker with a similar level of variability in both species. For all other four markers, P. maniculatus exhibited more haplotype diversity than P. leucopus. The numbers of parsimony-informative sites and segregating sites were also larger in P. maniculatus for all markers, except for the D-Loop, which exhibited a reverse pattern. This was also true for nucleotide diversity and mean pairwise differences among sites (Table 3).

Table 3. Numbers of sequences, sequence lengths (pb), numbers of haplotypes, private haplotypes, informative sites, segregating sites, nucleotide diversity and haplotype diversites for each mtDNA region used in this study for the species (sp) P. leucopus (P. l) and P. maniculatus (P. m).

Results of NJ trees and PhyML trees were almost identical for all five mtDNA genes, but for some minor topological differences among terminal nodes. For both species, two major groups were well supported (BS > 95%) in all trees, and the split between the north shore (NS) and south shore (SS) of the St. Lawrence River was highly supported (S1 Fig). The population structure was a little more resolved when considering the D-Loop, Cyt B, and COIII sequences, than the 16S and the ATP8 sequences, likely due to the very small length of the later. Overall, the resolution was improved when combining all markers into a single analysis. The majority consensus of all five phylogeographic trees for each species also only contained two groups (Fig 2A), corresponding to the north and south shores. Moreover, haplotypes were more closely related in the white-footed mouse, with phylogeographic trees exhibiting a better resolution in P. maniculatus than in P. leucopus (t = 4.53, p < 0.01, Fig 2B). Results of incongruence length difference tests (ILD) were significant for both species (P. maniculatus: ILD = 184, p < 0.05; P. leucopus: ILD = 134, p < 0.05) Similarly, the test of congruence among distance matrix (CADM) showed that all markers were globally congruent (P. maniculatus: W = 0.633, p < 0.001; P. leucopus: W = 0.656, p < 0.001). Results of pairwise Mantel tests indicated that all markers were significantly correlated for both Peromyscus species (all p < 0.001), except for the markers ATP8 and 16S in P. maniculatus (p = 0.117). Based on these results, all markers were concatenated for further analysis.

Fig 2.

(A) Phylogeographic trees estimated by neighbor-joining from the combined sequences of 5 mtDNA regions; (B) and average topological resolution of the trees obtained for each five marker. The north-shore clades are framed in grey and the south-shore clades in blue. Boostrap support values are shown for the NS/SS split.

The structure observed in haplotype networks indicated a larger number of shared haplotypes in P. leucopus than in P. maniculatus, for all markers combined (Fig 3A). P. maniculatus also appeared much more divergent, for each marker individually (S2 Fig). The clanistic analysis on haplotype networks showed that group diameters corresponding to each site were on average significantly larger in P. leucopus than in P. maniculatus (t = 2.51, p < 0.021, Fig 3B). Likewise, groups including haplotypes for each site were the purest in P. maniculatus, showing that haplotypes were shared among a larger number of sites in P. leucopus. This result was significant for slices (t = 2.53, p < 0.032, Fig 3B), but not for clans (t = 2.15, p = 0.060). Overall, connection probabilities between shared haplotypes were larger in P. leucopus than in P. maniculatus (Fig 4A). In addition, P. leucopus displayed a higher graph density for all markers combined in the population network than P. maniculatus (t = 11.39, p < 0.0001, Fig 4B), as well as for each marker individually (S3 Fig). P. leucopus exhibited more nodes with a high degree (larger than 3), whereas P. maniculatus displayed nodes with very few or no connections (tau = 0.557, p < 0.001, Fig 4B).

Fig 3.

(A) Haplotype networks estimated using the combined sequences of 5 mtDNA regions for P. leucopus and P. maniculatus; (B) Associated statistics. The colors in the networks correspond to different localities where P. maniculatus and P. leucopus were sampled. For both P. leucopus and P. maniculatus, we displayed the average group (slice) diameter and group purity in the haplotype networks of sampling sites for the five markers.

Fig 4.

(A) Population networks using the combined sequences of 5 mtDNA regions for P. leucopus and P. maniculatus; (B) Associated statistics. The colors in the networks correspond to different localities where P. maniculatus and P. leucopus were sampled. Dotted lines correspond to connection probabilities lower than 0.2 while solid lines represent connection probabilities of 0.2 and higher. For both P. leucopus and P. maniculatus, we displayed the average graph density and the average node degree distribution for the five markers in the population networks.

Values of neutrality tests were non-significant for independant mtDNA markers and all markers combined in P. leucopus, whereas several tests were significant in P. maniculatus (Table 4). Namely, all markers except ATP8 exhibited a significant departure from neutrality in deer mice sampled from the south shore of the St. Lawrence River, but a single test was significant for the north shore. Values of Tajima’s D were negative and significant for a single marker (Cyt B), while values of Ramos-Onsins and Rozas’ R2 and Fu’s Fs were respectively significant for four markers (D-Loop, Cyt B, 16S, COIII) and three markers (D-Loop, Cyt B, 16S, COIII). All three neutrality tests were significant for the south shore in P. maniculatus when all markers are combined.

Table 4. Results of neutrality tests for both species (P. m and P. l) and all mtDNA regions, for both shores combined or analyzed separately, with corresponding probability values (in parentheses).

Significant values are in bold.

Analysis of mismatch distributions revealed bimodal or multimodal distributions for both species when all markers were combined (S4 Fig), suggesting that populations have been stationary for a long time. Separating the analysis for distinct shores of the St. Lawrence River produced similar multimodal distributions, except for populations of P. maniculatus located on the south shore, which exhibited a distribution with a single peak indicative of a demographic expansion.

Mean pairwise Fst values were much larger in P. maniculatus (0.619) than in P. leucopus (0.373). Out of 120 comparisons among 16 populations, 47 tests (39%) were significant for P. leucoupus, with all tests between populations located on separate shores being significant (S3 Table). Out of 78 comparisons among 13 populations, 32 tests (41%) were significant for P. maniculatus, mostly for comparisons between populations located on different shores of the St. Lawrence River (S4 Table). By restricting the comparisons only between populations from the south shore, twice as many tests were significant for P. maniculatus (24%) compared with P. leucopus (11%). Combining all populations into two groups representing the south and north shores provided significant results both for P. maniculatus (Fst = 0.776, p < 0.001) and P. leucoupus (Fst = 0.647, p < 0.001).

Bayesian phylogenetic analysis recovered the clades in Dragoo et al. [20] when our 43 Cyt B sequences for P. maniculatus were combined with the original sequences. All individuals sampled from the south shore of the St. Lawrence River fell into the east coast clade, while individuals sampled from the north shore were included in the clades from the Midwest and Eastern Canada in Dragoo et al. [20]. We obtained a similar pattern when the sequences published in Shipp-Penock et al. [19] were combined to our 87 P. leucopus D-Loop sequences. Individuals sampled from the north shore of the St. Lawrence were included in Midwestern clade, whereas those sampled from the south shore fell within the east coast clade in Shipp-Penock et al. [19]. Based on these results, the timing of the split between clades corresponding to the north and south shores of the St. Lawrence River is estimated at either 2.7 Mya (S5 Fig) or 2.1 Mya (S6 Fig), depending on the mtDNA marker.


Congruent phylogeographic patterns

It has been shown that mtDNA markers may be biased, for instance through differential male/female dispersion rate. The use of several mtDNA markers improves the detection of significant phylogeographic signals, and they can display patterns congruent with those found using nuclear markers [65, 66]. Here, we found that the phylogeographic structure was congruent among the five mtDNA markers we used. For both Peromyscus species, samples from the south shore of the St. Lawrence River and samples from the north shore fell in distinct groups, and results of Fst tests between both shores were significant, which confirms our hypothesis that the St. Lawrence River constitutes a major geographical barrier to dispersal for Peromyscus individuals. Besides this strong separation, we found no further clear spatial structure in our data. Individuals from a given site were seldom grouped together, especially in P. leucopus.

Different mtDNA sequences may also exhibit varying degrees of ancestral polymorphism and thus not reflect consistent coalescent histories [66], which supports the need for considering more than a single marker in phylogeographic studies. Here, we found that the variability of the COIII marker was similar to that found in another phylogeographic study on P. leucopus and P. maniculatus in Québec [67]. The variability level observed in P. eremicus in Baja California for the Cyt B and COIII was about ten times larger than for P. leucopus and P. maniculatus in Québec, likely because the former study included samples from a much larger geographical area [22]. Likewise, Shipp-Pennock et al. [19] obtained sequences twice as variable as ours for P. leucopus using the D-Loop in a study that included specimens sampled throughout the eastern United States and Canada. With respect to the numbers of haplotypes, other studies have identified approximately ten times more than we did using the D-Loop in P. leucopus across the United-Sates [18]. However, we found proportionally more haplotypes than Dragoo et al. [20], based on Cyt B sequences of P. maniculatus. These results illustrate the benefits of using multiple markers at a smaller geographical scale, but also show that different mtDNA regions may behave differently in different species.

Range expansion and demographic changes

We detected the signature of a recent expansion of white-footed mouse populations that displayed a significantly distinct genetic pattern than the one observed in the deer mouse, which has long been established in Québec. Haplotype network analyses clearly showed that P. maniculatus was more structured across the sampling area than P. leucopus. Namely, haplotypes from the same site tended to be more clustered with respect to localities in P. maniculatus than in P. leucopus. Most haplotypes defined by the combination of all markers were unique in P. maniculatus, while P. leucopus exhibited a large number of shared haplotypes between sites. Shared haplotypes are generally considered as ancestral retention and provide evidence for continuous migration between populations [10, 68].

Using population network analyses we also found that P. leucopus had widespread haplotypes without any barrier other than the St. Lawrence River, whereas P. maniculatus only shared a few haplotypes between sites. These results are consistent with a recent northern expansion of white-footed mice detected in Michigan [9] and in Southern Québec [16]. Similarly, by comparing the phylogeographic patterns of twelve sympatric species of frogs, the genetic homogeneity observed throughout the sampling area was attributed to a recent range expansion during favorable climatic conditions [66].

The low genetic diversity we observed in P. leucopus in Southern Québec is consistent with a recent colonisation and a rapid range expansion [69]. On the other hand, the high diversity of closely related haplotypes with no clear geographic structure and the lack of apparent gene flow in P. maniculatus confirm that these populations have established in the region for a longer time, an observation corroborated by other field studies [9]. Populations of P. maniculatus appeared to be more structured and more diversified than populations of P. leucopus sampled in the study area. This clear phylogeographic structure suggests that P. maniculatus reproduces mostly within established populations, and that gene flow between these populations is reduced compared with P. leucoupus. Interestingly, neutrality tests revealed a significant departure from neutrality in populations of P. maniculatus sampled from the south shore of the St. Lawrence River. The same populations of P. maniculatus also exhibited a mismatch distribution with a unique mode. This was not observed on the north shore, and not for P. leucopus, however. Significant deviations from neutrality can be explained by selection or demographic fluctuations such as population expansion following a bottleneck [59]. Likewise, populations that have been experiencing a recent demographic expansion [57, 70] or range expansion [71, 72] generate unimodal mismatch distributions. In the present case, we postulate that interspecific competition between the two Peromyscus species [73, 74] could result in a selective sweep of deer mouse populations on the south shore of the St. Lawrence River, following the recent northward range expansion of the white-footed mouse [9, 16]. These results further reinforce the difference in their phylogeographic structures, especially if P. maniculatus were extirpated from territory newly colonized by P. leucopus.

Overall, our results are consistent with the hypothesis of a range expansion in P. leucopus, and have important implications for the spread of Lyme disease in southern Québec, as the risk of occurrence of B. burgdorferi was strongly associated with the presence of P. leucopus in our study area [27, 29].

Distinct lineages and implications for the emergence of Lyme disease

All five markers used in this study supported an ancestral divergence of mouse populations from the north shore of the St. Lawrence River and populations from the south shore, in both P. leucopus and P. maniculatus. When the sequences of the two species sampled in southern Québec were combined with those obtained in previous studies across North America [19, 20], we found that mice from both shores of the St. Lawrence River belonged to different ancestral clades, in the two species. Mice from the north shore of the St. Lawrence River originated from a Midwestern clade, while mice from the South shore originated from the East Coast clade. Previous studies have also shown that these clades represent different glacial refugia that have since then been expanding northwards independently [18]. Based on our results, the split between these clades corresponding to distinct shores of the St. Lawrence River occurred well before the Late Glacial Maximum. We estimated the time of this divergence at around 2.1–2.7 Mya, a value that must be taken with caution since we did not consider a third Peromyscus species as an outgroup for this analysis.

The St. Lawrence River thus appears to be a geographic barrier sufficiently strong to maintain the ancestral divergence between the two distinct lineages of P. leucopus in Québec, and likely distinct associated strains of B. burgdorferi. In line with this hypothesis, a phylogeographic pattern similar to that observed for the mice was detected in B. burgdorferi, with two main lineages of Borrelia (Midwest and North-East) that diverged since the Late Glacial Maximum across North America [75, 76]. These lineages possess different forms of the OspC protein, an antigen against which mammals can develop immunity [77]. Experimental studies have shown that P. leucopus immunity to B. burgdorferi is strain specific [78]. Since distinct lineages of B. burgdorferi geographically match distinct clades of P. leucopus in the United States, the two different clades of P. leucopus found in Québec are likely carrying distinct strains of the bacteria. We expect two strains of B. burgdorferi, that diverged since the Late Glacial Maximum, to prevail in Québec and that these strains will keep on spreading with the steady range expansion of P. leucopus in the province. It has been shown that B. burgdorferi subspecies manifest different clinical symptoms and responses to diagnosis in humans [79]. The existence of different virulence levels associated with different strains of B. burgdorferi and distinct clades of P. leucopus has significant implications for the monitoring of the emergence of Lyme disease in Québec, and the design of efficient strategies to lower the impact of this zoonosis on human populations.

Supporting Information

S1 Fig. Phylogeographic trees constructed by neighbor-joining from 5 mitochondrial genes (D-loop, Cyt B, ATP8, 16S and COIII) for P. leucopus and P. maniculatus.

The north-shore (NS) clade is framed in grey and the south-shore (SS) in blue. Boostrap support values are shown for the NS/SS split.


S2 Fig. Haplotype network estimated from 5 mitochondrial genes, for P. leucopus and P. maniculatus.

The colors in the networks correspond to sampling localities. North shore sites are on a grey-scale while south shore sites are in color.


S3 Fig. Population network estimated from 5 mitochondrial genes, for P. leucopus and P. maniculatus.

The colors in the networks correspond to sampling localities. North shore sites are on a grey-scale while south shore sites are in color. Dotted lines correspond to connection probabilities lower than 0.2 while solid lines represent connection probabilities of 0.2 and higher.


S4 Fig. Mismatch distribution analysis for P. leucopus and P. maniculatus based on all markers combined.

Distributions are presented for populations located (a) on both shores, (b) only the north shore, and (c) only the south shore of the St. Lawrence River. Empirical distributions (in blue) and expected distributions under a stable population model (in red) are shown on each plot.


S5 Fig. Bayesian phylogenetic tree based on the 87 D-Loop sequences of Peromyscus leucopus included in the present study combined with the sequences in Shipp-Pennock et al [20].

To calibrate the molecular clock, the corresponding 43 sequences of Peromyscus maniculatus were used as outgroup, and the split between both species was set at 3.5 Mya. Clades corresponding to the north and south shores of St. Lawrence River are highlighted to estimate their divergence time.


S6 Fig. Bayesian phylogenetic tree based on the 43 Cyt B sequences of Peromyscus maniculatus included in the present study combined with the sequences in Dragoo et al. [20].

To calibrate the molecular clock, the corresponding 87 sequences of Peromyscus leucopus were used as outgroup, and the split between both species was set at 3.5 Mya. Clades corresponding to the north and south shores of St. Lawrence River are highlighted to estimate their divergence time.


S1 Table. Alignments of 87 Peromyscus leucopus sequences for all mtDNA regions combined.


S2 Table. Alignments of 43 Peromyscus maniculatus sequences for all mtDNA regions combined.


S3 Table. Pairwise Fst values (above diagonal) and corresponding probabilities (below diagonal) among the 16 populations of Peromyscus leucopus sampled in this study.

Significant Fst values are in bold.


S4 Table. Pairwise Fst values (above diagonal) and corresponding probabilities (below diagonal) among the 13 populations of Peromyscus maniculatus sampled in this study.

Significant Fst values are in bold.



We thank A. Bruneau, B. Angers, P. Legendre and C. Favret from the Université de Montréal for their advices and critical comment; C. Bouchard, É. Roy Dufresne and D. Suter for field work assistance; M. Le Cam for programming help; the INSPQ and the many landowners who allowed sample collection; two anonymous reviewers for their constructive comments on previous versions of this article. We also acknowledge the staff at McGill University and Génome Québec for technical assistance with sequencing.

Author Contributions

Conceived and designed the experiments: VM FJL NT. Performed the experiments: JF NT. Analyzed the data: JF FJL. Wrote the paper: JF VM FJL NT.


  1. 1. Diffenbaugh NS, Field CB. Changes in ecologically critical terrestrial climate conditions. Science 2013; 341: 486–492. pmid:23908225
  2. 2. Houghton JT, Ding Y, Griggs DJ, Noguer M, van der Linden PJ, Dai X, et al. Climate change 2001, IPCC (Intergovernmental Panel on Climate Change) third assessment report. New York: Cambridge University Press; 2001.
  3. 3. McCarty JP. Ecological consequences of recent climate change. Conservation Biology 2001; 15: 320–331.
  4. 4. Parmesan P, Yohe G. A globally coherent fingerprint of climate change impacts across natural systems. Nature 2003; 421: 37–42. pmid:12511946
  5. 5. Root TL, Price JT, Hall KR, Schnelder SH, Rosenzwelg C, Pounds JA. Fingerprints of global warming on wild animals and plants. Nature 2003; 421: 57–60. pmid:12511952
  6. 6. Parmesan C. Ecological and evolutionary responses to recent climate change. Annual Review of Ecology, Evolution and Systematics 2006; 37: 637–669.
  7. 7. Swenson NG, Howard DJ. Clustering of contact zones, hybrid zones, and phylogeographic breaks in North America. The American Naturalist 2004; 166: 581–591.
  8. 8. Graham WR. FAUNMAP: an electronic database documenting late Quaternary distribution of mammal species. 2004. Available:
  9. 9. Myers P, Lundrigan BL, Hoffman SMG, Haraminac AP, Seto SH. Climate-induced changes in the small mammal communities of the Northern Great Lakes Regions. Global Change Biology 2009; 15: 1434–1454.
  10. 10. Avise JC, Arnold J, Ball RM, Bermingham E, Lamb T, Neigel JE, et al. Intraspecific phylogeography: the mitochondrial DNA bridge between population genetics and systematics. Annual Review of Evolution and Systematics 1987; 18: 489–522.
  11. 11. Avise JC. Phylogeography: the history and formation of species. Cambridge, Massachusetts: Harvard University Press; 2000.
  12. 12. McKenna MC, Bell SK. Classification of mammal above the species level. New York: Columbia University Press; 1997.
  13. 13. Bradley RD, Durish ND, Rogers DS, Miller JR, Engstrom MD, Kilpatrick CW. Toward a molecular phylogeny for Peromyscus: Evidence from mitochondrial Cytochrome-b sequences. Journal of Mammalogy 2007; 88: 1146–1159. pmid:19924266
  14. 14. Hibbard CW. Paleontology. In: King JA, editor. Biology of Peromyscus (Rodentia), Special Publication no. 2, American Society of Mammalogists; 1968. pp. 6–26.
  15. 15. Hooper ET. Classification. In: King JA, editor. Biology of Peromyscus (Rodentia), Special Publication no. 2, American Society of Mammalogists; 1968. pp. 27–74.
  16. 16. Roy-Dufresne E, Logan T, Simon JA, Chmura G, Millien V. Poleward expansion of the white-footed mouse (Peromyscus leucopus) under climate change: implications for the spread of Lyme disease. PloS ONE 2013; 8: e80724. pmid:24260464
  17. 17. Grant PR. An 11-year study of small mammal populations at Mont St. Hilaire, Quebec. Canadian Journal of Zoology 1976; 54: 2156–2173.
  18. 18. Rowe KC, Heske EJ, Paige KN. Comparative phylogeography of eastern chipmunks and white-footed mice in relation to the individualistic nature of species. Molecular Ecology 2006; 15: 4003–4020. pmid:17054499
  19. 19. Shipp-Pennock MA, Webster WD, Freshwater DW. Systematics of the White-footed mouse (Peromyscus leucopus) in the Mid-Atlantic region. Journal of Mammalogy 2005; 86: 803–813.
  20. 20. Dragoo JW, Lackey JA, Moore KE, Lessa EP, Cook JA, Yates TL. Phylogeography of the deer mouse (Peromyscus maniculatus) provides a predictive framework for research on hantavirus. Journal of General Virology 2006; 87: 1997–2003. pmid:16760402
  21. 21. Templeton AR. Nested clade analyses of phylogeographic data: testing hypotheses about gene flow and population history. Molecular Ecology 1998; 7: 381–397. pmid:9627999
  22. 22. Riddle BR, Hafner DJ, Alexander LF. Comparative phylogeography of Baileys’ Pocket mouse (Chaetodipus baileyi) and the Peromyscus ermicus species group: historical vicariances of the Baja California peninsular desert. Molecular Phylogenetics and Evolution 2000; 17: 161–172. pmid:11083931
  23. 23. Sullivan J, Arellano E, Rogers DS. Comparative phylogeography of Mesoamerican highland rodents: concerted versus independent response to past climatic fluctuations. The American Naturalist 2000; 15: 755–768.
  24. 24. Zheng X, Arbogast BS, Kenagy GJ. Historical demography and genetic structure of sister species: deermice (Peromyscus) in the North American temperate rain forest. Molecular Ecology 2003; 12: 711–724. pmid:12675826
  25. 25. Rogic A, Tessier N, Legendre P, Lapointe FJ, Millien V. Genetic structure of the white-footed mouse in the context of the emergence of Lyme disease in southern Québec. Ecology and Evolution 2013; 3: 2075–2088. pmid:23919153
  26. 26. Marrotte R, Gonzalez A, Millien V. Landscape heterogeneity and habitat quality combine to provide an optimal model of connectivity and estimate gene flow at the range margin of a small mammal. Molecular Ecology 2014; 23: 3983–3998. pmid:24975474
  27. 27. Ogden NH, Bouchard C, Kurtenbach K, Margos G, Lindsay LR, Trudel L, et al. Active and passive surveillance and phylogenetic analysis of Borrelia burgdorferi elucidate the process of Lyme disease risk emergence in Canada. Environmental Health Perspectives 2010; 18: 909–914.
  28. 28. Hanincova K, Kurtenbach K, Diuk-Wasser M, Brei B, Fish D. Epidemic spread of Lyme Borreliosis, Northeastern United States. Emerging Infectious Diseases 2006; 12: 604–611. pmid:16704808
  29. 29. Simon JA, Marrotte RR, Desrosiers N, Fiset J, Gaitan J, Gonzalez A, et al. Climate change and habitat fragmentation drive the occurrence of B. burgdorferi, the agent of Lyme disease, at the northern limit of its distribution. Evolutionary Applications 2014; 7: 750–764. pmid:25469157
  30. 30. Ostfeld RS. Lyme disease: The ecology of a complex system. New York: Oxford University Press; 2011.
  31. 31. Rand PW, Lacombe EH, Smith RP Jr, Rich SM, Kilpatrick CW, Dragoni CA, et al. Competence of Peromyscus maniculatus (Rodentia: Cricetidae) as a reservoir host for Borrelia burgdorferi (Spirochaetares: Spirochaetaceae) in the wild. Journal of Medical Entomology 1993; 30: 614–618. pmid:8510121
  32. 32. Ogden NH, St-Onge L, Barken IK, Brazeau S, Bigras-Poulin M, Charron DF, et al. Risk maps for range expansion of the Lyme disease vector, Ixodes scapularis, in Canada now with climate change. International Journal of Health Geographics 2008; 22: 7–24.
  33. 33. Bouchard C, Beauchamps G, Nguon S, Trudel L, Milord F, Lindsay LR, et al. Associations between Ixodes scapularis ticks and small mammal host in a newly endemic zone in southeastern Canada: implications for Borrelia burgdorferi transmission. Ticks and Tick-borne Diseases 2011; 2: 183–190. pmid:22108010
  34. 34. Olsen JB, Wenburg JK, Bentzen P. Semiautomated multilocus genotyping of Pacific salmon (Oncorhnychus spp.) using microsatellites. Molecular Marine Biology and Biotechnology 1996; 5: 259–272. pmid:8983195
  35. 35. Tessier N, Noël S, Lapointe FJ. A new method to discriminate the deer mouse (Peromyscus maniculatus) from the white-footed mouse (Peromyscus leucopus) using species-specific primers in multiplex PCR. Canadian Journal of Zoology 2004; 82: 1832–1835.
  36. 36. Griekspoor A, Groothuis T. 4peaks: a program that helps molecular biologists to visualize and edit their DNA sequence files; 2005. Available:
  37. 37. Thompson JD, Gibson JT, Plewniac F, Jeanmougin F, Higgins DG. The Clustal-X windows interface: flexible strategies for multiple sequences alignement aided by quality analysis tools. Nucleic Acids Research 1997; 25: 4876–4882. pmid:9396791
  38. 38. Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009; 25: 1451–1452. pmid:19346325
  39. 39. Pfeifer B, Wittelsbürger U, Ramos-Onsins SE, Lercher MJ. 2014. PopGenome: an efficient Swiss army knife for population genomic analyses in R. Molecular Biology and Evolution 2014; 31: 1929–1936. pmid:24739305
  40. 40. Posada D, Crandall KA. Modeltest: testing the model of DNA substitution. Bioinformatics 1998; 14: 817–818. pmid:9918953
  41. 41. Paradis E, Claude J, Strimmer K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics 2004; 20: 289–290. pmid:14734327
  42. 42. Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Molecular Biology and Evolution 1987; 4: 406–425. pmid:3447015
  43. 43. Kimura M. A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences. Journal of Molecular Evolution 1980; 16: 111–120. pmid:7463489
  44. 44. Felsenstein J, Churchill GA. A Hidden Markov model approach to variation among sites in rate of evolution. Molecular Biology and Evolution 1996; 13: 93–104. pmid:8583911
  45. 45. Guindon S, Lethiec F, Duroux P, Gascuel O. PHYML Online: a web server for fast maximum likelihood-based phylogenetic inference. Nucleic Acids Research 2005; 33: W557–559. pmid:15980534
  46. 46. Farris JM. Methods for computing Wagner trees. Systematic Zoology 1970; 19: 83–92.
  47. 47. Schliep KP. Phangorn: phylogenetic analysis in R. Bioinformatics 2011; 27: 592–593. pmid:21169378
  48. 48. Efron B, Tibshirani RJ. An Introduction to the Bootstrap. New York: Chapman & Hall; 1993.
  49. 49. Farris JS, Källersjö M, Kluge AG, Bult C. Testing significance of congruence Cladistics 1994; 10: 315–319.
  50. 50. Swofford D. PAUP*. Phylogenetic Analysis Using Parsimony (* and other methods). Version 4. Sunderland, Massachusetts: Sinauer Associates; 2002.
  51. 51. Legendre P, Lapointe FJ. Assessing congruence among distance matrices: single-malt Scotch whiskies revisited. Australian and New Zealand Journal of Statistics 2004; 46: 615–629.
  52. 52. Campbell V, Legendre P, Lapointe FJ. The performance of the Congruence Among Distance Matrices (CADM) test in phylogenetic analysis. BMC Evolutionary Biology 2011; 11: 64. pmid:21388552
  53. 53. Mantel N. The detection of disease clustering and a generalized regression approach. Cancer Research 1967; 27: 209–220. pmid:6018555
  54. 54. Salzburger W, Ewing GB, Von Haeseler A. The performance of phylogenetic algorithms in estimating haplotype genealogies with migration. Molecular Ecology 2011; 20: 1952–1963. pmid:21457168
  55. 55. Lapointe FJ, Lopez P, Boucher Y, Koenig J, Bapteste E. Clanistics: a multi-level perspective for harvesting unrooted gene trees. Trends in Microbiology 2010; 18: 341–347. pmid:20605718
  56. 56. Dyer RJ, Nason JD. Population graphs: the graph theoretic shape of genetic structure. Molecular Ecology 2004; 13: 1713–1727. pmid:15189198
  57. 57. Rogers AR, Harpending H. Population growth makes waves in the distribution of pairwise genetic differences. Molecular Biology and Evolution 1992; 9: 552–569. pmid:1316531
  58. 58. Tajima F. The effect of change in population-size on DNA polymorphism. Genetics 1989; 123: 597–601. pmid:2599369
  59. 59. Fu YX. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 1997; 147: 915–925. pmid:9335623
  60. 60. Ramos-Onsins R, Rozas R. Statistical properties of new neutrality tests against population growth. Molecular Biology and Evolution 2002; 19: 2092–2100. pmid:12446801
  61. 61. Excoffier L, Lischer HEL. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Molecular Ecology Resources 2010; 10: 564–567. pmid:21565059
  62. 62. Paradis E. Pegas: an R package for population genetics with an integrated–modular approach. Bioinformatics 2010; 26: 419–420. pmid:20080509
  63. 63. Drummond AJ, Suchard MA, Xie D, Rambaut A. 2012. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Molecular Biology and Evolution 2012; 29: 1969–1973. pmid:22367748
  64. 64. Riddle BR, Hafner DJ, Alexander LF. 2000. Phylogeography and systematics of the Peromyscus eremicus species group and the historical biogeography of North American warm regional deserts. Molecular Phylogenetics and Evolution 2000; 17: 145–160. pmid:11083930
  65. 65. Bell RC, MacKenzie JB, Hickerson MJ, Chavarria KL, Cunningham M, Williams S, et al. Comparative multi-locus phylogeography confirms multiple vicariance events in co-distributes rainforest frogs. Proceedings of the Royal Society B: Biological Sciences 2012; 279: 991–999. pmid:21900325
  66. 66. Fouquet A, Noonan BP, Rodrigues MT, Pech N, Gilles A, Gemmell NJ. Multiple quaternary refugia in the eastern Guiana shield revealed by comparative phylogeography of 12 frog species. Systematic Biology 2012; 61: 461–489. pmid:22223446
  67. 67. Noël S, Angers B, Lapointe FJ. Peromyscus populations and their Cuterebra parasites display congruent phylogeographical structure. Parasitology 2005; 131: 237–245. pmid:16145940
  68. 68. Ditchfield AD. The comparative phylogeography of Neotropical mammals: patterns of intraspecific mitochondrial DNA variation among bats contrasted to nonvolant small mammals, Molecular Ecology 2000; 9: 1307–1318. pmid:10972770
  69. 69. Polihronakis M, Caterino MS. Contrasting patterns of phylogeographic relationships in sympatric sister species of ironclad beetles (Zopheridae: Phloeodes spp.) in California's Transverse Ranges. BMC Evolutionary Biology 2010; 10: 195. pmid:20573263
  70. 70. Slatkin M, Hudson RR. Pairwise comparisons of mitochondrial DNA sequences in stable and exponentially growing populations. Genetics 1991; 129: 555–562. pmid:1743491
  71. 71. Ray N, Currat M, Excoffier L. Intra-deme molecular diversity in spatially expanding populations. Molecular Biological and Evolution 2003; 20: 76–86.
  72. 72. Excoffier L. Patterns of DNA sequence diversity and genetic structure after a range expansion: lessons from the infinite-island model. Molecular Ecology 2004; 13: 853–864. pmid:15012760
  73. 73. Dooley JL, Dueser RD. Experimental tests of nest site competition in two Peromyscus species. Oecologia 1996; 105: 81–86.
  74. 74. Wolff JO. Coexistence of white-footed mice and deer mice may be mediated by fluctuating environmental conditions. Oecologia 1996; 108: 529–533.
  75. 75. Humphrey PT, Caporale DA, Brisson D. Uncoordinated phylogeography of Borrelia burgdorferi and its tick vector, Ixodes scapularis. Evolution 2010; 64: 2653–2663. pmid:20394659
  76. 76. Margos G, Tsao JI, Castillo-Ramirez S, Girard YA, Hammer SA, Gatewood Hoen A, et al. Two boundaries separate Borrelia burgdorferi populations in North America. Applied and Environmental Microbiology 2012; 78: 6059–6067. pmid:22729536
  77. 77. Barbour AG, Travinsky B. Evolution and distribution of the ospC Gene, a transferable serotype determinant of Borrelia burgdorferi. 2010; mBio 1: e00153–10. pmid:20877579
  78. 78. Schwan TG, Kime KK, Schrumpf ME, Coe JE, Simpson WJ. Antibody response in white-footed mice (Peromyscus leucopus) experimentally infected with the Lyme disease spirochete (Borrelia burgdorferi). Infection and Immunity 1989; 57: 3445–3451. pmid:2807530
  79. 79. Stanek D, Strle F. Lyme borreliosis: a European perspective on diagnosis and clinical management. Current Opinion in Infectious Diseases 2009; 22: 450–45. pmid:19571749