## Figures

## Abstract

It is well known that parasites are often highly aggregated on their hosts such that relatively few individuals host the large majority of parasites. When the parasites are vectors of infectious disease, a key consequence of this aggregation can be increased disease transmission rates. The cause of this aggregation, however, is much less clear, especially for parasites such as arthropod vectors, which generally spend only a short time on their hosts. Regression-based analyses of ticks on various hosts have focused almost exclusively on identifying the intrinsic host characteristics associated with large burdens, but these efforts have had mixed results; most host traits examined have some small influence, but none are key. An alternative approach, the Poisson-gamma mixture distribution, has often been used to describe aggregated parasite distributions in a range of host/macroparasite systems, but lacks a clear mechanistic basis. Here, we extend this framework by linking it to a general model of parasite accumulation. Then, focusing on blacklegged ticks (*Ixodes scapularis*) on mice (*Peromyscus leucopus*), we fit the extended model to the best currently available larval tick burden datasets via hierarchical Bayesian methods, and use it to explore the relative contributions of intrinsic and extrinsic factors on observed tick burdens. Our results suggest that simple bad luck—inhabiting a home range with high vector density—may play a much larger role in determining parasite burdens than is currently appreciated.

**Citation: **Calabrese JM, Brunner JL, Ostfeld RS (2011) Partitioning the Aggregation of Parasites on Hosts into Intrinsic and Extrinsic Components via an Extended Poisson-Gamma Mixture Model. PLoS ONE 6(12):
e29215.
https://doi.org/10.1371/journal.pone.0029215

**Editor: **Ulrike Gertrud Munderloh, University of Minnesota, United States of America

**Received: **June 4, 2011; **Accepted: **November 22, 2011; **Published: ** December 22, 2011

This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.

**Funding: **The data collection for this research was supported by NSF grants DEB 0075277 and DEB 0444585, and NIH grant R01 AI40076 (www.nsf.gov, www.nih.gov). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

**Competing interests: ** The authors have declared that no competing interests exist.

## Introduction

Parasites, from nematodes and trematodes to lice and ticks, are typically highly aggregated on their hosts with relatively few individuals hosting the large majority parasites [1]–[3]. Indeed, parasite burdens among hosts are usually described by a negative binomial distribution (NBD) with its characteristic long right tail representing those few highly infected hosts [1], [3]. While many explanations for macroparasite (e.g., helminthes, cestodes, nematodes) aggregation exist, most involve small differences among host in terms of behavior, innate susceptibility, or acquired immune responses being magnified throughout the infection and/or lifetime of the host [1], [4]–[8]. Life-long infections and parasite replication on or in the host tend to increase aggregation, while density-dependent parasite mortality and parasite-induced host mortality work to reduce aggregation [4]. Most arthropod vectors, however, spend only a short time on their hosts and reproduce elsewhere, so these feedbacks have little time to manifest. While variation in extrinsic factors has historically been discussed as a potential cause of aggregation [1], [9], [10], recent studies have focused mainly on identifying the intrinsic host characteristics (e.g., sex, age, activity rates) presumably associated with large parasite burdens [11]–[14].

Understanding the cause(s) vector aggregation on hosts is important because this aggregation can inflate the potential rate of spread of an infection [15], [16]. One widely-cited example is tick-borne encephalitis (TBE), which is caused by a virus transmitted between *Ixodes ricinus* ticks when they co-feed on hosts such as yellow-necked mice, *Apodemus flavicollis* [17], [18]. Most TBE transmission occurs on the hosts with the greatest tick burdens [11]. If public health interventions could target the most infested 20% of hosts, transmission of TBE to humans could be effectively reduced by 75% [11], but similar interventions targeted at random hosts could be expected to have only negligible impact [15], [16]. Thus identifying those hosts responsible for feeding and infecting the most vectors has become a priority and has clear implications for disease management.

Currently two classes of models are applied to study parasite aggregation, neither of which allows direct quantification of the relative contributions of intrinsic and extrinsic factors. Regression-based approaches assuming negative binomial error structure and treating , the overdisperison parameter of the NBD, as a nuisance parameter [11], [12], [14] focus on identifying covariates that account for variation in mean burdens among groups of hosts. There is typically no link in these studies between biological processes and degrees of overdispersion (though it is possible to model as function of other variables [19], [20]). The second approach assumes that hosts randomly sample parasites from their environment, which would result in a Poisson distribution of burdens, but that the sampling rate (i.e., the expected burden) varies among hosts [1], [16], [21]. When the sampling rate of the Poisson is gamma distributed, the marginal distribution of burdens is negative binomial [1], [16], [22]. Variation in the sampling rate among hosts therefore causes overdispersion. In contrast to the regression approach, the Poisson-gamma mixture directly results in a NBD of burdens and the aggregation parameter, , can be expressed in terms of the parameters of the sampling rate distribution [23]. Unfortunately, the variation in sampling rates that drives aggregation is typically unobserved, and thus the causes of this variation are not identified.

Empirical studies of tick aggregation have focused heavily on identifying intrinsic host characteristics that explain observed tick burdens via the regression approach. The rationale is that *a priori* identification of hosts likely to have large burdens could lead to targeted and highly effective control efforts. Unfortunately, these efforts have, so far, produced equivocal results, with few consistent factors emerging across different studies, systems, sites, and years. Males often have greater burdens than females (e.g., on *A. flavicollis*, [11]), but two recent studies have shown that sex is just one of myriad host and environmental variables that each explain a small portion of the variability in burdens on several rodent species [12], [14]. Even a study that explicitly linked activity/exploration phenotypes of Siberian chipmunks (*Tamias sibiricus*) to tick burdens found that many variables, including their interactions, were significantly associated with burdens [13]. It is still not clear from these studies how much of the observed aggregation of tick burdens is due to variation in susceptibility among hosts, and how much is due to extrinsic factors such as variation in questing tick densities among host home ranges. If tick burdens are driven primarily by random, extrinsic factors, control efforts focusing on identifying the most susceptible hosts via host characteristics may be doomed to fail.

We have two main goals in this paper. First, we introduce a general framework for understanding the distribution of parasites on hosts. The framework consists of a simple and flexible mechanistic model of parasite accumulation that could be easily tailored to a wide range of host-parasite systems, and an explicit consideration of how variability among hosts enters into the parasite accumulation process. Our framework can be understood as an extension of the Poisson-gamma mixture model widely invoked to explain negative binomial parasite distributions. Second, we use a hierarchical Bayesian approach to couple our model to the best available data on blacklegged ticks and white-footed mice to quantify the degree to which random variation in tick density among mouse home ranges affects the overdispersion of tick burdens on mice.

## Methods

As mentioned above, the key weakness of using the Poisson-gamma mixture to model parasite burden distributions is that sampling rate variation is usually unobserved. Because of this, it is often not possible to identify the causes of aggregation with burden data alone. We seek here to extend the Poisson-gamma mixture framework by linking variation in the sampling rate among hosts to an underlying model of parasite accumulation. In other words, we aim to write the NBD and its aggregation parameter, , in terms of an accumulation model and associated sources of variability among individuals. It will then be possible to use burden data in combination with other types of data that can speak directly to the accumulation process. In our empirical example, we focus on a key extrinsic factor, spatial variation in larval blacklegged tick (*I. scapularis*) density, to quantify its contribution to aggregation relative to that of intrinsic differences in susceptibility among white-footed mice (*Peromyscus leucopus*).

We first derive a simple model relating host movement and tick density to the expected burden, or sampling rate, , on an individual host. We assume that the realized tick burden on a host, , is Poisson distributed with rate parameter . We then consider how variation in among hosts gives rise to an approximately negative binomial distribution of burdens. Table 1 summarizes the symbols and notation we use throughout the paper.

### A simple tick accumulation model

We assume that each host occupies a home range characterized by its area, , and parasite density, . Hosts move within their home ranges and encounter ticks as they do so. The the per day rate at which a host encounters and picks up parasites should then be proportional to the product of the distance it moves per day, , and the density of questing larvae, , in its home range.

We further assume that each home range can be characterized by the average parasite density the host experiences in that home range, and that this density remains constant during the study period. This later assumption implies that the removal and feeding of parasites by hosts and by parasite mortality are insignificant compared to the number of questing parasites available in a home range, at least over a short period of time. This assumption is reasonable for our blacklegged tick/white-footed mouse example, given that densities of questing nymphs and larvae, as well as burdens on mice, remain high for several weeks during the peak of the season [12], [24].

Lastly, we assume that the parasites feeding on a host drop off after successfully feeding or are removed (e.g., by grooming or host immune responses) at a constant rate, , independent of the density of ticks on the host (although it is possible to modify this assumption). The expected tick burden, , on a host at time is therefore determined by the rates at which parasites are picked up and lost, or(1)Notice that the accumulation constant, , could be broken into a number of individual constants including the width of area “sampled” by the host, the probability that given an encounter, the parasite attaches to the host, etc. These constants all enter as a product whose individual components are not separately estimable from the data at hand, and so we lump them together. Lumping these factors into a single constant is standard practice in models that have an encounter or accumulation term, such as predator-prey models (e.g., see derivation of the predator functional response curves in [25]). If more detailed data were available, the components of could be kept separate.

Assuming that parasite burdens at the time of sampling have reached their equilibrium (again, for our example system, the seasonal “peaks” in tick burdens last for several weeks) [12], we focus on the stationary solution of equation (1):(2)

For notational convenience, we will refer hereafter to the equilibrium sampling rate simply as , dropping the superscript. Again, if data on host movement or the components of are available, these factors could be kept separate, and the steps below could then be performed on this expanded model. Focusing on our empirical example, we assume that , , and are intrinsic to the individual, while variation in is extrinsic to the individual. Defining , collects all intrinsic components of the model into a single “susceptibility” factor. While it could be argued that movement rates are determined by extrinsic factors such as resource densities, it is equally true that the ability of an individual to occupy and maintain a home range with a given level of resources depends on intrinsic characteristics, such as sex, age class, and the animal's condition. In any case, by focusing on parasite densities, , which is clearly extrinsic to the host, and letting these other factors be included in the variable , we are being conservative about the importance of extrinsic factors.

### The approximate distribution of tick burdens

Both and will vary among hosts and will thus be considered random variables. As both are continuous, must be non-negative, and could conceivably assume a range of different distributional shapes, we assume they are gamma distributed. We denote shape parameters of gamma distributions by and scale parameters by . Thus, the gamma distribution of is characterized by and , while that of is parameterized by and .

When derived as a Poisson-gamma mixture, the probability mass function of the NBD can be written in terms of the parameters of the gamma distribution of sampling rates [23], yielding(3)where is the Euler gamma function, and and are the shape and scale parameters, respectively, of the sampling rate distribution.

Equation (2) shows that is the product of and . In appendix S1, we show via simulation that a gamma distribution provides a good approximation of the distribution of the product . We then use this fact to derive an approximation that allows us to write the parameters of the approximate gamma distribution of in terms of , , , and . The resulting approximate expressions for the shape and scale parameters of the rate distribution are (appendix S1)(4)and(5)

Substituting equations (4) and (5) into equation (3), we obtain an approximation for the distribution of parasite burdens over hosts in terms of the accumulation model and associated sources of variability. The mean of the burden distribution is(6)and the aggregation parameter of the burden distribution is . In other words, the degree of aggregation in tick burdens is determined by the shape parameter of the gamma distribution that describes how accumulation rates vary among individual hosts. The rate distribution shape parameter is, in turn, a function of and . Thus, this approximation links lower-level processes governing vector accumulation, which are potentially measurable, to the degree of aggregation in vector burdens on hosts.

Focusing on our blacklegged tick example, we can now develop an index that quantifies the contribution of variation in questing larval density among host home ranges, , to the observed value of . The limit of the expression for as (i.e., as the effect of variation in larval density goes away) is simply . In this limit, equations (3), (4), (5), and (6) are exact (i.e., the rate parameter distribution is a gamma) and variation among individuals in sampling rate is driven entirely by differential susceptibility. The ratio will then be as long as , and the quantity is a measure of the degree to which the estimated value of reduces the value of conditional on the value of . In other words, when the aggregation of vectors on hosts is dominated by differences in vector densities among home ranges and is zero when the aggregation is due entirely to differences in individual susceptibility. Writing in terms of and , we obtain(7)Checking the limit behavior of , we see that when , and when , as expected.

### Empirical data

We now show how the above-described extended Poisson-gamma mixture can be combined with empirical data to tease apart the contribution of extrinsic and intrinsic factors on the larval tick burdens of white-footed mice. We used two years of data (1999 and 2004) from two of the six permanent small mammal trapping grids (GC and TX) in the oak and maple dominated forests tracts of the Cary Institute of Ecosystem Studies (CIES) that have been trapped for by R.S. Ostfeld and colleagues. We chose these years and grids because they offered the most observations of larval burdens and mouse home range sizes and densities of questing larvae, used to estimate . A more detailed description of the trapping methods can be found elsewhere [26].

Questing activities and larval burdens are highly seasonal [24], showing fairly distinct, but broad peaks in the late summer/early fall (mid- to late-August into early September). We therefore restricted our analyses to the data collected during these peaks, as visually identified. In addition, individual mice were often captured several times (this being a mark-recapture study). In order to avoid multiple non-independent measurements of tick burdens, we selected at random only one observation per individual mouse. *Ixodes scapularis* were counted on each mouses' head and ears, and these counts are highly correlated with whole-body larval burdens () [26].

Densities of host-seeking, or “questing”, larvae at a site were estimated using standard drag cloth methods [27] along transects, so the grain of our tick density data is . Dragging was done several times during the expected peaks of larval activity, but the actual dates of dragging were inconsistent between years and trapping grids. We therefore restricted our analyses to the three or four transects that coincided with and straddled the peaks of questing larvae densities and of larval burdens.

Ideally, we would have data on larval tick densities across the home ranges of individual mice, or at least at the scale of mouse home ranges. As with every other study we are aware of, our tick density data are not paired with individual mice and estimated mouse home range areas are generally much larger than the tick drags (see appendix S2). To deal with this issue, we upscaled the density data to the home range sizes using two assumptions about the spatial correlation among samples, which bracket the range of possibilities. The upscaling proceeds by selecting a random home range size from the home range area distribution (appendix S2). This area is then “filled” at a time from the distribution of larval drags (appendix S2). The larval drag transects are widely spaced within the trapping grids. Filling each home range with random samples from the tick density distribution corresponds to one extreme where there is no short distance correlation in tick densities (hereafter Rnd). Thus the filled home ranges all tend towards the overall mean tick density and among home range differences are at their minimum. The other extreme, perfect short distance correlation in tick densities (hereafter Cor), can be obtained by taking a single random sample from the larval drag distribution and multiplying it by , where is the area of the focal home range. In this case, the large degree of heterogeneity observed among drags is preserved at the scale of entire home ranges. For each grid/year combination, this procedure was repeated 10000 times for each of the Rnd and Cor assumptions. Finally, random samples of 15 areas (matching the smallest actual sample size involved, that of larval drags for each grid year combination) were drawn from the Rnd and Cor distributions for each grid/year combination. This resulted in 8 datasets: 2 grids×2 years×2 density upscaling assumptions.

### Hierarchical Bayesian parameterization of the accumulation model

We employed a hierarchical Bayesian (HB) framework to fit our accumulation model to the larval burden and the upscaled larval density datasets (Fig. 1) [28], [29]. The framework includes two latent variables–“true susceptibility” and “true tick density”–to account for the facts that: 1) susceptibility is not directly observed, and 2) “observations” of upscaled larval tick densities cannot be directly paired with observations of tick burdens. The overall likelihood is thus a product of the two conditionally independent likelihoods of the data sources (burdens and upscaled densities), conditioned on the values of the latent variables (Fig. 1).

Gray boxes identify the levels in the hierarchy, white boxes represent data, and white ovals represent low-level model elements. Arrows show the relationships among model elements.

We used noninformative (uniform) priors for all four model parameters (, , , and ) on the four Rnd datasets. For the four Cor datasets, a weakly informative half-Cauchy prior [29], [30] was used on to achieve convergence (see appendix S3 for explanation), while uniform priors were used for the other three parameters. Though this prior introduces a slight bias in the results in favor of increasing the apparent contribution of differential susceptibility, its effect on our qualitative results is negligible: Upscaled tick densities account for most of the aggregation in burdens in the Cor datasets (table 2).

We implemented this approach via MCMC sampling in WinBugs 1.4 [31]. The WinBugs code including the priors is listed in appendix S3. All analyses except for TX 1999 Cor employed a 70,000 iteration burn-in period followed by 30,000 iterations of which 5000 were kept as samples from the posterior distribution. A longer burn-in period of 150,000 iterations was used for TX 1999 Cor. For each dataset, we ran three chains started from widely spaced initial conditions. We used , the Gelman-Rubin statistic [32], to verify convergence was achieved ( for all model parameters). Finally, we used posterior predictive simulations to check the fit of the models to the burden data and to propagate uncertainty in model parameters through to summary quantities that are functions of model parameters (, and ).

### Comparison of the accumulation model to the classical NBD

The NBD as commonly used in parasitology and ecology is parameterized by its mean, , and aggregation parameter, , which are estimated from count (e.g., burden) data via maximum likelihood [1], [22], [23]. Our accumulation model is parameterized in terms of the distributions of ( and ) and ( and ), but equations (4) and (6) allow us to calculate the aggregation parameter, , and the mean, , respectively, of the burden distribution that results from our Bayesian fit of the accumulation model to each dataset. Thus, it is possible to compare our accumulation model, which is fit using upscaled tick density data in addition burden data, to the classical NBD, which is fit using only burden data. This comparison serves two purposes. The first is as a consistency check of the new accumulation model in that the model should provide similar values of and as those obtained by fitting the classical NBD via maximum likelihood. The second purpose is to examine how the two upscaling assumptions affect the degree of aggregation, as measured by , relative to that obtained by fitting the empirical NBD. We use the subscript Cls to refer to the classical NBD, and the subscripts Rnd and Cor to refer to the accumulation model fitted to the Rnd and Cor upscaled density datasets, respectively.

## Results

The parameterized HB models successfully described the observed distributions of larval blacklegged ticks on white-footed mice (Figs. 2 and 3). The expected burden distribution under the fitted model and the 95% credible regions in these figures are obtained by posterior predictive simulation. The model fits well in most cases, with some disagreement in the upper quantiles for GC 1999 Rnd, and to a lesser extent, for GC 1999 Cor (Fig. 2). Table 2 presents Bayesian posterior means and 95% credible intervals for accumulation model parameters.

The “expected” distribution (solid lines) under the fitted accumulation model, as well as the 95% credible regions (dashed lines) around the predicted line, were generated via posterior predictive simulations.

The “expected” distribution (solid lines) under the fitted accumulation model, as well as the 95% credible regions (dashed lines) around the predicted line, were generated via posterior predictive simulations.

A key strength of our analytical approximation of the burden distribution is that it allows us to directly examine the factors that drive aggregation. Equation (4) shows that depends only on the shape parameters of the distributions of and . The effect of each variable on goes away as its shape parameter becomes large and its distribution becomes symmetrical. Thus skewness in each component distribution, as indicated by a small shape parameter value, translates into aggregation in the overall distribution of burdens. This can be seen by examining how the and point estimates change between the Rnd and Cor versions of each site/year dataset (Table 2). In all but the TX 1999 Rnd case, where and are essentially equal, for the Rnd datasets, and for the Cor datasets.

Point estimates of , the degree to which the aggregation in tick burdens is driven by variation in questing larval density, are higher in the Cor datasets (those in which tick distributions are spatially autocorrelated) than in the Rnd datasets (those without spatial autocorrelation) (Table 2). This indicates that strongly skewed tick density distributions ( small) can account for most of the aggregation observed in the burden data. Even in the Rnd datasets, where tick densities in different hosts' home ranges are more similar, point estimates for can be as high as 0.54 (TX 1999 Rnd), and are never less than , indicating that variability in the tick density experienced by different mice can still play a substantial role in explaining observed burdens.

Table 3 compares the posterior predictive mean values of the burden distribution mean and aggregation parameter under the fitted accumulation model for both the Rnd ( and ) and Cor ( and ) datasets to their empirical counterparts ( and ). The point and interval estimates obtained by the different methods are generally similar, further demonstrating the consistency between the accumulation model and observed data. Though there is substantial overlap between the Rnd and Cor datasets, the Cor datasets produce somewhat lower values of than those estimated directly from the burden data, suggesting that such strong spatial correlation in tick density introduces too much aggregation in burdens. This tendency suggests that, at least in extreme scenarios, variation in the tick density experienced by different individuals can be more than enough to account for the observed aggregation in parasite burdens.

## Discussion

We have developed an extension of the classical Poisson-gamma mixture model of overdispersed parasite burdens by linking among-host variation in parasite sampling rate to a mechanistic model of parasite accumulation. The key idea that the distribution of parasite sampling rates among hosts can be related to lower-level parasite encounter and accumulation processes is very general and should apply to many host/macroparasite systems. While certain details of the accumulation process will likely be system specific, the formulation of the accumulation model is flexible and can be tailored to such details when necessary. When embedded in a hierarchical Bayesian statistical framework, our model allows multiple sources of information, acting on different hierarchal levels, to be coherently integrated. The parameters of the distribution of parasite burdens can be written in terms of the components of the accumulation model and thus be linked to lower-level processes, and uncertainty in model parameters can be propagated through to quantities that are functions of model parameters (, , and ). Our framework differs from other models of macroparasite burdens in that it describes the shape of the distribution of burdens as a function of a biologically relevant parameters rather than simply treating the overdispersion parameter, , as a nuisance parameter, as in regression-based approaches, or leaving the variation in sampling rates among hosts unexplained, as in traditional applications of the Poisson-gamma mixture.

With only four free parameters, our model is able to reproduce the observed distribution of blacklegged tick burdens on white-footed mice in several places and times. Moreover, it provides a novel way to separate the contribution of intrinsic factors affecting parasite aggregation from that of extrinsic factors such as spatial variation in parasite density. The burden data provide strong information about , while the upscaled tick density data provide information about , both directly and indirectly through the accumulation model (Fig. 1). Overdispersion in burdens that cannot be accounted for by is absorbed by the latent variable and is thus attributed to differential susceptibility. Additional information, such as data on individual rates of host movement or grooming could easily be accommodated within our framework to provide more precise estimates of the parameters governing .

We have identified patchiness in the spatial distribution of questing tick density as a key factor in explaining observed burden distributions. Highly patchy distributions imply strong short-distance correlation in tick density, meaning adjacent areas will likely have similar tick densities on small spatial scales. Though we could not directly quantify this correlation with available data, we based our analyses on two extreme assumptions (no correlation and perfect correlation) that bracket the range of possibilities. In the Rnd extreme, where tick densities among home ranges tended toward the site-level mean, variation in tick densities was still important in some locations and years (e.g., TX 1999 Rnd), while it was less of a factor in others (e.g., TX 2004 Rnd). Importantly, the influence of this extrinsic factor did not completely disappear in any of the cases considered under the Rnd assumption ( values ranged from 0.1 to 0.54). Examining the other extreme (Cor), where the differences in tick densities among home ranges were most pronounced, we see that there is a tendency toward slightly too much aggregation, as demonstrated by the posterior predictive mean values in table 3. This implies that highly patchy tick spatial distributions can account for most or all of the aggregation observed in tick burden distributions. Furthermore, questing tick density had a strong effect on observed values at all grid/year combinations examined under the Cor assumption. As questing larvae distributions are known to be highly patchy [33], [34], we argue that the degree of correlation will likely fall closer to the Cor extreme than to the Rnd extreme.

Our results suggest that the often extreme differences in individual tick burdens we observe are not solely or, in many cases, even mostly caused by intrinsic differences in individual susceptibility due, for instance, to sex or life history stage. These differences in burdens can instead be explained primarily by random differences in the densities of questing ticks experienced by different hosts. As questing tick densities become more variable among home ranges, so ticks become more aggregated on a relatively small proportion of hosts. In other words, our results imply that some mice may have extremely large tick burdens simply because of bad luck; their home ranges happen to overlap with areas of high tick density.

Spatial variation in questing larval density is very clearly a product of random processes. Gravid *I. scapularis* drop to the ground after feeding to repletion on their blood meal host, usually a deer or other larger bodied mammal, wherever that may be, and lay eggs close to where they fall [24]. The adult females and resulting larvae move no more than a meter or two while questing for a host [35]. There is no evidence, to the best of our knowledge, that gravid females choose where to drop to the ground. If there is a deterministic aspect to local questing larval densities, it is that some locations may be more favorable for larval hatching and survival [36]. Our results highlight the importance of quantifying questing tick density within each host home range. The availability of such data would improve our ability to pin down the mechanisms driving the aggregation of vectors on hosts. We are currently attempting to directly quantify the relationship between home-range-scale larval tick density and host body burden in the blacklegged tick/white-footed mouse system. Such a dataset will facilitate a direct test of our main empirical conclusion here. Quantifying variability in movement rates among hosts would further refine our understanding of the mechanisms governing parasite accumulation.

Our main empirical result, that variation in tick densities among home ranges can strongly affect tick burdens, is, one the one hand, not surprising. It is well known that tick spatial distributions are patchy on relatively small scales [33], [34], and it is logical to expect that this variability will affect tick burdens on hosts. On the other hand, the focus in the literature has very clearly been on trying to identify *a priori* biological characteristics that reliably predict parasite burdens [11], [12], [14]. This search assumes that host-related factors account for the majority of the variation in observed burdens. Our results cast substantial doubt on this assumption, and suggest that more effort should be spent on testing it and on quantifying the contribution of random, extrinsic factors. As much of the variation in tick burdens could potentially be explained by largely unpredictable, small-scale variation in the density of questing ticks, our results imply that it may be impossible to predict *a priori* the type(s) of individuals that will accumulate the largest burdens, and thus make the greatest contribution to disease transmission. Management strategies that assume such an *a priori* determination of heavily burdened individuals is possible may therefore prove ineffective and may waste limited management resources. Instead, management strategies that focus on finding and mitigating concentrations of questing larval ticks might inhibit heavy larval burdens on mice and the resulting production of numerous infected nymphs.

## Supporting Information

### Appendix S1.

Approximation of the sampling rate distribution.

https://doi.org/10.1371/journal.pone.0029215.s001

(PDF)

### Appendix S2.

Home range sizes and upscaled larval density.

https://doi.org/10.1371/journal.pone.0029215.s002

(PDF)

### Appendix S3.

WinBUGS code and information about priors.

https://doi.org/10.1371/journal.pone.0029215.s003

(PDF)

## Acknowledgments

We thank C. Dormann, S. LaDeau, P. Leimgruber, K. Terrell, and J. Thompson for helpful comments on earlier drafts of the manuscript. We are also grateful to K. Oggenfuss and the many field assistants who collected these data at the Cary Institute.

## Author Contributions

Conceived and designed the experiments: JLB RSO. Performed the experiments: JLB RSO. Analyzed the data: JMC JLB. Wrote the paper: JMC JLB. Designed models and statistical methods: JMC.

## References

- 1. Crofton HD (1971) A quantitative approach to parasitism. Parasitology 62: 179–193.
- 2. Shaw DJ, Dobson AP (1995) Patterns of macroparasite abundance and aggregation in wildlife populations: a quantitative review. Parasitology 111: S111–S133.
- 3. Shaw DJ, Grenfell BT, Dobson AP (1998) Patterns of macroparasite aggregation in wildlife host populations. Parasitology 117: 597–610.
- 4. Anderson RM, Gordon DM (1982) Processes influencing the distribution of parasite numbers within host populations with special emphasis on parasite-induced host mortalities. Parasitology 85: 373–398.
- 5. Schad GA, Anderson RM (1985) Predisposition to hookworm infection in humans. Science 228: 1537.
- 6. Janovy J, Kutish GW (1988) A model of encounters between host and parasite populations. Journal of Theoretical Biology 134: 391–401.
- 7. Bush AO, Heard RW, Overstreet RM (1993) Intermediate hosts as source communities. Canadian Journal of Zoology 71: 1358–1363.
- 8.
Wilson K, Bjornstad ON, Dobson AP, Merler S, Poglayen G, et al. (2002) Heterogeneities in macroparasite infections: patterns and processes. In: Hudson PJ, Rizzoli A, Grenfell BT, Heesterbeek H, Dobson AP, editors. The Ecology of Wildlife Diseases. New York, New York, USA: Oxford University Press. 197 p.
- 9.
Keymer AE, Anderson RM (1979) The dynamics of infection of
*Tribolium confusum*by*Hymenolepis diminuta*: the influence of infective-stage density and spatial distribution. Parasitology 79: 195–207. - 10. Mouchet J, Faye O, Julvez J, Manguin S (1996) Drought and malaria retreat in the sahel, west africa. Lancet 348: 1735–1736.
- 11. Perkins SE, Cattadori IM, Tagliapietra V, Rizzoli AP, Hudson PJ (2003) Empirical evidence for key hosts in persistence of a tick-borne disease. International Journal for Parasitology 33: 909.
- 12. Brunner JL, Ostfeld RS (2008) Multiple causes of variable tick burdens on small-mammal hosts. Ecology 89: 2259–2272.
- 13.
Boyer N, Reale D, Marmet J, Pisanu B, Chapuis JL (2010) Personality, space use and tick load in an introduced population of siberian chipmunks
*Tamias sibiricus*. Journal of Animal Ecology 79: 538–547. - 14. Kiffner C, Vor T, Hagedorn P, Niedrig M, Rühe F (2011) Factors affecting patterns of tick parasitism on forest rodents in tick-borne encephalitis risk areas, germany. Parasitology Research 108: 323–335.
- 15. Woolhouse MEJ, Dye C, Etard JF, Smith T, Charlwood JD, et al. (1997) Heterogeneities in the transmission of infectious agents: implications for the design of control programs. Proceedings of the National Academy of Sciences of the United States of America 94: 338–342.
- 16. Lloyd-Smith JO, Schreiber SJ, Kopp PE, Getz WM (2005) Superspreading and the effect of individual variation on disease emergence. Nature 438: 355–359.
- 17. Jones LD, Davies CR, Steele GM, Nuttall PA (1987) A novel mode of arbovirus transmission involving a nonviraemic host. Science 37: 775–777.
- 18. Labuda M, Jones LD, Williams T, Danielova V, Nuttall PA (1993) Efficient transmission of tickborne encephalitis virus between co-feeding ticks. Journal of Medical Entomology 30: 295–299.
- 19. Paterson S, Lello J (2003) Mixed models: getting the best use of parasitological data. Trends in Parasitology 119: 370–375.
- 20.
Kiffner C, Lödige C, Alings M, Vor T, Rühe F (2011) Body-mass or sex-biased tick parasitism in roe deer (
*Capreolus capreolus*)? a gamlss approach. Medical and Veterinary Entomology 25: 39–45. - 21.
Hubbard A, Liang S, Maszle D, Qiu D, Gu X, et al. (2002) Estimating the distribution of worm burden and egg excretion of
*Schistosoma japonicum*by risk group in sichuan province, china. Parasitology 125: 221–231. - 22.
Boswell MT, Patil GP (1970) Chance mechanisms generating negative binomial distributions. In: Patil GP, editor. Random Counts in Scientific Work, Vol. 1. Pennsylvania State University Press. pp. 3–22.
- 23.
Hilborn R, Mangel M (1997) The ecological detective: Confronting models with data. Princeton Univ. Press.
- 24.
Fish D (1993) Population ecology of
*Ixodes dammini*. In: Ginsberg H, editor. Ecology and Environmental Management of Lyme Disease. New BrunswickNew Jersey, USA: Rutgers University Press. pp. 25–42. - 25.
Case T (2000) An illustrated guide to theoretical ecology, volume 44. Oxford University Press New York, New York, USA.
- 26.
Schmidt KA, Ostfeld RS, Schauber EM (1999) Infestation of
*Peromyscus leucopus*and*Tamias striatus*by*Ixodes scapularis*(acari: Ixodidae) in relation to the abundance of hosts and parasites. Journal of Medical Entomology 36: 749–757. - 27.
Falco RC, Fish D (1992) A comparison of methods for sampling the deer tick,
*Ixodes dammini*, in a lyme disease endemic area. Experimental and Applied Acarology 14: 165–173. - 28.
Clark J (2007) Models for ecological data: an introduction. Princeton university press Princeton, New Jersey, USA.
- 29.
Gelman A, Hill J (2007) Data analysis using regression and multilevel/hierarchical models. Cambridge University Press New York, New York, USA.
- 30. Gelman A (2006) Prior distributions for variance parameters in hierarchical models. Bayesian Analysis 1: 515–533.
- 31. Lunn DJ, Thomas A, Best N, Spiegelhalter D (2000) Winbugs: a bayesian modelling framework: concepts, structure, and extensibility. Statistics and Computing 10: 325–337.
- 32. Gelman A, Rubin DB (1992) Inference from iterative simulation using multiple sequences. Statistical Science 7: 457–472.
- 33. Petney TN, Van Ark H, Spickett AM (1990) On sampling tick populations: the problem of overdispersion. The Onderstepoort Journal of Veterinary Research 57: 123–127.
- 34. Markowski D, Hyland KE, Ginsberg HS, Hu R (1997) Spatial distribution of larval ixodes scapularis (acari: Ixodidae) on peromyscus leucopus and microtus pennsylvanicus at two island sites. The Journal of Parasitology 83: 207–211.
- 35.
Falco RC, Fish D (1991) Horizontal movement of adult
*Ixodes dammini*(acari: Ixodidae) attracted to co2-baited traps. Journal of Medical Entomology 28: 726–729. - 36.
Lindsay LR, Barker IK, Surgeoner GA, McEwen SA, Gillespie TJ, et al. (1998) Survival and development of the different life stages of
*Ixodes scapularis*(acari: Ixodidae) held within four habitats on long point, ontario, canada. Journal of Medical Entomology 35: 189–199.