Browse Subject Areas

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Influences of Host Community Characteristics on Borrelia burgdorferi Infection Prevalence in Blacklegged Ticks

  • Holly B. Vuong ,

    Current address: Center for Vaccine and Immunology, University of Georgia Athens, GA, United States of America

    Affiliations Rutgers University, Department of Ecology, Evolution, and Natural Resources, New Brunswick, NJ, United States of America, Cary Institute of Ecosystem Studies, 2801 Sharon Turnpike, Millbrook, NY, United States of America


  • Grace S. Chiu,

    Affiliation Research School of Finance, Actuarial Studies and Statistics, College of Business and Economics, Building 26C, Australian National University, Canberra, ACT, Australia

  • Peter E. Smouse,

    Affiliation Rutgers University, Department of Ecology, Evolution, and Natural Resources, New Brunswick, NJ, United States of America

  • Dina M. Fonseca,

    Affiliations Rutgers University, Department of Ecology, Evolution, and Natural Resources, New Brunswick, NJ, United States of America, Rutgers University, Department of Entomology, 180 Jones Ave., New Brunswick, NJ, United States of America

  • Dustin Brisson,

    Affiliation University of Pennsylvania, Department of Biology, 209 Leidy Laboratories, Philadelphia, PA, United States of America

  • Peter J. Morin,

    Affiliation Rutgers University, Department of Ecology, Evolution, and Natural Resources, New Brunswick, NJ, United States of America

  • Richard S. Ostfeld

    Affiliation Cary Institute of Ecosystem Studies, 2801 Sharon Turnpike, Millbrook, NY, United States of America

Influences of Host Community Characteristics on Borrelia burgdorferi Infection Prevalence in Blacklegged Ticks

  • Holly B. Vuong, 
  • Grace S. Chiu, 
  • Peter E. Smouse, 
  • Dina M. Fonseca, 
  • Dustin Brisson, 
  • Peter J. Morin, 
  • Richard S. Ostfeld


Lyme disease is a major vector-borne bacterial disease in the USA. The disease is caused by Borrelia burgdorferi, and transmitted among hosts and humans, primarily by blacklegged ticks (Ixodes scapularis). The ~25 B. burgdorferi genotypes, based on genotypic variation of their outer surface protein C (ospC), can be phenotypically separated as strains that primarily cause human diseases—human invasive strains (HIS)—or those that rarely do. Additionally, the genotypes are non-randomly associated with host species. The goal of this study was to examine the extent to which phenotypic outcomes of B. burgdorferi could be explained by the host communities fed upon by blacklegged ticks. In 2006 and 2009, we determined the host community composition based on abundance estimates of the vertebrate hosts, and collected host-seeking nymphal ticks in 2007 and 2010 to determine the ospC genotypes within infected ticks. We regressed instances of B. burgdorferi phenotypes on site-specific characteristics of host communities by constructing Bayesian hierarchical models that properly handled missing data. The models provided quantitative support for the relevance of host composition on Lyme disease risk pertaining to B. burgdorferi prevalence (i.e. overall nymphal infection prevalence, or NIPAll) and HIS prevalence among the infected ticks (NIPHIS). In each year, NIPAll and NIPHIS was found to be associated with host relative abundances and diversity. For mice and chipmunks, the association with NIPAll was positive, but tended to be negative with NIPHIS in both years. However, the direction of association between shrew relative abundance with NIPAll or NIPHIS differed across the two years. And, diversity (H') had a negative association with NIPAll, but positive association with NIPHIS in both years. Our analyses highlight that the relationships between the relative abundances of three primary hosts and the community diversity with NIPAll, and NIPHIS, are variable in time and space, and that disease risk inference, based on the role of host community, changes when we examine risk overall or at the phenotypic level. Our discussion focuses on the observed relationships between prevalence and host community characteristics and how they substantiate the ecological understanding of phenotypic Lyme disease risk.


Investigating the ecological factors that influence pathogen genetic variation in a host community may be critical to predicting disease risk. This partly reflects the fact that genetic variants of the pathogen can differ in virulence, transmissibility, and infectivity [13]. Unfortunately, our understanding of the ecological drivers influencing pathogen genetic diversity is limited, especially for multi-host zoonotic pathogens.

Interactions of pathogen genotypes with species in the host community may affect the temporal and spatial patterns of genotype prevalence, and could potentially influence the risk of disease [48]. For example, human disease severity associated with Borrelia burgdorferi [9], Mycobacterium tuberculosis [10], Toxoplasma gondii [11], and Helicobacter pylori [12] varies with different pathogen genotypes. Hence, understanding the ecological interactions between hosts and pathogen genotypic variability could provide insights on ways to reduce disease risk and protect human health.

Here, we examine the Lyme disease system, a disease caused by the bacterium Borrelia bugdorferi [13], to advance our understanding of how differences in the host community can influence risk of human exposure. This pathogen replicates within a variety of mammal and bird species and is transmitted between wildlife hosts, and from wildlife to humans, by ticks in the Ixodes ricinus complex (I. scapularis) in eastern North America. Over the past three decades, studies of Lyme disease ecology in the northeastern USA have revealed the importance of small mammals, including the white-footed mouse (Peromyscus leucopus), eastern chipmunk (Tamius striatus), short-tailed shrew (Blarina brevicauda), and masked shrew (Sorex cinereus) for general Lyme disease risk [1418]. These small mammal species are among the most efficient vertebrates at transmitting B. burgdorferi infection to feeding ticks (i.e. the small mammals are competent reservoirs), and prevalence of tick infection is correlated with absolute or relative abundances of small-mammal hosts [1619]. The ability of mice, chipmunks, and some shrews, to dominate depauperate faunal communities, by virtue of their ability to respond quickly to environmental degradation [20], appears to contribute to the negative relationship previously detected between host diversity and disease risk [16,18,19,2124]. Differences in host community composition and potential host-tick feeding interactions might also influence the nymphal infection prevalence and density of infected nymphs.

Lyme disease risk in humans is variable, due to infection by different B. burgdorferi strains transmitted from nymphal (and adult) ticks, which had previously fed on wildlife hosts that support dissimilar strains of the bacterium [2528]. The bacterial strains (or genotypes) can be characterized on the basis of their highly polymorphic outer surface protein C (ospC). Approximately 25 distinguishable strains of B. burgdorferi are currently known in the USA, with 17 strains occurring in the northeastern USA alone [2931]. Of these 17 strains, five (ospC types A, B, I, K and N) exhibit significantly elevated occurrence rates among Lyme disease patients [25,32,33]. We collectively termed this subset of five as human invasive strains (HIS). The occurrence of these HIS types in Lyme disease patients warrants exploration of the ecological contributors to variable genotypic frequencies in tick populations associated with various wildlife populations, given that HIS and non-HIS types show divergent frequencies among wildlife hosts [2628,34].

Our study, administered in 2006 and 2009, was intended to elucidate the determinants of Lyme disease risk, both in terms of overall nymphal infection prevalence (NIP) of any of the strains of B. burgdorferi (NIPAll), as well as the prevalence of tick infection with HIS strains (NIPHIS) only, across multiple host communities of an endemic county in New York State. We use hierarchical/multilevel Bayesian models to examine NIPAll, and NIPHIS simultaneously, the latter conditional on infection, across these host communities. This novel approach captures the variation at the individual tick level, irrespective of whether the ticks were tested positive, negative, or inconclusive for certain HIS strains. The approach also uses site-specific parameters for estimating NIPAll and NIPHIS. Although we identified strains by their genotypes, our model focused on the phenotypic disease risk (i.e., the prevalence of the HIS category among infected individuals) within the host community as this phenotypic risk is of greater impact than individual strains alone.

Materials and Methods

Field Collections

We sampled the small mammal communities throughout Dutchess County NY in 2006 (30 sites) and 2009 (19 sites), with seven of the sites sampled in both years (see horizontal axes in Figs 1 and 2). We obtained permission from private land owners to set our grids for the duration of the sampling season on conditions of anonymity. Our trapping dates were 30 May– 19 September 2006 and 2 June– 2 October 2009 [35]. In 2006, we conducted small mammal trapping every other week, whereas in 2009, we trapped at all sites weekly. In both years, each time the site was sampled, traps were deployed for two-consecutive nights (= 1 trapping session). We used an 8 x 8 live trapping grid system, placing one Sherman trap (22.9cm x 7.6cm x 7.6cm) every 15m, and Tomahawks (48.3cm x 15.2cm x 15.2cm) every 30m, for a maximum of 16 Tomahawks and 64 Sherman traps on a full grid (see Supporting Information S1 File for more details). Traps were set between ~15:30 and 17:30, and checked the next morning between 08:30 and 12:00 to avoid any potential heat or cold weather related issues for the animals.

Fig 1.

Graphical displays of 2006 raw data and modeled results of overall nymphal infection prevalence (NIPAll; panel A), NIPHIS (panel B), and overall density of infected nymphs (DINAll; panel C). All bar charts are ordered by descending values on the x-axis. A: Shown as gray bars are the thirty site-specific naïve NIPALL estimates (= y/n where n = # of test ticks; y = # of ticks which tested positive for B. burgdorferi) and each corresponding naïve 95% confidence interval (= y/n ± 1.96 SEnaïve(y/n) based on sample proportions, also in gray) for the true NIPAll (= pB) at that site. In contrast, each black interval is a 95% credible interval (Bayesian confidence interval) using the posterior inference from our Bayesian model. Superimposed on each credible interval is the posterior median (a Bayesian estimate of the site’s true NIPALL). B: Same as panel A but for conditional NIPHIS estimates (= h/y where h = # of ticks whose RLB procedure indicated HIS+; RLB failure on any of the y positive ticks would lead to an indeterminate h/y). Only three sites yielded complete RLB results; their naïve confidence intervals were not computed due to small ys (hence, an invalid SE formula). In contrast, our Bayesian model provides valid estimates and 95% credible intervals for the true NIPHIS (= pC) for all 30 sites (shown in black). C: Similar to panels A–B but for naïve DINAll estimates (= [m/a]x[y/n] where m = # of nymphs dragged over a distance of a) and model-based estimates (= [m/a] x [posterior inference for pB]). Due to the uncertainty in m (unreplicated and hence, unmodeled), our model-based inference for DINAll here should be interpreted with care (see S4 File).

Fig 2.

Graphical displays of 2009 raw data and modeled results of overall nymphal infection prevalence (NIPAll; panel A), NIPHIS (panel B), and overall density of infected nymphs (DINAll; panel C). All bar charts are ordered by descending values on the x-axis. Same information as for Fig 1 (year 2006), but for the eighteen sites in 2009.

Animals used in this study were approved under the Cary Institute of Ecosystem Studies IACUC numbers 06–03 and 09–01 for field sampling. During the warmer periods of the season, Sherman traps were provided a mix of oats and sunflower seeds for the small animals (e.g. mice, chipmunks). In colder nights, these traps were provided with sunflower seeds and cotton gauze for the animals to create a warm bedding material within the trap. Havahart traps had two raw, unpeeled walnuts for squirrels, and if rain was forecasted, wooden boards were placed over the traps to provide covers for the animals.

Each mammal was identified to species, ear tagged with a unique code, sexed, weighed, evaluated for reproductive status, and then released at the point of capture. Shrews were microchipped with a unique PIT tag rather than an ear tag. Although the trapping occurred throughout the summer, our small mammal diversity measures are based on data from August through early October, coinciding with peak larval tick abundances [36]. Thus, our trapping efforts included four trapping sessions in 2006 and eight trapping sessions in 2009. For the three most common host species captured at each site (white-footed mouse, eastern chipmunk and short-tailed shrew), we calculated the minimum number alive (MNA) using program MARK v.6.0 [37]. MNA is based on mark-recapture data, where individuals are marked upon initial capture and recorded as present or absent in subsequent trapping sessions. We averaged the MNA values across these trapping sessions within each year and used those average values to estimate population densities, based on grid size.

We calculated host Shannon diversity estimates (H’) based on the MNA values of the three small mammal host species (white-footed mouse, eastern chipmunk, and short-tailed shrew), avian point counts, and the ‘activity density’ of larger mammalian hosts captured by camera traps. Avian counts were conducted between 05:00 to 10:00 AM to maximize avian detection during early morning activity. These avian counts were conducted two or three times at each site, and birds within a 100 m radius of the observer were identified by sight and sound. We included the American Robin (Turdus migratorius), Veery (Catharus fuscescens), Ovenbird (Seiurus aurocapilla), and Woodthrush (Hylocichla mustelina) in our host community estimates for diversity, as these four host species are relevant to tick feeding and B. burgdorferi infections [18,27,38].

To obtain quasi-quantitative estimates of densities for medium and larger sized mammals, we placed motion-detecting wildlife cameras (DeerCam and CritterGetter) at the sites, with scent lures or raw chicken and corn-cob as bait for two weeks, starting in early October 2006 and mid-October 2009 [16,23]. The number of identifiable individuals in each picture and the number of pictures provide an index of ‘activity level’ for those animals at the site. Briefly, the site with the highest quartile of ‘activity level’ was assigned the ‘most common’ density values, while lower quartile values were scored as ‘present’, and if the animal was absent or rare, the density was recorded as either ‘0’ or some low value, depending on the species. The observed density estimates for each category (most common, present, rare/absent) were based on published values for similar habitats (S2 File). The quasi-Shannon diversity values were based on the most commonly detected species of the host community, following LoGiudice et al. [23] density estimates. The Shannon (H’) calculations incorporated values based on ‘activity density’ estimates, averaged weekly minimum number of live densities of mice, chipmunks, and short-tailed shrew, and density estimates of avian hosts.

We collected questing nymphs during the nymphal peak period (June/July) in 2007 and 2010. These questing nymphs represent the previous summer’s larvae that fed on the host community in 2006 and 2009, respectively. At each site, we randomly dragged four 30m transects across our trapping grid to obtain a density estimate of the tick population, followed by a second round of density drags at least two weeks later [39]. To ensure sufficient nymphal sample sizes for estimation of B. burgdorferi infection prevalence, we conducted additional tick drags on many of the sites, following the second density drags. These supplemental drags were not used for calculations of tick density, and not all ticks collected from the supplemental drags were tested for B. burgdorferi. Note that one of the 19 sites in 2009 yielded a single nymph despite supplemental drags. Therefore, we omitted this site from consideration, reducing the number of 2009 sites to 18. See S1 File for other details about field collections procedures.

Lab Analyses

We tested questing nymphal ticks for the ospC gene of B. burgdorferi with a polymerase chain reaction (PCR) procedure, followed by a reverse line blot (RLB) to differentiate the ospC genotypes detected [34,40]. For 2006 samples, we used outer primers OC6F/OC623R, followed by inner primers OC6+F/OC602R for a semi-nested PCR. For 2009 samples, we used new outer primers OC-368F/OC693R and new inner primers OC4+F/OC643 for the semi-nested PCR [41]. The primer set used in 2006 had lower binding efficiencies to the probes in the RLB procedure, resulting in only 68.4% of the B. burgdorferi positive ticks with conclusive ospC genotyping results. In contrast, there was 100% efficiency with the 2009 probes, leading to conclusive ospC genotyping results for all B. burgdorferi positive ticks. Genotype ospC-C is a hybrid of ospC-E and ospC-I, making double and triple co-infections with these genotypes difficult to distinguish. We scored ospC-C as present when both ospC-E and ospC-I were present, but we ultimately ignored ospC-C for statistical analyses. Genotype ospC-J was found once in one year and was absent the other year, so it was also removed from the analyses, resulting in a total of 15 ospC genotypes used in the statistical analyses.

Infection Prevalence Data for Bayesian Analyses

For each site in each year, the lab analysis data were used to calculate the naïve estimates (i.e., simple empirical proportions) of NIPAll, NIPHIS, and DINAll (gray bar charts in Figs 1 and 2). DIN stands for “density of infected nymphs,” defined as the product of NIP and DON (density of nymphs), the latter computed using primary drags only. A naïve NIPAll estimate was y/n, where y was the number of ticks testing positive for any ospC type and n was the number of ticks subjected to PCR (Figs 1A and 2A). Thus, a naïve DINAll estimate was [m/a] x [y/n] where m was the number of nymphs dragged over a distance of a (Figs 1C and 2C). A naïve NIPHIS estimate was h/y, where h was the number of ticks testing positive for one or more HIS types, although such estimates were missing for most sites in 2006 for which the RLB procedure was inconclusive on one or more ospC positive ticks (Figs 1B and 2B).

These naïve estimates of NIPAll and DINAll ignored covariate information, which were missing when RLB results were inconclusive, and the naïve confidence interval formula was invalid when the central limit theorem should not be applied, such as when n was small. In contrast, our Bayesian models described below integrate auxiliary information (covariates, as well as missing data due to inconclusive RLB), leading to more reliable inference (including credible intervals) that does not require large values of n.

Bayesian Models for NIPAll and NIPHIS

We analyzed the data separately for 2006 and 2009 because most of our sites and the small mammal trapping frequencies differed between years. To examine how host covariates such as H’ and the relative abundances of white-footed mouse, eastern chipmunk, and short-tailed shrew might influence NIPAll and conditional NIPHIS (among infected ticks), we constructed a Bayesian generalized linear model (GLM), first for 2006, then for 2009 (Figs 3 and 4, respectively). Relative abundances for the three small mammal host species were calculated, based on density estimates for the inclusive host community, rather than for just these three specific host species. NIPAll and NIPHIS were modeled as site-specific parameters, for which statistical inference was based on binary response variables, using individual ticks as the experimental units; that yielded a total of ni units (of ticks) at site i, along with the associated site-specific covariate values. The dataset was therefore partitioned as such by site, and the ith site’s multivariate response was represented by three response vectors (each of length ni): zi (containing 1’s and 0’s, denoting observed presence/absence of B. burgdorferi), vi (denoting the success ‘1’ or failure ‘0’ of the RLB test), and ti (denoting observed presence ‘1’ or absence ‘0’ of one or more HIS types). We did not distinguish among the five HIS ospC-types as we were comparing disease risk phenotypically (HIS vs non-HIS). The reverse line blot (RLB) inefficiencies associated with 2006 samples were taken into account by our models for NIPAll and NIPHIS. See S3 File for an example of the data associated with one of the sites in 2006. Data from 2009 consisted only of zi and ti vectors, because ospC-detection was complete for the 2009 samples.

Fig 3. Tree diagram with all possible states and associated probabilities for a test tick.

The probabilities are: pB (nymphal infection rate (NIPAll) of B. burgdorferi), pS (conditional probability of a successful RLB test, given infection), pSH (conditional probability that the test tick is HIS+, given RLB success), and pFH (conditional probability that the test tick is HIS+, given RLB failure). Note that pc (conditional NIPHIS, given infection) is equal to pSpSH + (1 − pS)pFH. Observable states are in boxes, and unobservable states are in ovals. Red nodes do not apply to 2009 because pS = 1.

Fig 4. Visual representation of the integrative Bayesian hierarchical approach, upon which our GLM is constructed.

All quantities depicted are site-specific except for regression coefficient vectors (α,γ) and variance parameters (τ2,ω2) which are study- (year-) specific. Both pB and pC depend on the same covariates. These two sets of dependencies are integrated through (1) the direct collective influence of pc,pSH, and pFH, and z (vector of 1’s and 0’s denoting the state of B. burgdorferi infection for test ticks), on v (vector of 1’s and 0’s denoting success/failure of RLB tests), and (2) the direct collective influence of pSH,pFH, z, and v on t (vector of 1’s and 0’s denoting HIS presence/absence on test ticks). Model parameters are in ovals and data are in boxes. Red nodes are not modeled for the 2009 data because v1 (non-stochastic). Model statements and details of the statistical analyses appear in S4 File.

Our integrative Bayesian framework fully accounted for the non-stochastic nature of the HIS presence/absence data (tij) when B. burgdorferi was absent (zij = 0), and for the unobservability of tij when the RLB procedure failed (vij = 0). (See boldfaces in S3 File.) This is achieved by defining Bernoulli distribution parameters that correspond to Fig 3, namely, pB (nymphal infection prevalence (NIPAll) of B. burgdorferi), pc (conditional NIPHIS, given infection), pS (conditional probability of a successful RLB test, given infection), pSH (conditional probability that the test tick is HIS+, given RLB success), and pFH (conditional probability that the test tick is HIS+, given RLB failure). The relationships depicted by Fig 3 induce the constraint that , which justified our choice of prior distributions (see S4 File). The regression equations of the GLM are where xki denotes the kth covariate (i.e., the relative abundance of mouse, shrew, or chipmunk species, or H’), and where the residual terms ηi and ξi have zero-mean Gaussian (normal) distributions with respective variance parameters τ2 and ω2. The α,γ regression coefficients vectors and the residual variance parameters (τ2 and ω2) were modeled to follow vague Gaussian and vague inverse-Gamma prior distributions, respectively, reflecting our lack of a priori knowledge concerning the behavior of these parameters. Additionally, there was no a priori indication that RLB test failure could be associated with a tick's underlying infection state, so by assuming a common RLB failure rate, irrespective of infection status, our modeling strategy allowed us to impute missing values of tij by treating them as unobserved model parameters. The inference for in turn accounted for these imputed values (Fig 4). Note that the covariates were log-transformed to reduce skewness (S5 File), then subsequently centered to improve computational efficiency ([4244] and S5 File). See S4 File for the roles of model parameters, observed data, and prior and posterior distributions in Bayesian inference, and for detailed model statements for our studies.

The integrative models simultaneously accounted for variation in the number of tested and infected ticks, over- or under-dispersion (as most ticks were not infected with B. burgdorferi, there was an elevated number of ‘0’ counts, leading to a distribution that was not a true binomial), as well as influential site-specific data points. This framework offers a flexible analytical approach to identifying relevant covariates of NIPAll and NIPHIS, as it utilizes all information available from the dataset. Standard logistic regression techniques would handle the two types of prevalence in separate analyses, while ignoring information on RLB efficiency, but our hierarchical Bayesian models utilize the RLB efficiency as a linkage between the two types of prevalence data, improving our overall statistical inference.

We also assessed our Bayesian model’s goodness-of-fit in various ways, one of which was posterior predictive checks (e.g. [45]) which we describe as follows (see S4 File for more detail). We considered the naïve NIPAll, NIPHIS, and DINAll estimates and compared them to what our Bayesian models predicted. Specifically, we used the posterior inference from our models to make predictions of PCR results, (hence, posterior predictions of y), and computed the predictive versions of the naïve NIPAll and DINAll estimates by using the model predictions. Goodness-of-fit of our models would be deemed high if each of the NIP and DIN estimates were consistent between raw data and posterior predictive results.


ospC Infection

In 2006, 167 of 245 (68.2%) tick samples hybridized efficiently with specific probes in the reverse line blots, whereas all 103 samples amplified from 2009 hybridized with the probes efficiently. Thus, ospC analyses for 2006 were based only on samples for which hybridization was successful. Because we used different primer sets for the 2006 and 2009 tick samples, we also tested whether primer changes could account for the changes in relative proportions of the ospC types detected each year. The proportions of each ospC genotype in the two years were marginally correlated (r = 0.49, df = 13, p = 0.06). While year to year variation is likely, it is also possible that there may have been a small potential bias in primer binding to B. burgdorferi, or that the PCR products bound differentially in the reverse line blot. We note that the mean number of genotypes per tick was smaller in 2006 (2.02, SE = 0.12) than in 2009 (2.41, SE = 0.18) but not significantly so (Wilcoxon W = 7704, p = 0.07). On balance, we concluded that separate analyses of NIPAll and NIPHIS prevalence between years were justified.

Exploratory Analyses of Prevalence Data from Bayesian Modeling

Our Bayesian model-based inferences for NIPs (pB,pc) appear in black in Fig 1A and 1B for 2006 and Fig 2A and 2B for 2009. Bayesian inference is based on posterior distributions, which can be summarized by posterior medians (shown as circles) and 95% credible intervals (shown as black intervals). The intervals are interpreted as follows: given the data and model, there is (a) a 0.5 probability that the true NIP is larger than its posterior median, and (b) a 0.95 probability that the true NIP lies inside the shown credible interval. The model-based credible intervals are not only tighter (hence, the inference has more power) than the naïve confidence intervals (shown as gray intervals), but are also valid even when very few or no ticks were tested (e.g. sites 612 in 2006 and 910 in 2009) or when RLB data were missing (e.g. all but 3 sites in 2006). In Figs 1C and 2C, we took the product between the posterior median of NIPAll and the naïve DON (density of nymphs) to produce the posterior medians for DINAll (density of infected nymphs); they can be regarded as model-based estimates of DINAll, but they and the accompanying predictive intervals are interpreted differently than the model-based medians and intervals for NIPs (see S4 File).

Referring to Figs 1 and 2, we see that naïve NIPAll estimates ranged from ~0.05 to 0.55 in 2006, whereas they ranged from 0 to ~0.35 in 2009. Their corresponding naïve 95% confidence intervals (in gray) are superimposed on the bar charts (and will be discussed further below). For NIPHIS, of the three sites with conclusive RLB results in 2006, two had naïve estimates near 0.8. Conclusive RLB testing in 2009 allowed calculation of naïve NIPHIS estimates for all sites except one (site 910), due to the absence of positive ticks in the sample. In most 2009 cases, the naïve NIPHIS estimate was quite high, being ~ ≥ 0.7 at 16 of the 18 observed sites. For DINAll, naïve estimates were noticeably higher in 2006 than in 2009, preliminarily suggesting higher Lyme disease risk for the 2006 sites than for the 2009 sites.

Bayesian Inference for Host Influence on Disease Prevalence

Naïve NIP and DIN estimates were consistent between using raw data and using model predictions. Because of this cross-validation and other model diagnostics (S4 File), the goodness-of-fit of our Bayesian models was deemed high, thus we proceed to summarize the modeling results.

With respect to the relationship between NIPs and host-community characteristics, our Bayesian modeling results show that for 2006, integrating RLB methodological failure with detection of infection and HIS improved the inference for model parameters associated with HIS status, namely (the conditional probability that a tick from the ith site would test positive for HIS, given infection), γ and ω2 (the vector of regression coefficients and residual variance, respectively, when predicting HIS status from covariates). Improvement amounts to a reduction in estimation uncertainty (reduced standard deviation of the posterior distribution), relative to models that ignore such failure. Not modeling RLB failure would amount to collapsing the tree diagram in Fig 3 by removing boxes in red and combining the red ovals with their respective box-shaped counterparts in black, which also removes the red nodes in Fig 4. Moreover, modeling RLB failure alongside NIPAll and NIPHIS provided valuable inference on (the conditional probability of RLB success for a tick from the ith site, given infection), (the conditional probability that a tick from the ith site would test positive for HIS, given RLB success), and (the conditional probability that a tick from the ith site, given RLB failure, would have tested positive for HIS, had RLB been successful), none of which would have been possible with the collapsed model (S4 File).

Based on the integrated model, the posterior probability for a regression slope parameter (α or γ) to take on a positive/negative value can be interpreted as evidence that the corresponding host community abundance is positively/negatively related to disease prevalence. For example, the regression slope estimate of +0.52 and a posterior probability of 0.96 that α2 > 0 in Table 1 (second row, year 2006), can be interpreted as near certainty that the relative abundance of mice is positively associated with NIPAll. Unlike classical hypothesis testing, a Bayesian posterior probability is literally the ‘probability of the scenario in question’ as informed by the observed data.

From our modeling, we found that each of the mouse, chipmunk, and shrew relative abundances showed good evidence of a positive relationship with NIPAll in 2006 (slope estimates = 0.52, 0.10, and 0.13, respectively; posterior probabilities = 96%, 89%, and 90%, respectively) and H’ showed mild evidence of a negative relationship with NIPAll (slope estimate = -0.04; post. prob. = 57%) (Table 1). For 2009, only the collapsed model is required, due to fully observed zi- and ti-vectors. Our results showed very strong evidence that mouse relative abundance was again positively associated with NIPAll (slope estimate = 0.66; post. prob. > 99%), whereas H’ and shrew relative abundance showed some evidence of negative association with NIPAll (slope estimates = -0.41 and -0.10, respectively; post. probs. = 70% and 71%, respectively) and the influence of chipmunk relative abundance was negligible.

For 2006, conditional NIPHIS was negatively associated with mouse and chipmunk relative abundances, although the strength of evidence differed between host species (Table 1; slope estimates = -0.73 and -0.06, respectively; post. probs. = 96% and 70%, respectively). However, H’, and shrew relative abundance, were positively associated with NIPHIS (slope estimates = 0.20 and 0.26, respectively; post. probs. = 74% and 95%, respectively). In contrast, the relative abundances of mice, chipmunks, and shrews in 2009 each showed strong evidence of negative association (slope estimates = -1.93, -0.81, and -0.57, respectively; post. probs. = 90%, 98%, and 89%, respectively). As in 2006, H’ was positively associated with NIPHIS, with a similar level of evidence (slope estimate = 1.67; post. prob. = 71%). (See S6 File for more detail on model fit and an in-depth discussion of nuances of our statistical analyses with respect to disease detection practices.)


Our study examined how host community covariates can affect overall B. burgdorferi nymphal (tick) infection prevalence (NIPAll), and infection prevalence of human-invasive strains (NIPHIS), the latter conditioned on ticks being infected. We were interested in understanding how the relative abundances of white-footed mouse, eastern chipmunk, and short-tailed shrew, and overall host community diversity (H’), might affect Lyme disease risk. This study assessed the influence–if any–of host community composition (i.e. relative abundances of three primary hosts), and of H’ (which is a combination of species richness and evenness) on NIPAll and NIPHIS. Our model provides, as quantitative evidence, the probability that NIPAll and NIPHIS were indeed associated with the relative abundances of particular hosts and/or host diversity. Overall, our analyses highlight that the relationships between the relative abundances of three primary hosts and the community diversity with NIPAll, and NIPHIS, are variable in time and space, and that disease risk inference, based on the role of host community, changes when we examine risk overall or at the phenotypic level.

The results of our models support the contribution of mice to NIPAll, but there was varying empirical support for the role of chipmunk and shrew relative abundances on NIPAll between the two years. With respect to NIPHIS, there was reasonably strong evidence present for mouse and chipmunk relative abundances being negatively associated with NIPHIS in both years, and for varying patterns of associations between shrew relative abundance and NIPHIS across the years. And in both years, there was moderate evidence of a negative association between H’ and NIPAll, but a positive association between H’ and NIPHIS. Lastly, NIPAll and DINAll were generally higher in 2006 than in 2009, with the model-based estimates providing more powerful inference than naïve estimates or inference based on discarding missing data.

The variation in NIPAll and DINAll across our sites in both years highlights the spatial and temporal variability of overall Lyme disease risk within Dutchess County, New York. However, when we examined Lyme disease risk in the context of the HIS phenotype (NIPHIS), we found that disease risk was generally and consistently high across these sites, even considering estimation uncertainty, irrespective of their NIPAll and DINAll. Because NIPHIS is conditional on ospC positive ticks, this does not translate directly into HIS risk. Nevertheless, the high HIS prevalence underscores how commonly the HIS phenotype can occur within the tick population, even after accounting for site and year variability.

It is not surprising that the relative abundance of the white-footed mouse would be positively associated with NIPAll (year 2006 and 2009, respectively), given that the white-footed mouse is a reservoir-competent host that is efficient at transmitting the bacterium to tick vectors [14,16,18,46]. White-footed mice are also abundant in the community, allowing for potentially higher host-tick feeding opportunities, and thus higher B. burgdorferi infection prevalence among ticks. Previous studies have demonstrated positive effects of chipmunks on Lyme disease risk [4749], but our detection of a positive association with NIPAll in only one of two study years suggests inconsistent effects of chipmunks on risk. With shrews, the models showed moderate evidence for relative abundances having a positive association with NIPAll in 2006 but weaker evidence of a negative association with NIPAll in 2009. Shrews are known to have relatively high competency, feed high proportions of ticks, and have high population densities that may increase their contact rates with tick vectors [16,17]. We would expect that shrews, like mice, would have a positive effect on NIPAll across both years, but the change in the direction of association highlights the need for further exploration of whether (and how) this host species ultimately influences disease risk.

The moderate, negative association we detected between H’ and NIPAll in both years was consistent with prior results from other sites in the northeastern US [23] and Ontario, Canada [19]. As H’ (host diversity) increases, the frequency with which black-legged ticks feed and transmit the pathogen from competent hosts decreases [22]. But this observation cannot explain the moderate, positive association between H’ and NIPHIS. Because white-footed mice, eastern chipmunks, short-tailed shrews, and masked shrews are competent reservoirs for B. burgdorferi [16,17], and because both mice and chipmunks (in particular) commonly carry HIS types [34], we had expected to find increased prevalence of HIS strains in depauperate (low H’) communities dominated by mice, chipmunks, and shrews. However, some of our recent work highlights the fact that a wide array of host species are competent at supporting HIS types, and that transmission efficiencies of these strains are relatively high [27]. Therefore, we might expect a positive association between H’ and NIPHIS, as seen in this study. On the other hand, the negative relationship between H’ and NIPAll may reflect the fact that these small rodents are clearly more competent reservoirs overall, so reductions in their relative abundances would help reduce the overall infection prevalence in the tick populations. This is in contrast to the findings by States et al. [50] who noted that overall NIP was similar in tick populations co-occurring with the host communities of an island and the adjacent mainland, where island sites had lower host species richness. Ultimately, understanding and reducing transmission risk would require better epidemiological data linking NIPHIS and NIPAll to risk or incidence of Lyme disease in local human populations.

The contrasting results between years for shrew relative abundances with NIPHIS also underscore the need for further investigation. Earlier research by Brisson and Dykhuizen [34] showed that HIS/non-HIS proportions in xenodiagnostic ticks that have fed on shrews are higher, compared to the average HIS/non-HIS proportions of the white-footed mouse, eastern chipmunk, and gray squirrel. Vuong et al. [27] detected similar results for shrews compared with rodents and birds. These positive associations between shrew relative abundance and NIPHIS suggests that shrews can be important components of Lyme disease risk [17], but we still do not understand when and where they are important. Challenges in understanding the ecology of the shrews, and in estimating population densities, can make it difficult to assess when they are important hosts, but these results underscore the importance of shrews in influencing Lyme disease risk.

Our novel Bayesian approach offered a comprehensive way of examining human Lyme disease risk by making site-specific inferences associated with each individual tick tested. Using this analytical method, we were able to draw collective conclusions on the role of host diversity and the relative abundances of mice, chipmunks, and shrews on Lyme disease risk for years 2006 and 2009, and for the sites within those years. Our method was especially effective at capturing the noise and variation associated with each tick individual, hence providing more insights into the relevance of each parameter we examined in the study. As in the case for any regression analysis, definitive conclusions on the more general roles of host diversity and the relative abundances of the three common host species on Lyme disease risk are somewhat conditional on the sampling limitations of the study, e.g. a low number of positive PCR or RLB tests could have resulted from a lack of the disease in the wild and/or the inherent imperfection of the PCR or procedure itself (S6 File). Indeed, we found using the improved primers [41] for the 2009 samples provide consistent RLB outcomes, and using these new primers should be applied in the future. Additionally, our results only provide temporal snapshots of the relationship between risk and hosts. In order to obtain a robust comprehension of how host community influences B. burgdorferi prevalence in ticks, we need longer-term studies of spatial and temporal trends associated with these important ecological predictors. Knowledge of the long-term population dynamics of important host species and the frequency of different strains of B. burgdorferi, with different phenotypes affecting human disease risk can help improve our overall understanding of Lyme disease risk.

Supporting Information

S1 File. Field Collections.


S2 File. Host Activity Density Categories.

Activity densities assigned to each host species in the community. Densities for the white-footed mouse, eastern chipmunks, short-tailed shrew, and birds were measured directly.


S3 File. Example of a Site-Specific Partition of the Full Dataset for the Bayesian Analyses.

Data from all 14 tested ticks sampled from site 602 (i = 2) in 2006 are presented. Rows are sorted according to the observed presence/absence of infection (zij), observed success/failure of RLB test (vij) if zij = 1 (otherwise "NA" for "not applicable"), and observed presence/absence of HIS strains (tij) if zij = vij = 1 (otherwise missing data). Boldfaced entries are determined automatically by the value(s) in the preceding column(s).


S4 File. Integrative Bayesian GLMs.


S5 File. Covariate Transformation.


S6 File. Limitations.


S7 File. Data (workbook of multiple spreadsheets).


S8 File. Computer Implementation for Analysis of 2006 Data (R dynamic document).


S9 File. Computer Implementation for Analysis of 2009 Data (R dynamic document).



We would like to thank the Academic Editor and referees for their helpful comments and suggestions. We thank the field crews of 2006, 2007, 2009, and 2010 for their help in animal trapping and tick dragging. Special thanks to Shannon Duerr and Deanna Sloniker for assistance in the field and managing the field crews. Without the permission of land-owners and land managers, we would not be able to conduct these field experiments so we are particularly grateful to them for seeing the importance of our Lyme disease research on their properties. GSC thanks K. Rice and A.H. Westveld for their helpful discussions, and the CSIRO Transformational Biology Platform for its support of GSC’s early involvement in this work. We appreciate an early draft review by Peter Thrall.

Author Contributions

  1. Conceptualization: HBV RSO GSC.
  2. Data curation: HBV GSC.
  3. Formal analysis: GSC.
  4. Funding acquisition: RSO DMF DB HBV.
  5. Investigation: HBV RSO GSC.
  6. Methodology: GSC PES DB.
  7. Project administration: HBV RSO.
  8. Resources: RSO DMF DB PJM GSC.
  9. Software: GSC.
  10. Supervision: RSO PJM.
  11. Validation: GSC.
  12. Visualization: GSC HBV.
  13. Writing – original draft: HBV GSC PES RSO DMF PJM DB.
  14. Writing – review & editing: HBV GSC PES RSO DMF PJM DB.


  1. 1. Smith I, Broos A, de Jong C, Zeddeman A, Smith C, Smith G, et al. Identifying Hendra virus diversity in Pteropid bats. PLoS One. Public Library of Science; 2011;6: e25275. pmid:21980413
  2. 2. Kyle JL, Harris E. Global spread and persistence of dengue. Annu Rev Microbiol. Annual Reviews; 2008;62: 71–92. pmid:18429680
  3. 3. Streicker DG, Turmelle AS, Vonhof MJ, Kuzmin I V, McCracken GF, Rupprecht CE. Host phylogeny constrains cross-species emergence and establishment of rabies virus in bats. Science. 2010;329: 676–9. pmid:20689015
  4. 4. Power AG, Borer ET, Hosseini P, Mitchell CE, Seabloom EW. The community ecology of barley/cereal yellow dwarf viruses in Western US grasslands. Virus Res. Elsevier B.V.; 2011;159: 95–100. pmid:21641945
  5. 5. Suzán G, Marcé E, Giermakowski JT, Mills JN, Ceballos G, Ostfeld RS, et al. Experimental evidence for reduced rodent diversity causing increased Hantavirus prevalence. Wilby A, editor. PLoS One. Public Library of Science; 2009;4: e5461. pmid:19421313
  6. 6. Wolinska J, King KC. Environment can alter selection in host-parasite interactions. Trends Parasitol. 2009;25: 236–44. pmid:19356982
  7. 7. Altizer S, Dobson A, Hosseini P, Hudson P, Pascual M, Rohani P. Seasonality and the dynamics of infectious diseases. Ecol Lett. 2006;9: 467–84. pmid:16623732
  8. 8. Thrall PH, Burdon JJ. Evolution of gene-for-gene systems in metapopulations: the effect of spatial scale of host and pathogen dispersal. Plant Pathol. 2002;51: 169–184.
  9. 9. Strle K, Jones KL, Drouin EE, Li X, Steere AC. Borrelia burgdorferi RST1 (OspC type A) genotype is associated with greater inflammation and more severe Lyme disease. Am J Pathol. 2011;178: 2726–39. pmid:21641395
  10. 10. Caws M, Thwaites G, Dunstan S, Hawn TR, Lan NTN, Thuong NTT, et al. The influence of host and bacterial genotype on the development of disseminated disease with Mycobacterium tuberculosis. PLoS Pathog. 2008;4: e1000034. pmid:18369480
  11. 11. Howe DK, Sibley LD. Toxoplasma gondii comprises three clonal lineages: correlation of parasite genotype with human disease. J Infect Dis. Oxford University Press; 1995;172: 1561–6. pmid:7594717
  12. 12. Blaser MJ, Berg DE. Helicobacter pylori genetic diversity and risk of human disease. J Clin Invest. American Society for Clinical Investigation; 2001;107: 767–73. pmid:11285290
  13. 13. Burgdorfer W, Barbour A, Hayes S, Benach J, Grunwaldt E, Davis J. Lyme disease-a tick-borne spirochetosis? Science (80-). 1982;216: 1317–1319. pmid:7043737
  14. 14. Levine JF, Wilson ML, Spielman A. Mice as reservoirs of the Lyme disease spirochete. Am J Trop Med Hyg. 1985;34: 355–60. Available: pmid:3985277
  15. 15. Anderson JF. Epizootiology of Borrelia in Ixodes tick vectors and reservoir hosts. Clin Infect Dis. 1989;11: S1451–S1459.
  16. 16. LoGiudice K, Ostfeld RS, Schmidt KA, Keesing F. The ecology of infectious disease: Effects of host diversity and community composition on Lyme disease risk. Proc Natl Acad Sci. 2003;100: 567–571. pmid:12525705
  17. 17. Brisson D, Dykhuizen DE, Ostfeld RS. Conspicuous impacts of inconspicuous hosts on the Lyme disease epidemic. Proc R Soc B-Biological Sci. 2008;275: 227–235.
  18. 18. Keesing F, Brunner J, Duerr S, Killilea M, Logiudice K, Schmidt K, et al. Hosts as ecological traps for the vector of Lyme disease. Proc Biol Sci. 2009;276: 3911–9. pmid:19692412
  19. 19. Werden L, Barker IK, Bowman J, Gonzales EK, Leighton PA, Lindsay LR, et al. Geography, deer, and host biodiversity shape the pattern of Lyme disease emergence in the Thousand Islands Archipelago of Ontario, Canada. PLoS One. 2014;9: e85640. pmid:24416435
  20. 20. Nupp TE, Swihart RK. Landscape-level correlates of small-mammal assemblages in forest fragments of farmland. J Mammal. American Society of Mammalogists; 2000;81: 512–526.
  21. 21. Ostfeld RS, LoGiudice K. Community diassembly, biodiversity loss, and the erosion of an ecosystem service. Ecology. 2003;84: 1421–1427.
  22. 22. Ostfeld RS, Keesing F. Biodiversity and disease risk: the case of Lyme disease. Conserv Biol. 2000;14: 722–728.
  23. 23. LoGiudice K, Duerr STK, Newhouse MJ, Schmidt K a, Killilea ME, Ostfeld RS. Impact of host community composition on Lyme disease risk. Ecology. Ecological Society of America; 2008;89: 2841–2849. pmid:18959321
  24. 24. Allan BF, Keesing F, Ostfeld RS. Effect of forest fragmentation on Lyme disease risk. Conserv Biol. 2003;17: 267–272.
  25. 25. Dykhuizen DE, Brisson D, Sandigursky S, Wormser GP, Nowakowski J, Nadelman RB, et al. The propensity of different Borrelia burgdorferi sensu stricto genotypes to cause disseminated infections in humans. Am J Trop Med Hyg. 2008;78: 806–810. Available: pmid:18458317
  26. 26. Hanincová K, Kurtenbach K, Diuk-Wasser M, Brei B, Fish D. Epidemic spread of Lyme borreliosis, northeastern United States. Emerg Infect Dis. 2006;12: 604–11. pmid:16704808
  27. 27. Vuong HB, Canham CD, Fonseca DM, Brisson D, Morin PJ, Smouse PE, et al. Occurrence and transmission efficiencies of Borrelia burgdorferi ospC types in avian and mammalian wildlife. Infect Genet Evol. 2014;27: 594–600. pmid:24382473
  28. 28. Mechai S, Margos G, Feil EJ, Barairo N, Lindsay LR, Michel P, et al. Evidence for host-genotype associations of Borrelia burgdorferi sensu stricto. PLoS One. 2016;11: e0149345. pmid:26901761
  29. 29. Barbour AG, Travinsky B. Evolution and distribution of the ospC gene, a transferable serotype determinant of Borrelia burgdorferi. MBio. 2010;1: e00153–10. pmid:20877579
  30. 30. Wang I-N, Dykhuizen DE, Qiu W, Dunn JJ, Bosler EM, Luft BJ. Genetic diversity of ospC in a local population of Borrelia burgdorferi sensu stricto. Genetics. 1999;151: 15–30. Available: pmid:9872945
  31. 31. Brisson D, Vandermause MF, Meece JK, Reed KD, Dykhuizen DE. Evolution of northeastern and midwestern Borrelia burgdorferi, United States. Emerg Infect Dis. 2010;16: 911–7. pmid:20507740
  32. 32. Wormser GP, Brisson D, Liveris D, Hanincová K, Sandigursky S, Nowakowski J, et al. Borrelia burgdorferi genotype predicts the capacity for hematogenous dissemination during early Lyme disease. J Infect Dis. 2008;198: 1358–1364. pmid:18781866
  33. 33. Seinost G, Dykhuizen DE, Dattwyler RJ, Golde WT, Dunn JJ, Wang I-N, et al. Four clones of Borrelia burgdorferi sensu stricto cause invasive infection in humans. Infect Immun. 1999;67: 3518–3524. Available: pmid:10377134
  34. 34. Brisson D, Dykhuizen DE. ospC diversity in Borrelia burgdorferi: different hosts are different niches. Genetics. 2004;168: 713–22. pmid:15514047
  35. 35. Brunner JL, Duerr S, Keesing F, Killilea ME, Vuong H, Ostfeld RS. An experimental test of competition among mice, chipmunks, and squirrels in deciduous forest fragments. PLoS One. 2013;8: 1–9.
  36. 36. Ostfeld RS. The ecology of Lyme-disease risk. Am Sci. 1997;85: 338–346.
  37. 37. White M. Program MARK v 6.0. 2012.
  38. 38. Giardina AR, Schmidt KA, Schauber EM, Ostfeld RS. Modeling the role of songbirds and rodents in the ecology of Lyme disease. Can J Zool Can Zool. 2000;78: 2184–2197.
  39. 39. Falco RC, Fish D. A comparison of methods for sampling the deer tick, Ixodes dammini, in a Lyme disease endemic area. Exp Appl Acarol. 1992;14: 165–73. pmid:1638929
  40. 40. Qiu W-G, Dykhuizen DE, Acosta MS, Luft BJ. Geographic uniformity of the Lyme disease spirochete (Borrelia burgdorferi) and its shared history with tick vector (Ixodes scapularis) in the Northeastern United States. Genetics. 2002;160: 833–49. Available: pmid:11901105
  41. 41. Devevey G, Dang T, Graves CJ, Murray S, Brisson D. First arrived takes all: inhibitory priority effects dominate competition between co-infecting Borrelia burgdorferi strains. BMC Microbiol. 2015;15: 1–9.
  42. 42. Chiu GS, Wu MA, Lu L. Model-based assessment of estuary ecosystem health using the latent health factor index, with application to the Richibucto Estuary. Paranhos R, editor. PLoS One. Public Library of Science; 2013;9: e65697.
  43. 43. Chiu GS, Westveld AH. A statistical social network model for consumption data in trophic food webs. Stat Methodol. 2014;17: 139–160.
  44. 44. Chiu GS, Westveld AH. A unifying approach for food webs, phylogeny, social networks, and statistics. Proc Natl Acad Sci U S A. 2011;108: 15881–6. pmid:21896716
  45. 45. Chambert T, Rotella JJ, Higgs MD. Use of posterior predictive checks as an inferential tool for investigating individual heterogeneity in animal population vital rates. Ecol Evol. 2014;4: 1389–97. pmid:24834335
  46. 46. Baum E, Hue F, Barbour AG. Experimental infections of the reservoir species Peromyscus leucopus with diverse strains of Borrelia burgdorferi, a Lyme disease agent. MBio. 2012;3.
  47. 47. Ostfeld RS, Canham CD, Oggenfuss K, Winchcombe RJ, Keesing F. Climate, deer, rodents, and acorns as determinants of variation in lyme-disease risk. PLoS Biol. 2006;4: e145. pmid:16669698
  48. 48. Slajchert T, Kitron UD, Jones CJ, Mannelli A. Role of the eastern chipmunk (Tamias striatus) in the epizootiology of Lyme borreliosis in northwestern Illinois, USA. J Wildl Dis. Wildlife Disease Association; 1997;33: 40–6. pmid:9027689
  49. 49. Mather TN, Wilson ML, Moore SI, Ribeiro JMC, Spielman A. Comparing the relative potential of rodents as reservoirs of the Lyme disease spirochete (Borrelia burgdorferi). Am J Epidemiol. 1989;130: 143–150. Available: pmid:2787105
  50. 50. States SL, Brinkerhoff RJ, Carpi G, Steeves TK, Folsom-O’Keefe C, Deveaux M, et al. Lyme disease risk not amplified in a species-poor vertebrate community: Similar Borrelia burgdorferi tick infection prevalence and OspC genotype frequencies. Infect Genet Evol. Elsevier B.V.; 2014;27: 566–575. pmid:24787999