Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

How sure are you? A web-based application to confront imperfect detection of respiratory pathogens in bighorn sheep

  • J. Terrill Paterson ,

    Contributed equally to this work with: J. Terrill Paterson, Carson Butler, Robert Garrott

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

    terrillpaterson@gmail.com

    Affiliation Department of Ecology, Montana State University, Bozeman, MT, United States of America

  • Carson Butler ,

    Contributed equally to this work with: J. Terrill Paterson, Carson Butler, Robert Garrott

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Visualization, Writing – review & editing

    Affiliation Fish and Wildlife Branch, Grand Teton National Park, Moose, WY, United States of America

  • Robert Garrott ,

    Contributed equally to this work with: J. Terrill Paterson, Carson Butler, Robert Garrott

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing – review & editing

    Affiliation Department of Ecology, Montana State University, Bozeman, MT, United States of America

  • Kelly Proffitt

    Roles Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing

    ‡ KP also contributed equally to this work.

    Affiliation Montana Fish Wildlife and Parks, Bozeman, MT, United States of America

Abstract

The relationships between host-pathogen population dynamics in wildlife are poorly understood. An impediment to progress in understanding these relationships is imperfect detection of diagnostic tests used to detect pathogens. If ignored, imperfect detection precludes accurate assessment of pathogen presence and prevalence, foundational parameters for deciphering host-pathogen dynamics and disease etiology. Respiratory disease in bighorn sheep (Ovis canadensis) is a significant impediment to their conservation and restoration, and effective management requires a better understanding of the structure of the pathogen communities. Our primary objective was to develop an easy-to-use and accessible web-based Shiny application that estimates the probability (with associated uncertainty) that a respiratory pathogen is present in a herd and its prevalence given imperfect detection. Our application combines the best-available information on the probabilities of detection for various respiratory pathogen diagnostic protocols with a hierarchical Bayesian model of pathogen prevalence. We demonstrated this application using four examples of diagnostic tests from three herds of bighorn sheep in Montana. For instance, one population with no detections of Mycoplasma ovipneumoniae (PCR assay) still had an 6% probability of the pathogen being present in the herd. Similarly, the apparent prevalence (0.32) of M. ovipneumoniae in another herd was a substantial underestimate of estimated true prevalence (0.46: 95% CI = [0.25, 0.71]). The negative bias of naïve prevalence increased as the probability of detection of testing protocols worsened such that the apparent prevalence of Mannheimia haemolytica (culture assay) in a herd (0.24) was less than one third that of estimated true prevalence (0.78: 95% CI = [0.43, 0.99]). We found a small difference in the estimates of the probability that Mannheimia spp. (culture assay) was present in one herd between the binomial sampling approach (0.24) and the hypergeometric approach (0.22). Ignoring the implications of imperfect detection and sampling variation for assessing pathogen communities in bighorn sheep can result in spurious inference on pathogen presence and prevalence, and potentially poorly informed management decisions. Our Shiny application makes the rigorous assessment of pathogen presence, prevalence and uncertainty straightforward, and we suggest it should be incorporated into a new paradigm of disease monitoring.

Introduction

Understanding the ecology of infectious wildlife diseases is critical for the informed management of animal populations. There is heightened interest in the role of infectious disease in conservation biology (e.g., associations between pathogen communities and individual vital rates of the host population such as survival and fecundity), which requires a better understanding of the relationships between pathogen dynamics, individual host susceptibility, disease events and host population dynamics [14]. The few studies that have addressed the interplay between pathogen dynamics and host population dynamics have primarily focused on pathogens associated with high mortality, although less virulent pathogens clearly have the potential to impact host population demography [2,57]. As a result, our understanding of pathogens causing chronic endemic infections and the resulting host population dynamics are particularly poor [2,8]. Moreover, considering the evidence that a large and increasing fraction of emerging infectious diseases have an origin in wildlife populations which can serve as reservoirs for zoonotic pathogens, an improved understanding of the ecology of infectious diseases in wildlife populations is of paramount importance [912].

Infectious disease is the result of the interaction(s) between pathogens, hosts and the environment (the classic “epidemiological triangle”) [3,13], and requires accurately monitoring changes in pathogen population dynamics over long enough time scales to account for temporal variation in potential environmental drivers, pathogen communities and host population dynamics [8]. Compounding the complexity of understanding how the different components of the epidemiological triangle are related to the etiology of a disease is the problem that pathogen communities are imperfectly observed. Decades of work on the implications of the imperfect detection of individual animals in a population on estimating abundance, occupancy, or vital rates has resulted in a diverse and rigorous set of tools to account for the bias induced by imperfect detection [14]. At the core of these methods is the simple principle that the failure to detect an individual does not mean it is not present. These principles have a long history in the study of human and livestock diseases, where efforts to account for uncertain and imperfect detection (or, test sensitivity) have profound implications for understanding disease processes and pathogen prevalences [1518]. To borrow from this wealth of work, it is clear that an assessment of pathogen communities in wildlife populations that does not explicitly account for the sampling process is fundamentally incomplete [1921].

Such inadequacy in assessments of pathogen communities may handicap our understanding of disease dynamics. Test results interpreted without accounting for imperfect detection can obscure or weaken inferences on disease etiology by conflating variation in pathogen population dynamics with variation in detection, thereby potentially yielding spurious conclusions on disease etiology [22,23]. This problem is further compounded when a disease has a polymicrobial origin, and the multiple pathogen/testing protocol combinations have different probabilities of detection such that the results of testing cannot be interpreted without accounting for imperfect detection [19,24]. Although inference on disease dynamics at the population level is improved with higher probabilities of detection, even tests with high sensitivities can be misleading when applied at the individual level [25], e.g., in the context of a test and cull program to remove infected individuals [26,27]. This unclear understanding can cascade into investing resources into ineffective management strategies or a failure to consider alternative management strategies. For populations of wild hosts for which there are few gold-standard reference tests and in which the true disease state of individuals are not known, estimating the sensitivities and specificities of diagnostic tests can be challenging [28]. However, test sensitivity for wild hosts can be approximated using occupancy models, which is a flexible and straightforward method of evaluating test performance while incorporating multiple layers of uncertainty in a hierarchical modeling framework [20,29]. Although this is only an approximation to the true, unknown, test sensitivity, it is still an improvement to correct prevalence estimates using this approximated sensitivity. The relative ease with which detection probability of a diagnostic testing protocol can be approximated, coupled to the consequences to inference for failing to do so, suggests that explicitly accounting for imperfect detection should be the paradigm for wildlife disease monitoring programs [19,23].

Here, we present a web-based application that allows users to assess the consequences of imperfect detection for estimating the presence and prevalence of a pathogen in a population. As a motivating example, we demonstrate the utility of the approach for assessing the presence and prevalence of respiratory pathogens in bighorn sheep (Ovis canadensis) in the western United States. Respiratory disease in bighorn sheep can be devastating to populations due to high mortality during epizootics [24] and subsequent years of high summer lamb mortality [30]. Disease outbreak events are thought to be a factor limiting the growth of some bighorn sheep populations and the restoration of the species [31]. However, despite the significant consequences of disease outbreaks that have occurred repeatedly for decades, the etiological understanding of bighorn sheep respiratory disease is surprisingly incomplete. Multiple bacterial pathogens (or combinations of pathogens) have been linked to the disease, including Mycoplasma ovipneumoniae and the Pasteurellaceae family of pathogens [24,3234]. There is emerging evidence that M. ovipneumoniae is consistently present when population-limiting disease events occur, but M. ovipneumoniae is also present in herds without evidence of disease and relatively less is known about the role Pasteurellaceae play [24,32,35]. The conditions that enable these pathogens to cause epizootic or enzootic disease are unresolved. Compounding this lack of clarity regarding the relationships between pathogens and disease outbreak events has been a consistent failure to account for imperfect detection of the pathogens proposed as causal agents [19]. This lack of rigorous treatment of pathogen sampling data likely has meaningful consequences for understanding the role of pathogens in respiratory disease outbreaks. Nearly all of the hypotheses and tentative explanations that appear in the literature related to pathogens responsible for respiratory disease in bighorn sheep and the disease process can be traced back to interpretations of results of pathogen sampling data [32].

To confront this problem, our primary objectives were to: 1) synthesize information on detection probabilities of testing protocols for pathogens thought to be causal agents of respiratory disease in bighorn sheep into a hierarchical Bayesian model to estimate prevalence, and 2) to provide a simple interface to the model that can be routinely used by managers and researchers to estimate respiratory pathogen presence and prevalence while properly accounting for imperfect detection. Our goal is to provide a coherent framework for assessing pathogen communities by estimating both prevalence and presence (i.e., the probability of freedom from infection) in bighorn sheep, while accounting for imperfect detection and variation in sampling intensity and design. We developed a web-based Shiny application to make this framework easily accessible in order to aid studies focused on pathogen community assessments, and to ensure interpretations of pathogen sampling data and management decisions can be made with a proper assessment of the uncertainty associated with pathogen monitoring.

Methods

Ethics statement

Our work is based on already-published data and simulations only, and presents a computational tool for biologists. For this particular work, no animals were involved, nor where public/private lands accessed. Our work depends on two studies involving animals that have previously been published in PLOS ONE, and where the Ethics statements indicate the appropriate handling [19,35].

Definitions

We define pathogen prevalence as the probability that any individual animal in a herd is infected with a pathogen, and differentiate this usage from the probability that an animal is infectious or the probability that an animal is diseased [36]. Apparent prevalence (hereafter AP) is the probability that an animal will test positive for the pathogen, which is the product of the actual underlying prevalence (hereafter “true prevalence”, TP) and the probability of detection (or, the sensitivity of the testing protocol), also referred to as naïve prevalence [20,29]. We note that we are not including the probability of false positives (related to the specificity of a test) in our work due to the lack of rigorous quantitative information on the specificity of diagnostic protocols. Importantly, apparent prevalence is only equal to true prevalence when the detection of pathogens is perfect [36,37]. The goal for a rigorous testing program is to combine information on detection with information on apparent prevalence reported in testing results to estimate the underlying true prevalence of a pathogen. Estimated quantities are denoted by a ^ accent.

Bayesian model for disease prevalence

We adopted a Bayesian paradigm as a modeling approach. This has important advantages when detection probabilities are low and sample sizes are small, resulting in a failure to detect pathogens, and is a natural paradigm in which to account for imperfect detection (i.e., the probability of detection is less than 1 and is not known with certainty) [36,37]. Our study relied on previous work that utilized an occupancy-modeling framework to estimate the probabilities of detection for different pathogen/protocol combinations [19]. Our approach here was to use those estimated probabilities of detection in a simple model for the true prevalence. Under binomial sampling (i.e., assuming that sampling is from an infinite population), the probability distribution for the number of animals that test positive for a pathogen out of a sample is given by: (1) where TP is the true prevalence, detection corresponds to the probability of detecting a pathogen in an infected animal using a single protocol, and N is the sample size. Alternatively, this distribution is also expressed using the apparent prevalence: (2)

We used a mixture prior to allow the true prevalence to be equal to zero [36]. This approach decomposed true prevalence into biologically meaningful probabilities of the probability that a pathogen is in the herd, (3) and, conditional on the presence of a pathogen, the prevalence of the pathogen, (4) such that the true prevalence (TP) is given by, (5)

This decomposition into the probabilities of pathogen presence and prevalence allows the estimation of the probability that a pathogen is in the herd even when sampling results in no positive tests. We then used a beta distribution as the prior for the probability of detection given an animal was randomly sampled and was infected with the pathogen: (6)

This formulation naturally extended to the incorporation of multiple tests per animal. Where multiple tests using the same protocol were used on the same animal, the probability of detecting that pathogen at least once was re-written as: (7) where n was the number of tests.

Finally, the complete model required the specification of the hyperparameters related to the probability of pathogen presence (d), true prevalence (aP, bP) and detection (aD, bD). We set d equal to 0.5 to provide a vague prior on the probability of pathogen presence (it can be shown that this is equivalent to setting a hyperprior such that d~Beta(1,1)). Similarly, we set aP = bP = 1 to provide a vague prior on true prevalence. Critical for prevalence estimation is the use of informed priors for the probability of detection, and we relied on previous work that estimated pathogen- and protocol-specific detection probabilities for pathogens in bighorn sheep using an occupancy framework [19,35]. We took advantage of prior work that used moment matching to parameterize a beta distribution for each pathogen and protocol (i.e., set values of aD and bD) using the mean and variance of the estimated detection probabilities from this work [35].

The use of the binomial model is predicated on sampling from an infinite (or approximated by a very large) population [36,37]. However, when sampling from a finite population without replacement the correct distribution is the hypergeometric distribution, although the binomial model is generally agreed to be a good approximation when population sizes are large enough (e.g., when the sample is less than 10% of the population size) [16]. The hypergeometric approach to the Bayesian estimation of disease prevalence would modify the above approach such that the number of positive tests is given by: (8) where K is the number of “successes” in the population, n is the sample size, and N the size of the herd. The number of successes is the number of animals with the pathogen that test positive in the sample from the herd, and is given by the product of true prevalence, detection and the population size: (9)

The remainder of the model specification, e.g., priors for detection and true prevalence, remained the same between the two approaches. We demonstrate the differences between the binomial and hypergeometric approaches below, and our Shiny app allows the user to choose which method is most appropriate to the situation.

We note two simplifying assumptions of our model. First, there is strong evidence that the probability of detection for pathogens can be related to the intensity of infection [3841]. Our model assumes that all individuals subject to the same testing protocols have the same probability of a positive test (i.e., independent of the intensity of infection). Second, the prevalence of pathogens can be strongly affected by the spatial/temporal dynamics of the host population such that a single estimate of prevalence may not reflect relevant, hidden pathogen population dynamics [42,43]. In both cases, the assumptions were made to match how the probabilities of detection were originally estimated, and to reflect the lack of any information on variation in prevalence or pathogen detection from individual, spatial, or temporal sources [19].

Estimation of Bayesian models

We used an MCMC approach to approximate the posterior distributions of the model parameters. We implemented our approach via the runjags package [44] as an interface to JAGS 4.3.0 [45] in the R programming environment [46]. Models were run for 50,000 iterations with the first 10,000 iterations discarded as burnin. Chain convergence was graphically assessed using traceplots and the Gelman-Rubin statistic, where convergence was assumed for values less than 1.05 [47].

Case study of bighorn sheep in Montana

Here, we use four examples of pathogen testing from recently published research on bighorn sheep respiratory pathogens [19,35] to illustrate the consequences of imperfect detection for estimating true prevalence (Table 1) (although note that additional Pasteurellaceae/protocol combinations are available in the online application). We chose three herds from Montana (Petty Creek, Taylor-Hilgard herd, and Highlands) (Fig 1) that were tested for a diverse group of pathogens using multiple testing protocols that can best illustrate the consequences of imperfect detection (details available in [19,35]). The primary testing protocol in this study for M. ovipneumoniae in bighorn sheep utilized tryptic soy broth (TSB) as a transport media with a PCR assay (named the “TSB-PCR Protocol”). Based on nasal swab samples tested by PCR and analyzed in an occupancy-modeling framework, this protocol has a modestly high probability of detection () [19,35]. Using this protocol with two nasal swabs per animal, the apparent prevalence of M. ovipneumoniae in the Petty Creek herd in the winter of 2015–2016 was 0, and 0.32 for the Taylor-Hilgard herd using a single nasal swab in the winter of 2016–2017 (Table 1). In contrast to the TSB-PCR protocol for M. ovipneumoniae, the primary testing protocol for Mannheimia haemolytica (a Pasteurellaceae) (tryptic soy broth with a culture assay, named “TSB-culture”) based on nasal swab samples has a substantially lower probability of detection () [19,35]. Using this protocol, the apparent prevalence of Mannheimia haemolytica in the Taylor-Hilgard herd using a single nasal swab in the winter of 2013–2014 was 0.24. Finally, the apparent prevalence of Mannheimia spp. (a Pasteurellaceae) in the Highlands herd in the winter of 2015–2016 using the TSB-culture testing protocol with two nasal swabs per animal was 0 () [19,35].

thumbnail
Fig 1. Study area map.

The locations of bighorn sheep herds used as examples in this analysis.

https://doi.org/10.1371/journal.pone.0237309.g001

thumbnail
Table 1. Sampling summary for the examples used from bighorn sheep in Montana.

These results are taken from a comprehensive assessment of pathogen communities in bighorn sheep in Montana [35].

https://doi.org/10.1371/journal.pone.0237309.t001

Application goals

Our goal was to apply a Bayesian approach to the estimation of true prevalence (or, freedom from infection in the case of no positive tests) into an easy-to-use web-based application. Although code to run models similar to ours is available from other sources, the specifics of Bayesian model estimation and the use of informed priors renders the routine use of such models impractical for many users and for many disease surveillance programs. Therefore, we combined prior work on the probabilities of detection and the specifics of our modeling approach into a web-based Shiny application [48] that integrates R code into a convenient user-interface (https://quantitativebiology.shinyapps.io/pathogens/) where the selection of a particular pathogen/protocol combination automatically uses the relevant informed prior to estimate true prevalence.

Results

Shiny application

The interface to the application has three main components: 1) a tab that contains information on the Pathogens and protocols available, 2) a tab that contains the inputs required and outputs generated for Estimating prevalence in the herd, and 3) a tab that allows a general exploration of the consequences of Imperfect detection (Fig 2).

thumbnail
Fig 2. Interface to the Bayesian estimation of respiratory pathogens in bighorn sheep.

The Shiny application has three main components: 1) a tab containing information on pathogens and protocols available to the used (“Pathogens and protocols”), 2) a tab estimating the probability of the herd being free from the pathogen and the prevalence of a pathogen (“Estimating prevalence”), and 3) a tab that allows a more general exploration of the consequences of imperfect detection (“Imperfect detection”). The third tab (“Imperfect detection”) allows for a more general exploration of the consequences of imperfect detection to freedom from pathogen or prevalence estimation.

https://doi.org/10.1371/journal.pone.0237309.g002

Our application only requires a user to input sampling specifics: number of positive tests, sample size, herd size (for the use of hypergeometric sampling), pathogen, and testing protocol (Fig 3). The application then generates estimates of the true prevalence (or, probability of freedom from infection and true prevalence in the case of no positive tests) and associated uncertainty. The output from the application is divided into three tabs: a Summary tab containing estimated probabilities of freedom from the pathogen and pathogen prevalence and associated uncertainty, a Figures tab showing graphs of the approximate posterior distributions for the estimated probabilities, and a Prior for detection probability tab that illustrates the informed prior used for the specific pathogen-protocol combination [19,35].

thumbnail
Fig 3. Inputs for the estimation of freedom from infection and prevalence.

The user first selects a pathogen(1)–protocol(2) combination for which prior work has estimated a probability of detection [19,35]. Second, the user sets the parameters of the sampling design: (3) whether the sampling is from a finite or infinite population, (4) the number of animals in the sample, (5) the number of swabs per individual, (6) the number of positive tests, and (7) the herd size (used for finite population sampling only).

https://doi.org/10.1371/journal.pone.0237309.g003

Consider a hypothetical example in which the prevalence of a particular Pasteurellaceae species, Bibersteinia trehalosi, was assessed using the MSU testing protocol (details on the Pathogens and protocols tab) by testing a random sample of 20 bighorn sheep using a single swab, and no animals tested positive. The Summary tab indicates that the probability of the hypothetical herd being free from the pathogen was 0.70; conversely, the probability that the herd hosted the pathogen and the lack of detection was due to imperfect testing was 0.30 (Fig 4). The apparent prevalence in this case is 0%; however, the very low probability of detection for this pathogen/protocol combination implies a non-zero probability that the pathogen was in the herd and simply missed in the sampled individuals, with significant resulting uncertainty in what the true prevalence was [95% CI = 0.0–0.81] (Fig 4). The graphs of the approximate posterior distributions on the Figures tab illustrate the skewed distributions associated with estimates of prevalence (Fig 5). The mismatch between apparent prevalence (0 out of 20 animals) and the estimated true prevalence is a result of low probabilities of detection for this pathogen-protocol combination (as visible in the Prior for detection probability tab) and modest number of animals sampled (Fig 6).

thumbnail
Fig 4. Summary output.

These are the estimates of the probability the herd is free from a pathogen and prevalence assuming sampling from an infinite population. In this hypothetical example, 0 out of 20 animals tested positive for Bibersteinia trehalosi using the MSU testing protocol. Imperfect detection of the pathogen resulted in a substantial probability that the pathogen was present and simply missed (approximately 0.30).

https://doi.org/10.1371/journal.pone.0237309.g004

thumbnail
Fig 5. Graphs of the approximate posteriors for the probability of a herd being free from a pathogen and pathogen prevalence for a hypothetical example.

In this hypothetical example, 0 out of 20 animals tested positive for Bibersteinia trehalosi using the MSU testing protocol.

https://doi.org/10.1371/journal.pone.0237309.g005

thumbnail
Fig 6. Graph of the informed prior distribution.

This is the separately estimated probability of detecting Bibersteinia trehalosi with the MSU protocol. Priors for detection probability were extracted from previous work [35].

https://doi.org/10.1371/journal.pone.0237309.g006

Pathogen prevalence in bighorn sheep

Our first example highlights that even when the detection probability for a given pathogen is relatively high, failure to account for imperfect detection and sampling may lead to the misleading inference that a pathogen is not present when there remains some probability the pathogen may be present. The apparent prevalence of M. ovipneumoniae in the Petty Creek herd in 2015–2016 was 0 (0 positive tests out of 16 tested animals). However, accounting for the imperfect detection probability of the testing protocol suggests that there was a non-trivial probability that M. ovipneumoniae was present in the herd and simply not detected in the sample of individuals tested (infinite population (binomial sampling): ) (S1 Fig). The estimated true prevalence of the pathogen in the herd was (infinite population: ([95% CI = 0.0–0.06]). These results are in close agreement with the finite sampling model () due to the small sample size (n = 16) relative to the herd size (N = 160) (S2 Fig).

Second, imperfect detection results in an apparent prevalence of the pathogen that can be a significant underestimate of true prevalence. Despite the comparatively high probability of detection for M. ovipneumoniae using the testing protocol the apparent prevalence in the Taylor-Hilgard herd in 2016–2017 (10 out of 31 animals with a positive test, 0.32) was a significant underestimate of the true prevalence (S3 Fig). After accounting for imperfect detection, the estimated true prevalence was (infinite population: [95% CI = 0.25–0.71]). The uncertainty associated with this prevalence estimate (and all such estimates based on small samples with imperfect and uncertain detection) is substantial and may render this estimate of little value for understanding disease etiology.

Third, this mismatch between apparent prevalence (0.32) and true prevalence (0.46) worsens as the detection probability for the pathogen declines. For example, for Mannheimia haemolytica in the Taylor-Hilgard herd in 2013–2014, results from the testing protocol suggested an apparent prevalence of 0.24 (7 out of 29 animals with a positive test). However, the estimated true prevalence of this pathogen was (infinite population: [95% CI = 0.43–0.99]), over three times the value of the apparent prevalence (S4 Fig). Thus, even when a testing protocol indicates pathogen presence, the estimated prevalence of the pathogen may be underestimated if TP is not properly estimated.

In each of the above examples, the results from the infinite population approach (binomial sampling) agree quite closely to the finite population approach (hypergeometric sampling) due to the small sample sizes relative to the herd size. For our final example, in 2015–2016 the apparent prevalence of Mannheimia spp. using the testing protocol in the Highlands herd was 0 (0 out of 16 animals tested positive), which was misleading given that the probability that the herd hosted the pathogen and it was missed due to imperfect detection was high () (S5 Fig). However, the sample size (n = 16) relative to the population size (N = 75) was large enough to violate the traditional assumption of sampling from an infinite population. Results from the finite population approach suggested the probability that the pathogen was actually in the herd was slightly lower () (S6 Fig).

Discussion

The ability to assess pathogen communities in wildlife populations when detection is imperfect is a pre-requisite for informed disease management and a better understanding of disease etiology [19,20,22,23,35]. However, proper pathogen community assessments require information on the probabilities of detection associated with each testing protocol coupled to a Bayesian framework in order to properly account for uncertainty. This combination can render the approach impractical or unrealistic for otherwise interested users that desire to translate diagnostic results into estimates of true prevalence and associated uncertainty. Our work integrates this unified framework with a user-friendly, web-based Shiny application in order to illustrate the consequences of testing protocols, sample sizes and sampling designs to the understanding of the relationship between apparent and estimated true prevalence. Moreover, our application is easily scalable to accommodate new pathogens and testing protocols, or updates to current combinations.

The primary utility of our application is to help inform stakeholders, biologists, managers and disease ecologists about the integrated consequences of imperfect testing, sampling designs, and sampling error. Furthermore, where retrospective data are available without repeated sampling, this application can provide estimates of prevalence and the probability that a pathogen is in a herd along with related estimates of uncertainty so as to assist informed management. We demonstrated the application using four examples of testing for respiratory pathogens in bighorn sheep, which we use to illustrate three main points. First, even when the detection probability is high failure to account for imperfect detection and sampling leads to misleading inference; a problem that is compounded when sample sizes are small. Despite the TSB-PCR testing protocol having a comparatively high probability of detection for M. ovipneumoniae, the apparent prevalence of the pathogen in the Taylor-Hilgard herd was substantially less than the estimated true prevalence and the probability that the pathogen was actually in the Petty Creek herd despite no positive tests was non-zero. Second, the misleading inference that results from failing to account for imperfect detection grows substantially worse as the probability of detection decreases until there is a gross mismatch between apparent and true prevalence (the apparent prevalence of Mannheimia haemolytica in the Taylor-Hilgard herd example was approximately one-third that of estimated true prevalence). Moreover, in the case of low and uncertain detection probabilities and small sample sizes, the resulting uncertainty in prevalence estimates may render the results useless for understanding disease etiology and management actions that rely on prevalence such as non-selective culling programs to reduce prevalence [49]. Third, sampling assumptions should be checked for each application. Although in our final example the difference in inference for pathogen prevalence between assuming sampling from an infinite population (binomial sampling) or sampling from a finite population (hypergeometric sampling) was small, we expect this difference to magnify with smaller herds such as those involved in management interventions for population restoration. Given that binomial sampling is only an approximation to the truth and that inference based on hypergeometric sampling is easily handled in our application, we strongly recommend inference on pathogen community dynamics be based on the latter where possible. However, the benefit of using a hypergeometric sampling distribution must be weighed against the cost of estimating the size of a population; where it is impractical or cost-prohibitive, the binomial sampling distribution is a practical alternative.

In addition to providing more accurate estimates of true prevalence and associated uncertainty, our approach and application could be used to inform the design of disease monitoring programs prior to data collection [50]. Previous work on the power of these testing protocols to detect pathogens and estimate their prevalence provided critical insight into sampling design, including recommendations for minimum sample sizes and the number of swabs per animal [19,51,52]. Guided by this prior work, our application allows a user to prospectively assess the power of a specific sampling design for pathogen assessment. If there is an a priori threshold for the probability that a pathogen is free from a herd prior to translocation, or a required level of precision on estimates of true prevalence, the user can assess the efficacy of different combinations of sample sizes, protocols, and swabs per animal to achieve that goal. Where logistical limitations render the goal unachievable, that failure forces a user to adopt an alternative sampling design to improve power or accept the risks of management actions informed by imperfect sampling of pathogen communities. For example, when sample sizes are small, even the comparatively high probabilities of detection for the protocols used to assess the presence of M. ovipneumoniae may fail to yield the required precision. In that case, a sampling design can evolve to combine multiple testing protocols (a future direction for this Shiny application) and samples in multiple years to improve inference [19,35], the risks associated with more uncertainty can be accepted, or alternately the management action or associated sampling plan be deemed unattainable and abandoned. In the case of M. ovipneumoniae, serology testing is another option to assess its presence in a population at some point in recent times. We recommend, however, that researchers and managers strive to detect the agent via PCR because this allows characterization of the infection status at the moment of sampling, which provides unequivocal proof of the pathogen’s presence and can help to infer the cause of outbreaks, although we acknowledge other diagnostic tools are promising alternatives [43,53,54]. A recent attempt to create a small domestic sheep flock free from M. ovipneumoniae based on intensive testing was unsuccessful at least in part due to imperfect detection of the target pathogen, illustrating the challenges to successfully implementing such management intervention strategies [55]. Use of our Shiny application would improve planning for such management experiments and help provide realistic expectations for the level of effort that might be required and practicality of meeting intervention goals. However, we acknowledge that these results are based off of a single study evaluating testing protocols for respiratory pathogens in bighorn sheep and, although they represent the best available information for this study system, care should be taken when using this application for other systems that deviate from the assumptions of either the model underlying the estimation of true prevalence in the Shiny application, or the occupancy model used for the estimation of detection probabilities. Moreover, these results apply only at the population level. Future work is required to address how they may translate to the individual level if prevalence is related to individual characteristics, and a promising direction takes advantage of a multiple-testing protocols using a longitudinal design in a probabilistic framework [43].

Respiratory disease in bighorn sheep is a critical factor limiting population growth rates and the restoration of populations to historic ranges [3032,56]. More work is needed to understand the linkages between pathogen and host population dynamics and, thus, to develop effective management strategies to recover bighorn sheep in the face of disease. We suggest that the first step towards improved understanding of the role of pathogens in bighorn sheep disease outbreaks is the accurate characterization of pathogen communities by accounting for imperfect probabilities of detection. Failure to properly account for imperfect probabilities of detection has two important consequences. First, an erroneous understanding of which pathogens are present can result in limited resources being poorly used, e.g., the translocation of sheep from herds that might not test positive for pathogens but nonetheless may have a non-trivial probability of hosting the pathogen. Second, biased estimates of prevalence that result from failing to account for imperfect detection make evaluating associations between pathogen and host population dynamics intractable, particularly where a polymicrobial origin of the disease is possible. To avoid these consequences and ensure management decisions are well-informed, accounting for imperfect detection in diagnostic tests should become the paradigm for wildlife disease monitoring programs.

Supporting information

S1 Fig. Summary output from the model with estimates of the probability the herd is free from a pathogen and prevalence assuming sampling from an infinite population.

In this specific example, 0 out of 16 animals in the Petty Creek herd (2015–2016) tested positive for Mycoplasma ovipneumoniae using the TSB-PCR protocol with 2 swabs per animal.

https://doi.org/10.1371/journal.pone.0237309.s001

(TIF)

S2 Fig. Summary output from the model with estimates of the probability the herd is free from a pathogen and prevalence assuming sampling from a finite population.

In this specific example, 0 out of 16 animals in the Petty Creek herd (2015–2016) tested positive for Mycoplasma ovipneumoniae using the TSB-PCR protocol with 2 swabs per animal. These results agree closely with those that assumed a finite population (S1 Fig).

https://doi.org/10.1371/journal.pone.0237309.s002

(TIF)

S3 Fig. Summary output from the model with estimates of the probability the herd is free from a pathogen and prevalence assuming sampling from a finite population.

In this specific example, 10 out of 31 animals in the Taylor-Hilgard herd (2016–2017) tested positive for Mycoplasma ovipneumoniae using the TSB-PCR protocol with 1 swab per animal.

https://doi.org/10.1371/journal.pone.0237309.s003

(TIF)

S4 Fig. Summary output from the model with estimates of the probability the herd is free from a pathogen and prevalence assuming sampling from a finite population.

In this specific example, 7 out of 29 animals in the Taylor-Hilgard herd (2013–2014) tested positive for Mannheimia haemolytica using the TSB-culture protocol with 1 swab per animal.

https://doi.org/10.1371/journal.pone.0237309.s004

(TIF)

S5 Fig. Summary output from the model with estimates of the probability the herd is free from a pathogen and prevalence assuming sampling from an infinite population.

In this specific example, 0 out of 16 animals in the Highlands herd (2015–2016) tested positive for Mannheimia spp. using the TSB-culture protocol with 1 swab per animal.

https://doi.org/10.1371/journal.pone.0237309.s005

(TIF)

S6 Fig. Summary output from the model with estimates of the probability the herd is free from a pathogen and prevalence assuming sampling from a finite population.

In this specific example, 0 out of 16 animals in the Highlands herd (2015–2016) tested positive for Mannheimia spp. using the TSB-culture protocol with 1 swab per animal.

https://doi.org/10.1371/journal.pone.0237309.s006

(TIF)

S1 Table. Description of diagnostic protocols used for examples from bighorn sheep.

https://doi.org/10.1371/journal.pone.0237309.s007

(DOCX)

References

  1. 1. Smith KF, Sax DF, Lafferty KD. Evidence for the role of infectious disease in species extinction and endangerment. Conserv Biol. 2006;20: 1349–1357. pmid:17002752
  2. 2. Cleaveland S, R. Hess G, Dobson A, K Laurenson M, Mccallum H, Roberts MG, et al. The role of pathogens in biological conservation. Ecol Wildl Dis. 2002.
  3. 3. Tompkins DM, Dunn AM, Smith MJ, Telfer S. Wildlife diseases: from individuals to ecosystems. J Anim Ecol. 2011;80: 19–38. pmid:20735792
  4. 4. Lloyd-Smith JO, Schreiber SJ, Kopp PE, Getz WM. Superspreading and the effect of individual variation on disease emergence. Nature. 2005;438: 355–359. pmid:16292310
  5. 5. Perez‐Heydrich C, Oli MK, Brown MB. Population-level influence of a recurring disease on a long-lived wildlife host. Oikos. 2012;121: 377–388.
  6. 6. Cotterill GG, Cross PC, Middleton AD, Rogerson JD, Scurlock BM, Toit JT du. Hidden cost of disease in a free-ranging ungulate: brucellosis reduces mid-winter pregnancy in elk. Ecol Evol. 2018;8: 10733–10742. pmid:30519402
  7. 7. McCallum H. Quantifying the impact of disease on threatened species. Pac Conserv Biol. 1994;1: 107–117.
  8. 8. Cross PC, Heisey DM, Bowers JA, Hay CT, Wolhuter J, Buss P, et al. Disease, predation and demography: assessing the impacts of bovine tuberculosis on African buffalo by monitoring at individual and population levels. J Appl Ecol. 2009;46: 467–475.
  9. 9. Jones KE, Patel NG, Levy MA, Storeygard A, Balk D, Gittleman JL, et al. Global trends in emerging infectious diseases. Nature. 2008;451: 990–993. pmid:18288193
  10. 10. Rhyan JC, Spraker TR. Emergence of diseases from wildlife reservoirs. Vet Pathol. 2010;47: 34–39. pmid:20080482
  11. 11. Daszak P, Cunningham AA, Hyatt AD. Emerging infectious diseases of wildlife–threats to biodiversity and human health. science. 2000;287: 443–449. pmid:10642539
  12. 12. Wiethoelter AK, Beltrán-Alcrudo D, Kock R, Mor SM. Global trends in infectious diseases at the wildlife–livestock interface. Proc Natl Acad Sci U S A. 2015;112: 9662–9667. pmid:26195733
  13. 13. Scholthof K-BG. The disease triangle: pathogens, the environment and society. Nat Rev Microbiol. 2007;5: 152–156. pmid:17191075
  14. 14. Williams BK, Nichols JD, Conroy MJ. Analysis and management of animal populations. Academic Press; 2002.
  15. 15. Christensen J, Gardner IA. Herd-level interpretation of test results for epidemiologic studies of animal diseases. Prev Vet Med. 2000;45: 83–106. pmid:10802335
  16. 16. Cameron AR, Baldock FC. A new probability formula for surveys to substantiate freedom from disease. Prev Vet Med. 1998;34: 1–17. pmid:9541947
  17. 17. Nikolay B, Brooker SJ, Pullan RL. Sensitivity of diagnostic tests for human soil-transmitted helminth infections: a meta-analysis in the absence of a true gold standard. Int J Parasitol. 2014;44: 765–774. pmid:24992655
  18. 18. Solomon AW, Engels D, Bailey RL, Blake IM, Brooker S, Chen J-X, et al. A diagnostics platform for the integrated mapping, monitoring, and surveillance of neglected tropical diseases: rationale and target product profiles. PLoS Negl Trop Dis. 2012;6: e1746. pmid:22860146
  19. 19. Butler CJ, Edwards WH, Jennings-Gaines JE, Killion HJ, Wood ME, McWhirter DE, et al. Assessing respiratory pathogen communities in bighorn sheep populations: Sampling realities, challenges, and improvements. PloS One. 2017;12: e0180689. pmid:28708832
  20. 20. McClintock BT, Nichols JD, Bailey LL, MacKenzie DI, Kendall WL, Franklin AB. Seeking a second opinion: uncertainty in disease ecology. Ecol Lett. 2010;13: 659–674. pmid:20426794
  21. 21. Conn PB, Cooch EG. Multistate capture–recapture analysis under imperfect state observation: an application to disease models. J Appl Ecol. 2009;46: 486–492.
  22. 22. Miller DA, Talley BL, Lips KR, Grant EHC. Estimating patterns and drivers of infection prevalence and intensity when detection is imperfect and sampling error occurs. Methods Ecol Evol. 2012;3: 850–859.
  23. 23. Lachish S, Murray KA. The certainty of uncertainty: potential sources of bias and imprecision in disease ecology studies. Front Vet Sci. 2018;5. pmid:29872662
  24. 24. Besser TE, Frances Cassirer E, Highland MA, Wolff P, Justice-Allen A, Mansfield K, et al. Bighorn sheep pneumonia: Sorting out the cause of a polymicrobial disease. Prev Vet Med. 2013;108: 85–93. pmid:23253148
  25. 25. Trevethan R. Sensitivity, specificity, and predictive values: foundations, pliabilities, and pitfalls in research and practice. Front Public Health. 2017;5. pmid:29209603
  26. 26. Woodroffe R, Frost SDW, Clifton‐Hadley RS. Attempts to control tuberculosis in cattle by removing infected badgers: constraints imposed by live test sensitivity. J Appl Ecol. 1999;36: 494–501.
  27. 27. van Dijk J. Towards risk-based test protocols: estimating the contribution of intensive testing to the UK bovine tuberculosis problem. PloS One. 2013;8: e63961. pmid:23717517
  28. 28. Enøe C, Georgiadis MP, Johnson WO. Estimation of sensitivity and specificity of diagnostic tests and disease prevalence when the true disease state is unknown. Prev Vet Med. 2000;45: 61–81. pmid:10802334
  29. 29. Lachish S, Gopalaswamy AM, Knowles SC, Sheldon BC. Site-occupancy modelling as a novel framework for assessing test sensitivity and estimating wildlife disease prevalence from imperfect diagnostic tests. Methods Ecol Evol. 2012;3: 339–348.
  30. 30. Manlove K, Cassirer EF, Cross PC, Plowright RK, Hudson PJ. Disease introduction is associated with a phase transition in bighorn sheep demographics. Ecology. 2016;97: 2593–2602. pmid:27859120
  31. 31. Cassirer EF, Sinclair ARE. Dynamics of pneumonia in a bighorn sheep metapopulation. J Wildl Manag. 2007;71: 1080–1088.
  32. 32. Besser TE, Highland MA, Baker K, Cassirer EF, Anderson NJ, Ramsey JM, et al. Causes of pneumonia epizootics among bighorn sheep, western United States, 2008–2010. Emerg Infect Dis. 2012;18: 406–414. pmid:22377321
  33. 33. Besser TE, Cassirer EF, Potter KA, Lahmers K, Oaks JL, Shanthalingam S, et al. Epizootic pneumonia of bighorn sheep following experimental exposure to Mycoplasma ovipneumoniae. PLOS ONE. 2014;9: e110039. pmid:25302992
  34. 34. Wood ME, Fox KA, Jennings-Gaines J, Killion HJ, Amundson S, Miller MW, et al. How respiratory pathogens contribute to lamb mortality in a poorly performing bighorn sheep (Ovis canadensis) herd. J Wildl Dis. 2017;53: 126–130. pmid:27690193
  35. 35. Butler CJ, Edwards WH, Paterson JT, Proffitt KM, Jennings-Gaines JE, Killion HJ, et al. Respiratory pathogens and their association with population performance in Montana and Wyoming bighorn sheep populations. PloS One. 2018;13: e0207780. pmid:30475861
  36. 36. Messam LLM, Branscum AJ, Collins MT, Gardner IA. Frequentist and Bayesian approaches to prevalence estimation using examples from Johne’s disease. Anim Health Res Rev. 2008;9: 1–23. pmid:18346298
  37. 37. Branscum AJ, Gardner IA, Johnson WO. Bayesian modeling of animal-and herd-level prevalences. Prev Vet Med. 2004;66: 101–112. pmid:15579338
  38. 38. Drewe JA, Dean GS, Michel AL, Pearce GP. Accuracy of three diagnostic tests for determining Mycobacterium bovis infection status in live-sampled wild meerkats (Suricata suricatta). J Vet Diagn Invest. 2009;21: 31–39. pmid:19139498
  39. 39. Gómez-Díaz E, Doherty PF Jr, Duneau D, McCoy KD. Cryptic vector divergence masks vector-specific patterns of infection: an example from the marine cycle of Lyme borreliosis. Evol Appl. 2010;3: 391–401. pmid:25567933
  40. 40. DiRenzo GV, Campbell Grant EH, Longo AV, Che-Castaldo C, Zamudio KR, Lips KR. Imperfect pathogen detection from non-invasive skin swabs biases disease inference. Methods Ecol Evol. 2018;9: 380–389.
  41. 41. Knowles SC, Wood MJ, Alves R, Wilkin TA, Bensch S, Sheldon BC. Molecular epidemiology of malaria prevalence and parasitaemia in a wild bird population. Mol Ecol. 2011;20: 1062–1076. pmid:21073677
  42. 42. Marchand P, Freycon P, Herbaux J-P, Game Y, Toïgo C, Gilot-Fromont E, et al. Sociospatial structure explains marked variation in brucellosis seroprevalence in an Alpine ibex population. Sci Rep. 2017;7: 1–12. pmid:28127051
  43. 43. Buzdugan SN, Vergne T, Grosbois V, Delahay RJ, Drewe JA. Inference of the infection status of individuals using longitudinal testing data from cryptic populations: Towards a probabilistic approach to diagnosis. Sci Rep. 2017;7: 1–11. pmid:28127051
  44. 44. Denwood MJ. runjags: an R Package providing interface utilities, model templates, parallel computing methods and additional distributions for MCMC models in JAGS. J Stat Softw. 2016;71: 1–25.
  45. 45. Plummer M. JAGS version 4.3. 0 user manual [Software manual]. 2017.
  46. 46. R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2018. Available: https://www.R-project.org/
  47. 47. Gelman A, Rubin DB. Inference from iterative simulation using multiple sequences. Stat Sci. 1992;7: 457–472.
  48. 48. Chang W, Cheng J, Allaire J, Xie Y, McPherson J. shiny: web application framework for R. 2019. Available: https://CRAN.R-project.org/package = shiny
  49. 49. Alex Potapov, Evelyn Merrill, Lewis Mark A. Wildlife disease elimination and density dependence. Proc R Soc B Biol Sci. 2012;279: 3139–3145. pmid:22593103
  50. 50. Restif O, Hayman DT, Pulliam JR, Plowright RK, George DB, Luis AD, et al. Model-guided fieldwork: practical guidelines for multidisciplinary research on wildlife ecological and epidemiological dynamics. Ecol Lett. 2012;15: 1083–1094. pmid:22809422
  51. 51. MacKenzie DI, Royle JA. Designing occupancy studies: general advice and allocating survey effort. J Appl Ecol. 2005;42: 1105–1114.
  52. 52. MacKenzie DI, Nichols JD, Lachman GB, Droege S, Andrew Royle J, Langtimm CA. Estimating site occupancy rates when detection probabilities are less than one. Ecology. 2002;83: 2248–2255.
  53. 53. Jennelle CS, Cooch EG, Conroy MJ, Senar JC. State-specific detection probabilities and disease prevalence. Ecol Appl. 2007;17: 154–167. pmid:17479842
  54. 54. Cassirer EF, Manlove KR, Plowright RK, Besser TE. Evidence for strain-specific immunity to pneumonia in bighorn sheep. J Wildl Manag. 2017;81: 133–143.
  55. 55. Harris R, Besser T, Bush K, Glenn G, Kjack J, Oyster J, et al. A pilot program to create a source of domestic sheep free of M. ovipneumoniae for cooperating private owners [abstract]. Proceedings of the 21st biennial symposium of the northern wild sheep and goat council. Whitefish, MT; 2018.
  56. 56. Cassirer EF, Manlove KR, Almberg ES, Kamath PL, Cox M, Wolff P, et al. Pneumonia in bighorn sheep: Risk and resilience. J Wildl Manag. 2018;82: 32–45.