Skip to main content
Advertisement
  • Loading metrics

Hidden variable models reveal the effects of infection from changes in host survival

  • Jake M. Ferguson ,

    Contributed equally to this work with: Jake M. Ferguson, Andrea González-González

    Roles Data curation, Formal analysis, Software, Validation, Visualization, Writing – original draft

    Affiliations Department of Biology, University of Hawaiʻi at Mānoa, Honolulu, Hawaii, United States of America, Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, Idaho, United States of America

  • Andrea González-González ,

    Contributed equally to this work with: Jake M. Ferguson, Andrea González-González

    Roles Investigation, Validation, Visualization, Writing – original draft

    andreagg@gmail.com

    Affiliations Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, Idaho, United States of America, Department of Biology, University of Florida, Gainesville, Florida, United States of America, Department of Biological Sciences, University of Idaho, Moscow, Idaho, United States of America

  • Johnathan A. Kaiser,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Department of Biological Sciences, University of Idaho, Moscow, Idaho, United States of America

  • Sara M. Winzer,

    Roles Data curation, Investigation, Methodology, Writing – review & editing

    Affiliation Department of Biological Sciences, University of Idaho, Moscow, Idaho, United States of America

  • Justin M. Anast,

    Roles Investigation, Methodology, Writing – review & editing

    Affiliation Department of Biological Sciences, University of Idaho, Moscow, Idaho, United States of America

  • Ben Ridenhour,

    Roles Formal analysis, Writing – review & editing

    Affiliation Department of Mathematics, University of Idaho, Moscow, Idaho, United States of America

  • Tanya A. Miura,

    Roles Resources, Software, Writing – review & editing

    Affiliations Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, Idaho, United States of America, Department of Biological Sciences, University of Idaho, Moscow, Idaho, United States of America

  • Christine E. Parent

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Software, Writing – review & editing

    Affiliations Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, Idaho, United States of America, Department of Biological Sciences, University of Idaho, Moscow, Idaho, United States of America, Institute for Interdisciplinary Data Sciences, University of Idaho, Moscow, Idaho, United States of America

Abstract

The impacts of disease on host vital rates can be demonstrated using longitudinal studies, but these studies can be expensive and logistically challenging. We examined the utility of hidden variable models to infer the individual effects of infectious disease from population-level measurements of survival when longitudinal studies are not possible. Our approach seeks to explain temporal deviations in population-level survival after introducing a disease causative agent when disease prevalence cannot be directly measured by coupling survival and epidemiological models. We tested this approach using an experimental host system (Drosophila melanogaster) with multiple distinct pathogens to validate the ability of the hidden variable model to infer per-capita disease rates. We then applied the approach to a disease outbreak in harbor seals (Phoca vituline) that had data on observed strandings but no epidemiological data. We found that our hidden variable modeling approach could successfully detect the per-capita effects of disease from monitored survival rates in both the experimental and wild populations. Our approach may prove useful for detecting epidemics from public health data in regions where standard surveillance techniques are not available and in the study of epidemics in wildlife populations, where longitudinal studies can be especially difficult to implement.

Author summary

Effectively monitoring disease outbreaks requires determining the proportion of diseased individuals and the consequences of disease through time. The ecological fallacy is the statistical assumption that groups are homogenous when they are not. The fallacy can arise in epidemiology when studying populations composed of both diseased and healthy individuals. The average mortality in the population will not accurately describe either a diseased or a healthy individual. This is a challenge for monitoring disease outbreaks since many forms of monitoring data are collected at the population level. Here, we apply a modeling approach that accounts for disease transmission at the individual-level when data are collected at the population-level. The approach works by comparing the pattern of mortality through time in a population with a disease to a fully healthy one. The observed rate of change in mortality through time informs the model of the infection process. We validated our approach using an experimental system (Drosophila melanogaster infected with different viral pathogens). We then applied the method to study an outbreak of phocine distemper in harbor seals. Our results show that the individual-level effects of disease can be determined from population-level data.

Introduction

Longitudinal studies that track individuals through time are the benchmark for detecting the effects of disease on survival and reproduction because they can directly link a change in an individual’s infection status to a change in their vital rates. Despite their power, longitudinal studies tend to be relatively rare, especially those concerning wildlife, due to the prohibitive costs and logistics of tracking individuals. Alternatives to longitudinal studies, such as tracking groups instead of individuals, can illustrate differences among populations; however, these comparisons cannot explicitly determine the individual-level impacts of the disease. The inability for inferences at the group level to be valid at the individual level is commonly known as the ecological fallacy.

Hidden variable models, also called latent-variable models, explicitly link processes that are not directly observed to the measured random variables, providing an effective pathway to model complex biological processes using aggregate data [1]. Commonly used hidden variable modeling approaches in biology include latent variable regression [2], hidden Markov [3], state-space models [4], and structural equation models [5]. In epidemiology, state-space approaches have been successfully used to link the observed number of deaths due to disease to the underlying disease dynamics [6] and distinguish seasonal and epidemic dynamics in public-health surveillance data [7,8]. Coupling hidden variable models to inexpensive and commonly collected forms of data, such as population-level measurements of vital rates, could offer a powerful new approach for retroactively determining the individual-level effects of disease on populations.

Monitoring vital rates, such as survival, is a common approach for managing wildlife populations. These measurements have been used to provide insights into the past causes of change in population growth [9] and to forecast the effects of potential future environmental conditions on population growth [10]. In the context of disease studies, coupling vital rates to disease prevalence has been used to determine how bovine tuberculosis, a mild infection at the individual-level, can lead to large reductions in the population growth of African buffalo [11]. More recent work has extended this approach to use vital rate and prevalence measurements to identify the impacts of interactions between multiple diseases [12]. Thus, while monitoring vital rates is a potential resource for detecting the effects of disease, the current usefulness of these data for answering epidemiological questions is contingent on having detailed information on individuals infection status.

Here, we determined whether the effect of infection on a host’s survival rates could be inferred without direct measurements of infection prevalence. Our approach coupled the observed survival rates to a model that predicted the infected status of individuals, the demographic rates of both infected and uninfected individuals and the probability of becoming infected. Conceptually, we are able to infer the survival rates of infected and uninfected individuals by comparing the temporal change between the observed pre- and post-infection survival rates.

We first demonstrate proof-of-concept of our modeling approach on a tractable experimental host-virus system of Drosophila melanogaster and the associated Drosophila C virus (DCV) and Drosophila X virus (DXV) where longitudinal surveys were impossible to perform due to the destructive nature of specimen sampling. We then apply the modeling technique to a phocine distemper outbreak in the Netherlands where weekly mortality data are collected but individuals are not tested for disease or virus presence. In both scenarios, disease incidence could not be directly measured but systematic changes in population-level vital rates were observed post-infection.

Materials and methods

Survival experiment

Fly line and husbandry.

We used the Oregon R wild-type strain of Drosophila melanogaster from Carolina Biology Supply. The stock and experimental flies were kept in an incubator on a 12-hour day/night cycle at 25°C. We housed the stock flies in 25 x 95 mm polystyrene vials (Genesee Scientific) with 10 ml of Jazz-Mix Drosophila Food referred here as fly food (Fisher Scientific) prepared following the manufacturer’s instructions. As described in [13], we bleached freshly laid eggs to clear any potential chronic infections before performing the viral challenges.

Infecting pathogens.

We used Drosophila C (DCV) and Drosophila X (DXV) viruses as infecting pathogens. DCV is a +ssRNA virus that belongs to the Dicistroviridae family. It has been found in natural populations of at least eight different Drosophila species [14,15] and has been extensively used as a model system to study the antiviral response in D. melanogaster [1619] as well as the ecology and evolution of viral infections in Drosophila [2022]. While it is possible for oral inoculation by DCV to result in systemic infection, it is more often associated with a local immune response and results in low mortality rates compared to intra-thoracic injections [17,23]. DXV is a dsRNA virus from the Birnaviridae family and is considered to be a cell culture contaminant [24]. It is genetically similar to the dsRNA Eridge virus isolated from natural populations of fruit flies in France and the UK [15]. Although the effects and pathology of DXV are not as well understood as DCV in Drosophila, it has been shown that infections by microinjection induce anoxia sensitivity, exhibited by increased rates of mortality when exposed to CO2 [2426].

We used the Charolles strain of DCV isolated from a laboratory population in 1972 [27] and obtained from Dr. Marta L Wayne (University of Florida). We obtained the DXV stock from Dr. Louisa Wu (University of Maryland). Both DCV and DXV were cultured in Schneider’s Drosophila Line 2 (S2 cells) and titrated to a tissue culture 50% infectious dose of 2x109 TCID50/ml as previously described [28]. Viral stocks were kept in 250 μl aliquots at -80°C.

Oral infection and survival monitoring.

We infected flies through an oral inoculation protocol meant to resemble a natural route of infection [2931]. We collected newly emerged flies in vials containing clean fly food and left them to age for three days in an incubator on a 12-hour day/night cycle at 25°C to reach maturity. We next starved the mature flies for four hours by placing them in empty polystyrene vials and then transferred them into new vials supplemented with a 1.9 cm diameter circular piece of Whatman filter paper (Cat No 1001 150) placed in the bottom. The filter paper was loaded with 100 μl of viral or negative control mixture. We prepared the viral mixture for infections by combining 50 μl sucrose 25%, 225 μl DCV or DXV virus stock, 225 μl Schneider’s Drosophila Medium (Genesee Scientific), and 10 μl of edible red dye. For the negative control (i.e., mock) trial, 450 μl of Schneider’s Drosophila Medium was used in addition to the 50 μl sucrose 25% plus 10 μl of edible red dye. After six hours of letting the flies feed on one of the mixtures, only the flies showing a red belly (indicating recent ingestion of viral or mock mixture) were transferred into vials containing clean fly food.

Each infection treatment (DCV or DXV) and negative control (Mock) consisted of 10 replicate vials, each vial housing 5 red belly females and 5 red belly males for a total of 100 flies per treatment or negative control. We placed the experimental vials in the incubator and counted the number of dead flies in each tube daily over 35 days. Alive flies were transferred into new vials containing clean food every seven days to avoid the emergence of offspring. Deceased flies were not collected immediately after a mortality event to avoid potential stress from repeated anesthetization of the experimental populations and we were not able to collect deceased flies from the 7-days old vial because the corpses were either decomposed or buried deep in the fly food. We illustrate these experimental steps in Fig 1.

thumbnail
Fig 1. Diagram of the oral DCV or DXV infection protocol in Drosophila melanogaster.

First, 15 males and 15 females were placed in a clean vial containing fly food for 3 days to reach complete maturation. Flies were next transferred into an empty vial to starve for 4 hours. Flies were then placed into new vials for 6 hours, where flies fed from either a virus or mock mixture that contained red dye. After this inoculation period, 10 flies (5 males and 5 females) showing a red belly, were transferred into a new vial containing food (without any pathogen). Finally, the number of dead and alive flies was scored on a daily basis over a period of 35 days and alive flies were transferred into clean vials containing fly food every 7 days.

https://doi.org/10.1371/journal.pcbi.1010910.g001

Fly survival models.

We analyzed the effects of DCV and DXV on the mortality rates of flies by treating the experimental data as mixtures of individuals with three different possible states (i.e., susceptible, infected, or dead). We started our experimental populations with all flies ingesting the viral mixture, however, since determining the state of infection in these flies required destructive sampling we could not determine which of these exposed flies became diseased. However, experimental evidence suggests that individual flies orally infected with DCV start clearing the virus as soon as two days post-infection [17,23]. Because we did not observe the dynamics typical of the recovery phase, such as an increase in average survival near the end of the experiment, we assumed that our experiment data are composed of a mixture of an unknown number susceptible and infected individuals. To resolve this unknown mixture, we used an epidemiological model to predict the numbers of susceptible and infected flies throughout the 35-day experiment. Flies selected for the experimental treatments were known to be directly exposed to DCV or DXV through the consumed medium. Subsequent infections likely follow a fecal- oral route of transmission [14,16]. Thus, we modeled the disease and population dynamics of the flies using a susceptible-infected with death (hereafter, SID) model. We modeled transmission with two components. The first was constant daily infection probabilities arising from environmental transmission (parameter denoted as π0,C/X). The second component was a density-dependent component that depends on the numbers of susceptible and infected each day (parameter denoted as πC/X). Together, these terms combined to give the total probability of infection of a susceptible individual as . The population size in each vial also changed through time through death, accounted for in our model by age-dependent survival probabilities. All transitions used in infection experiments are illustrated in Fig 2A. Our model assumed that all individuals in a vial began in the susceptible class (S(t = 0) = 10) with no individuals in any infected class (IC/X(t = 0) = 0). Susceptible individuals could become infected with DCV or DXV with the daily probability of infection (πC/X), or they could die with the time-dependent daily probability of death (Ω(t)).

thumbnail
Fig 2. State transitions for the SID model (panel A) and the SIRD model (panel B).

Blue boxes denote the susceptible state (denoted as S). Red boxes in the SID model denote the infected state including either DCV (IC) or DXV (IX). The red box in the SIR model is the state for individuals infected with phocine distemper (I), while the green box is the recovered state (R). The probability of death is denoted as Ω(t), with appropriate subscript, and the probability of transitions from one state to another are denoted with Greek letters. Lines that begin and end in the same state denote the proportion of individuals that stay in the same state the following time step. Thus, in the SID model the probability of remaining in state S from time t−1 to time t is 1−ΩS(t)−ΠC/X(t). Since D is an absorbing state, all individuals remain in this state.

https://doi.org/10.1371/journal.pcbi.1010910.g002

The daily probability of death, Ω(t), was the probability of dying between day t−1 and day t, given that the individual survived up to day t−1. We denote the probability of an individual in state Z dying on day t in vial j as ΩZ,j(t), modeled as . The time-independent mortality term, αZ, was assumed to be state-dependent denoted as αS for susceptibles, αC for DCV-infecteds, and αX for DXV-infecteds. The regression parameters βage and βtransfer were assumed to be independent of infection status; βage accounted for the age of the fly and t denotes the days since the beginning of the experiment and βtransfer is the effect of the number of days after the last transfer (Transfer is the number of days, from 0 to 6, since the last vial transfer), as described in the experimental methods. These transfers occurred each week and we observed increases in fly mortality with the amount of time between transfers. This mortality pattern likely was due to the medium becoming sticky over time and trapping flies. Finally, ψT,j, was a treatment- and vial-specific random-effect, assumed to follow a normal distribution with unknown variance, where the treatment was either the mock, DCV-exposed, or DXV-exposed flies. In addition, we fitted a model with no random effect and compared the model fits using the deviance information criterion (DIC) [32].

The total number of infected individuals, IC/X(t), where the subscript C denotes DCV and X denotes DXV, on day t was modeled as the number of susceptibles on the previous day that survived and became infected plus the existing infecteds from the previous day that survived. The total number of deaths on day t, D(t), was then the sum of the deaths from both susceptible and infected individuals. Thus, the expected numbers of susceptibles, infecteds, and deaths on day t in a population exposed to a virus given the previous day’s population are:

We modeled the number of flies that survived daily in the mock experiment as a binomial distribution with probability of death following ΩS,j(t) and size parameter Nj(t−1), the observed number of individuals alive at the end of the previous day in vial j. For the infection model, the observed probability of death depended on both the probabilities of death of infected and susceptible individuals, weighted by their relative proportions. Thus, the probability of death in vial j followed a binomial distribution with probability Dj(t)/(Sj(t−1)+IC/X,j(t−1)) and size parameter Nj(t−1). Specifications for all priors in this model are given in S1 Table.

In order to determine whether our SID model was a reasonable model of disease dynamics, we tested it against a model with no SID dynamics. This treatment-specific model assumed that all individuals in an experiment had the same chance of dying each day, thus, unlike the SID model there was no differentiation between infected and noninfected individuals in the vial. In this treatment-specific model the daily mortality probability was modeled following the survival models for ΩZ,j(t) described above, including the effects of day, days since the last vial transfer, and a vial-specific random effect; however, this model did not include any differences between susceptible and diseased individuals. Thus, all individuals in a vial were assumed to have the same daily mortality probability. We also compared our SID model with both environmental and density dependent transmission to a model with just environmental transmission (i.e., πC/X = 0), since a priori we believed this to be the dominant mode of transmission. We compared the fits of these models using the deviance information criterion (DIC).

We fit all fly survival models in JAGS [33] using an adaptive phase of 103 draws, a burn-in of 104 draws, followed by 106 draws from the posterior, thinning these draws by every 100 for a total of 104 draws from the posterior distribution. We examined the posteriors graphically for convergence and checked the Gelman and Rubin convergence diagnostic [34]. We reported the posterior mean and the standard deviation of the posteriors. In order to determine whether flies in the experimental treatments differed from the mock flies, we performed pairwise comparisons between posterior distributions of the time-independent mortality terms (αS, αC, and αX). In order to assess parameter identifiability, we generated 10,000 datasets using the mean posterior estimates from the SID model. We then re-estimated the model for each of these simulated datasets and recorded the mean parameter estimate for each parameter.

Harbor seal study

Stranding surveys.

Phocine distemper virus is closely related to the canine distemper virus in the Paramyxoviridae family; both pathogens are in the same genus as measles virus [35]. An outbreak of phocine distemper in 2002 was detected in the North Sea population of harbor seals. First detected on Anhholt Island in Denmark in May, the virus spread into the Netherlands with the first detection on June 16, presumably by the highly mobile gray seals who share haul-outs with harbor seals [36]. The 2002 outbreak cost an estimated 30,000 harbor seals their lives, making this the largest recorded mass mortality event ever in marine mammals [36].

Data on seal stranding events were obtained from [37] and [38], who used a public database, waarneming.nl, that contained weekly stranding data collected by seal rescue centers in the region. Stranding events may or may not be lethal, though rescued individuals are removed from the population and brought to local seal rescue centers. We used WebPlotDigitizer software [39] to acquire data from published figures from [37] and [38], rounding all digitized values to the nearest integer value. The Dutch outbreak ended by December the same year as indicated by the return of stranding rates to pre-epidemic levels [37], which resulted in 25 weeks of stranding data. We also obtained annual harbor seal strandings from [38] for the three years before the 2002 outbreak (1999–2001) and the three years following the outbreak (2003–2005) to determine the average weekly stranding rates in the absence of the phocine distemper outbreak. We limited our study to these years because the harbor seal population was relatively constant over that period of time [38].

Phocine distemper model.

As in the fly experiments, we could not directly observe disease incidence in seals, only the change in strandings due to disease. It is the change in rate of strandings from the background rate that provides information on epidemic parameters. Because we were only able to observe strandings, deaths that did not lead to a stranding were not incorporated into this analysis. We model all stranding events as deaths, since seals were removed from the wild population.

We modeled seal disease dynamics using a susceptible-infected-recovered with death (SIRD) model (Fig 2B) that linked the latent infectious state of individuals in the population to changes in stranding rates through time. This model also incorporated the possibility of recovery from disease. In contrast to the fly survival model described above, which was parameterized using probabilities, we parameterized the SIRD model in terms of time scales to facilitate comparison with previously published results.

The probability of infection of a susceptible individual, πI, was modeled using a discrete frequency-dependent model [40], where β determines the transmission period, determining how long it takes for susceptibles to become infected each week. The probability of recovery was modeled as , where γ determines the time to recovery of an infected individual, while the probability of stranding of an infected (I) individual was , where ω determines the time it takes for an infected individual to become stranded. We calculated the expected time for these transition events to occur by assuming a geometric distribution for the transition event with a probability of success given by estimated transition probability (either πR or ΩI). We assumed the population size at the beginning of the outbreak was N(0) = 5400 individuals, following estimates developed from aerial surveys [38]. We treated the initial number of infecteds in the population, I0, as an estimated parameter since it has been hypothesized that the disease may have been introduced by gray seals from Denmark and subsequently passed to harbor seals throughout the North Sea [36].

We estimated the stranding probability of susceptible and recovered individuals (denoted as ΩS) using the annual number of strandings from the three years prior to the 2002 outbreak (1999–2001) and the three years following the outbreak (2003–2005). Because population size estimates were not available each year, we assumed that the population size was always 5400. We further assumed that stranding rates were constant through the year so that the estimated weekly number of strandings was the annual count divided by 52. The probability of stranding was then the average weekly number of strandings divided by 5400.

Combining these rates as in Fig 2B, our SIRD model structure was given by: with initial conditions given by S(0) = 5400−I0, I(0) = I0, and R(0) = 0. We assumed that the number of strandings on day t followed a negative binomial distribution with mean D(t) and size parameter τ. Specifications for all priors in this model are given in S2 Table. We used the same specifications and posterior checks as used in the fly SID models. For all parameter estimates, we report the posterior mean and the standard deviation. In order to assess parameter identifiability, we simulated 10,000 datasets from SIRD model using the posterior mean estimates. We then re-estimated the model for each of these simulated datasets and recorded the mean parameter estimates of each parameter.

Results

Fly survival

We detected similar mortality rates by the end of the survival experiment (day 35 post-infection) in all treatments. Total mortality was 60% in the mock treatment, 64% in the DCV treatment, and 70% in the DXV treatment. Observed treatment-level mortality rates throughout the experiment are presented in Fig 3 and indicate that the temporal pattern in mortality differed between treatments.

thumbnail
Fig 3. Points indicate the proportion of surviving individuals in each vial.

Transparency is used to indicate overlapping values. Colored lines are the mean estimated survival probabilities that is composed entirely of uninfected flies (panel A), infected with DCV (panel B), or infected with DXV (panel C). Shaded regions denote the 95% credible interval of the mean.

https://doi.org/10.1371/journal.pcbi.1010910.g003

In our comparison of a model with SID dynamics to the treatment-specific model, we found strong evidence that the SID model fit the data better than the model with treatment-specific (i.e., no SID dynamics) mortality rates (ΔDIC = 12.4). We also found the SID model with both environmental and density dependent transmission fit the data better than a model with environmental transmission only (ΔDIC = 3.3). Comparing SID models with and without random effects had nearly identical fits (ΔDIC = 0.9), indicating very little variation among vials within the same experimental treatment. Our inspection of the point and interval estimates of survival and infection probability were nearly identical between these models. Here, we present the results for the full model with random effects but these results hold for the model without random effects as well.

We found that the time-independent of the logit daily survival parameters (reported as mean (standard deviation)) for an uninfected individual () differed from DCV () or DXV () infected individuals (Fig 4). These estimates corresponded to odds ratios of death for the infected versus the susceptible classes was for DCV and for DXV. In both cases the credible interval of the odds ratio did not cover one, indicating a significantly higher odds of death for flies that are infected by DCV or DXV. The estimated time-dependent probability of mortalities are illustrated in Fig 3. In pairwise comparisons between the posteriors of the time-independent component of survival, we found that the 95% high density intervals (hereafter, HDI) of the difference between infected and uninfected estimates did not overlap zero while in the comparison between the infected classes the 95% HDI’s did overlap zero (Fig 4). Thus, while we could statistically distinguish between a susceptible individual and an individual infected with DCV or DXV, we could not distinguish between the effects of DCV and DXV infections.

thumbnail
Fig 4. Posterior distributions for the time-independent survival of uninfected individuals (αS), DCV infected individuals (αC), and DXV infected individuals (αX).

https://doi.org/10.1371/journal.pcbi.1010910.g004

Our estimates also indicated that survival declined as flies age ( per day) and the time between transfers increased ( per day). Additionally, our estimates of the probability of infection in the SID model found that the daily probability of becoming infected with DCV (environmental transmission probability, density dependent transmission probability, ) or DXV (environmental transmission probability, , density dependent transmission probability, ) were statistically similar, as the 95% HDI overlapped 0). All other parameter estimates are reported in S1 Table, while the simulation results (S1 Fig) indicated low bias in all model estimates.

Phocine distemper outbreak

After validating our approach in the fly system, we applied the same protocol to the SIRD model. Our estimated weekly stranding probability of an uninfected individual (reported as mean (standard deviation)) was , while the weekly probability that an infected individual became stranded was estimated as , giving an odds ratio of and the credible interval of this ratio did not cover 1. Thus, infected seals were nearly one-thousand times more likely to get stranded than uninfected seals. Our model adequately explained the location of the peak of infection, estimated to occur in week 13, and the subsequent decline of the number of strandings in the harbor seal population (Fig 5A). Over the 25 weeks of observations, our model simulated that over 80% of the population became infected with phocine distemper at some point, for those infected there was a 54% chance of becoming stranded (Fig 5B). This is consistent with mortality estimates of 60% in the 1988 outbreak of phocine distemper in harbor seals in the North Sea [41,42]. Finally, we estimated that the number of strandings at the end of the epidemic was 2198 individuals, about 40% of the total population.

thumbnail
Fig 5. Panel A illustrates the observed (grey points) and expected number of strandings from the SIRD model (black line).

All deaths are assumed to lead to strandings. Panel B illustrates the expected number of susceptible (blue), infected (red), recovered (green), and dead (stranded) seals (black) over the course of the outbreak.

https://doi.org/10.1371/journal.pcbi.1010910.g005

We estimated that the mean time to recovery was between 2.6 as 3.1 weeks (with reported as mean (standard deviation)), consistent with the mean length of the infected period of 15 days reported in [42]. The mean time to death once infected was estimated as 3.6.2 weeks ( weeks), similar to previous reports of 3 weeks until death in dogs infected with canine distemper as reported in [43], though this has not been directly measured in pinnipeds. We estimated the transmission period as weeks and the initial number of infecteds as individuals. The posterior distribution of the period of transmission, , was strongly correlated both with the estimated time until death, , (r = 0.80) and estimated the initial number of infecteds, (r = 0.82). Posterior estimates of all parameters are reported in S1 Table. Results from simulating datasets from the SIRD model using the estimated parameters indicated that parameter bias was low for all parameters other than the mean time to recovery. Here we found a bias in this parameter of 0.5. Applying this bias-correction yielded an estimate of

Discussion

Our results show that hidden variable modeling approaches can be useful for estimating the per-capita effects of disease outbreaks from commonly collected forms of monitoring data such as population abundances. Differences in survival in our experiment were subtle but strong enough to jointly estimate per-capita vital rates and disease prevalence. The increasing discrepancy between experimental treatments and the mock treatment over the course of the experiment allowed the SID model to determine how disease prevalence changes over time due to transmission of the pathogen. This in turn allowed us to accurately determine the per-capita effects of disease. However, in this case we could only estimate the proportion of individuals that became sick enough to affect their mortality because we did not observe a recovery phase of the outbreak that would provide information on the recovery rates of infected individuals despite the relatively short time to clear the virus that has been reported previously [17,30]. We may have not observed a recovery phase in our data because the length of the experiment may not have been sufficient to observe the decline in the number of infecteds that occurs after peak infection. This decline depends on not just the recovery rate but also the rate of infection. Alternatively, if infected individuals developed chronic health issues that affected their survival post-recovery, we would not expect to observe a recovery phase.

In contrast to the fly experiment where recovery was not observed, we were able to reconstruct both the 2002 phocine distemper epidemic and the population’s subsequent recovery. This recovery phase allowed us to determine the rate of recovery providing additional information about how many individuals became infected but did not die. Thus, while we were able to estimate the contribution of nonlethal infections in the phocine distemper outbreak we were unable to detect their effects in the DCV and DXV experiments. In both examples, we found epidemiological parameter estimates consistent with past studies. Thus, this modeling approach can yield accurate per-capita estimates of the effects of disease on vital rates such as survival from observational data. We also note that while we focused on survival in this study, it is also possible to model other monitored vital rates such as fecundity, to determine impacts of disease.

Another approach developed to determine the influence of disease on survival is the method of excess mortality [44]. This uses individual data to model the survival of patients relative to a background population. The method has found extensive use for determining the impact of cancer [45,46], infectious disease [47], impacts of surgery [48], and even natural disasters [49] on an individual’s survival. In contrast with our approach, the method of excess mortality requires that each individual’s infection status at the time of death is known while in the latent variable approach this status is unknown.

Our models were able to successfully infer the individual-level effects of infection from population-level survival data; however, two conditions must be met for hidden variable models to successfully estimate the impacts of disease on vital rates. First, the overall effect of infection on the population’s vital rate must be strong enough to detect a statistical difference from an uninfected population. Second, the underlying epidemiological dynamics must generate systematic variation in the degree of prevalence linked to the per-capita change in prevalence predicted by the epidemiological models. When both conditions are met, the individual-level effects of disease can be determined. In cases where these conditions are not met or when additional precision is needed, additional information on disease prevalence can be used to infer individual-level effects of disease. For example, tools such as serology, histopathology, immunohistochemistry and molecular diagnostic techniques such as qPCR and RT-qPCR have been commonly used to confirm the presence of phocine distemper virus in infected marine mammals [50,51]. These data, if properly collected, could be used to independently estimate disease prevalence. A limitation of the approach is that we found the estimated parameters tended to have significant uncertainty. One way to improve estimates would be to use past studies to develop informative priors to constrain estimates of epidemiological parameters [52].

While hidden-variable models show significant promise, there are also several important limitations when using this indirect approach. One important tradeoff is that inferences made using these models may be quite sensitive to the underlying modeling assumptions. Another potential factor to consider when applying these models is knowledge about disease introduction. We knew both the time of disease introduction and the number of initial infecteds in the fly experiment. In the harbor seal example, we assumed knowledge of the time of introduction based on previous reports but not the initial number of infecteds. Estimating the initial number of infecteds is approximately equivalent to estimating the time of the disease introduction when the number of initial infected is known. However, if both pieces of information are unknown, it is unclear whether the parameters could be uniquely estimated.

The nature of the disease needs to be considered when developing models to estimate different epidemiological parameters and to properly contextualize the results of these models. For example, the route of infection can modulate the severity of the infection in fruit flies. Intrathoracic injection elicits a systemic infection that results in the death of all flies by day 9 post-infection. In contrast, oral infections have found mortality rates of less than 10% by day 15 post-infection [23]. In line with previous studies [17,30], our DCV survival results suggest low mortality rates estimated by day 15 post-infection. Like DCV, intrathoracic injection with DXV results in 100% mortality by day 10 post-infection [26]. However, we found similar patterns of low mortality rates when the flies are orally infected with DXV. To our knowledge, our study is the first to determine the effects of oral inoculation with DXV on fly survival. These differences between the mortality rates elicited by route of infection are likely to be due to the type of interaction established between pathogen and host and to the immune response mounted by the host. For example, it has been demonstrated that the amount of DCV virus ingested by the flies is significantly higher than the viral dose injected to them, however, orally infected flies are able to start clearing DCV without using the classic antiviral mechanisms such as RNAi used by the fly immune system when infected by injection. Interestingly, orally infected flies start clearing the infection despite clear evidence of viral replication [23].

We expect that our hidden variable modeling approach will be valuable for wildlife health surveillance. While several methods exist to detect wildlife epidemics early, many rely on explicitly surveying infection levels [53,54] and may require substantial resources to implement. Our modeling approach could detect emerging epidemics from existing vital rate data that are often collected for management purposes. Proposed monitoring databases, such as the Marine Mammal Health Monitoring and Analysis Platform [55] have emerged with the goal of collecting monitoring data of mortality events at daily to weekly time intervals. Coupling latent variable models to these data will allow us to potentially explain anomalous mortality events due to disease and forecast the effects of disease outbreaks in real-time, providing powerful tools for interventions in sensitive populations.

Supporting information

S1 Table. Parameter definitions and estimates for the fly SID models.

https://doi.org/10.1371/journal.pcbi.1010910.s001

(DOCX)

S2 Table. Parameter definitions and estimates for the seal SIRD models.

https://doi.org/10.1371/journal.pcbi.1010910.s002

(DOCX)

S1 Fig. Histograms of 10,000 estimated values under a simulation experiment which generated data using from the mean of posterior estimates of the SID model described in the main text.

Red lines denote the original estimates from the dataset, used to conduct the simulations.

https://doi.org/10.1371/journal.pcbi.1010910.s003

(DOCX)

S2 Fig. Histograms of 10,000 estimated values under a simulation experiment which generated data using from the mean of posterior estimates of the SIRD model described in the main text.

Red lines denote the original estimates from the dataset, used to conduct the simulations.

https://doi.org/10.1371/journal.pcbi.1010910.s004

(DOCX)

Acknowledgments

We thank Ashley DeAguero and LuAnn Scott for their technical assistance. We thank Holly Wichman, and Craig Miller for their helpful feedback on experimental design and data analysis.

References

  1. 1. Lancaster G, Green M. Deprivation, ill-health and the ecological fallacy. J R Statist Soc A. 2002;165(2):263–278.
  2. 2. Vitense K, Hanson MA, Herwig BR, Zimmer KD, Fieberg J. Uncovering state-dependent relationships in shallow lakes using Bayesian latent variable regression. Ecol Appl. 2018;28(2):309–322. pmid:29083517
  3. 3. Ferguson JM, Hopkins JB, Witteveen BH. Integrating abundance and diet data to improve inferences of food web dynamics. Methods Ecol Evol. 2018;9(6):1581–1591.
  4. 4. Brooks ME, Clements C, Pemberton J, Ozgul A. Estimation of individual growth trajectories when repeated measures are missing. Am Nat. 2017;190(3):377–388. pmid:28829634
  5. 5. Grace JB, Pugesek BH. A structural equation model of plant species richness and its application to a coastal wetland. Am Nat. 1997;149(3):436–460.
  6. 6. Ionides EL, Bretó C, King AA. Inference for nonlinear dynamical systems. Proc Natl Acad Sci USA. 2006;103(49):18438–18443. pmid:17121996
  7. 7. Le Strat Y, Carrat F. Monitoring epidemiologic surveillance data using hidden Markov models. Stat Med. 1999;18(24):3463–3478. pmid:10611619
  8. 8. Watkins RE, Eagleson S, Veenendaal B, Wright G, Plant AJ. Disease surveillance using a hidden Markov model. BMC Med Inform Decis Mak. 2009;9:39. pmid:19664256
  9. 9. Caughley G, Gunn A. Conservation biology in theory and practice. 1st ed. Wiley-Blackwell; 1995.
  10. 10. Crain BJ, Tremblay RL, Ferguson JM. Sheltered from the storm? Population viability analysis of a rare endemic under periodic catastrophe regimes. Popul Ecol. 2019;61(1):74–92.
  11. 11. Jolles AE, Cooper DV, Levin SA. Hidden effects of chronic tuberculosis in african buffalo. Ecology. 2005;86(9):2358–2364.
  12. 12. Gorsich EE, Etienne RS, Medlock J, Beechler BR, Spaan JM, Spaan JR, et al. Opposite outcomes of coinfection at individual and population scales. Proc Natl Acad Sci U S A. 2018;115(29):7545–7550. pmid:29967175
  13. 13. Merkling SH, Bronkhorst AW, Kramer JM, Overheul GJ, Schenck A, Van Rij RP. The epigenetic regulator G9a mediates tolerance to RNA virus infection in Drosophila. PLoS Pathog. 2015;11(4):e1004692. pmid:25880195
  14. 14. Kapun M, Nolte V, Flatt T, Schlötterer C. Host range and specificity of the Drosophila C virus. PLoS One. 2010;5(8):e12421. pmid:20865043
  15. 15. Webster CL, Longdon B, Lewis SH, Obbard DJ. Twenty-five new viruses associated with the Drosophilidae (Diptera). Evol Bioinform Online. 2016;12:13–25. pmid:27375356
  16. 16. Chtarbanova S, Lamiable O, Lee ZK, Galiana D, Troxler L, Meignin C, et al. Drosophila C virus systemic infection leads to intestinal obstruction. J Virol. 2014;88(24):14057–14069. pmid:25253354
  17. 17. Ferreira ÁG, Naylor H, Esteves SS, Pais IS, Martins NE, Teixeira L. The Toll-dorsal pathway is required for resistance to viral oral infection in Drosophila. PLoS Pathog. 2014;10(12):e1004507. pmid:25473839
  18. 18. Merkling SH, van Rij RP. Analysis of resistance and tolerance to virus infection in Drosophila. Nat Protoc. 2015;10(7):1084–1097. pmid:26110714
  19. 19. Merkling SH, Overheul GJ, van Mierlo JT, Arends D, Gilissen C, van Rij RP. The heat shock response restricts virus infection in Drosophila. Sci Rep. 2015;5:12758. pmid:26234525
  20. 20. Gupta V, Stewart CO, Rund SSC, Monteith K, Vale FP. Costs and benefits of sublethal Drosophila C virus infection. J Evol Biol. 2017;30(7):1325–1335. pmid:28425174
  21. 21. Longdon B, Day JP, Alves JM, Smith SCL, Houslay TM, McGonigle JE, et al. Host shifts result in parallel genetic changes when viruses evolve in closely related species. PLoS Pathog. 2018;14(4):e1006951. pmid:29649296
  22. 22. Roberts KE, Hadfield JD, Sharma MD, Longdon B. Changes in temperature alter the potential outcomes of virus host shifts. PLoS Pathogen. 2018;14(10):e1007185. pmid:30339695
  23. 23. Mondotte JA, Gausson V, Frangeul L, Blanc H, Lambrechts L, Saleh MC. Immune priming and clearance of orally acquired RNA viruses in Drosophila. Nat Microbiol. 2018;3(12):1394–1403. pmid:30374170
  24. 24. Teninges D, Ohanessian A, Richard-Molard C, Contamine D. Isolation and biological properties of drosophila X virus. J Gen Virol. 1979;42(2):241–254.
  25. 25. Xu J, Cherry S. Viruses and antiviral immunity in Drosophila. Dev Comp Immunol. 2014;42(1):67–84. pmid:23680639
  26. 26. Zambon RA, Nandakumar M, Vakharia VN, Wu LP. The Toll pathway is important for an antiviral response in Drosophila. Proc Natl Acad Sci U S A. 2005;102(20):7257–7262. pmid:15878994
  27. 27. Jousset FX, Plus N, Croizier G, Thomas M. Existence chez Drosophila de deux groupes de Picornavirus de propriétés sérologiques et biologiques différentes. CR Acad Sci (Paris), Série D. 1972;275:3043–3046
  28. 28. Hedges LM, Johnson KN. Induction of host defence responses by Drosophila C virus. J Gen Virol. 2008;89:1497–1501. pmid:18474566
  29. 29. Bonning BC, Miller WA. Dicistroviruses. Annu Rev Entomol. 2010;55:129–150.
  30. 30. Mondotte JA, Saleh MC. Antiviral immune response and the route of infection in Drosophila melanogaster. Adv Virus Res. 2018;100:247–278. pmid:29551139
  31. 31. Roxström-Lindquist K, Terenius O, Faye I. Parasite-specific immune response in adult Drosophila melanogaster: a genomic study. EMBO Rep. 2004;5(2):207–212. pmid:14749722
  32. 32. Spiegelhalter DJ, Best NG, Carlin BP, van der Linde A. Bayesian measures of model complexity and fit. J R Stat Soc Series B Stat Methodol. 2002;64(4):583–639.
  33. 33. Plummer M. JAGS: A program for analysis of Bayesian graphical models using Gibbs sampling. Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003). 2003.
  34. 34. Gelman A, Rubin DB. Inference from Iterative Simulation Using Multiple Sequences. Stat Sci. 1992;7:457–472.
  35. 35. Kennedy JM, Earle JAP, Omar S, Abdullah H, Nielsen O, Roelke-Parker ME, et al. Canine and phocine distemper viruses: global spread and genetic basis of jumping species barriers. Viruses 2019;11(10):944. pmid:31615092
  36. 36. Härkönen T, Dietz R, Reijnders P, Teilmann J, Harding K, Hall A, et al. The 1988 and 2002 phocine distemper virus epidemics in European harbour seals. Dis of Aquat Organ. 2006;68(2):115–130. pmid:16532603
  37. 37. Rijks JM, Van de Bildt MW, Jensen T, Philippa JD, Osterhaus AD, Kuiken T. Phocine distemper outbreak, The Netherlands, 2002. Emerg Infect Dis. 2005;11(12):1945–1948. pmid:16485486
  38. 38. Brasseur S. and rehabilitation in numbers: population development and stranding data on the Dutch coasts 1990–2016; analysis of new data from a public database. Wageningen Marine Research; 2018. pp.36
  39. 39. Rohatgi A. Webplotdigitizer: Version 4.3. https://github.com/ankitrohatgi/WebPlotDigitizer 2020
  40. 40. Mills NJ, Getz WM. Modelling the biological control of insect pests: a review of host-parasitoid models. Ecol Modell. 1996;92(2–3):121–143.
  41. 41. Bergman A, Järplid B, Svensson BM. Pathological findings indicative of distemper in European seals. Vet Microbiol. 1990;23(1–4):331–341. pmid:2402876
  42. 42. Heide-Jorgensen MP, Harkonen T. Epizootiology of the seal disease in the eastern North Sea. J Appl Ecol. 1992;29(1):99–107.
  43. 43. Vandevelde M, Kristensen F, Kristensen B, Steck AJ, Kihm U. Immunological and pathological findings in demyelinating encephalitis associated with canine distemper virus infection. Acta Neuropathol. 1982;56:1–8. pmid:6175160
  44. 44. Hakulinen T, Tenkanen L. Regression analysis of relative survival rates. J R Stat Soc Ser C Appl Stat. 1987;36(3),309–317.
  45. 45. Belot A, Remontet L, Launoy G, Jooste V, Giorgi R. Competing risk models to estimate the excess mortality and the first recurrent-event hazards. BMC Med Res Methodol. 2011;11:78. pmid:21612632
  46. 46. Rubio FJ, Rachet B, Giorgi R, Maringe C, Belot A. On models for the estimation of the excess mortality hazard in case of insufficiently stratified life tables. Biostatistics. 2021;22(1):51–67. pmid:31135884
  47. 47. Wu A, Peng Y, Du X, Shu Y, Jiang T. Correlation of influenza virus excess mortality with antigenic variation: application to rapid estimation of influenza mortality burden. PLoS Comput Biol. 2010;6(8):e1000882. pmid:20711361
  48. 48. Kanis JA, Oden A, Johnell O, de Laet C, Jonsson B, Oglesby AK. The components of excess mortality after hip fracture. Bone. 2003;32(5):468–473. pmid:12753862
  49. 49. Rivera R, Rolke W. Modeling excess deaths after a natural disaster with application to Hurricane Maria. Stat Med. 2019;38(23):4545–4554. pmid:31321799
  50. 50. Burek Huntington KA, Gill VA, Berrian AM, Goldstein T, Tuomi P, Byrne BA, et al. Causes of mortality of northern sea otters (Enhydra lutris kenyoni) in Alaska from 2002 to 2012. Front Mar Sci. 2021;8:630582.
  51. 51. VanWormer E, Mazet JAK, Hall A, Gill VA, Boveng PL, London JM, et al. Viral emergence in marine mammals in the North Pacific may be linked to Arctic sea ice reduction. Sci Rep. 2019;9(1):15569. pmid:31700005
  52. 52. Osthus D, Hickmann KS, Caragea PC, Higdon D, Del Valle SY. Forecasting seasonal influenza with a state-space SIR model. Ann Appl Stat. 2017;11(1):202–224. pmid:28979611
  53. 53. Parnell S, Gottwald TR. Estimating the incidence of an epidemic when it is first discovered and the design of early detection monitoring. J Theor Biol. 2012;305:30–36. pmid:22480434
  54. 54. Ferguson JM, Langebrake JB, Cannataro VL, Garcia AJ, Hamman EA, Martcheva M, et al. Optimal sampling strategies for detecting zoonotic disease epidemics. PLoS Comput Biol. 2014;10(6):e1003668. pmid:24968100
  55. 55. Chan DKP, Tsui HCL, Kot BCW. Database documentation of marine mammal stranding and mortality: current status review and future prospects. Dis Aquat Organ. 2017;126(3):247–256. pmid:29160222
  56. 56. Ferguson J, Gonzalez-Gonzalez A, Kaiser J, Winzer S, Anast J, Ridenhour B, Miura T, Parent C. Data from: Hidden variable models reveal the effects of infection from changes in host survival. https://doi.org/10.5061/dryad.jm63xsjf9.