Improved estimation of time-varying reproduction numbers at low case incidence and between epidemic waves

doi:10.1371/journal.pcbi.1009347

Fig 1.

EpiFilter algorithm and relationship to other methods.

In the left panels we consider three ways of inferring the instantaneous or effective reproduction number at time s, R_s, from the incidence curve, (blue dots). The filtering solution produces the posterior distribution p_s from all data prior to time s. EpiEstim approximates this solution by using the subset of data in a window of size k into the past. Reverse-filtering considers the complementary part of the incidence curve, leading to r_s, which utilises data beyond s. The WT method, with future window k, approximates this type of solution. Smoothing uses all information from to generate q_s, which is precisely computed by EpiFilter. Blue windows show the portions of that inform on R_s for each of p_s, r_s and q_s while red windows highlight the subsets used by EpiEstim and the WT method. Double arrows indicate data used for constructing various posterior distributions, while square arrows pinpoint instances of those distributions at the edges of . In the right panels we summarise the construction of EpiFilter. We outline the main assumptions (the model box) and computations (the algorithm box) necessary for realising EpiFilter, which allow us to obtain the most informative and minimum mean squared error (MSE) smoothing posterior distribution q_s. See the main text for the specific equations employed in our implementation [14].

More »

Expand

Fig 2.

Small or waning epidemics.

We compare reproduction number estimates ( or ) and one-step-ahead incidence predictions ( or ) from APEestim with optimal window k*, EpiEstim with window k and EpiFilter with state noise η. We simulate 200 epidemics with low daily case numbers or long tails (long sequences of zero cases) using the standard renewal model (Eq (1)) for three scenarios, representative examples of which are given in A-C. The true R_s and I_s are in black. All mean estimates or predictions are in red and blue with 95% credible intervals. APEestim and EpiEstim use a Gam(1, 2) prior distribution and EpiFilter a grid with m = 2000, R_min = 0.01 and R_max = 10. In D we provide statistics of the MSE of these estimates (relative to R_s) and the PMSE of these predictions (relative to I_s) for all 200 runs. We find that EpiFilter is more robust to small incidence (better uncertainty), whereas the other approaches can quickly decay to their prior distribution. It achieves significantly smaller MSE (2–10 fold reductions) and comparable PMSE to APEestim (which is optimised for prediction).

More »

Expand

Fig 3.

Temporal statistics of small or waning epidemics.

We expand on the results from Fig 2D by decomposing the MSE and one-step-ahead PMSE statistics across the 200 simulated trajectories for every scenario in Fig 2. We do not consider the k = 31 EpiEstim example given its poor performance. We present the k = 7 case, which is the generally recommended EpiEstim setting. We observe that EpiFilter significantly improves on MSE throughout the epidemic trajectory (and not only in periods of low incidence) while maintaining comparable prediction accuracies. Coverage statistics for these scenarios, which are given in Fig C of the S1 Appendix, confirm that EpiFilter also consistently contains the true R_s and I_s values within its credible intervals.

More »

Expand

Fig 4.

Epidemics with multiple waves.

We compare reproduction number estimates ( or ) and one-step-ahead incidence predictions ( or ) from APEestim with optimal window k*, EpiEstim with window k and EpiFilter with state noise η. We simulate 200 epidemics with multiple waves of infection using the standard renewal model (Eq (1)) for three scenarios, representative examples of which are given in A-C. The true R_s and I_s are in black. All mean estimates or predictions are in red and blue with 95% equal tailed credible intervals. APEestim and EpiEstim use a Gam(1, 2) prior distribution and EpiFilter a grid with m = 2000, R_min = 0.01 and R_max = 10. In D we provide statistics of the MSE of these estimates (relative to R_s) and the PMSE of these predictions (relative to I_s) for all 200 runs. We find EpiFilter is best able to negotiate troughs between epidemic peaks and hence infer resurging infectious dynamics, achieving significantly smaller MSE (2–10 fold reductions) and comparable PMSE to APEestim (which is optimised for prediction).

More »

Expand

Fig 5.

Temporal statistics of epidemics with multiple waves.

We expand on the results from Fig 4D by decomposing the MSE and one-step-ahead PMSE statistics across the 200 simulated trajectories for every scenario in Fig 4. We do not consider the k = 31 EpiEstim example given its poor performance. We present the k = 7 case, which is the generally recommended EpiEstim setting. We observe that EpiFilter significantly improves on MSE throughout the resurgent epidemic trajectory (whether incidence is small or large) while maintaining comparable prediction accuracies. Coverage statistics for these scenarios, which are given in Fig D of the S1 Appendix, confirm that EpiFilter consistently contains the true R_s and I_s values within its credible intervals.

More »

Expand

Fig 6.

H1N1 influenza transmission in Baltimore (1918).

We compare APEestim (top), EpiEstim with recommended weekly window (middle) (both with Gam(1, 2) prior distribution) and EpiFilter (with m = 2000, η = 0.1, R_min = 0.01 and R_max = 10) on the H1N1 influenza dataset from [42]. We use a 5-day moving average filter, as in [42], to ameliorate known sampling biases. Estimates of reproduction numbers, R_s, and corresponding 95% equal tailed credible intervals are in red. One-step-ahead predictions of incidence, I_s, (with 95% credible intervals) are in blue with the actual incidence in black. We find that EpiFilter combines the benefits of APEestim and EpiEstim, achieving both good estimates and predictions.

More »

Expand

Fig 7.

COVID-19 transmission in New Zealand.

We compute smoothed and filtered reproduction number estimates, (red) and (blue) respectively, from the COVID-19 incidence curve for New Zealand (available at [43]) in the left panels. We use EpiFilter with m = 2000, η = 0.1, R_min = 0.01 and R_max = 10 with a uniform prior distribution over the grid . The top of 7A shows conditional mean estimates and 95% credible intervals for (red) and (blue). Vertical lines indicate the start and end of lockdown, a major intervention that was employed to halt transmission. The additional ‘future’ information used in smoothing has a notable effect. The bottom of 7A provides smoothed one-step-ahead predictions (blue, with 95% credible intervals) of the actual reported cases I_s (black). The inset gives the estimated probability of R_s ≤ 1. We observe a clear trend of subcritical transmission that eventually seeds a second wave by August. In 7B we compare EpiFilter with EpiEstim (using weekly windows) and APEestim (both with Gam(1, 2) priors) with all left subfigures presenting R_s estimates and right ones providing filtered I_s predictions. We observe that both APEestim and EpiEstim lead to largely unusable estimates that mask transmission trends, in sharp contrast to EpiFilter.

More »

Expand