Recalibrating probabilistic forecasts of epidemics
Fig 4
Improvement in mean log score, for the different recalibration methods.
Log score is averaged over all 27 forecasters in the FluSight, 9 seasons, 11 locations, and 29 weeks (higher log score is better). The ensemble recalibration method improves accuracy for every target.