Figures
Abstract
Our study conducts a comprehensive analysis of the Covid-19 pandemic in Brazil, spanning five waves over three years. We employed a novel Susceptible-Infected-Recovered-Dead-Susceptible (SIRDS) model with a fuzzy transition between epidemic periods to estimate time-varying parameters and evaluate case underreporting. The initial basic reproduction number (R0) is identified at 2.44 (95% Confidence Interval (CI): 2.42–2.46), decreasing to 1.00 (95% CI: 0.99–1.01) during the first wave. The model estimates an underreporting factor of 12.9 (95% CI: 12.5–13.2) more infections than officially reported by Brazilian health authorities, with an increasing factor of 5.8 (95% CI: 5.2–6.4), 12.9 (95% CI: 12.5–13.3), and 16.8 (95% CI: 15.8–17.5) in 2020, 2021, and 2022 respectively. Additionally, the Infection Fatality Rate (IFR) is initially 0.88% (95% CI: 0.81%–0.94%) during the initial phase but consistently reduces across subsequent outbreaks, reaching its lowest value of 0.018% (95% CI: 0.011–0.033) in the last outbreak. Regarding the immunity period, the observed uncertainty and low sensitivity indicate that inferring this parameter is particularly challenging. Brazil successfully reduced R0 during the first wave, coinciding with decreased human mobility. Ineffective public health measures during the second wave resulted in the highest mortality rates within the studied period. We attribute lower mortality rates in 2022 to increased vaccination coverage and the lower lethality of the Omicron variant. We demonstrate the model generalization by its application to other countries. Comparative analyses with serological research further validate the accuracy of the model. In forecasting analysis, our model provides reasonable outbreak predictions. In conclusion, our study provides a nuanced understanding of the Covid-19 pandemic in Brazil, employing a novel epidemiological model. The findings contribute to the broader discourse on pandemic dynamics, underreporting, and the effectiveness of health interventions.
Citation: Lima HS, Tupinambás U, Guimarães FG (2024) Estimating time-varying epidemiological parameters and underreporting of Covid-19 cases in Brazil using a mathematical model with fuzzy transitions between epidemic periods. PLoS ONE 19(6): e0305522. https://doi.org/10.1371/journal.pone.0305522
Editor: Marwan Al-Raeei, Damascus University, SYRIAN ARAB REPUBLIC
Received: August 10, 2023; Accepted: June 1, 2024; Published: June 17, 2024
Copyright: © 2024 Lima et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The datasets produced in this study and the codes employed to generate the results, figures, and tables presented in this paper are accessible on the Zenodo repository https://doi.org/10.5281/zenodo.10499012.
Funding: H.S.L. was supported by the Scholarships for Employee Qualification Program of the Federal Institute of Northern Minas Gerais (https://ifnmg.edu.br/) [notice number 142/2022 PBQS-IFNMG], and F.G.G. was supported by National Council for Scientific and Technological Development (CNPq) (https://www.gov.br/cnpq/) [grant number 312991/2020-7]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
The impact of the Covid-19 pandemic on Brazil has been profound, with the country ranking second worldwide in reported deaths [1]. By December 2022, the national health authority had reported almost 36 million cases [2] and 703 thousand deaths [3], indicating the severity of the situation. Brazil experienced five distinct waves of the pandemic over the first three years, as presented in Fig 1, with 2021 emerging as the most lethal, presenting a peak of over 3,000 deaths per day and contributing to 60% of the total deaths during the period studied [3]. Notably, 2022 marked the highest peak in reported cases, with a drastic reduction in the Case Fatality Rate (CFR) compared to earlier stages. The emergence of the Omicron variant in early 2022, characterized by high transmissibility and lower severity, also shaped the dynamics of the pandemic [4–9]. We also highlight that at the beginning of January 2022, Brazil had 78% and 67% of the population vaccinated against Covid-19 with at least one dose and fully vaccinated, respectively [4].
(a) new reported cases and (b) new reported deaths. Data from DATASUS [2].
In this study, we aim to comprehensively reproduce the Covid-19 pandemic in Brazil over the initial three years, providing insights into the actual scale, including estimates for underreported cases. In this work, we refer to underreported cases as the number of infections estimated to have occurred in a population not captured by the surveillance system over a given time period. To achieve this, we implemented a modified Susceptible-Infected-Recovered-Dead-Susceptible (SIRDS) model, allowing us to infer changes in key epidemiological parameters: basic reproduction number (R0), Infection Fatality Rate (IFR), protected period (also named immunity period), and underreporting of cases. Our model introduces innovation by incorporating time-varying parameters through fuzzy transitions between epidemic periods. Furthermore, we explore the broader pandemic context in Brazil, examining the interplay between social isolation, vaccination coverage, and the emergence of coronavirus variants.
We conducted forecasting and sensitivity analyses to validate our model and applied it to data from countries beyond Brazil, ensuring its generalization for Spain, the United Kingdom, and the United States. During the first wave, we contextualized our findings with studies that estimated prevalence [10–16] and IFR [17–19] based in serological research. However, serological surveys have limitations in analyzing epidemics with multiple outbreaks [20], making epidemiological modeling crucial to estimating the true magnitude of prolonged epidemics. In this regard, our work relates to other studies that have presented time-varying models to reproduce changes in Covid-19 epidemiological parameters across outbreaks, considering factors like human mobility patterns, emerging variants, and vaccination [21–29]. Our work represents the first comprehensive investigation of the first three Covid-19 years in Brazil, contributing with novel insights to the existing literature. We present a comprehensive discussion of our results and a thorough review of related studies in Section 5.
Our study offers several contributions and findings, including:
- A comprehensive overview of the Covid-19 pandemic in Brazil spanning three years.
- Developing a modified SIRDS model for analyzing multi-outbreak epidemics, incorporating fuzzy transitions between epidemic periods.
- Estimation of time-varying epidemiological parameters such as R0, IFR, and immunity period.
- Generalization of our model with data from Spain, the United Kingdom, and the United States.
- Validation against robust evidence from serological surveys [10–16], demonstrating accuracy in reproducing observed trends and enhancing the reliability of our findings.
- Application of the model for forecasting outbreaks.
- Sensitivity analysis providing insights into the uncertainty of epidemiological parameters.
- Comparison of model simulations with officially reported data, revealing significant disparities and potential underreporting factors.
- Contribution to understanding the effectiveness of public health measures, such as reducing human mobility and mass vaccination, in controlling Covid-19 outbreaks. It provides evidence-based insights for formulating effective public policies and strategies to combat future outbreaks.
2 Background
During the early stages of the Covid-19 pandemic, the World Health Organization (WHO) had advised the adoption of non-pharmacological interventions to prevent the spread of the virus [30]. However, the federal government in Brazil failed to comply with these recommendations and neglected to coordinate a national response to the pandemic [31–35]. The scientific community heavily criticized President Jair Bolsonaro and his administration for promoting denialism [31–35], opposing containment measures [31, 32, 34], advocating for unproven Covid-19 treatments [31–35], discouraging vaccination [31], and downplaying the importance of wearing masks [31, 33].
Although the Brazilian federal government did not implement a national lockdown to control the spread of Covid-19, changes in the population’s mobility patterns during the pandemic were observed, either as a voluntary initiative or in response to measures taken by state and municipal governments, since the Federal Supreme Court allowed states and municipalities to take measures in favor of social isolation that oppose the federal government decision [32]. Fig 2 illustrates how the Covid-19 pandemic affected the Brazilian population’s mobility patterns. To track human mobility, we utilized data from the Covid-19 Community Mobility Report [36] produced by Google, which provided a measure of the percent change in time spent in residential places by its users in comparison to a baseline before the pandemic. We referred to this measure as the stay-at-home index (ΔH).
The dashed horizontal line highlights the baseline (ΔH = 0%). Data from Google Covid-19 Mobility Report [36].
Fig 2 shows that the first notable change occurred on March 17, 2020, when the country reported its first Covid-19 death. The first outbreak in 2020 was characterized by the highest peak in ΔH, even though the death peak occurred during 2021. It is important to note that although ΔH tends to converge to the baseline, there were periods when this trend was interrupted. For instance, during the January months in 2021 and 2022, corresponding to the school vacation period, and in mid-2021, coinciding with the peak of Covid-19 deaths, ΔH remained high.
The Auxílio Emergencial socioeconomic program was crucial in promoting social isolation and mitigating the spread of the coronavirus throughout 2020 [37, 38]. This program reached approximately 22% of the Brazilian population, paying a monthly mean of USD 154.86 from April to August 2020, continuing with reduced amounts after that period [37]. In parallel, Brazil also responded to the Covid-19 challenge by progressively increasing the number of Intensive Care Unit (ICU) beds dedicated to Covid-19 patients during the pandemic, as depicted in Fig 3 [39]. Although it does not directly impact disease transmission dynamics, this measure is a significant intervention that can potentially alleviate the IFR.
Data from DATASUS [39].
In January 2021, Brazil initiated its Covid-19 vaccination campaign. However, only in October 2021 did half of the population receive the complete dosage required for full vaccination against Covid-19 [4], as shown in Fig 4. This situation means that during the death peak in 2021, the worst moment of the pandemic in the country, the percentage of fully vaccinated individuals was still relatively low.
Data from Our World in Data [4].
Another relevant topic for understanding Covid-19 in Brazil is the occurrence of coronavirus variants in the country. We analyzed the prevalent coronavirus variants using data from GISAID, a global science initiative that provides open access to genomic data of SARS-CoV-2 [40]. Fig 5 shows that the emergence of the Gamma variant coincides with the second wave of the virus in the country. This variant was initially identified in the Brazilian state of Amazonas and has been associated with increased transmissibility, higher mortality, and immune evasion, resulting in reinfections and potentially reduced efficacy of vaccines and neutralizing antibodies [41]. During the middle of the second Covid-19 wave in Brazil, the Delta variant became the dominant variant. Like the Gamma variant, the Delta variant exhibits increased transmissibility, mortality, and immune evasion [42]. Subsequently, simultaneously with the emergence of the third wave in Brazil, the Omicron variant swiftly became the predominant strain. Characterized by many mutations compared to its predecessors, Omicron exhibits elevated transmissibility and the ability to evade the immune response [6, 7]. However, works indicate a lower severity regarding disease outcomes [5–9].
Relative frequency (%) of different Covid-19 variants over time, based on data from the GISAID database [40].
3 Material and methods
3.1 Data source
The initial source of Covid-19 data in Brazil was the Monitoring Panel [2], which provided timely updates throughout the pandemic, incorporating daily information on new cases and deaths. However, it is essential to acknowledge that this data reflects the date of reporting by health authorities rather than the onset of symptoms or death. This characteristic has resulted in delayed information and time series marked by artificial weekly seasonality, as depicted in Fig 1.
To address these limitations, we utilized the Mortality Information System [3] database from the Brazilian Health Ministry, which offers comprehensive details on all reported deaths in the country. Focusing on Covid-19 deaths, we filtered the dataset using the International Classification of Diseases (ICD) code B34.2 as the primary cause. Unlike the Monitoring Panel, this database reports the actual date of death occurrence, leading to a time series devoid of artificial seasonality, as illustrated in Fig 6b.
(a) New Covid-19 cases reported by onset symptoms date of Severe Acute Respiratory Syndrome (SARS) patients, sourced from DATASUS [43]. (b) New deaths of SARS patients reported by death date [43], new Covid-19 deaths reported in Mortality Information System by death date [3], and a 7-day rolling average of new Covid-19 deaths reported in Monitoring Panel by reporting date [2], all sourced from DATASUS.
Furthermore, the Brazilian Health Ministry provides the Severe Acute Respiratory Syndrome (SARS) database [43], containing severe cases of Covid-19. In Fig 6b, we present Covid-19 deaths reported for SARS patients, with this dataset recording the event date and avoiding artificial seasonality in death data. Regarding severe Covid-19 cases, Fig 6a shows that only a tiny fraction of Covid-19 cases escalated in severity compared to the reported cases in Fig 1a. The SARS database reports the onset date of symptoms, a crucial contribution to analyzing disease spread dynamics. Lastly, we observe a weekly seasonality in the onset date of symptoms, although this is lower than the seasonality observed for cases reported in the Monitoring Panel [2].
Our comprehensive analysis of the Covid-19 pandemic in Brazil spans the first three years, from 2020 to 2022. Table 1 summarizes the data obtained from the investigated databases. For estimating Rt, we utilized case data from the SARS database [43], as outlined in Section 3.2. Death data from the Mortality Information System [3] were used to fit the model proposed in this work. Additionally, case data from the Monitoring Panel [2] were employed to estimate underreporting, as detailed in Section 3.5.5. To determine the population size of Brazil, we used the 2022 Demographic Census [44] provided by the Brazilian Institute of Geography and Statistics (IBGE).
For broader applicability, we utilized Covid-19 data from Our World in Data [4] for Spain, the United Kingdom, and the United States. This dataset was employed to illustrate the generalization of our model, as presented in Section 3.5.2. S1 Appendix provides charts with time series for these countries.
3.2 Estimated reproduction number
The effective reproduction number (Rt) metric is essential for monitoring epidemics, providing insights into whether the disease is spreading or declining. Values above one indicate ongoing transmission, while values below one suggest a reduction in transmission. In our analysis of Covid-19 in Brazil, we calculated Rt using the time series of new cases from SARS patients [43]. This estimation involved the use of the epyestim library (https://github.com/lo-hfk/epyestim), a Python toolkit implementing the methodology proposed by Cori et al. [45].
To apply the epyestim method, we initially specified the distribution for the Covid-19 generation time, approximating it by the serial interval reported by Bi et al. [46] as ∼Gamma(2.29, 0.36), with an average of 6.36 days. Additionally, we defined the delay between infection and the onset of symptoms, representing the incubation period, as ∼Lognormal(1.57, 0.42), with an average time of 5.93 days [46].
Other parameters set in epyestim included a window size of 28 days to smooth the cases time series, a window size of 14 days for the final rolling average, one hundred bootstrap samples for estimating Rt, and a prior Rt ∼Gamma(9.90, 9.28). The epyestim estimates Rt after cumulative cases have reached at least 12, at least one mean generation has passed since the index case, and at least one window for the final rolling average has passed since the index case.
Fig 7 illustrates the time-varying Rt of Covid-19 in Brazil, showing that Rt exceeds one in different moments of the study period. The highest Rt values occurred in the initial pandemic phase. The period from late 2020 to mid-2021, often identified as the second wave in Brazil, is marked by recurrent Rt peaks, closely spaced and with lower amplitude than other periods of this pandemic. In contrast, 2022 exhibits three prominent Rt peaks with substantial amplitude and spaced widely in time.
The dashed horizontal line represents the reference value (Rt = 1) used to monitor epidemics. The Rt time series alternates between outbreak periods (solid line) and non-outbreak periods (dotted line). Nine outbreak periods, labeled from #0 to #8, were identified.
We inferred the basic reproduction number (R0) by considering the Rt peak during the early pandemic phase. Our analysis estimated R0 for Brazil at 2.52 (95% Confidence Intervals (CI): 2.34–2.71). Our estimated R0 aligns with other studies estimating R0 for Brazil, such as R0 = 3.10 (95% CI: 2.40–5.50) [47], R0 = 2.13 (95% CI: 0.81–3.04) [48], and R0 = 2.78 (95% CI: 1.9–3.81) for a serial interval mean of 4.8 days [27]. Additionally, there are approximations, such as R0 = 3.81 (95% CI: 2.73–4.81) for a serial interval mean of 6.48 days [27].
3.3 Covid-19 outbreaks
Successive waves of cases and deaths commonly characterize the dynamic of the Covid-19 pandemic. Usually, we describe that the pandemic begins in Brazil with an initial wave in 2020, followed by a second wave spanning the end of 2020 through 2021, and by three distinct waves throughout 2022. However, for enhanced modeling in this study, we opt for a more fine approach by segmenting the epidemic periods based on the emergence of outbreaks. Here, we define an outbreak as when the disease spreads actively, signifying a phase when the epidemic is out of control.
To identify these outbreaks, we analyze periods where the Rt remains above one for a minimum duration of seven consecutive days, allowing for a maximum of seven days below this threshold during the identified outbreak period.
Consequently, our analysis identified nine Covid-19 outbreaks in Brazil over the study period, as illustrated in Fig 7. There are two outbreaks during the initial wave, four during the second wave, and three in 2022.
3.4 SIRDS model with fuzzy epidemic period transitions
In this study, we introduce an epidemiological model that incorporates time-varying parameters and leverages fuzzy theory to enhance the smoothness of transitions between epidemic periods. We detail the critical components of the epidemic model in Section 3.4.1. Section 3.4.2 outlines the underlying assumptions that guided our decision to propose a model with time-varying parameters and fuzzy transitions. We present the implementation of the model in Section 3.4.3. Furthermore, Section 3.4.4 delves into the presentation of the objective function and the optimization method employed in our study.
3.4.1 SIRDS model core.
The SIRDS model is commonly used in the literature to model disease epidemics where immunity after infection is temporary and a recovered individual becomes susceptible again after some time [49], as illustrated in Fig 8.
Each compartment is denoted by a corresponding letter: S for Susceptible, I for Infected, R for Recovered, and D for Deceased. The model parameters include contact rate (β), recovery rate (γ), infection fatality probability (f), and immunity loss rate (ω).
The SIRDS model, with S representing susceptible individuals, I representing infected individuals, R representing recovered individuals, and D representing deceased individuals, follows this dynamics:
- The contact rate (β) at each time unit (t) determines the rate at which the I infect the S.
- The I become either R or D at each t according to the recovery rate (γ) and the infection fatality probability (f).
- The R become S at each t due to the immunity loss rate (ω).
The R0 is a key parameter in epidemic models, representing the average number of new infections generated by each infected individual introduced into a population with no prior immunity [50]. The others SIRDS parameters γ, β, f, and ω are specified in the equations below.
(1)
(2)
(3)
(4)
Being N = S(0) + I(0) + R(0), the compartments change over time following the four equations presented below [51, 52].
(5)
(6)
(7)
(8)
3.4.2 Assumptions.
In this study, we propose that epidemiological parameters change during a pandemic due to modifications in mobility patterns, mutations in virus properties (such as the emergence of new variants), and fluctuations in vaccination coverage.
We conducted an extensive set of 76,650 epidemic simulations spanning three years to validate these assumptions. These simulations covered a range of values for R0 (1 to 8). They included parameters such as an eight-day infectious period, a 1% IFR, a one-year protective immunity period, and an initial count of 0.14 infected individuals in a population of 100,000. Fig 9 shows the simulation results and sheds light on the dynamic patterns of Rt.
Each row corresponds to a specific R0 value. The charts on the left depict simulation outputs for the Susceptible (S), Infected (I), Recovered (R), and Deceased (D) compartments. On the right side, the charts display the observed effective reproduction number (Rt) over time, with a dashed horizontal line at the reference value (Rt = 1) used for epidemic monitoring.
In our empirical analysis of simulations with a static R0, we observed a pattern where each outbreak exhibited a lower Rt peak than its predecessor. This sequential decrease in Rt peaks continued until reaching a stable endemic equilibrium, as illustrated in Fig 9b and explained by Bjørnstad et al. [51].
While empirical analysis of simulations with static R0 revealed a consistent decrease in Rt peaks, aligning with a stable endemic equilibrium, real-world Covid-19 outbreaks displayed a different pattern. The dynamics of actual outbreaks did not conform to the consistent decrease seen in simulations, suggesting that factors like changes in mobility patterns [27, 48, 53] and the emergence of new variants [54, 55] contribute to variations in virus transmissibility, impacting the dynamic of R0.
We also observed a pattern in the initial outbreak, where Rt starts at the peak and then gradually declines following Eq 9 [53, 56, 57]. However, the initial outbreak coincided with a significant reduction in population mobility during the Covid-19 pandemic. Therefore, we hypothesize that the initial outbreak started with a transition pattern that changed even during the first outbreak for many locations. We conducted 18,625,950 simulations to investigate, reducing R0 by 10% to 50% at different points during the first outbreak.
(9)
We conducted a comparative analysis of the initial outbreak simulations, comparing those with variable R0 and their counterparts in simulations with constant R0. To assess the similarities between these scenarios, we employed the FastDTW algorithm [58]. FastDTW is a heuristic algorithm with a complexity of O(n), designed as an efficient alternative to the Dynamic Time Warping (DTW) algorithm. This approach produces a distance measurement and is widely utilized for evaluating the similarity between two-time series [58]. The assessed similarities are visualized in Fig 10a, with the lowest DTW similarity recorded at 0.215.
(a) Dynamic Time Warping (DTW) similarity for the first outbreak, contrasting synthetic samples with a change in basic reproduction number (R0) to their counterparts in synthetic samples without changing the R0. (b) Similarity between left and right sides for subsequent outbreaks in synthetic samples without changing the R0.
Moving beyond the initial outbreak, we observed that the Rt time series of the subsequent outbreaks present a curve similar to a bell-shaped one as presented in Fig 9. To quantify the similarity between the left and right sides of these curves, we utilized the following equation:
(10)
where b denotes the beginning point of an outbreak, p represents the Rt peak of an outbreak, and e signifies the end point of an outbreak. In this equation, when similarity > 0, the right side is higher than the left side; when similarity < 0, the left side is higher than the right side; and when similarity = 0, the left and right sides there is the same sum of Rt. The assessed similarities for simulations without R0 changes are presented in Fig 10b, with the similarity between sides ranging between -0.39 and 0.22.
Additionally, we noted fluctuations in the lethality of Covid-19 across the pandemic. Fig 11a and S3 Appendix illustrate a CFR reduction from the early stages to the end of the study period for Brazil and other countries. Recognizing the impact of underreporting on this measurement, especially during pandemic peaks, we also evaluated the CFR for SARS patients in Brazil (Fig 11b). Despite the impact of underreporting, our observation suggests a gradual decrease in the lethality of Covid-19 throughout the pandemic.
(a) General CFR calculated from cases reported in the Monitoring Panel [2] and deaths reported in the Mortality Information System [3]. (b) CFR calculated for Severe Acute Respiratory Syndrome (SARS) patients [43].
Our study assumes that epidemiological parameters change during different epidemic periods. Specifically, the transition between epidemic periods for the parameter β tends to be rapid, while that for the parameter f is more gradual. Although evidence regarding the transition between epidemic periods for immunity loss is lacking, our hypothesis suggests a slow transition similar to f.
3.4.3 Model implementation.
In this section, we describe the implementation details of our SIRDS model with fuzzy epidemic period transitions, building upon the epidemiological concepts introduced in Section 3.4.1 and the assumptions outlined in Section 3.4.2.
Our model incorporates the following parameters:
- Initial infected population (I(0))
- Recovery rate (γ)
- List representing contact rates for different epidemic periods (
)
- List representing infection fatality probabilities for different epidemic periods (
)
- List representing immunity loss rates for different epidemic periods (
)
- List denoting breakpoints for fast transition between epidemic periods (
)
- List representing transition days for smoothing fast transitions (
)
- List denoting breakpoints for slow transition between epidemic periods (
)
It is essential that the minimum size of the parameters ,
, and
be one. The parameters
and
are of the same length, both having one item less than
. Similarly,
and
have the same size, both having one item more than
.
Our model instantiates a fuzzy variable to represent a fast transition (μfast) between epidemic periods. The universe is the number of days in simulation. A fuzzy partition in this variable represents each epidemic period. These partitions are trapezoidal shapes whose peak, i.e. μfast = 1, extends from the correspondent breakpoint to the adjacent one. For each partition, the beginning is advanced by the correspondent transition days considering its peak beginning, and the end is delayed by the adjacent transition days considering its peak end. Thus, the total number of fuzzy partitions is .
Our model also instantiates a fuzzy variable to represent a slow transition (μslow) between epidemic periods. The universe also is the number of days in simulation. A fuzzy partition in this variable represents each epidemic period. These partitions are triangular shapes, beginning at the previous breakpoint, reaching the correspondent breakpoint at the top, i.e. μslow = 1, and ending at the next breakpoint. The exception is the last epidemic period, where there is a trapezoidal function with a shape starting at the previous breakpoint and reaching its peak from the correspondent breakpoint to the boundaries of the universe. Thus, the total number of fuzzy partitions is .
In our model, the fuzzy variables operate with parameters β, f, and ω to reproduce different epidemic periods. Eq (11) defines the inference mechanism for defuzzification at day t for a time-varying epidemic parameter (θ′) using a pair of a fuzzy variable (μ) with n partitions and a list of epidemic parameters (θ) also with n items.
(11)
The algorithm for the SIRDS model with fuzzy epidemic period transitions is summarized in Algorithm 1. The simulation function takes various parameters and computes the rates of change for susceptible (S), infected (I), recovered (R), and deceased (D) populations at a given time step t.
Algorithm 1 SIRDS model with fuzzy epidemic period transitions
1: procedure Simulation(t, S, I, R, D, γ, ,
,
, μfast, μslow
2: N ← S + I + R + D
3:
4:
5:
6: dS/dt = −βIS/N + ωR
7: dI/dt = βIS/N − γI
8: dR/dt = (1 − f)γI − ωR
9: dD/dt = γfI
10: return dS/dt, dI/dt, dR/dt, dD/dt
This algorithm provides a comprehensive view of the model dynamics, capturing the influence of fuzzy variables on epidemic parameters and their impact on the SIRDS compartmental model.
3.4.4 Parameter optimization.
We employed the stochastic differential evolution algorithm [59] from the Python Scipy library for effective model parameter fitting. The objective was to minimize the discrepancies between original data and simulations for both the death rate per 100,000 inhabitants in the 7-day moving average (M) from DATASUS [3] and the effective reproduction number (Rt) calculated in Section 3.2. To achieve this, we formulated a composite objective function using Eq (12):
(12)
where MAE denotes the mean absolute error measure, as defined by Eq (13). Here,
and
are the estimated mortality and effective reproduction numbers from our model, respectively.
and
represent the mean values of M and Rt, respectively.
(13)
We set the stochastic differential evolution algorithm with a maximum of 10,000 generations, a multiplier factor of five (so, the total population size is five times the quantity of model parameters), an update strategy for the best solution vector once per generation, the mutation strategy ‘best1bin’, a relative tolerance parameter of 0.01, an absolute tolerance parameter of 0 (a stringent convergence criterion), a mutation parameter range of (0.5, 1), a recombination parameter of 0.7, the Latin hypercube initialization strategy, and an additional local optimization step using the L-BFGS-B method after the global optimization process.
The input parameters for our model vary with the number of outbreaks. We first identify the outbreaks following Section 3.3. For each outbreak after the initial one, we define a breakpoint bfast to represent fast transitions between epidemic periods. We estimate R0 and assess the similarity between the first outbreak Rt and that estimated from its corresponding R0. If the similarity is higher than 0.22, we add an adjusted bfast to account for changes in disease transmissibility across the outbreak, as explained in Section 3.4.2. We also assess the similarity between sides of outbreak curves for the subsequent outbreaks. If not within the range (-0.39, 0.22), we add an adjusted bfast to capture changes in disease transmissibility across the outbreak, again, as presented in Section 3.4.2. We define an initial β and one β for each bfast, along with one τ for each bfast.
For representing a slow transition between epidemic periods, we define one breakpoint bslow for each outbreak after the initial one, considering that the interval between outbreaks is at least 180 days. Therefore, in practical terms, is bslow a subset of bfast. We set an initial f and ω, along with one f and ω for each bslow.
We optimize our model considering parameter bounds presented in Table 2, a population of 100,000 individuals, and the simulation period beginning at the start of the case time series. Empirically, we observed that our model performs better when γ is static, so we did not optimize this parameter using the stochastic differential evolution method, as detailed in Section 3.5.1. Additionally, it is crucial to highlight that bslow ⊂ bfast, and optimization of these parameters is unnecessary, as bfast has already been optimized.
3.5 Experiments
3.5.1 Fitting the recovery period.
By the insights from Voinsky et al. [62], indicating a Covid-19 infectious period spanning from 8 to 20 days, we systematically assessed these infectious periods within our model. To estimate the recovery rate (γ), we considered a range of values: . We conducted 20 model executions for each γ using Brazilian data. The maximum bound for the parameter initial quantity of infected population (I(0)) was set as the sum of case rates per 100,000 inhabitants until the 14th day in the first outbreak (i.e., 0.043).
The outcomes, illustrated in Fig 12a, underscore the optimal performance achieved by our proposed model when the infectious period is configured to eight days, corresponding to γ = 1/8, which we observed error of 0.165 (95% CI: 0.162–0.170). Furthermore, Fig 12b and 12c suggest that adopting an infectious period of 8 days incurs a slightly higher optimization cost. Importantly, this incremental cost is well-justified, contributing to refining and enhancing the model outcome. Therefore, all subsequent experiments in this study were conducted with γ = 1/8.
The boxplots depict: (a) the error in the objective function, (b) the number of iterations performed by the optimization algorithm, and (c) the number of objective function evaluations conducted by the optimization algorithm. In each boxplot, the lower and upper bounds represent the first and third quartiles, respectively. The horizontal line within the box indicates the median, while the whiskers extend to the minimum and maximum values within 1.5 times the interquartile range.
In Section 4, we succinctly present the outcomes for the optimal infection period. We evaluate the effectiveness of our model using two key metrics: the Sum of Squared Errors (SSE) and the coefficient of determination (R²), defined by Eqs (14) and (15), respectively.
(14)
(15)
3.5.2 Model application to other countries.
To apply our model to data from Spain, the United Kingdom, and the United States, we first estimated the Rt time series using the methodology outlined in Section 3.2. However, adjustments were necessary for the input data of these countries. Specifically, we estimated the onset of symptoms from the new death time series, considering a delay of 19 days between onset and death [60]. The S2 Appendix includes charts displaying Rt time series for these countries.
We conducted 20 pandemic simulations for each country, assuming an infectious period of eight days (γ = 1/8). In the cases of Spain and the United States, we set the maximum bound for the parameter I(0) as the sum of case rates per 100,000 inhabitants until the beginning of the first outbreak time series, resulting in 0.0084 and 0.0068, respectively. On the other hand, we set the maximum bound for the United Kingdom for the parameter I(0) to the same value as the minimum bound for this parameter, as presented in Table 2, and set the simulation period began at the peak date of Rt within the initial outbreak.
3.5.3 Forecasting outbreaks using the model.
We utilized our model to project the dynamics of Covid-19 outbreaks in Brazil. We conducted 20 simulations for each outbreak, fitting the model from the early pandemic moment until the 21st day and forecasting the following three months. We executed all simulations assuming an eight-day infectious period, and the remaining parameters were those employed in fitting the recovery period, as detailed in Section 3.5.1.
We compared the simulation outcomes with actual data by analyzing the objective function, Eq (12), for both fitting and forecasting samples. Also, we understand that the main contribution of a medium-term epidemic prediction is to provide insight into the total of deaths. So, we evaluated the Percentual Absolute Error (PAE), Eq (16), for the death prediction.
(16)
To showcase the benefits of extending the model fitting period further into within an outbreak, we experimented with fitting the model until the 28th, 35th, 42th, 49th, 56th, 63rd, and 70th days for the initial outbreak.
3.5.4 Parameter sensitivity assessment.
To evaluate the local sensitivity of our model parameters, we employed the One-Parameter-at-a-Time (OAT) method [63] using the simulations for Brazil data. The mean values estimated for the parameters in Section 3.5.1, corresponding to an infectious period of eight days (γ = 1/8), served as the baseline. For each parameter, we perturbed its value by 1%, 10%, and 50%, keeping other parameters fixed.
As defined in Eq (17), the absolute elasticity measurement was utilized to assess parameter sensitivity, considering the objective function as the output. An absolute elasticity greater than one indicates that the parameter is elastic, meaning higher sensitivity. In other words, a change in the parameter leads to a proportionally more significant change in the output. Conversely, an absolute elasticity of less than one suggests that the parameter is not elastic, indicating lower sensitivity, as a change in the parameter results in a proportionally smaller change in the output.
(17)
3.5.5 Estimating epidemiological measurements.
This section introduces measurements crucial for assessing the scale of a pandemic, as employed in this study. The Case Fatality Rate (CFR), defined by Eq (18), serves as a percentual lethality metric based on epidemiological data officially reported by health authorities. In contrast, the Infection Fatality Rate (IFR), expressed by Eq (19), serves as a percentual lethality metric derived from model-based estimations.
(18)
(19)
The underreporting factor, as estimated using Eq (20), represents the ratio between estimated infections by a model and reported cases by health authorities. It is essential to clarify that estimated infections encompass not only the reported cases but also the unreported cases and potentially any cases overreported by the health authorities.
(20)
4 Results
4.1 Retrospective analysis
In this retrospective analysis, we focus on the results obtained using our model with an eight-day recovery period (γ = 1/8) applied to Covid-19 data from Brazil. Fig 13 provides a comprehensive overview of our model outcomes, including comparisons with the original data for various aspects such as the Rt, new cases, new deaths, total cases, and total deaths.
(a) Model outcomes for an eight-day recovery period detailing the population compartments: Susceptible, Infected, Recovered, and Deceased. (b) Time series comparison between the effective reproduction number (Rt) estimated directly from reported Severe Acute Respiratory Syndrome (SARS) cases and Rt calculated by model simulations. (c) Time series comparison between new cases reported by health authorities and new infections in model simulations. (d) Time series comparison between new deaths reported by health authorities and new deaths in model simulations. (e) Time series comparison between cumulative cases reported by health authorities and cumulative infections in model simulations. (f) Time series comparison between cumulative deaths reported by health authorities and cumulative deaths in model simulations. Shaded regions depict the 95% Confidence Interval (CI).
Table 3 presents the quantitative evaluation of the model performance, demonstrating a low SSE and high R² for both Rt and new deaths, indicating a well-fitted model.
Fig 14a presents the model estimation of the R0 for Brazil, indicating an initial value of 2.44 (95% CI: 2.42–2.46). Notably, the model depicts a subsequent decrease in R0 to 1.00 (95% CI: 0.99–1.01) on May 18, 2020. Following this initial decline, the model suggests a successive increase in R0 with each outbreak, culminating in its peak value of 5.20 (95% CI: 4.83–5.41) observed in the last outbreak.
(a) Basic reproduction number (R0) varying with time (t). (b) Infection Fatality Rate (IFR) varying with t. (c) Days to loss of immunity (Ω) varying with t. Shaded regions depict the 95% Confidence Interval (CI).
Regarding the IFR, Fig 14b suggests an initial Covid-19 lethality in Brazil of 0.88% (95% CI: 0.81%–0.94%). The model further indicates a continuous reduction in IFR throughout the outbreaks, reaching its lowest value of 0.018% (95% CI: 0.011–0.033) in the last outbreak.
The parameter about the number of days to loss of immunity, Fig 14c, presented the highest uncertainty in our model. The initial immunity period was estimated to be 234 days (95% CI: 206–262), exhibiting an increasing trend, albeit with a reduction observed in mid-2021. The last studied outbreak revealed an immunity period of 341 days (95% CI: 327–352).
Our model also estimates that 63 people (95% CI: 58–68) were infected in Brazil when the health authorities reported the first Covid-19 case. For further insights, the S4 Appendix details the fuzzy variables used in this work to facilitate smooth transitions between epidemic periods.
In conclusion, our model reveals a substantial disparity between reported cases in Monitoring Panel [2] and simulated infections, estimating a factor of 12.9 (95% CI: 12.5–13.2) more infections than the officially notified cases by Brazilian health authorities until the end of 2022. When analyzing the data for each year, we found that the simulated infections were 5.8 (95% CI: 5.2–6.4), 12.9 (95% CI: 12.5–13.3), and 16.8 (95% CI: 15.8–17.5) times higher than the reported cases in 2020, 2021, and 2022 respectively.
4.2 Model generalization
Table 3 illustrates the broad applicability of our model across other countries. We observe reduced errors and well-fitted coefficients of determination for both reproduction number and new deaths. Notably, the simulations for the United States exhibit the highest level of fitting. Overall, the simulations align more closely with the rate of new deaths than the reproduction number. An overview of the simulations for Spain, the United Kingdom, and the United States is available in S6 Appendix.
4.3 Comparisons with serological research
Fig 15 illustrates that our simulations for the early pandemic moments align with national serological research for Brazil, the United Kingdom, and the United States. Notably, Spain is the only country where our simulation deviates significantly.
(a) Brazil: reported cases from DATASUS [2] and serological prevalence from Hallal et al. [10]. (b) Spain: reported cases from Our World in Data [4] and serological prevalence from Perez-Gómez et al. [11]. (c) United Kingdom: reported cases from Our World in Data [4] and serological prevalence from Public Health England [12–14]. (d) United States: reported cases from Our World in Data [4] and serological prevalence from Walker et al. [15] and Anand et al. [16]. Dashed lines are the simulated cumulative infections, and shaded regions depict the 95% Confidence Interval (CI).
Hallal et al. [10] conducted two seroprevalence surveys in 133 larger cities across all Brazilian states, utilizing random household and individual selection while excluding children under one year. The first survey, conducted from May 14–21, 2020, included 25,025 individuals, estimating a seroprevalence of 1.9% (95% CI: 1.7–2.1), and the second survey, conducted from June 4–7, 2020, included 31,165 individuals, estimating a seroprevalence of 3.1% (95% CI: 2.8–3.4). In comparison, our model estimated cumulative infections for Brazil as 2.36% (95% CI: 2.17–2.56) on May 14, 2020, and 4.06% (95%CI: 3.72–4.44) on June 04, 2020.
Perez-Gómez et al. [11] conducted a representative cohort study of the noninstitutionalized Spanish population, with 68,287 participants between April 27, 2020, and June 22, 2020. They estimated a seroprevalence in Spain of 6% (95% CI: 5.7–6.4) for this period. In contrast, our model estimated a prevalence of 15.5% (95% CI: 12.4–18.8) on April 27, 2020.
Public Health England conducted four serological surveys in the first wave in England [12–14], which serve as a reference for our simulations in the United Kingdom. The first two surveys [12] were based on healthy blood donors aged 17–69 years, aligning well with our simulations. However, the last two surveys [13, 14] included healthy blood donors aged 17 years and older, reducing seroprevalence and causing our model to estimate the prevalence higher than these surveys. Public Health England attributes this decrease in prevalence to demographic variations in the donor pool, such as the inclusion of donors aged 70 years and older, who were previously restricted during lockdown, and also considers waning antibodies as a potential contributing factor [13, 14].
Our prevalence simulations for the United States align with two serological surveys based on dialysis patients. Walker et al. [15] estimated a prevalence of 5.8% (95% CI: 5.4–6.2) among 12,932 dialysis patients from May 27, 2020, to July 1, 2020. Additionally, Anand et al. [16] estimated a prevalence of 9.3% (95% CI: 8.8–9.9) based on a sample of 28,503 randomly selected adult patients receiving dialysis during July 2020.
4.4 Forecasting analysis
Fig 16 provides a comprehensive summary of the three-month ahead forecasts generated by our model for the nine observed Covid-19 outbreaks in Brazil. While Table 4 indicates a generally good fit for the outbreaks, it also highlights an increasing error trend for predictions over more extended periods. The initial outbreak (Outbreak 0) had the highest error, suggesting insufficient data for accurate prediction. Accurately estimating new outbreaks during the forecasting horizon, as observed in Outbreaks 2, 3, and 4, poses a significant challenge.
Each row represents a distinct outbreak. Column (a) displays SIRDS simulation outputs for Susceptible (S), Infected (I), Recovered (R), and Deceased (D) compartments. Column (b) compares the effective reproduction number (Rt) from Section 3.2 with model simulations, including a dashed line at the reference value (Rt = 1). Column (c) contrasts the rate of new deaths per 100,000 inhabitants in a 7-day moving average from the Mortality Information System [3] with new deaths estimated by model simulations. Vertical dotted lines mark the 21st day inside the outbreak, the maximum fit date for forecasting the next 90 days. Shaded regions indicate the 95% Confidence Interval (CI).
Table 5 focuses on presenting the death errors. Notably, in the first month, our model exhibits a maximum PAE of 35% for six of the nine outbreaks in Brazil. However, in the second and third months, our model presents a maximum PAE of 50% for only four outbreaks. This reduction in performance highlights the challenge of forecasting epidemics for long-term periods.
Fig 17 illustrates that the predictive accuracy of our model improves when the fitting period encompasses more days within the outbreak. By fitting the initial outbreak until its 70th day, we achieved reduced death PAE, measuring 1.9% (95% CI: 1.2–2.6), 10.3% (95% CI: 7.4–13.3), and 12.7% (95% CI: 9.7–15.6) for the first, second, and third months, respectively.
Each line represents a day period within the first outbreak to mark the maximum adjustment date for forecasting the next 90 days, and the vertical dotted lines also highlight this date. Column (a) displays SIRDS simulation outputs for Susceptible (S), Infected (I), Recovered (R), and Deceased (D) compartments. Column (b) compares the effective reproduction number (Rt) from Section 3.2 with model simulations, including a dashed line at the reference value (Rt = 1). Column (c) contrasts the rate of new deaths per 100,000 inhabitants in a 7-day moving average from the Mortality Information System [3] with new deaths estimated by model simulations. Shaded regions indicate the 95% Confidence Interval (CI).
4.5 Sensitivity analysis
In Fig 18, the impact of perturbations in the first breakpoint parameter (b0) on the Covid-19 simulation for Brazil is evident, showcasing its significant sensitivity. Generally, breakpoint parameters demonstrate considerable sensitivity. Additionally, parameters related to transmissibility, denoted by β, exhibit sensitivity, with the last three showing relatively lower impact. Notably, the initial population infected (I(0)) is another parameter demonstrating sensitivity among the model variables.
Each row corresponds to a specific parameter θk, where k denotes the parameter associated with a particular Covid-19 outbreak. When is mentioned, it represents an adjustment parameter for atypical outbreak k. The numerical values in each cell represent the elasticity measured for θ under a specific perturbation. The parameters include the initial quantity of infected population I(0), the breakpoint indicating the start of an outbreak (b), transition days between epidemic periods for fast transitions (τ), contact rate (β), infection fatality rate probability (f), and immunity loss rate (ω). Empty cells indicate simulations with errors due to invalid parameter values.
Conversely, our model exhibits low sensitivity in two crucial epidemiological parameters, namely lethality (f) and loss of immunity (ω). About lethality, we observed some sensitivity for and f4 when perturbed by 50%. An additional parameter introduced in our model to facilitate smooth transitions between epidemic periods (τ) demonstrated low sensitivity, with
being the only instance where we observed some elasticity.
5 Discussion
In this study, we developed a novel mathematical epidemiological model with time-varying parameters driven by fuzzy transitions between epidemic periods. The formulation of this model originated from the considerations outlined in Section 3.4.2. We applied and validated our model using Covid-19 data from Brazil, Spain, the United Kingdom, and the United States, demonstrating its robustness and generalization capabilities (Section 4.2). Comparative analyses with seroprevalence research in Section 4.3 illustrated good model fit, aligning well with the available evidence. Furthermore, our model exhibited utility in forecasting Covid-19 outbreaks, as presented in Section 4.4. The sensitivity analysis in Section 4.5 emphasized the crucial role of the time-varying property in capturing the dynamics of the Covid-19 pandemic over three years, particularly concerning changes in transmissibility and breakpoints between epidemic periods. We use this model to present a retrospective analysis of the pandemic in Brazil in Section 4.1.
While several studies have leveraged fuzzy theory for Covid-19 pandemic modeling to account for complexity and uncertainties [64–68], our approach differs from others to employ fuzzy theory in the transitions between epidemic periods, turning the modeling of epidemic period transitions more aligned with our observations of the Covid-19 pandemic. This feature provides a comprehensive view of the epidemiological parameter dynamics, as evidenced in Fig 14, which depicts the time-varying trends for R0, IFR, and the protection period in the Brazilian context.
We used the proposed model in this work to reproduce the first three Covid-19 pandemic years in Brazil with great reasonability. The estimated R0 of 2.44 (95% CI: 2.42–2.46) aligns with other works [27, 48]. Notably, the time-varying R0 during the first wave is in line with the adjusted R0 of 0.91 (95% CI: 0.83–1.01) identified by Nouvellet et al. [27], reflecting the initial reduction in human mobility.
Marra and Quartin [17], using the same database of Hallal et al. [10], estimated the IFR for Brazil as 1.03% (95% CI: 0.88%–1.22%) for serological tests collected between May and June 2020, while our model estimated an IFR of 0.62% (95% CI: 0.57–0.68) for the period until June 24, 2020. Our estimate is consistent with other studies that investigated the IFR in the first wave, such as the study conducted by Picon et al. [18], which estimated an IFR of 0.60% (95% CI, 0.49%–0.74%) considering antibody prevalence in a Brazilian city, Silveira et al. [19], which estimated an IFR of 0.38% (95% CI: 0.21%–0.80%) considering antibody prevalence in a Brazilian state, and Verity et al. [60], which estimated an IFR of 0.66% (95% CI: 0.39%–1.33%) using early PCR tests in China.
The duration of protection against Covid-19 is still uncertain, as research suggests a fast decay of coronavirus antibodies [10]. While some studies indicate that antibody protection could last between five and seven months [69], others suggest that T and B cells may extend protection [70]. Furthermore, there is a risk of reinfection within 90 days of the previous infection [61, 71], and also the emergence of new variants that may escape the protection given by a previous infection or vaccine [7, 61].
Ferrante et al. [72] proposed an epidemic model that estimated a protection period of 240 days for Covid-19 based on data from the first and second waves in Manaus/AM. Similar to our model, that estimated a protection period of 234 days (95% CI: 206–262) for the early pandemic moments. Morris et al. [71] used SARS-CoV-2 genomic surveillance data from Johns Hopkins and found a median interval of 377 days between the first infection and reinfection. However, another empirical study showed that reinfection peaks with intervals of six months in South Africa [61].
The protection period is, without a doubt, the epidemiological parameter of Covid-19 with the most significant uncertainty, as can be seen in our simulation with data from Brazil in Fig 14 and for the other countries in S5 Appendix. Our sensitivity analysis reinforces this uncertainty, which shows that the protection period is a parameter with low sensitivity.
Estimating the underreporting of cases is a critical factor in comprehending the actual magnitude of an epidemic. In the early stages of the Covid-19 pandemic in Brazil, several studies focused on addressing this issue. For instance, Reis et al. [73] utilized a mathematical model and estimated that only 10% of Covid-19 infections in Brazil were reported until April 6, 2020. Our model suggests an even more significant underreporting, indicating that Brazilian health authorities reported approximately 4.3% (95% CI: 4.0–4.6) of infections during the same period. Bastos et al. [74], also employing a mathematical model, estimated a range of 8–16 times more infections than reported cases until May 31, 2020, which aligns with our model that estimated a factor of 15 (95% CI: 13–16) for the same period. Our findings are consistent with a machine learning model proposed by Noh and Danuser [75], which estimated that around 20% of infections were reported in Brazil until September 3, 2020.
We recognize that serological surveys are considered the most reliable method for assessing the underreporting of Covid-19 cases. While the comprehensive Covid-19 serological survey in Brazil conducted by Hallal et al. [10] did not explicitly calculate the underreporting ratio, we estimated an underreporting factor of 9.1 (95% CI: 8.2–10.0), based on their prevalence study on June 4–7, 2020. Our model approximated the factor derived from Hallal et al. [10], estimating a factor of 12.6 (95% CI: 11.6–13.9) until June 7, 2020.
It is important to note that serological surveys also have limitations as they cannot differentiate between historical and current infections or distinguish antibodies resulting from natural exposure and vaccination [20]. Conducting a serological survey after around three years (1,050 days) becomes increasingly challenging, and we have not found recent studies employing this method in the Brazilian context. Therefore, epidemiological modeling is crucial in analyzing a prolonged epidemic such as Covid-19 in Brazil. Our proposed model competes well with solid epidemiological evidence noted in the first wave and successfully captures the mortality rate trends and Rt in other outbreaks. To our knowledge, our model is the first comprehensive investigation of the first three years of Covid-19 in Brazil.
Still, regarding the underreporting of Covid-19 cases in Brazil, our model suggests an increase in underreporting factors after the first pandemic year. We conjecture that the model caught an under-ascertainment phenomenon, where infected individuals do not seek healthcare [20]. Several factors likely contributed to this population behavior, including a sense of reduced risk among the population due to vaccination [1, 76], the predominance of mild Omicron infections [7], and the availability of self-tests [77].
The official data about Brazil revealed a 79% reduction in CFR during the 2022 year compared to the first pandemic year. Our model estimated an even more substantial 94% (95% CI: 93–95) reduction in IFR for the same period, likely associated with increased underreporting during the Omicron phase, as previously discussed. Likewise, for the Omicron phase, other mathematical models by Xavier et al. [28] estimated a 41% reduction in IFR for Brazil, Liu et al. [21] reported a 78.7% reduction (95% CI: 66.9%–85.0%) in South Africa, and Yan and Shaman [22] observed IFR reductions in nine South African provinces.
5.1 Limitations
While our work effectively captures the dynamic changes in epidemiological parameters across outbreaks, we must recognize certain limitations inherent in our modeling approach.
Firstly, we maintained a fixed recovery period of 8 days (γ = 1/8), as discussed in Section 3.5.1. Our empirical observation drove this decision that introducing time variability to γ significantly increased computational costs for parameter optimization without yielding substantial improvements in model outcomes.
Moreover, our model does not explicitly include an Exposed (E) compartment, despite the common suggestion in the literature to incorporate an incubation period for modeling Covid-19 [78–80]. We addressed this limitation by assuming that our compartmentalization of the Infected (I) compartment implicitly considers aspects of asymptomatic and pre-symptomatic infections. Our empirical observation indicates that the model aligns well with the observed series of deaths and Rt even with this simplification.
It is crucial to note that we relied on notification dates reported by health authorities rather than the actual event dates for data from Spain, the United Kingdom, and the United States. This divergence in data sources introduces uncertainties and may impact the precision of our generalization analyses.
6 Conclusion
We conclude that Brazil effectively controlled the initial wave of Covid-19 by reducing the Rt. However, during the second wave, the country encountered challenges in rapidly reducing Rt as it had in the first wave, resulting in the highest proportion of Covid-19 deaths occurring during this period. Our model estimates that, during the second wave, the recovered population surpassed the susceptible population for the first time, indicating significant population exposure to the virus across in this wave. The combination of it with a 77% vaccination rate against Covid-19 [1] and evidence of the Omicron variant being less lethal [7], likely contributed to the substantial reduction in the IFR observed in 2022, despite the high number of infections.
To summarize, public health measures such as reducing human mobility and mass vaccination have been effective against Covid-19 outbreaks, as evidenced by the data in Brazil. We observed a significant reduction in human mobility in the early moment, and in 2022, the country already had a significant proportion of the population vaccinated. However, adequate public health measures were absent in the second wave, leading to elevated mortality rates.
Furthermore, this study introduces a novel mathematical epidemiological model with time-varying parameters driven by fuzzy transitions between epidemic periods, demonstrating its ability to capture the pandemic dynamics over three years. The methodology and insights presented here can inform policymakers in shaping effective strategies to combat future epidemic outbreaks.
Our future research efforts include extending the model application to estimate epidemic parameters individually at the municipal level, enabling more accurate quantification of intervention effectiveness, such as social isolation and vaccination. Additionally, we aim to conduct a deeper forecasting analysis using our model by leveraging larger datasets with more location samples and conducting benchmark comparisons with other machine-learning models across multiple Covid-19 outbreaks.
Supporting information
S1 Appendix. Epidemiological time series of Covid-19 for Spain, the United Kingdom, and the United States.
https://doi.org/10.1371/journal.pone.0305522.s001
(ODT)
S2 Appendix. Effective reproduction number (Rt) for Spain, the United Kingdom, and the United States.
https://doi.org/10.1371/journal.pone.0305522.s002
(ODT)
S3 Appendix. Time series of Covid-19 Case Fatality Rate (CFR) for Spain, the United Kingdom, and the United States.
https://doi.org/10.1371/journal.pone.0305522.s003
(ODT)
S4 Appendix. Fuzzy variables fitted for smoothing transitions between epidemic periods.
https://doi.org/10.1371/journal.pone.0305522.s004
(ODT)
S5 Appendix. Time-varying model parameters fitted for Covid-19 in Spain, United Kingdom, and United States.
https://doi.org/10.1371/journal.pone.0305522.s005
(ODT)
S6 Appendix. Comprehensive analysis of simulation results for Covid-19 in Spain, United Kingdom, and United States.
https://doi.org/10.1371/journal.pone.0305522.s006
(ODT)
Acknowledgments
This work was carried out with the support of the Coordination for the Improvement of Higher Education Personnel—Brazil (CAPES) through the Academic Excellence Program (PROEX), the National Council for Scientific and Technological Development (CNPq), the Foundation for Research of the State of Minas Gerais (FAPEMIG), and the Qualification Program of the Federal Institute of Northern Minas Gerais (PBQS-IFNMG). We also acknowledge the MINDS Laboratory (https://minds.eng.ufmg.br/), the {ci∂ic} Laboratory (http://cidic.ifnmg.edu.br/), and the Future Lab from UFMG for their valuable support and resources.
References
- 1. Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real time. The Lancet infectious diseases. 2020;20(5):533–534. pmid:32087114
- 2.
DATASUS. Painel de casos de doença pelo coronavírus 2019 (COVID-19) no Brasil pelo Ministério da Saúde; 2020 [cited 2023 November 23]. Database: Gov.BR [Internet]. Available from: https://covid.saude.gov.br/.
- 3.
DATASUS. Sistema de Informação sobre Mortalidade—SIM; 2022 [cited 2023 January 23]. Database: Gov.BR [Internet]. Available from: https://opendatasus.saude.gov.br/fa_IR/dataset/sim.
- 4.
Ritchie H, Mathieu E, Rodés-Guirao L, Appel C, Giattino C, Ortiz-Ospina E, et al. Coronavirus Pandemic (COVID-19); 2020 [cited 2023 July 14]. In: Our World in Data [Internet]. Available from: https://ourworldindata.org/coronavirus.
- 5. Beppu H, Fukuda T, Otsubo N, Akihisa T, Kawanishi T, Ogawa T, et al. Comparative outcomes of hemodialysis patients facing pre-Omicron and Omicron COVID-19 epidemics. Therapeutic Apheresis and Dialysis. 2024;28(1):51–60. pmid:37724487
- 6. Rana R, Kant R, Huirem RS, Bohra D, Ganguly NK. Omicron variant: Current insights and future directions. Microbiological Research. 2022;265:127204. pmid:36152612
- 7. Nyberg T, Ferguson NM, Nash SG, Webster HH, Flaxman S, Andrews N, et al. Comparative analysis of the risks of hospitalisation and death associated with SARS-CoV-2 omicron (B. 1.1. 529) and delta (B. 1.617. 2) variants in England: a cohort study. The Lancet. 2022;399(10332):1303–1312. pmid:35305296
- 8. Ward IL, Bermingham C, Ayoubkhani D, Gethings OJ, Pouwels KB, Yates T, et al. Risk of covid-19 related deaths for SARS-CoV-2 omicron (B.1.1.529) compared with delta (B.1.617.2): retrospective cohort study. BMJ. 2022;378. pmid:35918098
- 9. Lorenzo-Redondo R, Ozer EA, Hultquist JF. Covid-19: is omicron less lethal than delta? BMJ. 2022;378. pmid:35918084
- 10. Hallal PC, Hartwig FP, Horta BL, Silveira MF, Struchiner CJ, Vidaletti LP, et al. SARS-CoV-2 antibody prevalence in Brazil: results from two successive nationwide serological household surveys. The Lancet Global Health. 2020;8(11):e1390–e1398. pmid:32979314
- 11. Pérez-Gómez B, Pastor-Barriuso R, Fernández-de Larrea N, Hernán MA, Pérez-Olmeda M, Oteo-Iglesias J, et al. SARS-CoV-2 Infection During the First and Second Pandemic Waves in Spain: the ENE–COVID Study. American Journal of Public Health. 2023;113(5):533–544. pmid:36893370
- 12.
Public Health England. National COVID-19 surveillance report: 9 July 2020 (week 28); 2020 [cited 6 December 2023]. In: Gov.UK [Internet]. Available from: https://www.gov.uk/government/publications/national-covid-19-surveillance-reports.
- 13.
Public Health England. National COVID-19 surveillance report: 11 September 2020 (week 37); 2020 [cited 6 December 2023]. In: Gov.UK [Internet]. Available from: https://www.gov.uk/government/publications/national-covid-19-surveillance-reports.
- 14.
Public Health England. National COVID-19 surveillance report: 2 October 2020 (week 40); 2020 [cited 6 December 2023]. In: Gov.UK [Internet]. Available from: https://www.gov.uk/government/publications/national-covid-19-surveillance-reports.
- 15. Walker AG, Sibbel S, Wade C, Moulton N, Marlowe G, Young A, et al. SARS-CoV-2 antibody seroprevalence among maintenance dialysis patients in the United States. Kidney Medicine. 2021;3(2):216–222. pmid:33564746
- 16. Anand S, Montez-Rath M, Han J, Bozeman J, Kerschmann R, Beyer P, et al. Prevalence of SARS-CoV-2 antibodies in a large nationwide sample of patients on dialysis in the USA: a cross-sectional study. The Lancet. 2020;396(10259):1335–1344.
- 17. Marra V, Quartin M. A Bayesian estimate of the early COVID-19 infection fatality ratio in Brazil based on a random seroprevalence survey. International Journal of Infectious Diseases. 2021;111:190–195. pmid:34390858
- 18. Picon RV, Carreno I, da Silva AA, Mossmann M, Laste G, de Campos Domingues G, et al. Coronavirus disease 2019 population-based prevalence, risk factors, hospitalization, and fatality rates in southern Brazil. International Journal of Infectious Diseases. 2020;100:402–410. pmid:32949778
- 19. Silveira MF, Barros AJ, Horta BL, Pellanda LC, Victora GD, Dellagostin OA, et al. Population-based surveys of antibodies against SARS-CoV-2 in Southern Brazil. Nature Medicine. 2020;26(8):1196–1199. pmid:32641783
- 20. Gibbons CL, Mangen MJJ, Plass D, Havelaar AH, Brooke RJ, Kramarz P, et al. Measuring underreporting and under-ascertainment in infectious disease datasets: a comparison of methods. BMC public health. 2014;14(1):1–17. pmid:24517715
- 21. Liu Y, Yu Y, Zhao Y, He D. Reduction in the infection fatality rate of Omicron variant compared with previous variants in South Africa. International Journal of Infectious Diseases. 2022;120:146–149. pmid:35462038
- 22. Yang W, Shaman JL. COVID-19 pandemic dynamics in South Africa and epidemiological characteristics of three variants of concern (Beta, Delta, and Omicron). eLife. 2022;11:e78933. pmid:35943138
- 23. Ferrante L, Duczmal LH, Capanema E, Steinmetz WAC, Almeida ACL, Leão J, et al. Dynamics of COVID-19 in Amazonia: a history of government denialism and the risk of a third wave. Preventive medicine reports. 2022;26:101752. pmid:35242505
- 24. Ghosh K, Ghosh AK. Study of COVID-19 epidemiological evolution in India with a multi-wave SIR model. Nonlinear Dynamics. 2022;109(1):47–55. pmid:35502431
- 25. Xu J, Tang Y. Bayesian framework for multi-wave COVID-19 epidemic analysis using empirical vaccination data. Mathematics. 2021;10(1):21.
- 26. Romanescu R, Hu S, Nanton D, Torabi M, Tremblay-Savard O, Haque MA. The effective reproductive number: modeling and prediction with application to the multi-wave Covid-19 pandemic. Epidemics. 2023; p. 100708. pmid:37499586
- 27. Nouvellet P, Bhatia S, Cori A, Ainslie KE, Baguelin M, Bhatt S, et al. Reduction in mobility and COVID-19 transmission. Nature communications. 2021;12(1):1090. pmid:33597546
- 28. Ribeiro Xavier C, Sachetto Oliveira R, da Fonseca Vieira V, Lobosco M, Weber dos Santos R. Characterisation of omicron variant during COVID-19 pandemic and the impact of vaccination, transmission rate, mortality, and reinfection in South Africa, Germany, and Brazil. BioTech. 2022;11(2):12. pmid:35822785
- 29. Abolpour R, Siamak S, Mohammadi M, Moradi P, Dehghani M. Linear parameter varying model of COVID-19 pandemic exploiting basis functions. Biomedical Signal Processing and Control. 2021;70:102999. pmid:34306169
- 30.
World Health Organization. Covid-19 strategy update, 14 April 2020; 2020 [cited 2023 July 14]. In: World Health Organization [Internet]. Available from: https://www.who.int/publications/m/item/covid-19-strategy-update.
- 31. Ferrante L, Duczmal L, Steinmetz WA, Almeida ACL, Leão J, Vassão RC, et al. How Brazil’s President turned the country into a global epicenter of COVID-19. Journal of Public Health Policy. 2021;42(3):439–451. pmid:34453095
- 32. Barberia LG, Gómez EJ. Political and institutional perils of Brazil’s COVID-19 crisis. The Lancet. 2020;396(10248):367. pmid:32738938
- 33. Lasco G. Medical populism and the COVID-19 pandemic. Global Public Health. 2020;15(10):1417–1429. pmid:32780635
- 34. Ricard J, Medeiros J. Using misinformation as a political weapon: COVID-19 and Bolsonaro in Brazil. Harvard Kennedy School Misinformation Review. 2020;1(3).
- 35. Lotta G, Wenham C, Nunes J, Pimenta DN. Community health workers reveal COVID-19 disaster in Brazil. The Lancet. 2020;396(10248):365–366. pmid:32659212
- 36.
Google. Google COVID-19 Community Mobility Reports; 2020 [cited 2023 May 5]. In Google [Internet]. Available from: https://www.google.com/covid19/mobility/.
- 37. Albani VV, Albani RA, Bobko N, Massad E, Zubelli JP. On the role of financial support programs in mitigating the SARS-CoV-2 spread in Brazil. BMC Public Health. 2022;22(1):1781. pmid:36127657
- 38. de Leon FLL, Malde B, McQuillin B. The effects of emergency government cash transfers on beliefs and behaviours during the COVID pandemic: Evidence from Brazil. Journal of Economic Behavior & Organization. 2023;208:140–155. pmid:36874911
- 39.
DATASUS. Tabnet; 2020 [cited 2024 January 11]. Database: Gov.BR [Internet]. Available from: https://datasus.saude.gov.br/informacoes-de-saude-tabnet/.
- 40. Khare S, Gurry C, Freitas L, Schultz MB, Bach G, Diallo A, et al. GISAID’s role in pandemic response. China CDC weekly. 2021;3(49):1049. pmid:34934514
- 41. Zimerman RA, Ferrareze PAG, Cadegiani FA, Wambier CG, Fonseca DdN, De Souza AR, et al. Comparative genomics and characterization of SARS-CoV-2 P. 1 (gamma) variant of concern from Amazonas, Brazil. Frontiers in medicine. 2022;9:141. pmid:35242782
- 42. Tian D, Sun Y, Zhou J, Ye Q. The global epidemic of the SARS-CoV-2 delta variant, key spike mutations and immune escape. Frontiers in immunology. 2021; p. 5001. pmid:34917076
- 43.
DATASUS. Banco de Dados de Síndrome Respiratória Aguda Grave—incluindo dados da COVID-19; 2022 [cited 2023 November 23]. Database: Gov.BR [Internet]. Available from: https://opendatasus.saude.gov.br.
- 44.
Instituto Brasileiro de Geografia e Estatística. Censo 2022; 2022 [cited 29 March 2023]. Database: Gov.BR [Internet]. Available from: https://censo2022.ibge.gov.br/.
- 45. Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. American Journal of Epidemiology. 2013;178(9):1505–1512. pmid:24043437
- 46. Bi Q, Wu Y, Mei S, Ye C, Zou X, Zhang Z, et al. Epidemiology and transmission of COVID-19 in 391 cases and 1286 of their close contacts in Shenzhen, China: a retrospective cohort study. The Lancet Infectious Diseases. 2020;20(8):911–919. pmid:32353347
- 47. de Souza WM, Buss LF, Candido DdS, Carrera JP, Li S, Zarebski AE, et al. Epidemiological and clinical characteristics of the COVID-19 epidemic in Brazil. Nature Human Behaviour. 2020;4(8):856–865. pmid:32737472
- 48. Arroyo-Marioli F, Bullano F, Kucinskas S, Rondón-Moreno C. Tracking R of COVID-19: A new real-time estimation using the Kalman filter. PLoS ONE. 2021;16(1):e0244474. pmid:33439880
- 49. Gallos LK, Fefferman NH. The effect of disease-induced mortality on structural network properties. PLoS ONE. 2015;10(8):e0136704. pmid:26313926
- 50. Bjørnstad ON, Shea K, Krzywinski M, Altman N. Modeling infectious epidemics. Nature Methods. 2020;17(5):455–456. pmid:32313223
- 51. Bjørnstad ON, Shea K, Krzywinski M, Altman N. The SEIRS model for infectious disease dynamics. Nature methods. 2020;17(6):557–559. pmid:32499633
- 52. Bastos SB, Cajueiro DO. Modeling and forecasting the early evolution of the Covid-19 pandemic in Brazil. Scientific Reports. 2020;10(1):19457. pmid:33173127
- 53. Nishiura H, Chowell G. The effective reproduction number as a prelude to statistical estimation of time-dependent epidemic trends. Mathematical and statistical estimation approaches in epidemiology. 2009; p. 103–121.
- 54. Liu Y, Rocklöv J. The reproductive number of the Delta variant of SARS-CoV-2 is far higher compared to the ancestral SARS-CoV-2 virus. Journal of travel medicine. 2021;28(7):taab124. pmid:34369565
- 55. Liu Y, Rocklöv J. The effective reproductive number of the Omicron variant of SARS-CoV-2 is several times relative to Delta. Journal of Travel Medicine. 2022;29(3):taac037. pmid:35262737
- 56. Gostic KM, McGough L, Baskerville EB, Abbott S, Joshi K, Tedijanto C, et al. Practical considerations for measuring the effective reproductive number, R t. PLoS computational biology. 2020;16(12):e1008409. pmid:33301457
- 57.
Cintrón-Arias A, Castillo-Chávez C, Betencourt L, Lloyd AL, Banks HT. The estimation of the effective reproductive number from disease outbreak data; 2008 [cited 12 January 2024]. In: North Carolina State University [Internet]. Available from: https://repository.lib.ncsu.edu/bitstream/handle/1840.4/4036/crsc-tr08-08.pdf?sequence=1.
- 58. Salvador S, Chan P. Toward accurate dynamic time warping in linear time and space. Intelligent Data Analysis. 2007;11(5):561–580.
- 59. Storn R, Price K. Differential evolution-a simple and efficient heuristic for global optimization over continuous spaces. Journal of Global Optimization. 1997;11(4):341.
- 60. Verity R, Okell LC, Dorigatti I, Winskill P, Whittaker C, Imai N, et al. Estimates of the severity of coronavirus disease 2019: a model-based analysis. The Lancet Infectious Diseases. 2020;20(6):669–677. pmid:32240634
- 61. Pulliam JR, van Schalkwyk C, Govender N, von Gottberg A, Cohen C, Groome MJ, et al. Increased risk of SARS-CoV-2 reinfection associated with emergence of Omicron in South Africa. Science. 2022;376(6593):eabn4947. pmid:35289632
- 62. Voinsky I, Baristaite G, Gurwitz D. Effects of age and sex on recovery from COVID-19: Analysis of 5769 Israeli patients. Journal of Infection. 2020;81(2):e102–e103. pmid:32425274
- 63. Hamby DM. A review of techniques for parameter sensitivity analysis of environmental models. Environmental monitoring and assessment. 1994;32:135–154. pmid:24214086
- 64. Melin P, Castillo O. Spatial and Temporal Spread of the COVID-19 Pandemic Using Self Organizing Neural Networks and a Fuzzy Fractal Approach. Sustainability. 2021;13(15):8295.
- 65. Castillo O, Melin P. A new fuzzy fractal control approach of non-linear dynamic systems: The case of controlling the COVID-19 pandemics. Chaos, Solitons & Fractals. 2021;151:111250. pmid:36568906
- 66. Castillo O, Castro JR, Melin P. Forecasting the COVID-19 with interval type-3 fuzzy logic and the fractal dimension. International Journal of Fuzzy Systems. 2023;25(1):182–197.
- 67. Sharma MK, Dhiman N, Mishra VN, et al. Mediative fuzzy logic mathematical model: A contradictory management prediction in COVID-19 pandemic. Applied Soft Computing. 2021;105:107285. pmid:33723486
- 68. Abdy M, Side S, Annas S, Nur W, Sanusi W. An SIR epidemic model for COVID-19 spread with fuzzy parameter: the case of Indonesia. Advances in difference equations. 2021;2021:1–17. pmid:33613667
- 69. Ripperger TJ, Uhrlaub JL, Watanabe M, Wong R, Castaneda Y, Pizzato HA, et al. Orthogonal SARS-CoV-2 serological assays enable surveillance of low-prevalence communities and reveal durable humoral immunity. Immunity. 2020;53(5):925–933. pmid:33129373
- 70. Cohen KW, Linderman SL, Moodie Z, Czartoski J, Lai L, Mantus G, et al. Longitudinal analysis shows durable and broad immune memory after SARS-CoV-2 infection with persisting antibody responses and memory B and T cells. Cell Reports Medicine. 2021;2(7):100354. pmid:34250512
- 71. Morris CP, Eldesouki RE, Fall A, Gaston DC, Norton JM, Gallagher ND, et al. SARS-CoV-2 reinfections during the Delta and Omicron waves. JCI Insight. 2022;7(20). pmid:36048527
- 72. Ferrante L, Duczmal LH, Steinmetz WA, Almeida ACL, Leão J, Vassão RC, et al. Brazil’s COVID-19 epicenter in manaus: how much of the population has already been exposed and are vulnerable to SARS-COV-2? Journal of Racial and Ethnic Health Disparities. 2021; p. 1–7. pmid:34590244
- 73. Reis RF, de Melo Quintela B, de Oliveira Campos J, Gomes JM, Rocha BM, Lobosco M, et al. Characterization of the COVID-19 pandemic and the impact of uncertainties, mitigation strategies, and underreporting of cases in South Korea, Italy, and Brazil. Chaos, Solitons & Fractals. 2020;136:109888. pmid:32412556
- 74. Bastos SB, Morato MM, Cajueiro DO, Normey-Rico JE. The COVID-19 (SARS-CoV-2) uncertainty tripod in Brazil: Assessments on model-based predictions with large under-reporting. Alexandria Engineering Journal. 2021;60(5):4363–4380.
- 75. Noh J, Danuser G. Estimation of the fraction of COVID-19 infected people in US states and countries worldwide. PLoS ONE. 2021;16(2):e0246772. pmid:33556142
- 76. Juyal D, Pal S, Thaledi S, Pandey HC. COVID-19: The vaccination drive in India and the Peltzman effect. Journal of Family Medicine and Primary Care. 2021;10(11):3945. pmid:35136749
- 77.
Salles S. Falta de comunicação de resultados de autotestes gera subnotificação, apontam specialists. CNN Brasil; 2022 June 6 [cited 2023 August 9]. Available from: https://www.cnnbrasil.com.br/saude/falta-de-comunicacao-de-resultados-de-autotestes-gera-subnotificacao-apontam-especialistas.
- 78. Delli Compagni R, Cheng Z, Russo S, Van Boeckel TP. A hybrid Neural Network-SEIR model for forecasting intensive care occupancy in Switzerland during COVID-19 epidemics. PLoS ONE. 2022;17(3):e0263789. pmid:35239662
- 79. Ala’raj M, Majdalawieh M, Nizamuddin N. Modeling and forecasting of COVID-19 using a hybrid dynamic model based on SEIRD with ARIMA corrections. Infectious Disease Modelling. 2021;6:98–111. pmid:33294749
- 80. Silva PC, Batista PV, Lima HS, Alves MA, Guimarães FG, Silva RC. COVID-ABS: An agent-based model of COVID-19 epidemic to simulate health and economic effects of social distancing interventions. Chaos, Solitons & Fractals. 2020;139:110088. pmid:32834624