Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Neural parameter calibration for dengue outbreak forecasting

  • Hoang Viet Pham,

    Roles Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft

    Current address: Faculty of Engineering, Vietnamese-German University, Ho Chi Minh City, Vietnam

    Affiliation Faculty of Engineering, Vietnamese-German University, Ho Chi Minh City, Vietnam

  • Khuong Trung Dang Nguyen,

    Roles Data curation, Formal analysis, Software, Visualization

    Current address: Faculty of Engineering, Vietnamese-German University, Ho Chi Minh City, Vietnam

    Affiliation Faculty of Engineering, Vietnamese-German University, Ho Chi Minh City, Vietnam

  • Thirumalaisamy P. Velavan ,

    Roles Conceptualization, Funding acquisition, Writing – review & editing

    t.velavan@uni-tuebingen.de (TPV); khanh.td@vgu.edu.vn (KDT)

    Affiliations Institute of Tropical Medicine, University of Tuebingen, Tuebingen, Germany, Vietnamese German Center for Medical Research (VG-CARE), Hanoi, Vietnam, Faculty of Medicine, Duy Tan University, Da Nang, Vietnam

  • Khanh Duc Tran

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Writing – review & editing

    t.velavan@uni-tuebingen.de (TPV); khanh.td@vgu.edu.vn (KDT)

    Current address: Faculty of Engineering, Vietnamese-German University, Ho Chi Minh City, Vietnam

    Affiliation Faculty of Engineering, Vietnamese-German University, Ho Chi Minh City, Vietnam

Abstract

Dengue fever poses a growing public health challenge in tropical and subtropical regions, with transmission driven by complex interactions among viral and host. Computational models, often expressed as ordinary differential equations (ODEs), are widely used to understand complex systems such as dengue fever transmission dynamics. However, traditional parameter estimation methods such as Markov chain Monte Carlo (MCMC) often require complex setups and are computationally expensive. In this study, we choose a compartmental model extended to human and mosquito populations, estimate its parameters using neural parameter calibration (NPC), and validate the approach using datasets collected from South America and Southeast Asia. The extended compartment model (ECM) is expressed using seven ODEs, describing dengue transmission dynamics between humans and mosquitoes. NPC involves using a neural network to learn the posterior distribution of parameters and initial conditions of the model in consideration. We analyzed six surveillance datasets on cumulative dengue cases, comprising data from three cities (Bello, Iquitos, and San Juan) and three Southeast Asian countries (Vietnam, the Philippines, and Cambodia). NPC achieved significantly faster run times than MCMC: 408 seconds on average versus 2616.01 seconds for city-level analyses and 368 seconds on average versus 2998.83 seconds for country-level analyses. Meanwhile, it delivers comparable accuracy: mean squared error (MSE) 0.00678 versus 0.01638 for the city datasets; and 0.00605 versus 0.01897 for the country datasets. The experimental results demonstrate that combining ECM with NPC enables accurate dengue outbreak forecasts at substantially lower computational cost, offering a practical tool that supports timely response, especially in low-resource environments such as Southeast Asia.

1 Introduction

Dengue fever, an expanding mosquito-borne disease in tropical and subtropical regions, threatens half of the world’s population across 129 countries, causing an estimated 100-400 million infections annually, with mortality up to 20% in severe cases without prompt treatment [14]. Recurrent large outbreaks are expected to result in a cumulative economic burden of approximately $306 billion by 2050 [5]. Dengue transmission dynamics arise from complex interactions among viral replication cycles, environmental conditions (e.g. temperature, rainfall, humidity) [68], and human factors (e.g. population movement, immunity profiles) [9,10]. These complexities highlight the need for computational models capable of forecasting outbreaks accurately and efficiently to support early warning and timely intervention.

Computational models, often expressed through systems of ordinary differential equations (ODEs), are widely used to describe the evolution of multiple dependent variables using derivatives with respect to a single independent variable. Specifically, compartmental models, ODE systems that track the rates of change in the susceptible, exposed, infectious, and recovered compartments (SEIR), form the backbone of mathematical epidemiology. They integrate demographic data and are calibrated against parameters to simulate and forecast outbreak dynamics [11,12]. Although traditional compartmental models provide flexibility in adapting to diverse epidemiological contexts and retain interpretability through clear mechanistic representation, they still have notable limitations. They are mostly limited to a single component, human only, and often require considerable time and resources for parameter estimation [13].

Recent research on data-driven parameterization of computational models for uncertainty quantification has explored various directions, including hierarchical, nonparametric, ensemble, and Bayesian methods [14,15]. Among these, Bayesian techniques, such as Markov chain Monte Carlo (MCMC), Hamiltonian Monte Carlo, and Langevin dynamics-based samplers, like the Metropolis-adjusted Langevin algorithm and its variants [16], are increasingly used. These techniques have been used to capture both the dynamics of viral transmission and the uncertainties arising from parameter variability by running multiple simulations and analyzing the resulting statistical distributions [17,18]. However, Bayesian parameter estimation and uncertainty quantification generally rely on posterior sampling, which faces three key computational challenges. First, the complex a priori setup of initial conditions and parameter ranges creates substantial implementation barriers. Second, when likelihood evaluations require numerical integration of ODEs, high-dimensional estimation becomes expensive, this leads gradient-driven MCMC methods to exhibit random-walk behavior, resulting in excessive computational time and resource, that makes inference impractical during an outbreak. Third, burn-in periods and sample rejections further waste computational resources. This persistent trade-off between computational efficiency and sampling accuracy in Bayesian ODE parameter estimation underscores the need for alternative approaches such as neural parameter calibration (NPC).

Neural parameter calibration (NPC) is an approach that uses a neural network to estimate model parameters and initial conditions so the model outputs closely match real-world observations. NPC overcomes the computational limitations of traditional Bayesian methods by using a neural network to estimate posterior distributions over parameters for large-scale multi-agent models, achieving two- to three-orders-of-magnitude speedups with improved accuracy compared to classical Bayesian techniques [19]. In COVID-19 applications, a neural network method was used to calibrate the parameters and initial conditions of an ODE model, which simulated the spread of the virus in Berlin in 2020 [20]. Despite its promise for epidemic modeling, NPC has not yet been adapted for dengue fever modeling, suggesting an interesting line of research: applying fast NPC to dengue outbreak prediction models.

This study addresses the aforementioned key challenges by pursuing several objectives. We employ an extended compartmental model (ECM), which comprises human and mosquito components and is formulated using seven ODEs to describe dengue transmission dynamics. The model explicitly represents the SEIR states for humans alongside the Susceptible, Exposed, and Infectious states for mosquitoes. We then apply NPC, which uses a neural network to estimate posterior distributions of parameters and initial conditions of the ECM. We conduct parameter uncertainty quantification and basic reproductive number sensitivity analysis. We run NPC on six surveillance datasets of cumulative dengue cases from three cities (Bello, Iquitos, San Juan) and three Southeast Asian countries (Vietnam, the Philippines, Cambodia). Finally, we perform an analysis between NPC and MCMC to show the viability of NPC against state-of-the-art Bayesian methods.

The article is organized as follows. Sect 2 details the ECM, which includes the equations governing the transmission dynamics of dengue between the mosquito and the human population. It also introduces the NPC and MCMC parameter estimation frameworks for the ECM, and presents the calibration and projection datasets. Sect 3 describes the experimental setup for comparing the NPC and MCMC frameworks. Sect 4 evaluates the performance of the neural calibration method against MCMC benchmarks, comparing computational efficiency, predictive accuracy, uncertainty quantification, and sensitivity analysis for the basic reproductive number (R0) in the analysis for both city and country datasets. Finally, Sect 5 discusses the results, including key findings, limitations, and future directions.

2 Methodologies

2.1 Modeling dengue transmission dynamics

Fig 1 illustrates the mosquito-borne virus transmission cycle between Aedes aegypti mosquitoes and human hosts. The cycle begins when an infectious female mosquito bites a healthy human for blood meal, which is essential for egg production and maturation. During this feeding process, the mosquito transfers the saliva-containing virus to the human host, initiating infection. When sufficient viremia is established, known as the intrinsic incubation period, human-to-mosquito transmission becomes possible. Viremic individuals may recover within 1-2 weeks of symptom onset, depending on illness severity. When a susceptible mosquito subsequently feeds on this viremic human, it ingests the virus along with the blood meal. Within the mosquito, the virus must overcome several anatomical barriers, first infecting and replicating in cells before spreading through the salivary glands. The process, known as the extrinsic incubation period, typically takes 8-12 days to complete, after which mosquito-to-human transmission becomes possible. Once an infectious adult mosquito emerges, it seeks another blood meal, transmitting the virus to a new human host and continuing the transmission cycle [21]. The mosquito progresses through its entire life cycle, developing from eggs laid in water containers to larvae (within 48 hours), then pupae (within 5 days) and finally emerging as adult mosquitoes. The entire mosquito development cycle from egg to adult takes approximately 8-10 days under optimal conditions.

thumbnail
Fig 1. Human-mosquito transmission dynamics of dengue fever.

https://doi.org/10.1371/journal.pone.0339398.g001

To capture human-mosquito transmission dynamics, we adapt the model formulation of [22], following a compartmental structure that separates mosquitoes (vector) and humans (host) components. This framework converts the biological processes, described above, into measurable parameters that control how the population moves between different states, as visually summarized in the flow diagram Fig 2.

thumbnail
Fig 2. Compartmental model structure for dengue transmission dynamics.

https://doi.org/10.1371/journal.pone.0339398.g002

Dengue transmission begins when a susceptible female mosquito Ms(t)(t denotes the dependence on time) bites an infectious human Hi(t), turning the mosquito into an exposed carrier Me(t). The rate at which the turning process occurs, denoted , depends on two factors: (a) the product of the transmission rate from humans to mosquitoes and the number of susceptible mosquitoes Ms(t); and (b) the proportion of infectious humans in the population . Reflecting the biological extrinsic incubation period described earlier, exposed mosquitoes become infectious Mi(t) at a rate . Susceptible humans Hs(t) become exposed He(t) through infectious mosquito bites at a rate . Here, quantifies the transmission rate from the mosquito to the human, while represents the proportion of infectious mosquitoes in the population. Following the intrinsic incubation period described earlier, an exposed human Me(t) becomes an infectious state Mi(t) at a rate and subsequently recovers Hr(t) at a rate . The model also includes natural mosquito mortality rates and a constant human birth and mortality rate .

The mathematical representation of the human-mosquito transmission dynamics is given by the following ODEs system Eq (1):

(1)

where time t is measured in weeks; M(t) = Ms(t)  +  Me(t)  +  Mi(t) represents the mosquito population; and H = Hs(t)  +  He(t)  +  Hi(t)  +  Hr(t) represents the human population. We add a new state variable He(t) to represent the cumulative dengue cases without accounting for recoveries. This is because the available data only includes newly exposed dengue cases, and the average period recovery for this population was unknown a priori, therefore, it is not possible to determine the number of infectious individuals for any given time. Table 1 represents the epidemiological description of each parameter in the compartmental model.

thumbnail
Table 1. Description of epidemiological parameters in the compartmental model.

https://doi.org/10.1371/journal.pone.0339398.t001

2.2 Basic reproductive number (R0)

The basic reproductive number in Eq (2), known as R0 (pronounced “R zero" or “R naught"), is a key indicator in the study of epidemics. It serves as a critical threshold parameter where values that exceed 1 indicate a potential for epidemic spread, while values below 1 suggest that the infection will eventually die out [23]. Researchers use the next generation matrix approach (NGM) [24] to determine R0, under the assumption that the population is initially free of infection (Hi(0) = 0). The square root reflects the biological reality that vector-borne diseases require two generations (one vector generation and one host generation) to maintain ongoing transmission. The numerator represents transmission potential rates through mosquito-human and human-mosquito transmission rates (, and incubation rates , ). The denominator accounts for the loss process, natural mortality, and recovery, which reduce effective transmission.

(2)

2.3 Parameter estimation

We consider two parameter estimation frameworks, namely MCMC and NPC.

2.3.1 Markov Chain Monte Carlo (MCMC).

The MCMC framework defines parameter bounds for the ODEs, explores the parameter spaces via Latin-hypercube Monte Carlo sampling, and then runs multiple local least-square fits from those starting points, following [25]. Specifically, for each proposed parameter vector α, the underlying ODE system is numerically integrated to produce the discrete state variables x(ti) at the observation time ti. Here, denotes the ODE function that defines as in Eq (3)

(3)

The component selects the cumulative infectious case variable from the state vector x(ti). The difference between the output of and the observed yi is quantified using the least squares loss function Eq (4).

(4)

MCMC proposes new parameter vectors and accepts them with a Metropolis-Hastings algorithm based on the posterior ratio, so increases in posterior density are always accepted while decreases are sometimes accepted. After discarding burn-in and checking convergence across multiple chains, the accepted proposals approximate the posterior, from which summaries such as the mean, median, and credible intervals provide parameter estimates with uncertainty. Although MCMC is a rigorous method, it often involves long burn-in periods, frequent sampling rejections that can lower efficiency, and reliance on stochastic sampling techniques such as random walks, which increase computational cost.

2.3.2 Neural parameter calibration (NPC).

In NPC framework from [19] (Algorithm 1), we train a NN to directly predict both initial conditions and epidemiological parameters for the ECM, achieving orders of magnitude faster inference than traditional MCMC. The network, which functions as , has three hidden layers with 20 neurons each, and uses only the logarithmic normalized cumulative dengue cases at t = 0 as input. It then outputs 14 estimated values with an additional constant specified a priori. The chosen NN architecture effectively balances learning capacity and computational efficiency, providing accurate prediction with low run times. Its outputs are split into the initial conditions and the parameter set δ that serve as input to a high–order numerical solver (specifically a Dormand–Prince Shampine Runge–Kutta method of order 5) to integrate the ECM over L time steps. At each step, the solver yields output variables , whose first component is compared to the normalized observed cumulative dengue cases Hi,total(t) via MSE loss as in Eq (5),

(5)

Algorithm 1 Single training epoch for NPC in ODE system.

Inputs:

{Observable Cumulative Infectious Cases}

B = L {Batch Size}

Partition α into state variables and parameter set δ, set

for t = 1 to L do

{Runge-Kutta order 5}

end for

Extract from

Compute as MSE loss

Compute gradient

Update θ via backpropagation (automatic differentiation)

Gradients are calculated by backpropagation through both the output from the numerical solver, enabling updates to the network weights θ, which are then used to produce the output α via feedforward. This forward-backward training loop iterates until convergence, after which the output is transformed back to the original scale by − 1 for direct comparison with the observed data. The NPC workflow is summarized in Fig 3.

thumbnail
Fig 3. NPC workflow.

In S1 Fig, we provide an illustration of the complete neural network model, including the input, hidden, and output layers.

https://doi.org/10.1371/journal.pone.0339398.g003

In traditional methods like MCMC, the posterior density is estimated by counting how often each point is sampled. However, in neural networks, the posterior is determined directly using the loss value J at each point, without relying on the sampling frequency. This means that we do not need to use rejection sampling or wait for a burn-in period because each calculation gives us the true likelihood immediately. Therefore, sampling the same point with the same information more than once is avoided, which makes the NN process much faster compared to the traditional MCMC. Furthermore, since the network movement is guided by the gradient , it quickly focuses on the regions with the highest probability. This leads to a dense collection of samples around the most likely values, achieving faster and more accurate results compared to traditional sampling methods.

2.3.3 Marginal posterior densities.

A marginal density distribution represents the probability distribution of a single parameter obtained by integrating the joint posterior distribution over all other parameters. This statistical technique enables visualization of the uncertainty associated with the estimation of each parameter individually, providing information on parameter identifiability and correlation structures within the model [20,25]. For the parameter vector , the subscript i signifies all components of α except the component ith during the integration process. The marginal posterior density is mathematically expressed as Eq (6).

(6)

Where the integration is performed over the entire feasible domain of the remaining parameters . The joint posterior distribution that appears in the integral is derived using the Bayes theorem as in Eq (7)

(7)

Here, represents the likelihood function that captures the probability distribution of the parameters given the observational data. denotes the joint prior distribution over all parameters. The only information we have a priori about the parameter values is that they are positive; hence, in the following, we always assume uniform priors on .

2.4 City datasets

Time series of infectious dengue case were collected from laboratory-confirmed and clinically suspected case notifications: Iquitos (Peru) and San Juan (Puerto Rico) from National Oceanic and Atmospheric Administration (NOAA) [26]; and Bello (Colombia) from [25], with each dataset covering a 48-week epidemic window. Calibration periods were as follows: Bello, week 8 of 2014 to week 2 of 2015 (616 cases); Iquitos, week 6 of 2005 to week 1 of 2006 (426 cases); San Juan, week 51 of 2000 to week 46 of 2001 (1,492 cases). Projection periods were as follows: Bello, week 32 of 2006 to week 27 of 2007 (512 cases); Iquitos, week 47 of 2001 to week 42 of 2002 (515 cases); San Juan, week 52 of 1998 to week 47 of 1999 (1,158 cases). Fig 4 summarizes outbreak dynamics in city regions during calibration and projection.

thumbnail
Fig 4. Selected calibration and projection dengue outbreaks from city data.

https://doi.org/10.1371/journal.pone.0339398.g004

2.5 Country datasets

Time series of dengue cases for Southeast Asia were drawn from standardized public sources as described in [27]. Three high-incidence countries were analyzed over 48-week windows for calibration and projection. Calibration periods were as follows: Vietnam (week 9 of 2017 to week 4 of 2018, 142,854 cases), the Philippines (week 9 of 2013 to week 5 of 2014, 201,056 cases), Cambodia (week 5 of 2013 to week 48 of 2013, 16,889 cases). Projection periods were as follows: Vietnam (week 16 of 2018 to week 12 of 2019, 134,533 cases), the Philippines (week 13 of 2015 to week 9 of 2016, 199,811 cases), Cambodia (week 9 of 2011 to week 5 of 2012, 15,635 cases). Fig 5 summarizes the dynamics over these periods.

thumbnail
Fig 5. Selected calibration and projection dengue outbreaks from country data.

https://doi.org/10.1371/journal.pone.0339398.g005

3 Experimental setup

We ran experiments on a 13th Gen Intel core i7-13620H CPU with 32GB RAM. Training used a batch size B = L (the outbreak-window length); the optimizer called Adam [28] with MSE between observed Hi,total(t) and predicted ; a learning-rate scheduler with an initial value 5e − 4, linear warm-up over the first 20% of epochs followed by linear decay; and gradient clipping at 0.1. Weights were initialized following [29]; hidden layers use Sigmoid activations, and the output applied an absolute-value transform to enforce non-negative parameter estimates without functional output constraints. The model was trained for 2,000 epochs and run with 10 random seeds. During training, we tracked and aggregated the parameter ranges to define constrained priors for subsequent MCMC sampling.

Following [25], we implemented MCMC and ECM in MATLAB using the Symbolic Math Toolbox [30] and the GSUA-CSB Toolbox [31]. The ODE system was integrated with ODE45 (Dormand-Prince-Shampine Runge-Kutta method of order 5), to ensure comparability with the NPC solver. Parameters were estimated by minimizing MSE (Eq 4) via MATLAB’s (trust region reflective). Latin hypercube sampling (LHS) [32] generated 1,000 initializations, uniformly covering the parameter space. For each initialization, ran up to 4,000 model evaluations with default stopping tolerances FunctionTolerance = 1e − 6 and StepTolerance = 1e − 6. LHS provided diverse initializations, and local optimization refined them, outputting plausible parameter sets.

4 Results

4.1 Experiments on city data

Across the three cities, NPC achieves significantly higher computational efficiency and accuracy comparable to MCMC. As shown in Table 2, NPC completes calibration approximately three orders of magnitude faster (seconds versus thousands of seconds; see Fig 6). For accuracy, NPC provides roughly three to four times reduction in MSE during calibration and similar MSE to MCMC during projection at all locations. The reported NPC MSE is averaged over multiple runs with different random seeds. These results indicate that NPC substantially reduces computational cost while maintaining predictive accuracy across diverse demographic contexts.

thumbnail
Fig 6. NPC vs MCMC accuracy and processing time on city data.

https://doi.org/10.1371/journal.pone.0339398.g006

thumbnail
Table 2. Comparison of NPC and MCMC: MSE and processing time on city data.

https://doi.org/10.1371/journal.pone.0339398.t002

Fig 7 compares NPC and MCMC across dengue outbreaks in three cities. The blue and orange shaded bands denote standard deviation around the mean trajectory, computed across NPC networks with different random seeds and MCMC starting points, respectively. In the cumulative-incidence panels, NPC closely tracks the observations and performs similarly to MCMC across all cities, with small deviations during mid-epidemic growth, while MCMC appears to capture the total burden slightly better. In the weekly-incidence panels, NPC tracks key epidemic features, including peak timing and overall outbreak shape, despite data variability. Overall, NPC achieves accuracy comparable to that of MCMC in modeling the temporal dynamics of dengue transmission. Convergence of NPC training for city datasets, quantified by MSE over epochs, across random seeds is shown in S2 Fig.

thumbnail
Fig 7. Comparison of NPC and MCMC accuracy for infectious (Hi) and cumulative infectious () compartments on city data.

https://doi.org/10.1371/journal.pone.0339398.g007

Fig 8 compares marginal posterior distributions for parameters estimated using NPC (blue) and MCMC (orange) across the cities, with black dots and gray diamonds marking the posterior mean and mode, respectively. Across most parameters, both methods yield largely overlapping, unimodal posteriors with noticeable right-skew in several panels. Transmission and recovery rates, such as , and are highly concentrated and unimodal, whereas some vector-related parameters exhibit broader uncertainty. Notably, in the Bello datasets, λ and exhibit clear bimodality with two well-separated modes and minimal overlap between NPC and MCMC posteriors. Overall, central tendencies (mean and mode) are similar between methods, but uncertainty differs, with NPC typically producing narrower spreads while MCMC occasionally exhibits boundary-peaked densities.

thumbnail
Fig 8. Comparison of NPC and MCMC for marginal posterior densities of ECM parameter set on city data.

The plot shows marginal posterior distributions for parameters in α after calibration in Bello, Iquitos, and San Juan. The x-axis gives parameter values and the y-axis gives posterior density. Narrow, sharp peaks indicate higher certainty, while broader shapes indicate greater uncertainty. Comparing these distributions across NPC and MCMC highlights differences in parameter uncertainty and estimation values.

https://doi.org/10.1371/journal.pone.0339398.g008

Fig 9 summarizes variability in R0 distribution and parameter sensitivities in Bello, Iquitos, and San Juan. The upper panels show broader R0 distribution in Bello (a median of 2.5 and outliers greater than 15), moderate spread in Iquitos (a median of approximately 2.0 and outliers of approximately 17.5), and the most constrained distribution in San Juan (a median of approximately 1.5 and outliers of approximately 12), indicating that most parameter sets produce moderate R0 distribution while some combinations provide extreme values. The lower panel shows consistent importance rankings: mosquito mortality is most influential (sensitivity about 0.86 in Bello, about 1.76 in Iquitos, about 0.86 in San Juan), while , , , , contribute modestly (approximately 0.00-0.39). These patterns suggest prioritizing vector control that reduces mosquito survival, with recovery-focused clinical interventions as a secondary level.

thumbnail
Fig 9. Sensitivity and uncertainty analysis for basic reproductive number on city data.

https://doi.org/10.1371/journal.pone.0339398.g009

4.2 Experiments on country data

Building on the city-level results, the same framework is applied to national datasets to assess whether NPC’s advantages persist when scaling up dengue modeling from cities to countries.

Across Vietnam, the Philippines, and Cambodia, NPC attains roughly 4 times lower calibration MSE and a similar projection MSE than MCMC, while calibrating orders of magnitude faster (seconds versus thousands of seconds), as summarized in Table 3 and visualized in Fig 10. These findings indicate that NPC scales from city- to country-level with substantial computational gains and comparable accuracy across diverse settings.

thumbnail
Fig 10. NPC vs MCMC accuracy and processing time on country data.

https://doi.org/10.1371/journal.pone.0339398.g010

thumbnail
Table 3. Comparison of NPC and MCMC: MSE and Processing Time on Country Data

https://doi.org/10.1371/journal.pone.0339398.t003

Fig 11 compares NPC and MCMC across Vietnam, the Philippines, and Cambodia. In the cumulative-incidence panels, NPC aligns closely with MCMC and the observations over the a 48-week window, with only minor deviations during mid-epidemic growth. In the weekly-incidence panels, NPC captures overall outbreak dynamics, while MCMC appears to better capture peak timing and magnitude. These country-level results are consistent with the city-level findings. Convergence of NPC training for country datasets, quantified by MSE over epochs, across random seeds is shown in S3 Fig.

thumbnail
Fig 11. Comparison of NPC and MCMC accuracy for infectious (Hi) and cumulative infectious () compartments on country data.

https://doi.org/10.1371/journal.pone.0339398.g011

Fig 12 presents marginal posterior densities for each parameter from the national datasets (Vietnam, the Philippines, and Cambodia), with NPC in blue and MCMC in orange; black dots and gray diamonds mark the posterior mean and mode, respectively. Across parameters, both methods yield largely unimodal, substantially overlapping densities, with noticeable right-skew in several panels. For the transmission and recovery parameters , and , the MCMC densities are typically more concentrated than NPC, particularly in Vietnam and Cambodia, while NPC shows broader spreads. The rate parameters λ and are unimodal across countries; MCMC is generally tighter and slightly right-shifted relative to NPC, indicating smaller posterior variance under MCMC. For , the mass often concentrates near the lower boundary, with the sharper boundary peak alternating between methods across countries (sharper under MCMC in Vietnam, and under NPC in the Philippines and Cambodia). Overall, the national-scale results indicate that MCMC provides tighter uncertainty intervals than NPC.

thumbnail
Fig 12. Comparison of NPC and MCMC for marginal posterior densities of ECM parameter set on country data.

The plot shows marginal posterior distributions for parameters in α after calibration in Vietnam, the Philippines, and Cambodia. The x-axis gives parameter values and the y-axis gives posterior density. Narrow, sharp peaks indicate higher certainty, while broader shapes indicate greater uncertainty. Comparing these distributions across NPC and MCMC highlights differences in parameter uncertainty and estimation values.

https://doi.org/10.1371/journal.pone.0339398.g012

Fig 13 presents the uncertainty (box plots) and sensitivity results for R0 in Vietnam, the Philippines, and Cambodia. The R0 distributions are centered near or just above 1, with most values between 0.5 and 2, and with high outliers up to 10 (Vietnam), 8 (Philippines), and 12 (Cambodia), indicating potential for extreme transmission under certain parameter combinations. Sensitivity results identify mosquito mortality as most influential in Vietnam and the Philippines (indices 0.75 and 0.56), with human recovery next; in Cambodia, and are co-dominant (0.87 and 0.81). Other parameters, including , , and , show moderate to low influence across countries. These patterns support prioritizing vector control while tailoring interventions to local drivers, including recovery dynamics.

thumbnail
Fig 13. Sensitivity and uncertainty analysis for basic reproductive number on country data.

https://doi.org/10.1371/journal.pone.0339398.g013

5 Discussion

We used the ECM, consisting of human and mosquito components, formulated as seven equations, to model dengue transmission dynamics. We trained 10 independently initialized networks, with 3 hidden layers, 20 neurons each, for 2,000 epochs to estimate posterior distributions for parameters. We used MCMC to sample the posterior distributions of the same model parameters by defining priors and bounds using LHS, numerically integrating the ODE system for each proposal, and applying the Metropolis–Hastings algorithm across multiple chains. We quantified parameter uncertainty of NPC and MCMC’s and analyzed the sensitivity of the basic reproductive number. We validated the performance and generalizability of both frameworks on six heterogeneous dengue datasets: urban cases from Bello, Iquitos, and San Juan, and national cases from Vietnam, the Philippines, and Cambodia.

The analysis showed that, compared to MCMC, NPC achieves substantial speedups of approximately 7 times on city-scale datasets and 8 times on country-scale datasets over traditional MCMC, while being equally accurate with approximately 10% reduction in MSE on average for city data, and comparable accuracy for country data. Moreover, NPC yields consistently narrower posterior distributions and tighter uncertainty bounds than the broader MCMC posteriors. Sensitivity analysis of the basic reproductive number R0 reveals that mosquito mortality (), human recovery (), mosquito-to-human transmission (), and human-to-mosquito transmission () are the most influential parameters, with traditional MCMC also classifying these parameters , , , and as most influential [25]. The experimental results, consistent across diverse epidemiological locations in the Americas and Southeast Asia, demonstrate the generalizability of NPC, in particular when combined with ECM for dengue fever modeling and analysis.

However, our work still has limitations. The ECM only has human and mosquito components and did not consider climate and weather factors such as temperature, humidity, and precipitation, which are important to mosquito breeding [6]. Besides, NPC may suffer from overfitting or bias, necessitating careful design of network architectures and hyperparameter tuning. Future work should prioritize the integration of temperature, humidity, and precipitation into ECM and the use of NPC to jointly calibrate epidemiological, biological, and meteorological parameters and initial conditions.

Supporting information

S1 Fig. Illustration of the neural network

visualizes a deep neural network with three hidden layers. Here, the inputs (Hi,total(0) shown in yellow) and the bias term (x0 shown in blue) pass through interconnected hidden layers, where edges visualize the weight matrices and bias. The hidden units (green) process the activations to produce the output of the initial conditions and epidemiological parameters.

https://doi.org/10.1371/journal.pone.0339398.s001

(TIFF)

S2 Fig. MSE loss convergence during NPC training with different random seeds on city data.

https://doi.org/10.1371/journal.pone.0339398.s002

(TIFF)

S3 Fig. MSE loss convergence during NPC training with different random seeds on country data.

https://doi.org/10.1371/journal.pone.0339398.s003

(TIFF)

References

  1. 1. Brady OJ, Gething PW, Bhatt S, Messina JP, Brownstein JS, Hoen AG, et al. Refining the global spatial limits of dengue virus transmission by evidence-based consensus. PLoS Negl Trop Dis. 2012;6(8):e1760. pmid:22880140
  2. 2. Bhatt S, Gething PW, Brady OJ, Messina JP, Farlow AW, Moyes CL, et al. The global distribution and burden of dengue. Nature. 2013;496(7446):504–7. pmid:23563266
  3. 3. Gubler DJ. Dengue and dengue hemorrhagic fever. Clin Microbiol Rev. 1998;11(3):480–96. pmid:9665979
  4. 4. Chen H-L, Hsiao W-H, Lee H-C, Wu S-C, Cheng J-W. Selection and characterization of DNA aptamers targeting all four serotypes of dengue viruses. PLoS One. 2015;10(6):e0131240. pmid:26110785
  5. 5. Chen S, Cao Z, Jiao L, Chen W, Prettner K, Kuhn M, et al. The Global Economic Burden of Dengue in 2020 –2050: Estimates and Projections for 141 Countries and Territories. Elsevier BV; 2024. https://doi.org/10.2139/ssrn.4691773
  6. 6. da Silva ST, Gabrick EC, Protachevicz PR, Iarosz KC, Caldas IL, Batista AM, et al. When climate variables improve the dengue forecasting: a machine learning approach. Eur Phys J Spec Top. 2024;234(3):555–69.
  7. 7. Morin CW, Monaghan AJ, Hayden MH, Barrera R, Ernst K. Meteorologically driven simulations of dengue epidemics in San Juan, PR. PLoS Negl Trop Dis. 2015;9(8):e0004002. pmid:26275146
  8. 8. Mordecai EA, Cohen JM, Evans MV, Gudapati P, Johnson LR, Lippi CA, et al. Detecting the impact of temperature on transmission of Zika, dengue, and chikungunya using mechanistic models. PLoS Negl Trop Dis. 2017;11(4):e0005568. pmid:28448507
  9. 9. Zeng Q, Yu X, Ni H, Xiao L, Xu T, Wu H, et al. Dengue transmission dynamics prediction by combining metapopulation networks and Kalman filter algorithm. PLoS Negl Trop Dis. 2023;17(6):e0011418. pmid:37285385
  10. 10. Kiang MV, Santillana M, Chen JT, Onnela J-P, Krieger N, Engø-Monsen K, et al. Incorporating human mobility data improves forecasts of Dengue fever in Thailand. Sci Rep. 2021;11(1):923. pmid:33441598
  11. 11. Weng YT. Statistical methods for fitting dengue disease models, and related issues [M.S. in Applied Statistics]. University of Pittsburgh; 2010.
  12. 12. Esteva L, Vargas C. Analysis of a dengue disease transmission model. Math Biosci. 1998;150(2):131–51. pmid:9656647
  13. 13. Yang X, Quam MBM, Zhang T, Sang S. Global burden for dengue and the evolving pattern in the past 30 years. Journal of Travel Medicine. 2021;28(8):taab146.
  14. 14. Roberts GO, Rosenthal JS. type. arXiv; 2007. https://arxiv.org/pdf/math/0404033
  15. 15. Roberts GO, Tweedie RL. Exponential convergence of langevin distributions and their discrete approximations. Bernoulli. 1996;2(4):341.
  16. 16. Girolami M, Calderhead B. Riemann manifold Langevin and Hamiltonian Monte Carlo Methods. Journal of the Royal Statistical Society Series B: Statistical Methodology. 2011;73(2):123–214.
  17. 17. Leach CB, Hoeting JA, Pepin KM, Eiras AE, Hooten MB, Webb CT. Linking mosquito surveillance to dengue fever through Bayesian mechanistic modeling. PLoS Negl Trop Dis. 2020;14(11):e0008868. pmid:33226987
  18. 18. Hridoy MB, Mustaquim SM. type. arXiv; 2024. https://arxiv.org/abs/2410.00947v1.
  19. 19. Gaskin T, Pavliotis GA, Girolami M. Neural parameter calibration for large-scale multiagent models. Proc Natl Acad Sci U S A. 2023;120(7):e2216415120. pmid:36763529
  20. 20. Gaskin T, Conrad T, Pavliotis GA, Schütte C. Neural parameter calibration and uncertainty quantification for epidemic forecasting. PLoS One. 2024;19(10):e0306704. pmid:39418246
  21. 21. Organization WH. type; 2025. https://www.who.int/news-room/fact-sheets/detail/dengue-and-severe-dengue
  22. 22. Lizarralde-Bejarano DP, Arboleda-Sánchez S, Puerta-Yepes ME. Understanding epidemics from mathematical models: Details of the 2010 dengue epidemic in Bello (Antioquia, Colombia). Applied Mathematical Modelling. 2017;43:566–78.
  23. 23. Heesterbeek JAP. A brief history of R0 and a recipe for its calculation. Acta Biotheor. 2002;50(3):189–204. pmid:12211331
  24. 24. Diekmann O, Heesterbeek JAP, Roberts MG. The construction of next-generation matrices for compartmental epidemic models. J R Soc Interface. 2010;7(47):873–85. pmid:19892718
  25. 25. Lizarralde-Bejarano DP, Rojas-Díaz D, Arboleda-Sánchez S, Puerta-Yepes ME. Sensitivity, uncertainty and identifiability analyses to define a dengue transmission model with real data of an endemic municipality of Colombia. PLoS One. 2020;15(3):e0229668. pmid:32160217
  26. 26. Service NNW. Dengue forecasting;.
  27. 27. Clarke J, Lim A, Gupte P, Pigott DM, van Panhuis WG, Brady OJ. A global dataset of publicly available dengue case count data. Sci Data. 2024;11(1).
  28. 28. Kingma DP, Ba J. type. arXiv:1412.6980; 2014. https://arxiv.org/abs/1412.6980.
  29. 29. Kumar SK. type. arXiv; 2017. https://arxiv.org/abs/1704.08863.
  30. 30. MathWorks. type; 2025. https://de.mathworks.com/products/symbolic.html.
  31. 31. GSUA-CSB. type; 2025. https://de.mathworks.com/matlabcentral/fileexchange/72637-gsua-csb.
  32. 32. Olsson A, Sandberg G, Dahlblom O. On Latin hypercube sampling for structural reliability analysis. Structural Safety. 2003;25(1):47–68.