Exploring epidemic control policies using nonlinear programming and mathematical models

Sandra Montes-Olivas; Adam J. Kucharski; Michael B. Gravenor; Simon D.W. Frost

doi:10.1371/journal.pcbi.1014238

Abstract

Optimal control theory in epidemiology has been used to establish the most effective intervention strategies for managing and mitigating the spread of infectious diseases while considering constraints and costs. Using Pontryagin’s Maximum Principle, indirect methods provide necessary optimality conditions by transforming the control problem into a two-point boundary value problem. However, these approaches are often sensitive to initial guesses and can be computationally challenging, especially when dealing with complex constraints. In contrast, direct methods, which discretise the optimal control problem into a nonlinear programming (NLP) formulation, hold potential for automation and could offer suitable, adaptable solutions for real-time decision-making. However, despite their potential, the widespread adoption of these techniques has been limited. Several factors may contribute to this challenge, including limited access to specialised software, a perception of high computational costs, or a general unfamiliarity with these methods. This study investigates the feasibility, robustness, and potential of direct optimal control methods using nonlinear programming solvers on compartmental models described by ordinary differential equations to determine the best application of various interventions, including non-pharmaceutical interventions (NPIs) and vaccination strategies. Through case studies, we demonstrate the use of NLP solvers to determine the optimal application of interventions based on single objectives, such as minimising total infections, “flattening the curve”, or reducing peak infection levels, as well as multi-objective optimisation to achieve the best combination of interventions. While indirect methods provide useful theoretical insights, direct approaches may be a better fit for the fast-evolving challenges of real-world epidemiology. By integrating newly available data more quickly, direct methods can enhance the ability to make informed and timely decisions for managing outbreaks effectively.

Author summary

This study demonstrates the practical advantages of direct optimisation methods in epidemiological modelling when there is a need to identify effective strategies for disease control while balancing constraints. Through case studies, it examines the effort required to adapt compartmental models for optimisation, the time needed to obtain an optimal solution, and the performance of both open-source and licensed tools. The study begins by contrasting indirect and direct methods using a simple infection model. It then illustrates the application of an accessible mathematical programming framework, JuMP, to optimise control strategies aimed at reducing infections, minimising intervention costs under constraints, and managing multiple interventions. Finally, the study compares the efficiency of different optimisation algorithms. The results show that direct methods, aided by readily available tools like JuMP and IPOPT, enable efficient, flexible, and interpretable modelling with minimal additional implementation effort. This work demonstrates how these techniques can support informed and timely decision-making in the early stages of an epidemic.

Citation: Montes-Olivas S, Kucharski AJ, Gravenor MB, Frost SD (2026) Exploring epidemic control policies using nonlinear programming and mathematical models. PLoS Comput Biol 22(5): e1014238. https://doi.org/10.1371/journal.pcbi.1014238

Editor: Joseph T. Wu, University of Hong Kong, HONG KONG

Received: October 6, 2025; Accepted: April 14, 2026; Published: May 6, 2026

Copyright: © 2026 Montes-Olivas et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All code illustrating the optimisation scenarios described in this study is publicly available at http://github.com/EpiRecipes/EpiPolicies.

Funding: This work was supported by the Medical Research Council (MRC) under the grant “Building an epidemiological modelling toolkit for epidemic preparedness” (Grant Ref: MR/Z503939/1 to SMO, MBG, SDWF). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

The study of infectious disease dynamics is crucial for the development and assessment of effective control strategies. Mathematical modelling has become an indispensable tool in this field, providing insights into transmission mechanisms and the potential impact of various interventions. Optimal control theory in epidemiology has been used to establish the most effective intervention strategies for managing and mitigating the spread of infectious diseases while considering constraints and costs [1]. Optimal control can provide a framework to balance common control strategies, such as vaccination and isolation, according to an objective function which can be aimed at minimising infections or intervention costs, enabling decision-makers to weigh the benefits of disease control against potential economic and societal impacts.

The literature on infectious disease modelling using optimal control has grown considerably, particularly in response to the recent coronavirus disease 2019 (COVID-19) pandemic [2]. For instance, Godara et al. [3] applied control theory to a Susceptible-Infected-Recovered (SIR) model to minimise the total number of infected individuals while considering the costs of mitigation efforts. Their findings underscored the importance of strict initial interventions, followed by a gradual relaxation as the epidemic progresses and herd immunity increases. Similarly, Britton and Leskela [4] explored optimal non-pharmaceutical interventions (NPIs) within a SIR framework, emphasising that a single, intense lockdown of short duration, implemented at an optimal time, was the most effective strategy for reducing infections while managing cumulative intervention costs. Other studies have highlighted the practical aspects of different control strategies. For example, Miclo et al. [5] focused on suppression policies to prevent healthcare systems from becoming overwhelmed. They proposed a “filling the box” strategy that involves intensifying suppression measures as infections approach ICU capacity and relaxing them as pressure on healthcare systems decreases. Meanwhile, Asamoah et al. [6] developed an optimal control model and performed a cost-effectiveness analysis of different combinations of various NPIs, including social distancing, personal hygiene, and disinfection of public spaces.

Beyond single-strain models, Arruda et al. [7] addressed reinfection and multiple viral strains using a multi-strain SEIR model, with a case study of the COVID-19 outbreak. Their study highlighted the need to consider the population’s waning immunity and the emergence of new variants when implementing control strategies. Xia et al. [8] proposed a geographically tailored optimal control strategy that underscored the importance of spatial control measures, such as border closures, to contain infectious disease outbreaks effectively. Finally, integrating medical and non-medical interventions, Smirnova [9] suggested the need for control strategies that involve behavioural changes during the early stages of an outbreak and the importance of developing improved control tools, such as vaccines and therapeutics, to apply them as the efficacy of early interventions diminishes over time.

Beyond the recent focus prompted by COVID-19, optimal control theory has been broadly applied in the analysis and management of diverse infectious diseases. For vector-borne diseases, optimal control has been applied to malaria [10,11], dengue fever [12], and other arboviruses like Zika and chikungunya, where strategies extend to innovative biological controls such as the Sterile Insect Technique (SIT) and Wolbachia-infected releases [13]. For example, Asamoah et al. [12], applied optimal control to a dengue fever transmission model incorporating asymptomatic carriers and partial immunity to evaluate the cost-effectiveness of treated bednets, vaccination, treatment, and insecticides. Similarly, optimal control approaches have been explored for rabies management. For instance, Charles et al., [14] integrated optimal control theory into a rabies transmission model involving humans, domestic and free-ranging dogs to evaluate the optimal application of control strategies such as domestic dog vaccination, health promotion and surveillance, public awareness, and post-exposure prophylaxis (PEP). For foot-and-mouth disease (FMD), studies such as that by Wang et al., [15] demonstrate how optimal control can be used to assess control interventions within a bi-seasonal model to minimise infections and costs, with their findings suggesting that reducing transmission among children and isolating older infected individuals are critical for limiting the epidemic.

Together, these studies showcase the versatility and impact of optimal control theory in guiding infectious disease management. They also point out the ongoing challenges in effectively applying these methods, particularly the need for approaches that are both adaptable to real-world complexities and practical for rapid implementation. Addressing these challenges requires a closer look at the methodologies used in optimal control.

Optimal control methods can be classified into two categories: indirect and direct [16]. Indirect methods rely on deriving first-order necessary conditions for optimality, typically using the Pontryagin’s Maximum Principle (PMP). The PMP states that if the control and state trajectories are optimal, then there exists a piecewise differentiable adjoint variable that satisfies conditions defined by the Hamiltonian of the problem [17]. Indirect methods are known for their high numerical accuracy, but have significant challenges. Their reliance on precise initialisation makes convergence highly sensitive to the initial guess, and they often require detailed structural knowledge of the optimal solution, such as a priori identifying bang-bang or singular arcs. In bang-bang controls, the optimal strategy switches instantaneously between its minimum and maximum allowable values. This results in a control function that is piecewise constant and characterised by sharp transitions, and potentially difficult to identify. The challenge from a singular arc occurs when the control does not operate at either extreme. Instead, it is determined by higher-order conditions, which require solving additional equations to characterise the control trajectory within that specific interval. These complexities introduce extra analytical and computational challenges. As a result, indirect methods can become intricate and time-consuming, particularly for complex or poorly understood models where the solution structure is not immediately apparent.

In contrast, direct methods adopt a more intuitive approach by discretising the model in time, transforming the continuous optimal control problem into a nonlinear optimisation problem (NLP) [18]. A NLP is an optimisation problem in which at least one component of the formulation, that is, the objective function or the constraints, is a nonlinear function of the decision variables. Formally, an NLP seeks to minimise or maximise a scalar objective function subject to equality and/or inequality constraints, where these functions are not restricted to being linear. The decision variables include the state variables (e.g., S, I, R) and control variables (e.g., intervention intensity) at each discretised time point. The NLP solver then determines values for these variables that minimise (or maximise) the objective while satisfying all constraints. If the discretised NLP is differentiable, it can be addressed using gradient-based optimisation solvers such as the Interior Point Optimizer (IPOPT) solver [19] or sequential quadratic programming (SQP) solvers [20]. Non-differentiable problems can alternatively be solved by non-gradient-based methods, including metaheuristic algorithms such as Genetic Algorithms [21] and Simulated Annealing [22,23].

Recent advances have expanded the optimisation landscape for mechanistic epidemiological models through the integration of deep learning techniques. Colas et al. [24] proposed a toolbox in Python aimed at bridging epidemiological models and reinforcement learning, specifically leveraging Q-Learning combined with deep neural networks (DQN) [25] alongside evolutionary optimisation methods such as NSGA-II [26]. Furthermore, Yin et al. [27] proposed a hybrid optimisation approach integrating traditional optimisation frameworks with deep-learning algorithms, employing both first- and second-order optimisers such as Adam [28,29] and L-BFGS [20,30]. Although these methods show potential, their computational demands can create challenges and may limit their applicability in time-critical scenarios.

In contrast, established NLP methods are often characterised by rapid convergence to locally optimal solutions thus minimising computational demands. These methods can handle complex system dynamics, multiple constraints, and continuously evolving models, making them well suited for real-time intervention planning and decision-making. However, despite their practical advantages and the fact that such methods and solvers are well established within the optimisation community [18,19,31], they have yet to be widely adopted in the context of epidemiological modelling. Factors such as limited access to specialised software, perceived high computational costs, and unfamiliarity with these techniques may hinder their widespread use. Additionally, practical challenges, like adapting complex epidemiological models to fit into an optimal control framework, can create barriers to broader adoption.

This study aims to demonstrate the practical value and accessibility of direct optimal control methods in epidemiological modelling. While the underlying optimisation techniques are well established, their application in epidemiology has been limited by practical challenges such as computational complexity, model adaptability, and accessibility of suitable tools. Through case studies, we demonstrate that direct optimisation methods provide a practical and accessible alternative to indirect approaches, such as forward–backward sweep algorithms, for computing optimal control strategies in epidemiological models. In particular, we show that widely used compartmental models from the literature can be readily adapted to a direct optimisation framework with minimal structural modification. Initial explorations have been conducted to assess the resources required for implementing these methods and to evaluate their reliability. By highlighting both the strengths and limitations of these approaches, this work seeks to lower the barrier to adoption and encourage their use in real-time intervention planning and decision-making for infectious disease management.

Materials and methods

Optimisation algorithms

Epidemiological control problems frequently present nonlinear dynamics, and are constrained by the available resources [32]. Based on the existing literature, many epidemiological models that employ optimal control theory rely on the PMP to derive necessary conditions, which are then solved using numerical methods. Among these, the forward-backward sweep method [17] is one of the most commonly employed approaches for simulations. In contrast, direct optimisation methods are widely used in several engineering disciplines to find optimal solutions to complex problems [18]. In these fields, a direct optimisation approach formulates the entire optimisation problem (states, controls, and constraints) in a single framework, as a NLP for instance, which is then solved numerically. The NLP solver methods can be grouped as sequential or simultaneous. Sequential methods, like SQP, involve breaking down the problem into a sequence of subproblems that are updated from previous iterations. On the other hand, simultaneous methods solve the optimisation problem as a whole by discretising both the state and control profiles in time, which may be better suited for large scale problems [33]. IPOPT is a commonly used open-source solver that is based on an interior-point approach to solve large-scale problems by iteratively improving a candidate solution from within the feasible region [19]. Here, IPOPT was chosen for its proven ability to efficiently solve large-scale nonlinear programming problems, and its wide availability across multiple programming environments.

JuMP (Julia Mathematical Programming) is an open-source algebraic modelling language embedded in Julia [31,34]. Similar to other commercially available optimisation software such as AMPL (A Mathematical Programming Language) [35] and GAMS (General Algebraic Modeling System) [36], JuMP enables users to express optimisation problems in a form similar to their original mathematical representation, which is then translated into the form expected by the solver. JuMP provides an interface to various solvers and supports automatic differentiation, making it highly efficient for large-scale optimisation problems. The main components of a NLP formulation in JuMP are as follows; a solver; a list of variables; constraints that define the feasible region for the solution; and the objective function used to minimise or maximise specific outcomes, such as the number of infections or costs.

All models presented in this study were discretised in time using a basic Euler method, a straightforward and widely used approach for approximating solutions. This discretisation helps maintain consistency in the behaviour of the model and transforms the continuous system into a form suitable for solving the NLP. The IPOPT solver was used to solve the NLP due to its open-source nature and broad support across the optimisation languages used in this study. The resulting NLPs were solved using IPOPT (v3.14.17), interfaced through JuMP (v1.24), chosen for its open-source availability and broad support across optimisation frameworks. Default solver settings were employed, including a convergence tolerance of 10⁻⁶, a feasibility tolerance of 10⁻⁶, and a maximum of 3000 iterations. Linear systems were solved using MUMPS, and IPOPT’s default exact Hessian of the Lagrangian was used. All case studies converged successfully, with constraint violations remaining below the specified feasibility tolerance, and no manual scaling or problem reformulation was required.

Model formulation

To demonstrate the application of direct optimisation methods, we begin with a simple exponential-growth infection model in which a time-dependent control intervention seeks to reduce the number of infected individuals. This example provides a setting to compare the two methodologies: the indirect method, in which the Pontryagin’s Maximum Principle is applied to derive optimality conditions, and the direct method, in which the problem is discretised and solved as a nonlinear programming problem.

Next, we present the application of JuMP with IPOPT across four case scenarios that employ models already established within epidemiological research [4,12]. The first three scenarios evaluate single control intervention strategies on a modified SIR model aimed at minimising total infections in an epidemic through lockdown measures, ’flattening the curve,’ or vaccination efforts. The final scenario evaluates a more complex compartmental model of dengue transmission formulated by Asamoah et al. [12], optimising the combination of four control strategies.

All case scenarios were solved on a MacBook Pro with an Apple M3 Pro chip and 18GB of memory. The computational framework employed JuMP (v1.24), IPOPT (v3.14.17), AMPL (v20241203), amplpy(v0.14.0), rAMPL (v2.0.13.0.20241012), AmplNLWriter (v1.2.3) and Pyomo (v6.8.2). Code illustrating these scenarios can be found at http://github.com/EpiRecipes/EpiPolicies.

Results

Comparison of indirect and direct methodologies

We consider a simple mechanistic model that tracks the number of infected individuals I(t) over time, assuming a nearly constant susceptible population S ≈ N, where N denotes the total population size. The dynamics of infection are described by:

(1)

where is the transmission rate, is the recovery rate, and represents a time-dependent control strategy (i.e., varying social distancing measures or lockdown) aimed at reducing transmission.

The optimal control policy that balances disease mitigation with the cost of intervention is defined by the following objective function:

(2)

where A and B are the infection and control effort weights, respectively.

First, PMP was used to construct the Hamiltonian:

(3)

where is the adjoint variable associated with state I(t). Then, the adjoint equation is given by:

(4)

To obtain the optimal control, we solve the following condition:

(5)

which yields the unconstrained optimal control:

(6)

Taking into account control bounds, the final expression for the optimal control is as follows:

(7)

Using these derivations, the optimal control problem was solved numerically using the forward-backward sweep method [17], which iteratively updates the state and adjoint equations until convergence.

To compare with a direct approach, the optimisation problem was then formulated and solved using the algebraic modelling language JuMP [34] with the the interior-point solver IPOPT [19]. This approach requires the user to specify a solver and define decision variables, constraints, and the objective function. JuMP then translates the algebraic representation of the problem into a standard form that the solver interprets to find an optimal solution.

Code snippets illustrating the implementation of both methods are shown in Fig 1. The direct method not only results in a more concise block of code, but also eliminates the need for extensive preparatory analytical work, which can be considerable when applied to more complex models, compared to the simplified example presented here. Crucially, Fig 2 presents the results obtained from each approach, showing that both methods produced the same optimal solution.

Download:

Fig 1. Code Snippets: FBS vs JuMP–Ipopt.

Code snippets implementing the different approaches to solve the optimal control problem: forward-backward sweep (left) and JuMP with IPOPT solver (right).

https://doi.org/10.1371/journal.pcbi.1014238.g001

Download:

Fig 2. Optimal control solutions: FBS vs JuMP–Ipopt.

Optimal control solutions obtained using the (a) forward-backward sweep method and the (b) JuMP modeling framework with IPOPT.

https://doi.org/10.1371/journal.pcbi.1014238.g002

Case scenario 1: Lockdown.

This scenario examines the optimal control of an SIR model through a lockdown intervention that reduces the infection rate, with the aim of minimising the total number of infected individuals. The state variables are as follows: S represents the number of susceptible individuals, I denotes the number of infected individuals, and C indicates the total number of cases. The infection rate is modified according to a policy denoted as , where . This model is described by the following differential equations (Eq (8)):

(8)

where and are the baseline transmission and recovery rates, respectively.

Britton and Leskela [4] analytically demonstrated that the optimal policy under these conditions is a single “bang-bang” intervention, characterised by a sudden shift from no intervention to the maximum allowable level for a single period. The optimal control problem is defined as the policy that minimises the total number of cases (i.e., C(∞)) while adhering to two main constraints: (a) the value of cannot exceed a specified maximum, , and (b) there is a cost associated with the policy, quantified as the integral of over time, which must remain within a certain limit, i.e., . In the implementation, we calculate C for a long time horizon to approximate C(∞).

The parameter values used in the evaluations with the modified SIR models are listed in Table 1, and they were assumed for the purpose of intervention optimisation only. These values were not derived from real-world data but were instead utilised to assess the performance of the optimisation algorithm within the model. We initially ran the model without the presence of intervention to establish baseline outputs, observing a cumulative incidence of approximately 79% and the peak of infection occurring at around 17.5 days (see Fig 3(a)).

Download:

Table 1. Values of parameters used in evaluations performed using the modified SIR models.

https://doi.org/10.1371/journal.pcbi.1014238.t001

Download:

Fig 3. SIR baseline vs intervention starting at peak of infections (non-optimal).

Comparison of SIR model outputs (a) baseline scenario without intervention and (b) with a lockdown intervention that lasts 20 simulated days, set at and applied at the peak of infections.

https://doi.org/10.1371/journal.pcbi.1014238.g003

This information was used to simulate the impact of a heuristic (non-optimal) baseline intervention strategy: a single lockdown intervention initiated at the peak of infection cases and lasting for a fixed period of 20 days (set at the maximum policy value of 0.5) consistent with the duration suggested by Britton and Leskela [4]. This scenario was chosen as representative of an intervention with a simple operational definition. Fig 3(b) shows that, in this case, the final cumulative incidence under intervention is 63%.

To investigate whether these results represent the optimal values for minimising cumulative incidence in this scenario, the NLP model was run. The results, shown in Fig 4(a), indicate that the optimal start time obtained using JuMP is 14.3 days, which is earlier than the peak observed in the baseline non-intervention model (17.5 days). Furthermore, the optimal output suggests implementing a single lockdown lasting approximately 19.9 days, and the final cumulative incidence is 59%.

Download:

Fig 4. SIR with optimised NPIs.

Comparison of SIR model outputs (a) optimised lockdown scenario, (b) effective reproduction number during lockdown, (c) optimised “flattening the curve” (FtC) scenario, and (d) effective reproduction number during FtC intervention.

https://doi.org/10.1371/journal.pcbi.1014238.g004

Case scenario 2: “Flattening the curve”.

This scenario employs the same system of equations (Eq (8), as in the lockdown scenario. In addition, in this case, the optimal intervention policy, represented as , balances the costs associated with intervention against the need to manage the spread of infection. This policy aims to ensure that the number of infected individuals, denoted as I, does not exceed a predetermined threshold, I_max. This approach is commonly referred to as “flattening the curve” (FtC), and it is easily implemented in the constraints included in the NLP. Both and I(t) are constrained to values between 0 and and I_max, respectively. The objective function in this scenario is to minimise the total cost of intervention instead of the cumulative incidence.

The results shown in Fig 4(c) suggest that the optimal policy following a FtC approach involves a single lockdown that increases in intensity rapidly at or shortly before the maximum tolerable infections (I_max). Once this threshold is reached the intensity of the lockdown is gradually reduced. Fig 4(d) shows that the end of the lockdown coincides with the effective reproduction number, R_t, crossing the value of 1. This ensures that the population of infected individuals remains below the critical level.

Case scenario 3: Vaccination.

From an epidemiological standpoint, in vaccination-based control strategies, the primary goal is to reduce the susceptible population below a threshold required for sustained transmission. This process is mathematically represented by Eq (9), where the main difference is the position of the control variable , which in this case represents vaccination at a per-capita rate.

(9)

The optimal control problem is defined as the policy that minimises the total number of cases (i.e., the final size of the epidemic) while adhering to the following constraints: (a) the vaccination rate, , cannot exceed a maximum value, indicating a limit on the rate of vaccination, and (b) there is a cost associated with the vaccination process, measured as the integral of over time, which cannot exceed a predetermined level.

The directly optimised results shown in Fig 5 suggest that the optimal policy in this scenario is initiating vaccination immediately, and that maintaining a continuous administration at the maximum allowable rate is most effective. This intervention should be maintained until the allocated vaccine resources are depleted. This strategy reduces the susceptible population early in the epidemic, significantly limiting disease transmission and minimising the peak incidence of infections to ∼3%.

Download:

Fig 5. SIR with optimised vaccination.

Comparison of SIR model outputs (a) baseline scenario without intervention and (b) with a lockdown intervention that lasts 20 simulated days, set at and applied at the peak of infections.

https://doi.org/10.1371/journal.pcbi.1014238.g005

Case scenario 4: Multiple control strategies.

To assess the effectiveness of direct optimisation methods in identifying the optimal solution for a combination of multiple interventions, we applied the same methodology to a model proposed by Asamoah et al. [12], which provides a framework for evaluating the combination of different control interventions to control dengue transmission. This model considers two populations: humans and female mosquitos. The human population is divided into five compartments: susceptible S_h, infected (symptomatic) I_h, carrier (asymptomatic) I_hA, partially immune P, and recovered R_h, while the mosquito population is formed by two compartments: susceptible S_v and infected I_v. The model is described by Eq (10, 11).

(10)

(11)

The time-dependent controls in the model are: treated bednets (u₁), vaccination (u₂), treatment with prophylactics (u₃), and insecticides (u₄). All parameter definitions and values were obtained from [12], and restated in Table 2 for reference.

Download:

Table 2. Parameter values and descriptions used on the dengue fever model.

https://doi.org/10.1371/journal.pcbi.1014238.t002

Following the Asamoah et al. [12] study, the objective was to minimise dengue incidence and the intervention cost according to the objective function shown in Eq (12), with weights relevant to diferent compartments (C_1...3), and those relative to each control variable (D_1...4).

(12)

Similar to the case scenarios presented previously, the system of equations was discretised using a simple Euler discretisation and optimised using JuMP and the IPOPT solver. All tested control combinations resulted in an optimal solution, and the number of iterations and the time taken to reach this solution are displayed in Table 3. These results indicate that the JuMP optimisation framework, in conjunction with the IPOPT solver, can return optimal solutions, even in more complex scenarios involving the optimisation of multiple control strategies. In this case, where a weighted sum is used to aggregate the contributions of each strategy to the objective function, the algorithms effectively converge to the optimal solution in less than 30 seconds for all combinations tried (see Table 3).

Download:

Table 3. Comparison of the number of iterations and the time taken to reach an optimal solution when combining control strategies using JuMP with the IPOPT solver.

https://doi.org/10.1371/journal.pcbi.1014238.t003

In the original study by Asamoah et al. [12], the optimal controls were obtained using the indirect method based on PMP, as described previously. For this model, the Hamiltonian takes the form:

(13)

where L is the integrand of the objective function (Eq 12), are the adjoint variables associated with each state, and the derivatives are given by the state equations (Eq 11).

The necessary optimality conditions yield a system of seven adjoint differential equations, , with transversality conditions , together with the characterisation of each optimal control via . Given the complexity of the model, the resulting adjoint equations expand into extensive expressions (see pages 10–11 in [12]). This highlights the substantial analytical effort required by the indirect method when applied to complex systems. We applied the forward-backward sweep method to this model and confirmed that both this and JuMP approaches produced equivalent optimal control trajectories. The code and trajectory comparisons are available in the associated repository.

Comparison between optimisation algorithms.

There could be concerns regarding the use of direct optimisation methods if these are limited to specific programming languages. To address this, we leveraged the accessibility of the IPOPT solver across various platforms.

Table 4 shows the number of iterations and time each algorithm took to find an optimal solution for the lockdown model. The results indicate that the solver maintains a consistent number of iterations and the resulting objective value across platforms. The timing results shown in the table reflect solve times after JIT compilation warmup for JuMP, ensuring fair comparison across platforms. Overall solve times were consistent across platforms, though JuMP reported a denser Lagrangian Hessian matrix, which led to slightly higher function evaluation times. This difference arises due to the automatic differentiation approach used by each platform to compute the Hessian sparsity pattern. In JuMP, the automatic differentiation framework identified a comparatively denser sparsity structure for the Lagrangian Hessian. In contrast, the remaining algorithms depend on the AMPL Solver Library (ASL) [37] for the derivative evaluations, either directly (amplpy, rAMPL, and AMPL IDE), or indirectly (Pyomo and AmplNLWriter), which exploits partial separability to yield a more compact sparsity representation [38].

Download:

Table 4. Comparison of different optimisation algorithms using IPOPT. The optimisation across platforms was performed on the same optimisation model, using the lockdown case scenario as an example (n = 100 runs, dt = 0.5). Hessian nnz denotes the number of non-zero elements of the Lagrangian Hessian matrix. Objective function evaluations (183) and Lagrangian Hessian evaluations (128) were identical across all platforms.

https://doi.org/10.1371/journal.pcbi.1014238.t004

From the optimisation algorithms tested in this study, JuMP and Pyomo are open-source, while amplpy and rAMPL are official interfaces that allow the user to access the license-based optimisation language AMPL and its features from Python and R, respectively. On the other hand, AmplNLWriter is an open-source wrapper that also interacts with AMPL-enabled solvers but is maintained by the JuMP community. Importantly, AmplNLWriter does not require an AMPL license when used with open-source solvers like IPOPT; however, a license may be needed to use AMPL’s proprietary solvers.

By applying the solver to the initial case scenario using multiple programming languages, we demonstrate that users can choose their preferred programming language while utilising either open-source or licensed optimisation platforms, depending on their needs.

Additionally, we compared the performance of JuMP with alternative NLP solvers, including MadNLP [39,40] and UnoSolver [41], two alternative open-source NLP solvers. Applied to Case Scenario 1 and Case Scenario 4, all solvers converged to equivalent objective values, confirming that the solutions are not solver-dependent. Full benchmark results and trajectory comparisons are available in the accompanying repository.

Discussion

This study demonstrates that existing direct nonlinear programming methods can be effectively applied to epidemiological optimal control problems without introducing new optimisation theory. By leveraging modern optimisation tools such as JuMP, previously published compartmental epidemic models can be readily reformulated and solved using direct optimisation approaches. The results show that optimal control strategies can be computed without deriving adjoint equations or implementing custom numerical solvers. Taken together, these findings indicate that direct optimisation methods provide a practical and accessible framework for applying optimal control in epidemiological modelling, with potential relevance for decision-making and policy analysis.

Modelling underpinned a number of key policy decisions in the UK during the COVID-19 pandemic. As in the examples here, the timing, duration, and extent of lockdown were foremost among the policy options considered at several time points [42]. In trying to provide rapid real-time advice, COVID-19 modelling approaches often relied on combining scenarios based on systematic grid searches of assumed plausible parameters [43,44]. Furthermore, the formal consideration of costs to address questions of optimality has been identified as a priority for future modelling in the UK COVID-19 Inquiry. The Core Decision-Making Report noted it was not clear to what extent ‘smarter’ interventions (that maximised the impact on transmission while minimising the economic or social impact) were ever seriously taken forward, thus hampering the ability of decision-makers to assess and balance relative harms [45]. With tight time constraints involved in preparing modelling output for real-time decision-making, the ability to automate the search of policy options in a manner that can both save time and identify optimal choices offers considerable advantages, especially for complex models with long runtimes that are often used in informing decision-making.

The application of numerical simulations and control theory is commonplace in the analysis of complex models, especially in fields dealing with unpredictable dynamics and large-scale systems [18]. While these simulations provide valuable insights into a wide range of complex systems, optimal control theory becomes particularly helpful when managing disease control, as it helps balance resource constraints and optimise treatment or control strategies. In addition, these strategies also aim to reduce the implementation costs associated with large-scale interventions, offering a double benefit of improving public health outcomes while minimising economic impact [1,17].

The continuous nature of many models, including those used in disease control, often necessitates discretisation for computational implementation. This study employed a simple Euler discretisation, a straightforward method for approximating continuous systems. While easy to implement, this method has limitations regarding accuracy and stability. We conducted a timestep sensitivity analysis comparing simple Euler discretisation with an exponential approximation approach (where transitions are modelled as ) across six timestep values for Case Scenario 1. Both methods converged successfully across all timesteps tested, with the simple Euler method showing greater stability (1.58% relative variation) compared to exponential discretisation (22% relative variation). At small timesteps, both methods reached nearly identical optimal solutions (0.7% difference at dt = 0.05). These results demonstrate that for the models considered here, Euler discretisation with appropriately chosen timesteps provides reliable convergence S3 Fig.

In Case Scenario 1, the lockdown intervention was optimised; it is worth noting that other grid-search-like methods could have been employed. For instance, one approach could involve fixing the intervention length to 20 days and optimising only the start of the intervention. The supplementary S1 Fig, shows a comparison of the optimal time found using JuMP with the cumulative incidence obtained by running the model simulations for different intervention start times. However, the use of JuMP allowed us to confirm the optimal intervention frequency and duration.

Compared to Case Scenario 1, where the objective was to minimise the overall number of infections, Case Scenario 2 had a different focus, the goal being to keep the infected population below a certain threshold, which could be influenced by factors such as the current healthcare capacity. This led to a distinct optimal policy, which suggested a single lockdown period that started earlier than in scenario 1. However, unlike in scenario 1, the strength of the intervention gradually decreased over time until the effective reproduction number dropped below one, at which point disease transmission was effectively controlled. While this optimal policy provides a theoretical framework for managing disease transmission, there are significant challenges in translating these findings into actual intervention policies. Fine-tuning the intensity of the intervention over time, as suggested by the model, may not be feasible in real-world settings. Instead, a more practical approach may involve implementing a series of staged interventions with varying intensities, depending on the evolving situation. The impact of the intervention may be uncertain prior to its implementation, and if its efficacy is found to be lower than expected, it may require initiating the intervention well in advance before reaching the infected threshold. Furthermore, determining when to stop the intervention requires knowledge of the effective reproduction number in the absence of the intervention. This requires reliable estimates of R_t and the intensity of the intervention. These uncertainties are in addition to the usual uncertainty in model structure and parameter values of the underlying model.

Similarly, Case Scenario 3 relies on the assumption that a vaccine is available at the start of an epidemic, and in such a case, the optimal policy would be to vaccinate early and at the maximum capacity available until the supply is exhausted, to reduce the susceptible fraction of population as much as possible. However, it is important to recognise that, in reality, even if a vaccine is developed quickly, there are several factors that can delay or limit its availability. The production capacity may be insufficient to meet demand, leading to delays in distribution. Additionally, the vaccine’s efficacy may not be immediately known or consistent across all demographic groups. Variations in immune response based on age, health status, or pre-existing conditions could affect the overall success of the vaccination campaign.

The optimal control trajectories obtained in this study correspond to solutions under specific modelling assumptions. Control variables are normalised and bounded, providing a structured way to explore the relative timing and intensity of interventions, rather than literal representations of policy actions. In practice, the realised impact of an intervention depends on behavioural responses, compliance, enforcement, and contextual factors that are difficult to quantify precisely, and this uncertainty is well recognised [45]. Within this context, the value of direct optimisation lies in its ability to support structured exploration under uncertainty.

Despite the obvious appeal of identifying optimal control strategies, there are challenges associated with solving optimisation problems. We highlight here that indirect methods are more mathematically intensive, and their successful application depends on an in-depth understanding of the system’s dynamics and constraints. In contrast, the direct methods described do not require the explicit construction of the adjoint and control equations, making them more straightforward to define computationally. Nevertheless, they may come with limitations, such as potentially slower convergence or less precision in highly complex systems. Another issue may arise when the constraints of an NLP problem turn out to be infeasible, which is a common problem that the user should be prepared to address. In these cases, the solver may fail to converge to a feasible solution, which may require revisiting and adjusting the model or constraints [18]. Thus, it is essential to have a clear understanding of the system and the ability to adjust the model as necessary to avoid such pitfalls. Therefore, we do not advocate one approach as the standard; rather we aim to demonstrate to the reader how optimisation languages such as JuMP can be a very useful tool to obtain fast optimal approximations or even corroborate the results obtained through indirect methods. Similarly, Silva et al. [46] applied both direct and indirect methods to develop optimal control strategies to minimise the cost of interventions for treating tuberculosis. They utilised AMPL, IPOPT, and PROPT MATLAB Optimal Control Software to compute and compare the numerical results with those obtained through an iterative method, a standard approach for solving systems of ODEs and updating controls in indirect methods.

The complexity of deriving adjoint equations and Hamiltonians increases significantly with model dimension, making indirect methods more challenging for larger models. For instance, the simple exponential infection model discussed earlier requires only a single adjoint equation and one control optimality condition. However, when we consider a 3-compartment SIR model, the situation becomes more complex, requiring three adjoint equations, three boundary conditions, and coupling between state and adjoint variables. In larger models, such as the dengue transmission model (Case Scenario 4), which includes seven compartments (S_h, I_h, I_hA, P, R_h, S_v, I_v) and four control variables (u₁, u₂, u₃, u₄), the analytical derivation becomes much more complicated. In this case, seven adjoint equations must be derived (one for each state variable), along with four control optimality conditions. Additionally, the boundary conditions become more intricate, and the coupling between state and adjoint variables increases nonlinearly as the model dimension grows. This analytical burden becomes increasingly tedious and prone to error as models become more complex, requiring meticulous manual derivation, verification, and implementation. In contrast, direct methods only require the discretisation of the state equations, the definition of variables and constraints, and then the problem can be passed to a solver. The complexity in direct methods scales approximately linearly with the number of time points and compartments. This scalability advantage becomes particularly evident for models that incorporate additional complexities such as age structure, spatial heterogeneity, or multiple pathogen strains.

As for solving NLP optimisation problems, using an interior point solver such as IPOPT has proven to be highly effective in this study and in others [19,33,46–48]. IPOPT can efficiently solve problems with a large number of inequalities and degrees of freedom, making it a good and flexible option for high-dimensional control problems. The algorithm’s ability to scale and find feasible solutions within a reasonable time frame is an important advantage when tackling the optimisation challenges posed by complex control systems. Additionally, advancements in computing power have reduced the computational expense once associated with these methods, enabling faster and more accessible solutions. However, alternative tools such as AMPL, GAMS, and Pyomo also offer similar capabilities, providing users with flexibility in software selection based on preference and the specific requirements of their computational environment.

All presented case studies, including the most complex example (Case Scenario 4: a 7-compartment dengue model with four controls, which, after discretisation, resulted in 5,409 decision variables and 4,215 constraints), converged successfully using default IPOPT settings within reasonable computation times. However, we did not systematically evaluate sensitivity with respect to initialisation strategies, problem scaling, stiffness, solver tolerances, or parameter uncertainty. While JuMP provides a high-level interface for formulating nonlinear optimisation problems, convergence behaviour is ultimately determined by the underlying solver and the numerical properties of the discretised model [31]. Interior-point methods are known to exhibit sensitivity to numerical choices such as starting points and scaling, particularly in more challenging settings [49]. The models considered here are well-behaved. In contrast, more complex formulations incorporating spatial structure, stochastic effects, population heterogeneity, or multiple strains may introduce additional numerical difficulties. In such cases, the direct optimisation framework remains conceptually applicable. Still, increased model complexity primarily raises computational considerations, including larger numbers of decision variables and constraints, potential numerical stiffness, and the need for solver tuning or alternative discretisation strategies. Addressing these challenges may require additional numerical measures, such as custom scaling, higher-order discretisation schemes (i.e., Runge-Kutta [50]), or multiple initialisation strategies to improve robustness and convergence [18,19]. It is worth noting that, while we employed Euler discretisation throughout this study, the framework itself does not specify a specific discretisation approach; modellers can implement alternative schemes as needed, provided the resulting formulation remains differentiable and compatible with automatic differentiation.

While this work focuses on direct nonlinear programming (NLP) transcription methods for optimal control, other approaches can provide complementary capabilities. Model Predictive Control (MPC) formulates a sequence of finite-horizon optimisation problems in a receding-horizon framework, making it well-suited for multivariable dynamic systems with constraints and for adapting to real-time disturbances [51,52]. In contrast, direct NLP transcription discretises the full-time horizon into finite elements and formulates a single finite-dimensional optimisation problem, in which the system dynamics and control variables are solved simultaneously [18,53]. This approach offers advantages in exploiting sparsity and handling complex path and boundary constraints. However, for nonconvex problems, the resulting nonlinear programs are typically solved using optimisation solvers such as IPOPT, which provide local rather than theoretical global optimality [19].

Reinforcement learning (RL) and deep learning–based control methods learn control policies through trial-and-error interaction with an environment and can be effective for systems with stochastic behaviour or large state spaces [54–57]. In these settings, traditional control approaches attempt to precompute control actions for all possible system states, which often become impractical as the problem size increases. This difficulty arises because the complexity of such controllers grows rapidly with the number of system variables and constraints. RL addresses this challenge by using flexible function approximators to represent control policies, allowing it to scale to higher-dimensional problems. However, this flexibility typically requires large amounts of training data and comes with reduced interpretability and weaker safety or stability guarantees compared to model-based methods. In comparison, the direct NLP framework provides a transparent middle ground between classical analytical methods and data-driven approaches, making it well-suited for deterministic models with well-characterised system parameters.

Additionally, this study focused exclusively on differentiable epidemiological models, which allowed us to use gradient-based optimisation solvers. As a result, our findings do not directly address the effectiveness or applicability of non-gradient-based optimisation approaches. However, recent advancements have introduced differentiable frameworks for traditionally non-differentiable models, such as agent-based models (ABMs), which may expand the scope and use of gradient-based optimisation solvers. For instance, Chopra et al. [58], developed GradABM, a scalable and differentiable design for ABMs that enables gradient-based learning through automatic differentiation.

Conclusion

Overall, the NLP optimisation approach illustrated in this study performed very effectively for simple models, consistently delivering accurate results within seconds which showcase its practical value as an accessible tool for rapid analysis and decision-making in epidemiological modelling.

While JuMP and IPOPT have been used in previous studies to solve optimisation problems in the context of infectious disease modelling [48], this study aims to showcase their ease of use and effectiveness to promote the wider adoption of direct methods for addressing optimisation challenges. By employing JuMP and IPOPT, this work illustrates how these tools can effectively deliver an initial solution. This approach can aid policymakers in making informed decisions during the early stages of an epidemic, based on the dynamics of the disease model.

Real-world policy questions typically concern trade-offs between timing, duration, and resource constraints, rather than exact intervention intensities. Direct optimisation methods allow these constraints to be represented explicitly and modified as assumptions evolve. When implemented using frameworks such as JuMP, models can be reformulated and re-solved efficiently to examine how conclusions change under different operational constraints or effectiveness assumptions. This flexibility enables direct optimisation to function as a practical decision-support tool, facilitating comparative and iterative analysis rather than producing fixed solutions.

Regarding ongoing challenges in public health economics, the uncertainty surrounding costs has been consistently highlighted in the evidence presented to the UK COVID Inquiry [45]. As we consider future developments, it is important to integrate cost considerations into our decision-making processes. Despite this need, it is likely that cost estimates will continue to be uncertain. In addition, there may be other considerations that would affect the optimal policy; for example, Zarebski et al. [59] incorporated measures of equity into finding an optimal vaccination program in order to achieve an intervention that was both effective and ethical. Thus, there is a pressing need for efficient methodologies that facilitate the exploration of optimal strategies across a wide array of cost function assumptions. The methods described here offer an opportunity for real-time evaluation and the potential generation of rules of thumb for optimal approaches, based on extensive exploration of cost-benefit analysis under a wide range of disease dynamic scenarios, to support preparedness.

Supporting information

S1 Fig. Impact of intervention timing on fixed lockdown duration.

Final cumulative infected population values (C) obtained from applying a fixed intervention with a length of 20 days at different starting times. The minimal C value is highlighted in orange.

https://doi.org/10.1371/journal.pcbi.1014238.s001

(TIF)

S2 Fig. Optimal intervention strategy across multiple parameter scenarios.

Performance of an optimal intervention strategy to reduce the cummulative infections, as in Case Scenario 1, across various scenarios by sweeping through a range of transmission and recovery parameter values ([0.35,0.55] for and [0.15,0.30] for ). Rather than optimising for each scenario separately, the JuMP model creates a unified intervention profile that reduces the expected total infections across all.

https://doi.org/10.1371/journal.pcbi.1014238.s002

(TIF)

S3 Fig. Timestep sensitivity analysis.

Comparison of simple Euler and exponential discretisation methods across six timestep values (dt = 2.0 to 0.05 days) for Case Scenario 1. (a) Shows the convergence of the optimal objective value with timestep for both methods. (b) Displays the relative difference between methods. (c) Illustrates the scaling of computational time with the number of time points (mean ± standard deviation across 50 repetitions). (d) Compares control trajectories at dt = 0.5 days.

https://doi.org/10.1371/journal.pcbi.1014238.s003

(TIF)

Acknowledgments

The authors sincerely thank Dr. Joshua Asamoah, whose shared files and communication supported the testing of the methods presented in this study.

References

1. Sharomi O, Malik T. Optimal control in epidemiology. Ann Oper Res. 2017;251(1–2):55–71.
- View Article
- Google Scholar
2. World Health Organization. A timeline of WHO’s COVID-19 Response in the WHO European Region: a living document (version 3.0, from 31 December 2019 to 31 December 2021). 2022. Available from: https://iris.who.int/bitstream/handle/10665/351782/WHO-EURO-2022-1772-41523-63024-eng.pdf?sequence=1
3. Godara P, Herminghaus S, Heidemann KM. A control theory approach to optimal pandemic mitigation. PLoS One. 2021;16(2):e0247445. pmid:33606802
- View Article
- PubMed/NCBI
- Google Scholar
4. Britton T, Leskelä L. Optimal Intervention Strategies for Minimizing Total Incidence During an Epidemic. SIAM J Appl Math. 2023;83(2):354–73.
- View Article
- Google Scholar
5. Miclo L, Spiro D, Weibull J. Optimal epidemic suppression under an ICU constraint: An analytical solution. J Math Econ. 2022;101:102669.
- View Article
- Google Scholar
6. Asamoah JKK, Okyere E, Abidemi A, Moore SE, Sun G-Q, Jin Z, et al. Optimal control and comprehensive cost-effectiveness analysis for COVID-19. Results Phys. 2022;33:105177. pmid:35070649
- View Article
- PubMed/NCBI
- Google Scholar
7. Arruda EF, Das SS, Dias CM, Pastore DH. Modelling and optimal control of multi strain epidemics, with application to COVID-19. PLoS One. 2021;16(9):e0257512. pmid:34529745
- View Article
- PubMed/NCBI
- Google Scholar
8. Xia F, Xiao Y, Ma J. The optimal spatially-dependent control measures to effectively and economically eliminate emerging infectious diseases. PLoS Comput Biol. 2024;20(10):e1012498. pmid:39374303
- View Article
- PubMed/NCBI
- Google Scholar
9. Smirnova A, Baroonian M, Ye X. Optimal epidemic control with nonmedical and medical interventions. Mathematics. 2024;12(18):2811.
- View Article
- Google Scholar
10. Ouattara L, Ouedraogo D, Diop O, Guiro A. Analysis and optimal control of a mathematical model of malaria. Nonlinear Dyn Syst Theory. 2024;24.
- View Article
- Google Scholar
11. Jaleta SF, Duressa GF, Deressa CT. A mathematical modeling and optimal control analysis of the effect of treatment-seeking behaviors on the spread of malaria. Front Appl Math Stat. 2025;11:1552384.
- View Article
- Google Scholar
12. Asamoah JKK, Yankson E, Okyere E, Sun G-Q, Jin Z, Jan R, et al. Optimal control and cost-effectiveness analysis for dengue fever model with asymptomatic and partial immune individuals. Results in Physics. 2021;31:104919.
- View Article
- Google Scholar
13. Almeida L, Bellver-Arnau J, Privat Y, Rebelo C. Vector-borne disease outbreak control via instant releases. J Math Biol. 2024;89(6):63. pmid:39532731
- View Article
- PubMed/NCBI
- Google Scholar
14. Charles M, Mfinanga SG, Lyakurwa GA, Torres DFM, Masanja VG. A Mathematical and Optimal Control Model for Rabies Transmission Dynamics Among Humans and Dogs With Environmental Effects. J Appl Math. 2025;2025(1):5559842.
- View Article
- Google Scholar
15. Wang A, Bai D, He J, Smith SR. Optimal control of bi-seasonal hand, foot and mouth disease in mainland China suggests transmission from children and isolating older infected individuals are critical. J Math Biol. 2024;89(4):43. pmid:39331191
- View Article
- PubMed/NCBI
- Google Scholar
16. Caillau J-B, Ferretti R, Trélat E, Zidani H. An algorithmic guide for finite-dimensional optimal control problems. Handbook of Numerical Analysis. vol. 24. Elsevier; 2023. p. 559–626.
17. Lenhart S, Workman JT. Optimal control applied to biological models. Chapman and Hall/CRC; 2007. https://doi.org/10.1201/9781420011418
18. Betts JT. Practical Methods for Optimal Control and Estimation Using Nonlinear Programming. 2nd ed. Society for Industrial and Applied Mathematics; 2010. https://doi.org/10.1137/1.9780898718577
19. Wächter A, Biegler LT. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Math Program. 2005;106(1):25–57.
- View Article
- Google Scholar
20. Nocedal J, Wright SJ. Numerical optimization. 2nd ed. Springer series in operations research and financial engineering. New York: Springer;
21. Michalewicz Z. Genetic algorithms data structures = evolution programs. 3rd ed. Berlin: Springer-Verlag; 1996.
22. Kirkpatrick S, Gelatt CD Jr, Vecchi MP. Optimization by simulated annealing. Science. 1983;220(4598):671–80. pmid:17813860
- View Article
- PubMed/NCBI
- Google Scholar
23. Aarts E, Korst J, Michiels W. Simulated Annealing. In: Burke EK, Kendall G, editors. Simulated Annealing. Boston, MA: Springer US; 2005. p. 187–210.
24. Colas C, Hejblum B, Rouillon S, Thiébaut R, Oudeyer P-Y, Moulin-Frier C, et al. EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models. jair. 2021;71:479–519.
- View Article
- Google Scholar
25. Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, et al. Human-level control through deep reinforcement learning. Nature. 2015;518(7540):529–33. pmid:25719670
- View Article
- PubMed/NCBI
- Google Scholar
26. Deb K, Pratap A, Agarwal S, Meyarivan T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput. 2002;6(2):182–97.
- View Article
- Google Scholar
27. Yin S, Wu J, Song P. Optimal control by deep learning techniques and its applications on epidemic models. J Math Biol. 2023;86(3):36. pmid:36695914
- View Article
- PubMed/NCBI
- Google Scholar
28. Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980. 2014.
29. Zhang Z. Improved Adam Optimizer for Deep Neural Networks. In: 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS). 2018. p. 1–2.
30. Moritz P, Nishihara R, Jordan M. A linearly-convergent stochastic L-BFGS algorithm. In: Artificial Intelligence and Statistics. PMLR; 2016. p. 249–58.
31. Dunning I, Huchette J, Lubin M. JuMP: A Modeling Language for Mathematical Optimization. SIAM Rev. 2017;59(2):295–320.
- View Article
- Google Scholar
32. Hernandez-Vargas EA, Gonzalez AH, Beck CL, Bi X, Campana FC, Giordano G. Modelling and Control of Epidemics Across Scales. In: 2022 IEEE 61st Conference on Decision and Control (CDC). 2022. p. 4963–80.
33. Biegler LT. An overview of simultaneous strategies for dynamic optimization. Chem Eng Process Process Intensif. 2007;46(11):1043–53.
- View Article
- Google Scholar
34. Lubin M, Dowson O, Dias Garcia J, Huchette J, Legat B, Vielma JP. JuMP 1.0: Recent improvements to a modeling language for mathematical optimization. Math Programm Computat. 2023;15:581–9.
- View Article
- Google Scholar
35. Fourer R, Gay DM, Kernighan BW. AMPL: A Modeling Language For Mathematical Programming. 2nd ed. Belmont (CA): Duxbury Press; 2003.
36. Bisschop J, Meeraus A. On the development of a general algebraic modeling system in a strategic planning environment. Applications. Springer. 2009. p. 1–29.
37. Gay DM. The AMPL Modeling Language: An Aid to Formulating and Solving Optimization Problems. In: Al-Baali M, Grandinetti L, Purnama A, editors. Numerical Analysis and Optimization. Cham: Springer International Publishing; 2015. p. 95–116.
38. Petra CG, Qiang F, Lubin M, Huchette J. On efficient Hessian computation using the edge pushing algorithm in Julia. Optim Method Softw. 2018;33(4–6):1010–29.
- View Article
- Google Scholar
39. Shin S, Coffrin C, Sundar K, Zavala VM. Graph-Based Modeling and Decomposition of Energy Infrastructures. IFAC-PapersOnLine. 2021;54(3):693–8.
- View Article
- Google Scholar
40. Shin S, Anitescu M, Pacaud F. Accelerating optimal power flow with GPUs: SIMD abstraction of nonlinear programs and condensed-space interior-point methods. Electr Power Syst Res. 2024;236:110651.
- View Article
- Google Scholar
41. Vanaret C, Leyffer S. Implementing a unified solver for nonlinearly constrained optimization; 2025. Available from: https://arxiv.org/abs/2406.13454
42. Gravenor MB, Dawson M, Bennett E, Thorpe B, White C, Rahat A. Real-Time Epidemiological Modelling During the COVID-19 Emergency in Wales. In: Aston PJ, editor. Cham: Springer Nature Switzerland; 2025. p. 97–104.
43. Davies NG, Kucharski AJ, Eggo RM, Gimma A, Edmunds WJ, Centre for the Mathematical Modelling of Infectious Diseases COVID-19 Working Group. Effects of non-pharmaceutical interventions on COVID-19 cases, deaths, and demand for hospital services in the UK: a modelling study. Lancet Public Health. 2020;5(7):e375–85. pmid:32502389
- View Article
- PubMed/NCBI
- Google Scholar
44. Keeling MJ, Hill EM, Gorsich EE, Penman B, Guyver-Fletcher G, Holmes A, et al. Predictions of COVID-19 dynamics in the UK: Short-term forecasting and analysis of potential exit strategies. PLoS Comput Biol. 2021;17(1):e1008619. pmid:33481773
- View Article
- PubMed/NCBI
- Google Scholar
45. UK Covid-19 Inquiry. UK Covid-19 Inquiry. 2024. Accessed: 2025-07-17. https://covid19.public-inquiry.uk
46. Silva CJ, Torres DFM. Optimal control for a tuberculosis model with reinfection and post-exposure interventions. Math Biosci. 2013;244(2):154–64. pmid:23707607
- View Article
- PubMed/NCBI
- Google Scholar
47. Word DP, Cummings DAT, Burke DS, Iamsirithaworn S, Laird CD. A nonlinear programming approach for estimation of transmission parameters in childhood infectious disease using a continuous time model. J R Soc Interface. 2012;9(73):1983–97. pmid:22337634
- View Article
- PubMed/NCBI
- Google Scholar
48. Zhu J, Wang Q, Huang M. Optimizing two-dose vaccine resource allocation to combat a pandemic in the context of limited supply: The case of COVID-19. Front Public Health. 2023;11.
- View Article
- Google Scholar
49. Gertz M, Nocedal J, Sartenar A. A starting point strategy for nonlinear interior methods. Appl Math Lett. 2004;17(8):945–52.
- View Article
- Google Scholar
50. Butcher JC. 3. In: Runge–Kutta Methods. John Wiley & Sons, Ltd; 2016. p. 143–331. Available from: https://onlinelibrary.wiley.com/doi/abs/10.1002/9781119121534.ch3
51. Rawlings JB, Mayne DQ, Diehl M. Model predictive control : theory, computation, and design. 2nd ed. Santa Barbara: Nob Hill Publishing; 2024.
52. Köhler J, Müller MA, Allgöwer F. Analysis and design of model predictive control frameworks for dynamic operation—An overview. Annu Rev Control. 2024;57:100929.
- View Article
- Google Scholar
53. Ebeed M, Kamel S, Jurado F. Chapter 7 - Optimal Power Flow Using Recent Optimization Techniques. In: Zobaa AF, Abdel Aleem SHE, Abdelaziz AY, editors. Classical and Recent Aspects of Power System Optimization. Academic Press; 2018. p. 157–83.
54. Kiumarsi B, Vamvoudakis KG, Modares H, Lewis FL. Optimal and Autonomous Control Using Reinforcement Learning: A Survey. IEEE Trans Neural Netw Learn Syst. 2018;29(6):2042–62. pmid:29771662
- View Article
- PubMed/NCBI
- Google Scholar
55. Ohi AQ, Mridha MF, Monowar MM, Hamid MA. Exploring optimal control of epidemic spread using reinforcement learning. Sci Rep. 2020;10(1):22106. pmid:33328551
- View Article
- PubMed/NCBI
- Google Scholar
56. Tian Y, Li X, Wang H, Yuan H, Zhang T. Optimizing spatiotemporal nonpharmaceutical interventions for influenza: An adaptive reinforcement learning approach for regional heterogeneity. Infect Dis Model. 2025;11(1):203–17. pmid:41140927
- View Article
- PubMed/NCBI
- Google Scholar
57. Li X, Yin L, Liu K, Zhu K, Cui Y. Deep-reinforcement-learning-based optimization for intra-urban epidemic control considering spatiotemporal orderliness. Int J Geogr Inf Sci. 2024;1–26.
- View Article
- Google Scholar
58. Chopra A, Rodríguez A, Subramanian J, Krishnamurthy B, Prakash BA, Raskar R. Differentiable agent-based epidemiological modeling for end-to-end learning. In: ICML 2022 Workshop AI for Agent-Based Modelling; 2022. Available from: https://arxiv.org/abs/2207.09714
59. Zarebski AE, Tellioglu N, Stockdale JE, Spencer JA, KhudaBukhsh WR, Miller JC, et al. Including frameworks of public health ethics in computational modelling of infectious disease interventions. arXiv preprint arXiv:250200071. 2025.

[ref1] 1. Sharomi O, Malik T. Optimal control in epidemiology. Ann Oper Res. 2017;251(1–2):55–71.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. World Health Organization. A timeline of WHO’s COVID-19 Response in the WHO European Region: a living document (version 3.0, from 31 December 2019 to 31 December 2021). 2022. Available from: https://iris.who.int/bitstream/handle/10665/351782/WHO-EURO-2022-1772-41523-63024-eng.pdf?sequence=1

[ref3] 3. Godara P, Herminghaus S, Heidemann KM. A control theory approach to optimal pandemic mitigation. PLoS One. 2021;16(2):e0247445. pmid:33606802
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref4] 4. Britton T, Leskelä L. Optimal Intervention Strategies for Minimizing Total Incidence During an Epidemic. SIAM J Appl Math. 2023;83(2):354–73.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref5] 5. Miclo L, Spiro D, Weibull J. Optimal epidemic suppression under an ICU constraint: An analytical solution. J Math Econ. 2022;101:102669.
View Article
Google Scholar

[13] View Article

[14] Google Scholar

[ref6] 6. Asamoah JKK, Okyere E, Abidemi A, Moore SE, Sun G-Q, Jin Z, et al. Optimal control and comprehensive cost-effectiveness analysis for COVID-19. Results Phys. 2022;33:105177. pmid:35070649
View Article
PubMed/NCBI
Google Scholar

[16] View Article

[17] PubMed/NCBI

[18] Google Scholar

[ref7] 7. Arruda EF, Das SS, Dias CM, Pastore DH. Modelling and optimal control of multi strain epidemics, with application to COVID-19. PLoS One. 2021;16(9):e0257512. pmid:34529745
View Article
PubMed/NCBI
Google Scholar

[20] View Article

[21] PubMed/NCBI

[22] Google Scholar

[ref8] 8. Xia F, Xiao Y, Ma J. The optimal spatially-dependent control measures to effectively and economically eliminate emerging infectious diseases. PLoS Comput Biol. 2024;20(10):e1012498. pmid:39374303
View Article
PubMed/NCBI
Google Scholar

[24] View Article

[25] PubMed/NCBI

[26] Google Scholar

[ref9] 9. Smirnova A, Baroonian M, Ye X. Optimal epidemic control with nonmedical and medical interventions. Mathematics. 2024;12(18):2811.
View Article
Google Scholar

[28] View Article

[29] Google Scholar

[ref10] 10. Ouattara L, Ouedraogo D, Diop O, Guiro A. Analysis and optimal control of a mathematical model of malaria. Nonlinear Dyn Syst Theory. 2024;24.
View Article
Google Scholar

[31] View Article

[32] Google Scholar

[ref11] 11. Jaleta SF, Duressa GF, Deressa CT. A mathematical modeling and optimal control analysis of the effect of treatment-seeking behaviors on the spread of malaria. Front Appl Math Stat. 2025;11:1552384.
View Article
Google Scholar

[34] View Article

[35] Google Scholar

[ref12] 12. Asamoah JKK, Yankson E, Okyere E, Sun G-Q, Jin Z, Jan R, et al. Optimal control and cost-effectiveness analysis for dengue fever model with asymptomatic and partial immune individuals. Results in Physics. 2021;31:104919.
View Article
Google Scholar

[37] View Article

[38] Google Scholar

[ref13] 13. Almeida L, Bellver-Arnau J, Privat Y, Rebelo C. Vector-borne disease outbreak control via instant releases. J Math Biol. 2024;89(6):63. pmid:39532731
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref14] 14. Charles M, Mfinanga SG, Lyakurwa GA, Torres DFM, Masanja VG. A Mathematical and Optimal Control Model for Rabies Transmission Dynamics Among Humans and Dogs With Environmental Effects. J Appl Math. 2025;2025(1):5559842.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref15] 15. Wang A, Bai D, He J, Smith SR. Optimal control of bi-seasonal hand, foot and mouth disease in mainland China suggests transmission from children and isolating older infected individuals are critical. J Math Biol. 2024;89(4):43. pmid:39331191
View Article
PubMed/NCBI
Google Scholar

[47] View Article

[48] PubMed/NCBI

[49] Google Scholar

[ref16] 16. Caillau J-B, Ferretti R, Trélat E, Zidani H. An algorithmic guide for finite-dimensional optimal control problems. Handbook of Numerical Analysis. vol. 24. Elsevier; 2023. p. 559–626.

[ref17] 17. Lenhart S, Workman JT. Optimal control applied to biological models. Chapman and Hall/CRC; 2007. https://doi.org/10.1201/9781420011418

[ref18] 18. Betts JT. Practical Methods for Optimal Control and Estimation Using Nonlinear Programming. 2nd ed. Society for Industrial and Applied Mathematics; 2010. https://doi.org/10.1137/1.9780898718577

[ref19] 19. Wächter A, Biegler LT. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Math Program. 2005;106(1):25–57.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref20] 20. Nocedal J, Wright SJ. Numerical optimization. 2nd ed. Springer series in operations research and financial engineering. New York: Springer;

[ref21] 21. Michalewicz Z. Genetic algorithms data structures = evolution programs. 3rd ed. Berlin: Springer-Verlag; 1996.

[ref22] 22. Kirkpatrick S, Gelatt CD Jr, Vecchi MP. Optimization by simulated annealing. Science. 1983;220(4598):671–80. pmid:17813860
View Article
PubMed/NCBI
Google Scholar

[59] View Article

[60] PubMed/NCBI

[61] Google Scholar

[ref23] 23. Aarts E, Korst J, Michiels W. Simulated Annealing. In: Burke EK, Kendall G, editors. Simulated Annealing. Boston, MA: Springer US; 2005. p. 187–210.

[ref24] 24. Colas C, Hejblum B, Rouillon S, Thiébaut R, Oudeyer P-Y, Moulin-Frier C, et al. EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models. jair. 2021;71:479–519.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref25] 25. Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, et al. Human-level control through deep reinforcement learning. Nature. 2015;518(7540):529–33. pmid:25719670
View Article
PubMed/NCBI
Google Scholar

[67] View Article

[68] PubMed/NCBI

[69] Google Scholar

[ref26] 26. Deb K, Pratap A, Agarwal S, Meyarivan T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput. 2002;6(2):182–97.
View Article
Google Scholar

[71] View Article

[72] Google Scholar

[ref27] 27. Yin S, Wu J, Song P. Optimal control by deep learning techniques and its applications on epidemic models. J Math Biol. 2023;86(3):36. pmid:36695914
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref28] 28. Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980. 2014.

[ref29] 29. Zhang Z. Improved Adam Optimizer for Deep Neural Networks. In: 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS). 2018. p. 1–2.

[ref30] 30. Moritz P, Nishihara R, Jordan M. A linearly-convergent stochastic L-BFGS algorithm. In: Artificial Intelligence and Statistics. PMLR; 2016. p. 249–58.

[ref31] 31. Dunning I, Huchette J, Lubin M. JuMP: A Modeling Language for Mathematical Optimization. SIAM Rev. 2017;59(2):295–320.
View Article
Google Scholar

[81] View Article

[82] Google Scholar

[ref32] 32. Hernandez-Vargas EA, Gonzalez AH, Beck CL, Bi X, Campana FC, Giordano G. Modelling and Control of Epidemics Across Scales. In: 2022 IEEE 61st Conference on Decision and Control (CDC). 2022. p. 4963–80.

[ref33] 33. Biegler LT. An overview of simultaneous strategies for dynamic optimization. Chem Eng Process Process Intensif. 2007;46(11):1043–53.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref34] 34. Lubin M, Dowson O, Dias Garcia J, Huchette J, Legat B, Vielma JP. JuMP 1.0: Recent improvements to a modeling language for mathematical optimization. Math Programm Computat. 2023;15:581–9.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

[ref35] 35. Fourer R, Gay DM, Kernighan BW. AMPL: A Modeling Language For Mathematical Programming. 2nd ed. Belmont (CA): Duxbury Press; 2003.

[ref36] 36. Bisschop J, Meeraus A. On the development of a general algebraic modeling system in a strategic planning environment. Applications. Springer. 2009. p. 1–29.

[ref37] 37. Gay DM. The AMPL Modeling Language: An Aid to Formulating and Solving Optimization Problems. In: Al-Baali M, Grandinetti L, Purnama A, editors. Numerical Analysis and Optimization. Cham: Springer International Publishing; 2015. p. 95–116.

[ref38] 38. Petra CG, Qiang F, Lubin M, Huchette J. On efficient Hessian computation using the edge pushing algorithm in Julia. Optim Method Softw. 2018;33(4–6):1010–29.
View Article
Google Scholar

[94] View Article

[95] Google Scholar

[ref39] 39. Shin S, Coffrin C, Sundar K, Zavala VM. Graph-Based Modeling and Decomposition of Energy Infrastructures. IFAC-PapersOnLine. 2021;54(3):693–8.
View Article
Google Scholar

[97] View Article

[98] Google Scholar

[ref40] 40. Shin S, Anitescu M, Pacaud F. Accelerating optimal power flow with GPUs: SIMD abstraction of nonlinear programs and condensed-space interior-point methods. Electr Power Syst Res. 2024;236:110651.
View Article
Google Scholar

[100] View Article

[101] Google Scholar

[ref41] 41. Vanaret C, Leyffer S. Implementing a unified solver for nonlinearly constrained optimization; 2025. Available from: https://arxiv.org/abs/2406.13454

[ref42] 42. Gravenor MB, Dawson M, Bennett E, Thorpe B, White C, Rahat A. Real-Time Epidemiological Modelling During the COVID-19 Emergency in Wales. In: Aston PJ, editor. Cham: Springer Nature Switzerland; 2025. p. 97–104.

[ref43] 43. Davies NG, Kucharski AJ, Eggo RM, Gimma A, Edmunds WJ, Centre for the Mathematical Modelling of Infectious Diseases COVID-19 Working Group. Effects of non-pharmaceutical interventions on COVID-19 cases, deaths, and demand for hospital services in the UK: a modelling study. Lancet Public Health. 2020;5(7):e375–85. pmid:32502389
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

[ref44] 44. Keeling MJ, Hill EM, Gorsich EE, Penman B, Guyver-Fletcher G, Holmes A, et al. Predictions of COVID-19 dynamics in the UK: Short-term forecasting and analysis of potential exit strategies. PLoS Comput Biol. 2021;17(1):e1008619. pmid:33481773
View Article
PubMed/NCBI
Google Scholar

[109] View Article

[110] PubMed/NCBI

[111] Google Scholar

[ref45] 45. UK Covid-19 Inquiry. UK Covid-19 Inquiry. 2024. Accessed: 2025-07-17. https://covid19.public-inquiry.uk

[ref46] 46. Silva CJ, Torres DFM. Optimal control for a tuberculosis model with reinfection and post-exposure interventions. Math Biosci. 2013;244(2):154–64. pmid:23707607
View Article
PubMed/NCBI
Google Scholar

[114] View Article

[115] PubMed/NCBI

[116] Google Scholar

[ref47] 47. Word DP, Cummings DAT, Burke DS, Iamsirithaworn S, Laird CD. A nonlinear programming approach for estimation of transmission parameters in childhood infectious disease using a continuous time model. J R Soc Interface. 2012;9(73):1983–97. pmid:22337634
View Article
PubMed/NCBI
Google Scholar

[118] View Article

[119] PubMed/NCBI

[120] Google Scholar

[ref48] 48. Zhu J, Wang Q, Huang M. Optimizing two-dose vaccine resource allocation to combat a pandemic in the context of limited supply: The case of COVID-19. Front Public Health. 2023;11.
View Article
Google Scholar

[122] View Article

[123] Google Scholar

[ref49] 49. Gertz M, Nocedal J, Sartenar A. A starting point strategy for nonlinear interior methods. Appl Math Lett. 2004;17(8):945–52.
View Article
Google Scholar

[125] View Article

[126] Google Scholar

[ref50] 50. Butcher JC. 3. In: Runge–Kutta Methods. John Wiley & Sons, Ltd; 2016. p. 143–331. Available from: https://onlinelibrary.wiley.com/doi/abs/10.1002/9781119121534.ch3

[ref51] 51. Rawlings JB, Mayne DQ, Diehl M. Model predictive control : theory, computation, and design. 2nd ed. Santa Barbara: Nob Hill Publishing; 2024.

[ref52] 52. Köhler J, Müller MA, Allgöwer F. Analysis and design of model predictive control frameworks for dynamic operation—An overview. Annu Rev Control. 2024;57:100929.
View Article
Google Scholar

[130] View Article

[131] Google Scholar

[ref53] 53. Ebeed M, Kamel S, Jurado F. Chapter 7 - Optimal Power Flow Using Recent Optimization Techniques. In: Zobaa AF, Abdel Aleem SHE, Abdelaziz AY, editors. Classical and Recent Aspects of Power System Optimization. Academic Press; 2018. p. 157–83.

[ref54] 54. Kiumarsi B, Vamvoudakis KG, Modares H, Lewis FL. Optimal and Autonomous Control Using Reinforcement Learning: A Survey. IEEE Trans Neural Netw Learn Syst. 2018;29(6):2042–62. pmid:29771662
View Article
PubMed/NCBI
Google Scholar

[134] View Article

[135] PubMed/NCBI

[136] Google Scholar

[ref55] 55. Ohi AQ, Mridha MF, Monowar MM, Hamid MA. Exploring optimal control of epidemic spread using reinforcement learning. Sci Rep. 2020;10(1):22106. pmid:33328551
View Article
PubMed/NCBI
Google Scholar

[138] View Article

[139] PubMed/NCBI

[140] Google Scholar

[ref56] 56. Tian Y, Li X, Wang H, Yuan H, Zhang T. Optimizing spatiotemporal nonpharmaceutical interventions for influenza: An adaptive reinforcement learning approach for regional heterogeneity. Infect Dis Model. 2025;11(1):203–17. pmid:41140927
View Article
PubMed/NCBI
Google Scholar

[142] View Article

[143] PubMed/NCBI

[144] Google Scholar

[ref57] 57. Li X, Yin L, Liu K, Zhu K, Cui Y. Deep-reinforcement-learning-based optimization for intra-urban epidemic control considering spatiotemporal orderliness. Int J Geogr Inf Sci. 2024;1–26.
View Article
Google Scholar

[146] View Article

[147] Google Scholar

[ref58] 58. Chopra A, Rodríguez A, Subramanian J, Krishnamurthy B, Prakash BA, Raskar R. Differentiable agent-based epidemiological modeling for end-to-end learning. In: ICML 2022 Workshop AI for Agent-Based Modelling; 2022. Available from: https://arxiv.org/abs/2207.09714

[ref59] 59. Zarebski AE, Tellioglu N, Stockdale JE, Spencer JA, KhudaBukhsh WR, Miller JC, et al. Including frameworks of public health ethics in computational modelling of infectious disease interventions. arXiv preprint arXiv:250200071. 2025.

Exploring epidemic control policies using nonlinear programming and mathematical models

Exploring epidemic control policies using nonlinear programming and mathematical models

This is an uncorrected proof.

Figures

Abstract

Author summary

Introduction

Materials and methods

Optimisation algorithms

Model formulation

Results

Comparison of indirect and direct methodologies

Case scenario 1: Lockdown.

Case scenario 2: “Flattening the curve”.

Case scenario 3: Vaccination.

Case scenario 4: Multiple control strategies.

Comparison between optimisation algorithms.

Discussion

Conclusion

Supporting information

S1 Fig. Impact of intervention timing on fixed lockdown duration.

S2 Fig. Optimal intervention strategy across multiple parameter scenarios.

S3 Fig. Timestep sensitivity analysis.

Acknowledgments

References