Figures
Abstract
Effective reservoir flood control demands real-time decision-making that balances multiple objectives. However, traditional optimization approaches are often too computationally intensive and become intractable when considering dynamically changing preferences of operators, modelled as weights of different objectives. This study aims to develop tractable real-time flood control strategies that maintain performance while reducing computational complexity. We propose two data-driven approaches based on Model Predictive Control (MPC): (1) an explicit MPC using deep neural networks to directly determine optimal outflow schedules, and (2) a switched MPC that produces optimal weights of objectives based on hydrological conditions. Both methods leverage offline learning from an online Parameterized Dynamic MPC framework incorporating state-dependent weights. We tested these approaches on South Korea’s Daecheong multipurpose reservoir using historical flood events with various patterns. The explicit MPC demonstrated reliable performance under conditions similar to its training data. However, it showed frequent changes in outflow schedules and constraint violations for scenarios outside training data. In contrast, the switched MPC maintained robustness across all test scenarios due to a linear optimization process in a receding horizon manner, though with slightly reduced performance compared to the explicit MPC under scenarios inside the range of training data. Most significantly, both approaches reduced computation time from approximately 10 minutes to less than one second, making real-time implementation feasible. This dramatic improvement enables prompt decision-making during rapidly evolving flood events while maintaining near-optimal control performance.
Citation: Koo J-H, Moradvandi A, Abraham E, Jonoski A, Solomatine DP (2025) Flood control of reservoir systems: Learning-based explicit and switched model predictive control approaches. PLOS Water 4(5): e0000361. https://doi.org/10.1371/journal.pwat.0000361
Editor: Bimlesh Kumar, Indian Institute of Technology Guwahati, INDIA
Received: January 16, 2025; Accepted: March 21, 2025; Published: May 9, 2025
Copyright: © 2025 Koo et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The hydrological and operational data were acquired from Korea Water Resources Public Corporation’s website (http://kwater.or.kr). The data and code of this research are available at https://doi.org/10.4121/b6dd9d97-118d-406e-867d-b821fb6d08d4 under the CC-BY-4.0 license.
Funding: The authors received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Floods are among the most destructive natural disasters, resulting in substantial economic and environmental damage across affected communities. Therefore, flood control is one of the main purposes of most reservoirs. Reservoir flood control, which significantly impacts watershed management, has attracted research attention for decades [1, 2]. However, rapidly changing hydrological conditions, highly uncertain forecasts, and changes in decision makers’ preferences [3, 4] make reservoir flood control challenging. Many researchers have suggested optimization-based control methods because flood control is a control problem, as its name implies [5]. Various optimization-based methodologies have been proposed to address challenges of real-time optimal flood control, particularly subject to uncertainty, either in rainfall or inflow forecasts [6–8]. The complexity of reservoir flood control arises from the system’s nonlinearity, which stems from intricate relationships among different objectives that operators want to achieve, including minimizing changing outflow schedules, minimising peak outflow, minimising Reservoir Water Level (RWL), and minimising opening and closing of spillway gates [9].
The flood control problem can be formulated as a multi-objective problem involving outflow schedules, RWL, and other factors. Various approaches have been proposed to effectively solve this multi-objective optimization problem of reservoir flood control. In classical Model Predictive Control (MPC) approaches, the relative importance of objectives is assumed to be constant throughout the flood event [7, 10]. Therefore, optimal weights, as significant performance factors, should be determined properly and in advance. However, the relative importance of objectives can vary with the hydrological conditions in practice. Under this dynamic preference scenario, reflecting the operators’ preferences in the weights of the objectives [9, 11], the optimal outflow sequence can be chosen from the Pareto front [12]. This approach is flexible in the decision-making process, either by operators or through methods such as multi-criteria decision-making at each time step. The performance of this method, however, depends on the effective derivation of the Pareto fronts in real-time. Nonetheless, generating a Pareto front for the real-time application can be intractable when a nonlinear and nonconvex multi-objective problem has to be solved. Because the search spaces for exploration can grow exponentially [13, 14], getting optimality guarantees for solutions can also be difficult, and it can take longer than the available decision window during flood events.
In our previous work in [9], a Parameterized Dynamic Model Predictive Control (PD-MPC) framework to derive optimal outflow schedules in reservoir flood control problems has been investigated. The PD-MPC formulation explicitly considers the dynamic characteristic of the relative importance of objectives depending on the situation via time-varying relative weights for the different objectives. In this approach, we separated the weight optimization and control action optimization in an alternating fashion until convergence. A Genetic Algorithm (GA) was employed to optimize objective weights and system parameters at the same time using the nonlinear system model and objectives, while the linear MPC problem was solved to optimality for a given weight set. The approach utilizes a single indicator, i.e., the absolute performance evaluator, to obtain optimal weights and parameters, along with optimal control inputs, at each time step. To the best of our knowledge, it was the first attempt to decide weights at each time step depending on hydrological conditions in real-time MPC-based reservoir flood control. Nonetheless, the approach still suffers from the inefficiency of online implementation due to the employed gradient-free algorithm requiring considerable time to achieve sufficient convergence. This can be limiting for the PD-MPC approach in real-time control.
In recent years, data-driven models like Machine Learning (ML), Deep Learning (DL), and Reinforcement Learning (RL) have emerged as effective tools to build efficient meta-(surrogate) models and thus reduce online computational complexity [15–17]. These models can be trained offline to map the nonlinear relationships between inputs and desired outputs [18, 19], enabling efficient online implementation in various fields. For example, an Artificial Neural Network (ANN) was trained offline using the input and output data of a nonlinear MPC controller for a power converter and a robot, allowing the ANN to be implemented on hardware as an efficient explicit MPC that directly maps states to optimal control inputs [20, 21]. In [22, 23], hybrid RL-MPC approaches are proposed where the original MPC non-linear program is truncated to be one step ahead, with cost-to-go functions of the MPC replaced with the value function of the following state from RL. In [24], Multilayer Perceptron (MLP) policies are learned offline from MPC to automatically choose hard-to-optimize decision parameters, and so to parameterize a linear MPC to be solved online.
In real-time reservoir operations, these data-driven approaches are particularly valuable because they can address a practical operational challenge, i.e., computational time under a multi-objective optimization problem. Despite many studies in various fields, it is difficult to find a study on adopting data-driven models for an optimization problem with many objectives and dynamically changing weights of objectives. Due to the complex relationship among objectives, dynamically changing weights can make it difficult to learn the relationship between states and control inputs or optimal weights in a data-driven model. The conventional approach, which collects training data directly from simulation environments and trains an ANN or RL model, is not suitable for this case. This is because, for example, there can be many weight sets which can produce an optimal control input.
In this work, we leverage data-driven approaches to avoid difficulties in dealing with changing weights and solving a highly nonlinear problem online. From a control point of view, this article investigates explicit MPC and switched linear MPC approaches using data-driven models trained on the outputs of PD-MPC. The switched MPC framework allows the MPC system models to switch based on system states and predefined switching rules [25], requiring prior knowledge of these rules [26]. This allows simpler linear MPC problems to be solved online by selecting the appropriate model based on the initial state and the switching rules as a function of the states. In contrast, explicit MPC approaches compute control actions offline for all states by solving an optimal control problem for each state space partition [27], establishing either a mapping table or piecewise affine functions to relate states and control actions.
For our reservoir optimal flood control problem, we develop explicit MPC and switched MPC approaches from the PD-MPC results computed under a large set of inflow scenarios. A DNN model is trained offline to directly map states to control inputs as an explicit MPC controller. Additionally, a data-driven model is used to learn a mapping from system states to optimal parameters (i.e., state-dependent switching rule for optimal objective weights and other system parameters to use in a linear MPC online); this latter approach can be formulated as a switched linear MPC problem.
The novel contributions of this article are: i) we can obtain a consistent optimal weight set for a specific state by adopting the -norm, ii) we present approaches for adopting explicit and switched MPC approaches using data-driven models, and iii) we demonstrate the advantages and disadvantages of switched and explicit MPC approaches for a multi-objective optimization problem with dynamically changing weights. To the best of the authors’ knowledge, this is the first application and comparison of learning-based explicit and switched MPC for reservoir management.
The article is organized as follows. In Sect 1, the PD-MPC framework and the study area are discussed. The control objectives and system constraints used in this research are also formulated. Sect [simulation study]2 presents the methodology and experimental setup. Focusing on approaches for data acquisition using PD-MPC, we describe the proposed data-driven surrogate models for explicit MPC and switched MPC. Sect [Results and Discussion]3 presents results, discussion, assessing the efficiency and characteristics of the surrogate models based on the two considered approaches. Finally, conclusions are drawn.
1. Problem formulation
1.1. Parameterized dynamic model predictive control
Model Predictive Control (MPC) utilizes a system model to predict states of a control system and produces a control input vector which optimizes objectives while explicitly considering constraints over a prediction horizon [28, 29]. By implementing the MPC process in a receding horizon manner, the first control input in a sequence of optimal control inputs is implemented, and then the control inputs are updated based on the new system state, which is generally observed, as described in Fig 1.
Then, the whole sequence of the control inputs is regenerated at , reflecting updated states. (Modified from [30]).
The flood control problem can be formulated as a Parameterized Dynamic Model Predictive Control (PD-MPC) problem [9], where the dynamic preferences of operators are represented by changing the relative importance of objectives [9, 31]. As depicted in Fig 2, the PD-MPC problem solves a multi-objective optimal control problem at each time step, where the weights are also co-optimized to reflect these changing preferences of the operators. The relative importance of the objectives can depend on the state of the system and projected inflows. For example, although frequent changes in schedules may not be desired from one time step to another, they become far less important if the potential for flooding increases due to a high water level driven by inaction.
The diagram in Fig 2 depicts how the PD-MPC approach solves the optimal control problem at the step of a receding horizon implementation:
by alternatingly solving the two sets of problems:
respectively, where is the initial state at time k,
denotes the multi-objectives of a linear MPC and u expresses control inputs for a given weight set, z.
What we call the absolute performance evaluator, , is used to improve the optimal weight set,
by solving a nonlinear simulation-based heuristic optimization, where the corresponding control actions
are generated by the linear MPC and the evaluation is done with a nonlinear system model and objectives. Here, this evaluator assesses additional nonlinear performance criteria that represent the operators’ selection criterion but are not included in
. Using this method, we can decouple finding an optimal weight set and the control action set, which is represented in (1), thereby reducing the complexity of the co-design problem. In other words, in each time step, solving (2b) provides control actions associated with the given weight set, and by solving (2a), we are able to improve the optimal weight set, and the corresponding control action set using the evaluators expressed by (2a). The evaluator can be solved using heuristic optimization methods such as Bayesian Optimization (BO) and GA.
The key aspect of this approach is the dynamic adjustment of the relative importance (weights ) of objective functions based on hydrological conditions, allowing reservoir operators to respond more effectively to changing flood situations. For example, if the reservoir water level approaches a critical threshold, the weight for the objective to maintain RWL in the safe range increases.
In our previous research [9], the GA was applied, which evaluates and explores a set of potential solutions (the population) over each iteration until it meets convergence or time criteria. Another popular approach is BO algorithms, such as the Tree-structured Parzen Estimator (TPE) algorithm [32, 33]. This algorithm is considered efficient because the TPE algorithm explores solutions based on a probabilistic model and is easy to parallelize [34].
In this manuscript, we employ the TPE algorithm, a BO algorithm that makes use of tree-structured search spaces. At each time step, the TPE algorithm derives the optimal parameter vector by learning a surrogate probabilistic model for parameters and their associated objective values, i.e., z and E online. The TPE algorithm utilizes a Kernel Density Estimator, especially a Parzen estimator, as the surrogate model. It employs the Expected Improvement (EI) and weighted random sampling as the acquisition method to explore promising regions of parameters z and update the surrogate model. The exploration terminates when either the specified number of searches is reached, or the evaluation index falls into a predefined threshold. Thereafter, the best parameters and their associated evaluation value are returned. The corresponding control inputs, , can then be computed. Therefore, the TPE algorithm is utilized at each time step in order to determine
, minimizing the return (penalty) value from the absolute performance evaluator, E.
1.2. Reservoir flood control
A total of 13 linear and nonlinear objectives are suggested to describe the practical reservoir flood control problem, including minimizing peak spillway outflow, gate operations, and changes in outflow schedules [9]. Our approach here aims to approximate this rather complex PD-MPC problem derived from these considered objectives. To do so, the switched MPC and explicit MPC approaches are taken into account. To simplify the problem, three objectives for linear MPC and six nonlinear objectives to formulate the evaluator are then considered. This simplification is introduced because it may be difficult to show the performance of our approaches due to the highly complicated relationships among the objectives, when considering all 13 of them. Moreover, it would be reasonable to focus on objectives only directly relating to flood control in practice, as other objectives, such as minimizing environmental effects and securing water for demands, naturally receive lower priorities during flood events.
The three objectives for linear MPC are related to RWL, peak outflow, and outflow schedule changes. Reservoir operators should maintain RWL within a target range not only to ensure the dam’s safety but also to secure enough water to satisfy demands. Therefore, three target water levels are defined: (i) The lower target level, to secure water, so it constrains RWL from decreasing a certain level; (ii) The upper level, to secure the flood control capacity between this level and the FWL; and (iii) the highest level for dam safety, considering uncertainty. Furthermore, to reduce downstream flood risk, minimising peak outflow is also an essential factor, as the high outflows directly affect downstream water levels. Meanwhile, because each outflow schedule is shared with other flood control organizations, the previously shared outflow schedule should be maintained consistently with the minimum number of changes possible. Moreover, limiting the frequency of operations of spillway gates is advantageous to prevent wear and malfunction. This can be achieved implicitly by minimizing changes in consecutive outflow schedules. Additionally, a soft constraint is added to constrain opening spillway gates before the total outflow exceeds the turbine capacity.
Therefore, the overall objective function can be written as follows:
in which
in which z1, z2, z3, and z4 are the objective weights. To scale the different objective values, constants, i.e., here, based on the Maximum Allowed Value Estimate (MAVE) values of the associated objectives are applied [35, 36]. The MAVE values are the spillway outflow capacity Cspill for J1, J2, and J4, the storage amount between FWL and LWL for J2, respectively. The optimization of these objectives is subject to the constraints:
in which k represents each time step for the defined receding horizon MPC, where t at time step k ranges from k to k + N−1. Therefore, the optimal control input vector obtained from the proposed linear MPC over the time span can be written as follows:
where denotes the turbine outflow capacity, and
,
, and
express the total outflow, the spillway gate outflow, and the turbine outflow at time t and time step k, respectively. We refer to
as an outflow schedule at time step k. The control horizon is N. For simplicity, we omit a superscript k, e.g.,
at time step k denotes J1, except when it is essential, e.g., in ([eqn5c]5c) and ([eqn5g]5g) because these equations compare current outflows with outflows at previous time steps.
,
,
,
,
, and
are considered as slack variables.
and
express the amounts of increase and decrease between consecutive outflow schedules at time t, respectively.
,
, and
are also introduced to penalize the violations of target reservoir storage, i.e., the highest target water level, SH, the upper target water level, SU, and the lower target water level, SL.
To address the complementarity between outflows via turbines and spillway gates, is also taken into account. In general, this constraint is formulated by a nonlinear complementarity operator that can be written as [37]:
This nonlinear constraint can be linearised by adding a slack variable in ([eqn5b]5b) and using the penalty function approach to integrate constraint violations in the objective J4. As an example, if
, then Ospill,t = 0 and
is minimized by the difference between the total outflow and the turbines’ capacity outflow. Conversely, if
, then
and
becomes zero. Since this is a soft constraint and this value is independent of other objectives, i.e., there is no trade-off with other objectives; we therefore deem J4 as not dynamically responding to the relative preferences among objectives, and so z4 is set to a constant.
To assign a higher penalty for changes in the near future, a time-dependent weight function, , is defined as follows:
Even though there are nonlinear elements in the reservoir system, such as reservoir storage volume-water level relationship and water level-spillway outflow capacity relationship, we consider only the linear parts of the reservoir system by using reservoir volume directly as state instead of RWL and by assuming that errors from nonlinear factors can be included as additive errors within inflow uncertainty. These can be written as follows:
where St, It, and Ot denote the reservoir storage, inflow, and outflow at time t, respectively. expresses the outflow capacity via spillway gates. The constraints are set for the spillway and turbine discharge capacity, the LWL, and the FWL. The minimum water supply should be more than the demands, Omin, as expressed in ([eqn9d]9d). In fact, the total demand varies monthly; however, in this study, a constant value of
is used, which is the annual average. Nonlinear elements, such as the relationship between RWL and storage amount, are used for the absolute performance evaluator.
1.3. Online selection of the best weight set
Since the weight for the soft constraint does not need to change dynamically because its value is independent of other objectives and constraints, as mentioned earlier, we set z4 to a constant with a reasonably small value for simplicity. Therefore, the optimal set to be determined consists of three weights, as follows:
Using TPE, we can explore search spaces, such as those in Table 1, at each time step of the receding horizon control implementation. The search space is quantized in the normalised range [0,1] with steps of 0.01 interval to sufficiently explore all the objectives’ space. Reasonably, is excluded from the search space. As mentioned, the evaluator plays an important role in providing the best weight set at each time step according to (2a) and Fig 2. We formulate the absolute performance evaluator consisting of six nonlinear objective functions as follows:
where,
where [.] + is the ramp function, i.e., , and p, q, v, and w are constants to be used to normalize each objective value between zero and 10 over typical situations such as when RWL is lower than SH and Ospill is within the peak inflow of the target flood, with values of 2.4, 10, 5, and 75, respectively. l should be very large, chosen here as 5000, to avoid opening the gates when the outflow capacity of turbines is not fully utilized.
and
are used for the values from conditional expressions.
Among the objectives in (12), E1 to E3 have the same meaning as J1 to J3, and E6 as J4 in (4), but they are more freely configured using exponential functions and conditional equations. Moreover, E4 and E5 are considered in the evaluator to account for preferences for i) minimising outflow schedule changes inside a prediction horizon and ii) minimising the number of gate closures in a prediction horizon after opening action. These objectives may not be as critical as E1, E2, and E3 but are related to operators’ preferences [9].
It is worth noting that the objective values for J1 and J3 in the linear MPC can be normalized to be of the same scale by multiplying N for J1. Additionally, changes are smaller than peaks, i.e., . Furthermore, the effect of long-term changes is diminished by
in (8). Therefore, the MAVE values can not scale the objective values of the linear MPC exactly. In contrast, the evaluator’s objective E3 in ([eqn12d]12d) holds equal importance to the other objectives within the evaluator, as various exponential and conditional equations are used to scale objective values, ensuring they reach the same values in worst-case scenarios. Moreover, E3 has the maximum value when there is at least one change in outflow schedules.
1.4. Daecheong reservoir: Description of a case study
Daecheong Reservoir is a multipurpose reservoir located in the upper reaches of the Geum River, flowing through the central region of South Korea, as depicted in Fig 3.
The left figure shows the location of the Geum river basin, and the right figure illustrates the location of the Daecheong Reservoir. The base maps are the digital elevation model (DEM) and the river shape file of Korea from the National Geographic Information Institute (https://www.ngii.go.kr) and the basin shape file from VWorld (https://www.vworld.kr) published by the Han River Flood Control Office (https://www.hrfco.go.kr).
The Daecheong reservoir serves as a vital water source, delivering of drinking water daily to approximately 2 million residents in surrounding areas, while also supporting irrigation, hydroelectric power generation, and flood control. Geographically, it features a complex elongated structure with a dendritic drainage pattern, reaching a maximum width of around
. The watershed flowing into Daecheong Dam covers
, and at full capacity, the reservoir holds
of water, spans a surface area of
, and reaches a maximum depth of 55 m [38]. Annual precipitation and water flow predominantly occur during summer months (June-September), with water levels fluctuating significantlyâ€"up to 15 m—during monsoon seasons. The hydraulic residence time averages about 145 days, characteristic of typical river-type reservoirs [39]. Annual average rainfall is 1230 mm and average inflow is
. The inflow for a five-year frequent flood is
, which is 50 times the annual average inflow.
Although it impacts the flood control in the Geum River basin, Daecheong Reservoir has a relatively limited flood control capacity in comparison to its extensive watershed area. For instance, the 200-year flood inflow is almost which can fill the flood control capacity in almost 6.5 hours. The characteristics of Daecheong Reservoir are provided in Table 2.
Based on the characteristics of the Daecheong reservoir, the threshold SH is set to Elevation Level (EL) 78.5 m, which is the 99.5th percentile of observed RWL during the experiment flood events, and SU and SL are set to EL 76.5 m (the NHWL), and EL 76.0 m in (5), respectively. The control horizon, N in (4), is set to 6 h inspired by 6 h rainfall forecast reported by the meteorological agency of South Korea [9].
Hourly inflow data were collected for 24 years from 1997 to 2020, including 28 observed flood events from the National Water Resources Management Information System (WAMIS) operated by the Ministry of Environment, Korea, and the Korea Water Resources Public Corporation (K-water). These inflow data were calculated data based on measured outflows and changes in measured RWL over time. K-water records RWLs and outflows on an hourly basis, allowing inflows to be calculated using the water balance ([eqn9a]Eq 9a) presented in Sect 1.2. The dataset is noise-corrupted due to measurement errors. By applying a wavelet filter [40], noises can be detected and removed from the dataset [41, 42]. To account for uncertainty in inflow prediction, random normal values, in which the deviation is increased over the prediction time, are added to the filtered inflow data. A moving average filter is then used to prevent significant fluctuations. Details of this process can be found in [9], and a summary of it is given in Appendix A.
2. Simulation study
2.1. Data-driven models for explicit and switched MPC
Taking dynamical parameters into account based on real-time hydrological conditions used in PD-MPC is computationally expensive. For instance, with three dynamical weights and a search space of 100 quantized values per weight, each iteration of PD-MPC requires an average of 8.48 minutes to complete(CPU: AMD EPYCâ"¢ 9654). As the number of dynamical weights increases and the search space grows exponentially, the computational burden becomes too large. Therefore, two well-known methodologies, namely the explicit MPC and the switched MPC approach, are utilized to address this computational challenge of PD-MPC.
In conventional explicit linear MPC, it is assumed that for each predefined partition in the entire state space, there exists an affine function that maps the states to the optimal control inputs [27]. The state space may consist of several polyhedral partitions. Therefore, because of the high dimensionality of either state or control spaces, implementation of explicit MPC may be challenging [27, 43]. In conventional switched MPC approaches, the dynamic system states are first partitioned into predefined parts. Then, based on the given system states, control inputs are calculated by MPC with the corresponding dynamical system [26]. Ensuring the continuity of control inputs in line with the well-defined optimized system partitions and switching rules in this approach is challenging [44].
For flood control, i.e., short-term operation, two specific challenges should be addressed: (i) unavailability of a comprehensive dataset including as many potential operational scenarios like flood events with several high peaks as possible; (ii) the smoothly changing states and the implicit relationship between the states and weights, which make it difficult to formulate switching rules for weights explicitly. Therefore, the implementation of explicit MPC and switched MPC is not straightforward.
In this work, we take a pragmatic approach, using a DNN model as a surrogate model for explicit MPC, trained based on PD-MPC results. In other words, the relationship between the states and the optimal control inputs is modeled using DNN based on the dataset obtained by PD-MPC, as shown in Fig 4. Similarly, for the switched MPC approach, the relationship between states and optimal weights from the PD-MPC results is modeled utilizing a Random Forest classifier and regressor as surrogate models for switching between different linear MPC controllers (i.e., switching between different weights). The detailed process illustrated in Fig 4 will be explained in the following section.
(a) Model development (b) Implementation.
2.2. Data generation for learning surrogate models
2.2.1. Data consistency.
At each time step of a receding horizon implementation, PD-MPC can propose a unique optimal outflow schedule . However, there may be many combinations of the optimal parameter set, i.e.
, that could result in the same schedule. For instance, when a flood is going to occur, the RWL is between SL and SU, and flood control can be achieved by regulating turbine outflow itself, while some objectives are inactive (are zero) in this scenario, resulting in the same optimal outflow schedule with different weight sets corresponding to the zero objectives (i.e., inactive constraint violations). In such cases, PD-MPC would randomly select a set among several available candidates at that time step, as there would be no one-to-one mapping between
and
.
This can be a problem for a surrogate to learn a mapping between states and optimal weights. Therefore, we would want to somehow project the weights corresponding to the objectives relating to inactive constraints to a lower dimension. An regularization could, for example, be used to enforce these non-unique weights to have the same value (in this case, the mean of the weights for the objectives that are non-zero). An
regularisation is typically employed to prevent overfitting in DNN models by adding the
-norm of model parameters to the original loss function. However, we incorporate the
-norm for weights into the objective function of the TPE algorithm to construct preferences for the same weights. The objective function expressed by (11) is then rewritten as follows:
where denotes the weight for the preference for the same weights and small parameter value, and
expresses the
-norm. To normalize a weight set, the average of weights
is subtracted.
It can be implied from (13) that, when there are multiple choices for weights, a set whose elements are as equal as each other should be chosen. Therefore, PD-MPC suggests where its elements are the same, i.e.,
, when
is not sensitive to z and it returns the unique optimal
when it is highly sensitive. It should be noted that if
, whatever each value is, then it is trivial that the optimal control inputs are always equal to the control inputs when
. Thus, we can set
to
when
without trying to find the exact value of each weight. Basically, in the case of large
, PD-MPC proposes a suboptimal z that tends to be closer to the equal weight preference than
. On the other hand, in the case of small
, it always finds an optimal solution for
; however, it may not distinguish situations where
is insensitive to z, thereby failing to ensure data consistency. To select an appropriate
, a sensitivity analysis is conducted using five training flood events, including some extreme cases. A total of 10 candidates are considered for
, ranging from
to
with an additional case where
(meaning
). Moreover, for each event, five uncertain inflow forecasts are applied. To consider a sufficiently diverse set of scenarios, we examine three initial water levels, i.e., EL 76.0 m, EL 76.5 m, and EL 77.0 m, covering a total of 750 cases (
).
The sensitivity analysis results are depicted in Fig 5. Each point represents the average or the maximum penalty of the five uncertain inflow scenarios. Points not marked in the figure, e.g., those for Event No.10 in Fig 5a with ,
, and
, indicate the cases where online PD-MPC iterations are infeasible, meaning that constraint violations occurred at least once. In this figure, as
decreases, the performance of PD-MPC is improved, i.e., penalty values decrease, and no further improvement is observed for smaller
than
.
(a) average values of the sum of penalty values for five different uncertain forecasts. (b) average values of the maximum penalty value of each uncertain forecast. (c) maximum values of the maximum penalty value of each uncertain forecast.
In Fig 6, ZI is an index for weights, where ZI = 1 corresponds to and ZI = 2 is for all other cases. when ZI = 1, it also means that the same
is computed for various zs, indicating that
is insensitive to z. As shown in the figure, in most cases, such as the early stage of a flood event,
is insensitive to z, resulting in several instances with ZI = 1. Therefore, ZI = 1 and ZI = 2 are considered differently in this analysis. In this figure, as
decreases, a more diverse set of weights, i.e., ZI = 2 cases, appears. This indicates that even when the control inputs are less sensitive to weights, more weight sets closer to
can be obtained. Conversely, for larger values of
, ZI = 2 cases are only suggested when control inputs are highly sensitive to weights. When
, most cases are ZI = 2, but it causes the inability to find the consistent weight set when the optimal control inputs are insensitive to weights, as discussed above. For instance, at time steps where multiple zs lead to the same optimal control inputs and none of these zs are in ZI = 1, with
, the optimal weight set is randomly selected among ZI = 2 cases. However, for
greater than zero, a case with less difference between each weight is preferably chosen. Therefore, based on the sensitivity analysis and the discussion above,
can be set to 1e-4.
2.2.2. Simulating multiple events for generating control learning data.
Among the total of 28 flood events, 22, 3, and 3 events are assigned for model training, validation, and testing, respectively. These validation and testing events may not represent all practical cases, but we carefully choose them to cover various inflow patterns, such as multiple peak inflows, significant peak magnitudes, and so on. For this case study, we do not anticipate very different conditions compared to the range of extreme and average scenarios considered. For other applications where different conditions are anticipated, data-driven models can be trained using outputs of simulation models, which can, for example, produce future inflows based on changes in land use and land cover patterns, and climate change scenarios.
Data are collected through PD-MPC simulations [9] under the following conditions. The initial water levels were set from NHWL-0.5 m to +0.5 m at intervals of 0.1 m, using five uncertain inflow predictions. Setting much higher or lower initial water levels than our initial levels allows PD-MPC to stabilize RWL quickly by increasing or decreasing outflows abruptly. Therefore, a narrow range of water levels around the NHWL is set as initial RWLs. Consequently, the training data consists of 1210 cases (21 events 11 initial RWLs
five uncertain inflow cases), and each validation and testing data have 165 cases (three events
11 initial RWLs
five uncertain inflow cases) as presented in Table 3.
It is worth noting that historically observed initial RWLs are lower than our simulation, e.g., Event No.3, which occurred near the end of the flood season, started with an RWL at elevation 72.2 m. This is because operators want to secure enough flood control capacity in case of unexpected huge inflows, even though lower RWLs increase risks to water supply. Thus, lower initial RWLs in historical data often result from the absence of an optimal flood control system, which is what this study aims to address. Therefore, we set the initial RWL in this research to the NHWL, representing the likely initial condition when operators have access to optimal reservoir flood control inputs at each time step.
2.3. Optimal outflow schedule by DNN controller
The efficiency of DNNs can be affected by hyperparameter settings, such as the number of nodes and layers, the learning rate, the activation function, and the batch size. To find the best hyperparameters, a grid search based on mean squared errors is carried out [45]. However, considering the extensive use of ramp functions in the evaluator, the Rectified Linear Unit (ReLU) is selected as the activation function. In addition, the number of layers is also fixed to six based on a six-hour prediction horizon. We utilize the mean square error (MSE) loss function [46–48] with the Adam optimizer [48–51], which is widely used in developing surrogate inflow prediction models. The MSE loss function is particularly suitable for flood inflow prediction due to the huge variance between peak and low inflows. Additionally, it helps in accurately estimating the peak values, which are critical when targeting flood prediction.
To reduce computational burden and searching time and to prevent overfitting during the process of finding optimal hyperparameters, 330 cases for six events among the training events are utilized along with the early stopping technique [52], and the five-fold cross-validation approach. Through the grid search method, we found that batch size had the greatest impact on model performance, followed by learning rate and the number of nodes, respectively. The best hyperparameters are presented in Table 4.
As the results shown in Fig 7 and summarized in Table 5 indicate, the DNN model, mapping the states to the control inputs, is able to mimic accurately the PD-MPC control. Note that the DNN model is trained only for total outflows, i.e., Ototal, for simplicity because spillway outflows can be obtained by abstracting the turbine capacity from total outflows when total outflows are larger than the turbine capacity. The Nash-Sutcliffe model efficiency coefficient, which is commonly used to evaluate the prediction accuracy of hydrological models [53], is consistently more than 0.95, which demonstrates that the model is reliable enough to replace PD-MPC [54]. The Root Mean Squared Error (RMSE) also varies between and
. Considering that the initial inflows for the three flood events in the validation set are
,
, and
, and their respective peak inflows are
,
, and
, we can say that the model is accurate. It should be noted that because
is fixed to the value determined in the previous time step, i.e., k–1, the DNN model output has N–1 values when the prediction horizon is N hours. As expected, predictions for shorter time horizons are more accurate, while performance gradually deteriorates as the prediction time increases.
Performance diminishes with increases over time.
2.4. Training machine learning models for mapping states to preference (weights of objectives)
To simulate the changes in weights based on the states of the system, machine learning models are used to map them together. Given the predominance of ZI = 1 cases (shown in Fig 6) and the challenges of identifying a linear relationship between the states and the weights (shown in Fig 8), constructing a linear relationship to directly assign the best weights based on the states seems to be infeasible. Therefore, a binary classification model is employed to distinguish between the cases of ZI = 1 and ZI = 2. Additionally, a regression model is utilized to estimate the weights for ZI = 2.
Among the available criteria for selecting a binary classification algorithm, accuracy, precision score, and recall score are taken into account in this article. Accuracy represents the ratio of correctly classified instances to the total number of predictions. Precision score refers to the proportion of correctly classified instances for a specific class divided by the total instances classified as a given class by the model. Recall score measures the ratio of instances correctly classified as a particular class to the total instances of that class. When a classifier classifies a case with ZI = 1, the weight set is determined; however, when ZI = 2, the optimal weight set should be estimated using the regressor. In other words, a case needs to be classified as ZI = 2 when there is uncertainty and ZI = 1 when no uncertainty exists. Therefore, ZI = 1 and ZI = 2 are selected with a high precision score and a recall score, respectively.
Furthermore, to address the skewness of data in ZI, we adopt both under-sampling and over-sampling algorithms. The Edited Nearest Neighbours (ENN) is an under-sampling method that removes the nearest neighbors from the majority class data [55]. The Synthetic Minority Over-sampling Technique (SMOTE) [56] is an over-sampling technique where synthetic data are generated by randomly selecting one data point from the minority class and its nearest neighbor in the minority class. The synthetic data are classified by a classification model. Through the application of these techniques, the number of cases for ZI = 1 and ZI = 2 becomes balanced. The number of cases changes to 158134 for ZI = 1 from 187587 in the original data and 150783 from 89833 for ZI = 2.
Classifiers are constructed based on k–nearest Neighbors, Random Forest, and Support Vector algorithms using balanced data. As summarized in Table 6, the validation results indicate that the Random forest model is the best regarding precision and recall scores. For the implementation, a python library imbalanced learning [57] is used for the SMOTEENN algorithm, and the scikit-learn package [58] with default parameters for the classifiers.
To map the nonlinear relationship between the states and the weights when ZI = 2, a Random forest regressor is employed, which uses an ensemble approach with decision trees, which can effectively map the nonlinear relationship between inputs and outputs [18, 59]. This model uses the average value at the final (decision-making) node for regression.
The parameters, i.e., the number of trees and the maximum depth of trees, are selected by grid search based on RMSE values for the validation events, as shown in Fig 9. The parameters are selected at the point where the RMSE curve starts to level off in order to simplify the model. The selected number of trees is then 150, and the maximum depth of each tree is 30. However, as can be seen in Fig 9, no significant difference is observed with different parameter values.
In addition, the Piecewise Affine (PWA) modelling approach, which is known for effectively approximating switched discrete-time nonlinear systems [60, 61], is also employed. They are widely used to strike a balance between simplicity and accuracy [18]. This PWA approximation does not show considerable improvement in terms of the Peak RWL and the penalty values of the evaluator compared to the Random Forest regressor. The Random Forest regressor demonstrates even better performance in terms of the peak outflow and peak evaluator value. Linear Random Forest regressor [62], which uses the Random Forest algorithm, but the final value is calculated by linear regression, showing similar results with the Random Forest regressor. The details on the PWA model and Linear Random Forest regressor can be found in Appendix B.
3. Results and discussion
In this section, the efficiency of data-driven models based on the two aforementioned MPC approaches on the test dataset for flood events is presented and discussed. In this regard, the flood events selected for testing, as shown in Table 3, include cases with two and three peaks of inflow and a large peak. Therefore, except for extreme cases like the Probable Maximum Flood for designing reservoirs, these events could represent situations similar to those encountered by operators during flood control operations.
3.1. Analyses of test events
The average of result indexes for each test event, i.e., the peak outflows, peak RWLs, minimum RWLs, the total evaluator values, and peak evaluator values, are presented in Table 7, Table 8, and Fig 10. ‘Fixed 1’, ‘Fixed 2’, and ‘Fixed 3’ refer to predefined weights, which are ,
, and
, respectively. ‘Fixed 2’ and ‘Fixed 3’ represent the cases with a high weight for J2 in (4), which emphasizes maintaining the RWL in the target water level. J2 with a high weight forces the linear MPC controller to maintain the reservoir water level at the minimum possible level, ensuring operational stability even over extreme conditions. In addition, ‘Fixed 3’ highlights the importance of J3, minimizing changes between outflow schedules.
‘E-MPC’ refers to the explicit MPC implemented using a DNN model, which is directly mapping the states and the outflow schedules. On the other hand, the switched MPC approach, which employs a random forest classifier and regressor as surrogate models for the weight vector, is referred to as ‘S-MPC’.
For cases with predefined weights, the system violates at least one constraint more than once, and the peak RWL reaches the FWL sometimes. For example, although Fixed 3 has a high weight to the water level, in which the weight for J2 is ten times higher than J1 and two times higher than J3, it shows the highest peak RWL in Event No.1 and 25. This result implies that fixing the weights may not respond adequately, i.e., being sensitive enough, to changing hydrological situations, thereby failing to satisfy the constraints and produce reliable results from a long-term perspective. In contrast, E-MPC, which incorporates the DNN model, and S-MPC, which applies the Random Forest models, show stable operation for the test events. In this scenario, the maximum outflow capacity, minimum water supply, and RWL constraints are all satisfied. Even though the best performance belongs to PD-MPC, they also outperform the fixed weight cases with respect to the absolute performance evaluator value.
The Absolute Performance Evaluator quantifies what reservoir operators want to achieve during flood events. While conventional MPC approaches are limited to linear or quadratic objective formulations, making it challenging to express complex operational goals mathematically, the PD-MPC framework overcomes this fundamental limitation by enabling the use of diverse nonlinear equations, including exponential functions and conditional formulations. Therefore, this evaluator can serve as a definitive performance index, providing a single quantitative measure. As presented in Table 7, S-MPC shows a lower average penalty in terms of the evaluator compared to E-MPC. Fixed weight cases generally present higher penalties than both S-MPC and E-MPC, with one exception: in Event No.1, E-MPC exhibits the highest evaluator value. However, regarding the highest penalty during an event period, ‘Peak E’, E-MPC has a lower average penalty than S-MPC. The results indicate that the relative performance of S-MPC and E-MPC is contingent upon specific event characteristics. Note that ‘Total E’ in Tables 7 and 8 represents the sum of penalty values accumulated during an event period. Here, E is interpreted as a penalty since the evaluator’s objectives are formulated as minimization problems.
Considering the peak RWL and peak outflow, E-MPC mimics PD-MPC results accurately. It can be concluded that E-MPC is able to respond adequately to changes in the states. However, E-MPC tends to change the outflow schedule frequently, leading to very high values for E3 in (11). This is because even small changes in the outflow schedule yield an increase in E3 value, and the DNN regressor is less sensitive to minor deviations due to the effect of a loss function, which is the mean squared error.
On the other hand, S-MPC derives state-dependent weights, and the linear MPC, incorporating these weights, generates the optimal outflow schedule in real-time. As a result, in terms of the absolute performance evaluator, switched MPC shows lower penalty values and fewer changes in outflow schedules - an area emphasized by the evaluator, as mentioned in Sect 1.3 - compared to E-MPC. Since the PD-MPC framework is designed to select optimal control inputs based on the evaluator, not the linear MPCâ€(tm)s objectives, we can conclude that switched MPC better replicates the original multi-objective optimization results compared to E-MPC.
However, this improvement is obtained by compromising the peak outflow and peak RWL, since the number of changes, peak outflow, and peak RWL are correlated. This can be explained by errors in these two surrogate models, indicating that the models are not sufficiently sensitive to a given state.
Some errors in the classification of ZI and the regression of weights can be expected for the surrogate models. Firstly, if the classifier fails to identify situations where ZI = 2, i.e., when weights need to change, the model maintains RWLs between the target RWLs and outflows at the minimum amount. It yields subsequent higher RWLs and larger outflows. Furthermore, if the regressor fails to provide the optimal weights, linear MPC may struggle to accurately determine the appropriate adjustments to the outflow.
Moreover, when RWLs exceed EL 78.89 m, which is the maximum RWL in training data, switched MPC surrogate models need to provide optimal weights for states that fall outside the range of the training data. It can result in errors.
Despite errors in the surrogate models (RMSEs are approximately 0.45 in Fig 9) in optimal decision timings and outflows, the switched MPC approach enables linear MPC to produce robust results concerning constraints. Even if the surrogate models cannot present the optimal weights at a certain time step and the system state worsens in terms of the evaluator values, they can promptly respond to the slightly deteriorated new state due to the receding horizon control. Thus, linear MPC makes appropriate decisions with the weights provided by the surrogate model.
3.2. Comparison of explicit MPC and switched MPC
For a deeper understanding of the characteristics of surrogate models within the frameworks of explicit MPC and switched MPC, we define scenarios that these models have not previously been trained on. In general, data-driven models often fail to produce reliable outputs when the inputs are beyond the range of the training data [63, 64]. A scenario with an initial RWL of EL 75.0 m is defined, while, in the training data, the lowest water level is EL 75.95 m. This scenario is simulated for Event No.1 with one peak and a significant peak inflow.
The simulation results of the data-driven models for the aforementioned scenario are shown in Fig 11 and summarized in Table 9. As expected, PD-MPC outperforms other methods in this case. In addition, S-MPC does not violate any constraints. However, E-MPC does not satisfy constraints compared to the other three cases, particularly when the RWL is low; E-MPC suggests negative outflow, e.g., , to increase the reservoir level to the target range (from EL 76.0 m to EL 76.5 m), even lower than the minimum outflow for water supply, i.e. Omin. This is because S-MPC explicitly incorporates constraints through linear MPC, whereas E-MPC only maps the states and the outflow schedules.
3.3. Discussions
Considering the limitation of data-driven models to model complex relationships between states and outflow schedules, or states and weights, which are embedded in the training data, the PD-MPC framework outperforms in all the cases. However, the explicit MPC and the switched MPC outperform in the fixed weight cases, demonstrating applicable results without violating constraints for the test events.
The DNN model based on the explicit MPC approach results in small and frequent changes in the outflow schedule, as shown in Table 8 and Fig 11b. This causes physically impossible outflows and constraint violations for the unseen cases. Despite these shortcomings, they could potentially be addressed by introducing heuristic rules based on an operator’s experience. For instance, operators can design a simple heuristic algorithm in which a proposed outflow is replaced with the minimum outflow when the proposed outflow is less than the constrained minimum outflow. Negligible fluctuations between outflow schedules can also be ignored using these kinds of heuristic rules. Furthermore, as the number of training flood events increases and various uncertain inflow scenarios and initial RWLs are integrated, a surrogate DNN model can cover all potential states to avoid infeasible control inputs.
The model using the switched MPC approach also demonstrates reliable results for flood control. However, the complex relationship between the states and optimal weights poses a challenge due to the selection of a single weight set from multiple sets that produce the same optimal control inputs. As a result, using a conventional data-driven model often leads to significant testing errors. For instance, although not presented here, a DNN model does not outperform our proposed classifier and regressor. These errors cause an overall performance degradation, particularly delaying the timing of changes in the outflow schedule. Moreover, this degradation can cause a switched MPC surrogate model to encounter states far outside the training data. Consequently, the peak RWL is higher, and the peak outflow is larger than the PD-MPC results. The errors can also be mitigated by adopting knowledge-based heuristic rules. As an example, we can design different default weight sets for different states outside training data based on the operator’s experience. The most crucial point to be considered is that online implementation of linear MPC can guarantee that the optimal outflow is stable, not violating constraints, even for unseen cases.
In summary, although PD-MPC outperforms other methods, it requires solving computationally complex nonlinear optimal control problems online for reservoir flood control, which makes it difficult to implement in practice. The switched MPC approach replaces the gradient-free randomized search algorithm, which was time-consuming for solving PD-MPC online. It thus can significantly reduce complexity. In addition, since linear MPC optimization is conducted every time step using updated information, results are robust to errors and avoid constraint violations. The explicit MPC approach has the lowest complexity for implementing online as it directly maps states to control, but a DNN model can present infeasible control inputs and needs more training data to cover every possible situation. The complexity and accuracy of the discussed methods are schematically described in Fig 12.
Even though we can convert an expensive online nonlinear optimization problem into expensive offline training and cheap online implementation, it will be better to make the online nonlinear optimization, i.e., the GA or TPE algorithm itself, cheap. This means the online implementation of the PD-MPC framework becomes more practical. The warm-starting strategy [33, 65] can be a good method to start. Adaptive learning techniques present promising alternatives to traditional data-driven models. For example, reinforcement learning [66] and learning-based MPC approaches [67, 68] have gained significant attention in recent research. These methods can effectively reduce the errors inherent in surrogate models, ultimately near optimality with lower computational complexity.
Another important consideration is how to scale this approach to a multi-reservoir system. In standard MPC approaches, some challenges in scaling the receding horizon MPC approach to complicated multi-reservoir systems have been reported [7, 69]. In our framework, expanding the surrogate switched and explicit MPC approaches to a multi-reservoir system is technically straightforward, given that obtaining the optimal outflow schedules and/or the optimal weights is feasible. In such extensions, the number of system states will grow linearly with the number of reservoirs in the system. Since the switched approach uses linear MPC, the computational complexity of the real-time control will remain manageable. For the explicit MPC, it would be even more computationally efficient. However, the most complex task in such an approach is designing a sufficient number of scenarios for the PD-MPC to generate a sufficient range of control data sets for learning the explicit and switched models.
The uncertainty of inflow forecasts also needs to be considered. Even though we consider the uncertainty of inflows by adding noise to observed inflow data when generating the training data, we deal with it in a deterministic way. However, it is advisable to explore stochastic/robust MPC [70] to consider uncertainty explicitly. In addition, we acknowledge that we cannot present the comprehensive impacts of the inherent inaccuracy of data-driven models in reservoir flood control, despite our detailed explanations of how S-MPC and E-MPC respond to the errors of surrogate models within the studied flood scenarios. Many methodologies for uncertainty quantification of data-driven models, e.g., sensitivity analysis [71], Bayesian-based approaches, bootstrap-based approaches, and the Monte-Carlo method, have been studied [72]. However, incorporating these methodologies for comprehensive uncertainty analysis is beyond the scope of this research.
4. Conclusions
In this article, we propose data-driven surrogate models in the form of explicit and switched Model Predictive Control (MPC) approaches for a flood control problem with dynamically changing weights of multi-objectives, which is the first study based on the author’s best knowledge. These approaches have the advantage of low computational complexity for online implementation compared to a Parameterized Dynamic Model Predictive Control (PD-MPC) framework proposed in our previous work [9]. The PD-MPC formulation explicitly considers the dynamic characteristic of the relative importance of objectives depending on the situation via time-varying relative weights for the different objectives, which are optimized online at each time step of the MPC by solving a highly nonlinear optimization problem. Through numerical experiments on the Daecheong multipurpose reservoir in South Korea, we demonstrate that both approaches outperform linear MPC approaches with predefined fixed weights and produce compatible results with PD-MPC. Especially, the switched MPC approach is robust in terms of model errors despite some constraint violations of the explicit MPC DNN model under previously unseen events. However, because the highly complex relationship between states and weights leads to the performance degradation of data-driven models for the switched MPC, testing these strategies for other case studies where there may be a larger size of training data may be useful.
Overall, we expect this study to contribute to widening the possibility of the application of real-time optimal reservoir flood control and lay the foundation for practical implementation in the field. The switched MPC approach, for example, offers a computationally efficient method for real-time reservoir operation to effectively balance multiple objectives, such as minimizing flood risk downstream, maintaining reservoir levels, and minimizing outflow schedule changes, even under uncertain inflow conditions we simulated. Additionally, the insights into the dynamic weighting of objectives based on real-time hydrological conditions and states can aid in developing more adaptive and robust flood control strategies.
Even though our research focuses on a single reservoir in South Korea, we believe our approach could be applicable to reservoirs with various hydrological conditions. The methodology presented in this study may be adaptable to different reservoir systems with their own unique characteristics and operational requirements, as the fundamental concepts of model predictive control and surrogate modelling potentially extend to diverse hydrological settings. The framework potentially offers sufficient flexibility to accommodate region-specific constraints and objectives, suggesting its potential utility for addressing water resource management challenges in different geographical contexts.
References
- 1. Giuliani M, Lamontagne JR, Reed PM, Castelletti A. A State‐of‐the‐Art Review of Optimal Reservoir Control for Managing Conflicting Demands in a Changing World. Water Resources Research. 2021;57(12).
- 2. Labadie JW. Optimal Operation of Multireservoir Systems: State-of-the-Art Review. J Water Resour Plann Manage. 2004;130(2):93–111.
- 3. Lee S, Kang T, Lee KS. An operational model of a reservoir system simulation for real‐time flood control in the Han River Basin. J Flood Risk Management. 2015;10(4):499–510.
- 4. Cheng C, Chau KW. Fuzzy iteration methodology for reservoir flood control operation1. J American Water Resour Assoc. 2001;37(5):1381–8.
- 5. Castelletti A, Ficchì A, Cominola A, Segovia P, Giuliani M, Wu W, et al. Model Predictive Control of water resources systems: A review and research agenda. Annual Reviews in Control. 2023;55:442–65.
- 6. Jain SK, Shilpa LS, Rani D, Sudheer KP. State-of-the-art review: Operation of multi-purpose reservoirs during flood season. Journal of Hydrology. 2023;618:129165.
- 7. Myo Lin N, Tian X, Rutten M, Abraham E, Maestre JM, van de Giesen N. Multi-Objective Model Predictive Control for Real-Time Operation of a Multi-Reservoir System. Water. 2020;12(7):1898.
- 8. Breckpot M, Agudelo OM, De Moor B. Flood Control with Model Predictive Control for River Systems with Water Reservoirs. J Irrig Drain Eng. 2013;139(7):532–41.
- 9. Koo JH, Abraham E, Jonoski A, Solomatine DP. Balancing operator’s risk averseness in model predictive control of a reservoir system; 2024. Available from: https://arxiv.org/abs/2407.04506
- 10. Chiang P-K, Willems P. Combine Evolutionary Optimization with Model Predictive Control in Real-time Flood Control of a River System. Water Resour Manage. 2015;29(8):2527–42.
- 11. Kong Y, Mei Y, Wang X, Ben Y. Solution Selection from a Pareto Optimal Set of Multi-Objective Reservoir Operation via Clustering Operation Processes and Objective Values. Water. 2021;13(8):1046.
- 12. Moridi A, Yazdi J. Optimal Allocation of Flood Control Capacity for Multi-Reservoir Systems Using Multi-Objective Optimization Approach. Water Resour Manage. 2017;31(14):4521–38.
- 13. Liu D, Bai T, Deng M, Huang Q, Wei X, Liu J. A parallel approximate evaluation-based model for multi-objective operation optimization of reservoir group. Swarm and Evolutionary Computation. 2023;78:101288.
- 14. Pan I, Babaei M, Korre A, Durucan S. Artificial Neural Network based surrogate modelling for multi- objective optimisation of geological CO2 storage operations. Energy Procedia. 2014;63:3483–91.
- 15. Karg B, Lucia S. Efficient Representation and Approximation of Model Predictive Control Laws via Deep Learning. IEEE Trans Cybern. 2020;50(9):3866–78. pmid:32574145
- 16. Nubert J, Kohler J, Berenz V, Allgower F, Trimpe S. Safe and Fast Tracking on a Robot Manipulator: Robust MPC and Neural Network Control. IEEE Robot Autom Lett. 2020;5(2):3050–7.
- 17. Hertneck M, Kohler J, Trimpe S, Allgower F. Learning an Approximate Model Predictive Controller With Guarantees. IEEE Control Syst Lett. 2018;2(3):543–8.
- 18. Bemporad A. A Piecewise Linear Regression and Classification Algorithm With Application to Learning and Model Predictive Control of Hybrid Systems. IEEE Trans Automat Contr. 2023;68(6):3194–209.
- 19. Li L, Rong S, Wang R, Yu S. Recent advances in artificial intelligence and machine learning for nonlinear relationship analysis and process control in drinking water treatment: A review. Chemical Engineering Journal. 2021;405:126673.
- 20. Wang D, Shen ZJ, Yin X, Tang S, Liu X, Zhang C, et al. Model Predictive Control Using Artificial Neural Network for Power Converters. IEEE Trans Ind Electron. 2022;69(4):3689–99.
- 21. Hirose N, Tajima R, Sukigara K. MPC policy learning using DNN for human following control without collision. Advanced Robotics. 2018;32(3):148–59.
- 22.
Cai W, Kordabad A, Esfahani H, Lekkas A, Gros S. MPC-based reinforcement learning for a simplified freight mission of autonomous surface vehicles. In: Proceedings of the 60th IEEE Conference on Decision and Control (CDC). IEEE. 2021. 2990–5.
- 23. Arroyo J, Manna C, Spiessens F, Helsen L. Reinforced model predictive control (RL-MPC) for building energy management. Applied Energy. 2022;309:118346.
- 24. Song Y, Scaramuzza D. Policy Search for Model Predictive Control With Application to Agile Drone Flight. IEEE Trans Robot. 2022;38(4):2114–30.
- 25.
Liberzon D. Switching in systems and control vol. 190. Springer; 2003.
- 26. Zhang L, Zhuang S, Braatz RD. Switched model predictive control of switched linear systems: Feasibility, stability and robustness. Automatica. 2016;67:8–21.
- 27.
Alessio A, Bemporad A. A survey on explicit model predictive control. Nonlinear model predictive control: towards new challenging applications. 2009. p. 345–69.
- 28. Schwenzer M, Ay M, Bergs T, Abel D. Review on model predictive control: an engineering perspective. Int J Adv Manuf Technol. 2021;117(5–6):1327–49.
- 29.
Kouvaritakis B, Cannon M. Model predictive control. Switzerland: Springe; 2016; 38(13-56):7.
- 30. Alasali F, Haben S, Foudeh H, Holderbaum W. A Comparative Study of Optimal Energy Management Strategies for Energy Storage with Stochastic Loads. Energies. 2020;13(10):2596.
- 31. Wang H, Olhofer M, Jin Y. A mini-review on preference modeling and articulation in multi-objective optimization: current status and challenges. Complex Intell Syst. 2017;3(4):233–45.
- 32. Bergstra J, Bardenet R, Bengio Y, Kégel B. Algorithms for hyper-parameter optimization. Adv Neural Inf Process Syst. 2011;24.
- 33.
Watanabe S. Tree-structured parzen estimator: understanding its algorithm components and their roles for better empirical performance. arXiv. 2023. https://arxiv.org/abs/2304.11127
- 34.
Bergstra J, Yamins D, Cox D. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In: International Conference on Machine Learning. PMLR; 2013. p. 115–23.
- 35. Aydin BE, Oude Essink GHP, Delsman JR, van de Giesen N, Abraham E. Nonlinear model predictive control of salinity and water level in polder networks: Case study of Lissertocht catchment. Agricultural Water Management. 2022;264:107502.
- 36. Xu M, van Overloop PJ, van de Giesen NC. On the study of control effectiveness and computational efficiency of reduced Saint-Venant model in model predictive control of open channel flow. Advances in Water Resources. 2011;34(2):282–90.
- 37. Powell K, Eaton A, Hedengren J, Edgar T. A Continuous Formulation for Logical Decisions in Differential Algebraic Systems using Mathematical Programs with Complementarity Constraints. Processes. 2016;4(1):7.
- 38. Chung SW, Hipsey MR, Imberger J. Modelling the propagation of turbid density inflows into a stratified lake: Daecheong Reservoir, Korea. Environmental Modelling & Software. 2009;24(12):1467–82.
- 39. Park Y, Kim S, Chon K, Lee H, Kim JH, Shin J-K. Impacts of heavy rain and floodwater on floating debris entering an artificial lake (Daecheong Reservoir, Korea) during the summer. DWT. 2021;219:399–404.
- 40. Nourani V, Hosseini Baghanam A, Adamowski J, Kisi O. Applications of hybrid wavelet–Artificial Intelligence models in hydrology: A review. Journal of Hydrology. 2014;514:358–77.
- 41. Sahay RR, Srivastava A. Predicting Monsoon Floods in Rivers Embedding Wavelet Transform, Genetic Algorithm and Neural Network. Water Resour Manage. 2013;28(2):301–17.
- 42. Kisi O, Shiri J. Precipitation Forecasting Using Wavelet-Genetic Programming and Wavelet-Neuro-Fuzzy Conjunction Models. Water Resour Manage. 2011;25(13):3135–52.
- 43. Ferreau HJ, Bock HG, Diehl M. An online active set strategy to overcome the limitations of explicit MPC. Intl J Robust & Nonlinear. 2007;18(8):816–30.
- 44. Zhu F, Antsaklis PJ. Optimal control of hybrid switched systems: A brief survey. Discrete Event Dyn Syst. 2014;25(3):345–64.
- 45. Huang I-H, Chang M-J, Lin G-F. An optimal integration of multiple machine learning techniques to real-time reservoir inflow forecasting. Stoch Environ Res Risk Assess. 2021;36(6):1541–61.
- 46. Shelke M, Londhe SN, Dixit PR, Kolhe P. Reservoir Inflow Prediction: A Comparison between Semi Distributed Numerical and Artificial Neural Network Modelling. Water Resour Manage. 2023;37(15):6127–43.
- 47. Ahmad M, Al Mehedi MA, Yazdan MMS, Kumar R. Development of Machine Learning Flood Model Using Artificial Neural Network (ANN) at Var River. Liquids. 2022;2(3):147–60.
- 48. Maddu R, Pradhan I, Ahmadisharaf E, Singh SK, Shaik R. Short-range reservoir inflow forecasting using hydrological and large-scale atmospheric circulation information. Journal of Hydrology. 2022;612:128153.
- 49.
Kingma DP, Ba J. Adam: a method for stochastic optimization. 2014. https://arxiv.org/abs/14126980
- 50. Kao I-F, Zhou Y, Chang L-C, Chang F-J. Exploring a Long Short-Term Memory based Encoder-Decoder framework for multi-step-ahead flood forecasting. Journal of Hydrology. 2020;583:124631.
- 51. Le X-H, Ho HV, Lee G, Jung S. Application of Long Short-Term Memory (LSTM) Neural Network for Flood Forecasting. Water. 2019;11(7):1387.
- 52. Zhang J, Chen F, Cui Z, Guo Y, Zhu Y. Deep Learning Architecture for Short-Term Passenger Flow Forecasting in Urban Rail Transit. IEEE Trans Intell Transport Syst. 2021;22(11):7004–14.
- 53. Jain SK, Sudheer K. Fitting of hydrologic models: a close look at the Nash–Sutcliffe index. J Hydrol Eng. 2008;13(10):981–6.
- 54. D. N. Moriasi, J. G. Arnold, M. W. Van Liew, R. L. Bingner, R. D. Harmel, T. L. Veith. Model Evaluation Guidelines for Systematic Quantification of Accuracy in Watershed Simulations. Transactions of the ASABE. 2007;50(3):885–900.
- 55. Wilson DL. Asymptotic Properties of Nearest Neighbor Rules Using Edited Data. IEEE Trans Syst Man Cybern. 1972;3(3):408–21.
- 56. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321–57.
- 57. Lema A, Zˇtre G, Nogueira F, Aridas CK. Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach Learn Res. 2017;18(17):1–5.
- 58. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30.
- 59. Breiman L. Random forests. Mach Learn. 2001;45:5–32.
- 60. Moradvandi A, Lindeboom REF, Abraham E, De Schutter B. Models and methods for hybrid system identification: a systematic survey. IFAC-PapersOnLine. 2023;56(2):95–107.
- 61. Wang M, Qiu J, Yan H, Tian Y, Li Z. Data-driven control for discrete-time piecewise affine systems. Automatica. 2023;155:111168.
- 62. Ao Y, Li H, Zhu L, Ali S, Yang Z. The linear random forest algorithm and its advantages in machine learning assisted logging regression modeling. Journal of Petroleum Science and Engineering. 2019;174:776–89.
- 63. Badillo S, Banfai B, Birzele F, Davydov II, Hutchinson L, Kam-Thong T, et al. An Introduction to Machine Learning. Clin Pharmacol Ther. 2020;107(4):871–85. pmid:32128792
- 64.
Wang H, Lei Z, Zhang X, Zhou B, Peng J. Machine learning basics. Deep learning. 2016. p. 98–164.
- 65. Feurer M, Springenberg J, Hutter F. Initializing bayesian hyperparameter optimization via meta-learning. In: Proceedings of the AAAI Conference on Artificial Intelligence. vol. 29; 2015.
- 66.
Li Y. Deep reinforcement learning: an overview. arXiv. 2017. https://arxiv.org/abs/1701.07274
- 67. Hewing L, Wabersich KP, Menner M, Zeilinger MN. Learning-Based Model Predictive Control: Toward Safe Learning in Control. Annu Rev Control Robot Auton Syst. 2020;3(1):269–96.
- 68. Yang S, Wan MP, Chen W, Ng BF, Dubey S. Model predictive control with adaptive machine-learning-based model for building energy efficiency and comfort optimization. Applied Energy. 2020;271:115147.
- 69. Lin NM, Rutten M. Optimal Operation of a Network of Multi-purpose Reservoir: A Review. Procedia Engineering. 2016;154:1376–84.
- 70. Saltık MB, Özkan L, Ludlage JHA, Weiland S, Van den Hof PMJ. An outlook on robust model predictive control algorithms: Reflections on performance and computational aspects. Journal of Process Control. 2018;61:77–102.
- 71. Pianosi F, Beven K, Freer J, Hall JW, Rougier J, Stephenson DB, et al. Sensitivity analysis of environmental models: A systematic review with practical workflow. Environmental Modelling & Software. 2016;79:214–32.
- 72. Kasiviswanathan KS, Sudheer KP. Quantification of the predictive uncertainty of artificial neural network based river flow forecast models. Stoch Environ Res Risk Assess. 2012;27(1):137–46.