Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Dam deformation forecasting using SVM-DEGWO algorithm based on phase space reconstruction

  • Mingjun Li ,

    Roles Investigation, Methodology, Writing – original draft, Writing – review & editing

    lmjunwhu@163.com

    Affiliations China Power Construction Group Zhongnan Survey Design & Research Institute Co., Ltd., Changsha, China, College of Water Conservancy and Hydropower, Hohai University, Nanjing, China

  • Jiangyang Pan,

    Roles Supervision, Visualization

    Affiliation China Power Construction Group Zhongnan Survey Design & Research Institute Co., Ltd., Changsha, China

  • Yaolai Liu,

    Roles Visualization

    Affiliation China Power Construction Group Zhongnan Survey Design & Research Institute Co., Ltd., Changsha, China

  • Yazhou Wang,

    Roles Software

    Affiliation Institute of HydroEcology, State Key Laboratory of Water Resources and Hydropower Engineering Science, Wuhan University, Wuhan, China

  • Wenchuan Zhang,

    Roles Visualization

    Affiliation Chang Jiang Survey, Planning, Design and Research CO., LTD., Wuhan, China

  • Junxing Wang

    Roles Visualization

    Affiliation College of Water Conservancy and Hydropower, Hohai University, Nanjing, China

Abstract

A hybrid model integrating chaos theory, support vector machine (SVM) and the difference evolution grey wolf optimization (DEGWO) algorithm is developed to analyze and predict dam deformation. Firstly, the chaotic characteristics of the dam deformation time series will be identified, mainly using the Lyapunov exponent method, the correlation dimension method and the kolmogorov entropy method. Secondly, the hybrid model is established for dam deformation forecasting. Taking SVM as the core, the deformation time series is reconstructed in phase space to determine the input variables of SVM, and the GWO algorithm is improved to realize the optimization of SVM parameters. Prior to this, the effectiveness of DEGWO algorithm based on the fusion of the difference evolution (DE) and GWO algorithm has been verified by 15 sets of test functions in CEC 2005. Finally, take the actual monitoring displacement of Jinping I super-high arch dam as examples. The engineering application examples show that the PSR-SVM-DEGWO model established performs better in terms of fitting and prediction accuracy compared with existing models.

1. Introduction

Dam is one of the most important engineering measures to regulate the spatio-temporal distribution of water resources and rationally allocate water resources, and it is also the key component of flood control engineering system [1, 2]. The safety status of dam engineering not only directly affects the full utilization of the benefits of the hydropower station, but also affects the life and property safety of the downstream people, even the ecological environment and social stability [3].

As a comprehensive response to dam behavior, deformation is an important indicator to evaluate dams’ safety [4]. By analyzing the measured data of dam displacement in time, and then establishing the corresponding prediction model, the deformation behavior and development trend of the dam can be accurately identified, and most hidden dangers can be further discovered to avoid the occurrence of catastrophic accidents.

According to different construction methods, conventional dam deformation prediction models are mainly divided into statistical models, deterministic models and hybrid models [5]. However, conventional models are difficult to adapt to the complex nonlinear relationship between multi-factors and effect sizes, and the accuracy of model predictions is hard to guarantee. With the development of machine learning (ML) technology, ML-based models are widely used to explain the structural behavior of dams [6]. Moreover, the existing prediction models only consider the main influencing factors, such as water pressure, temperature and aging, but do not consider the chaotic components that may be included in the deformed time series, which further limits the improvement of fitting accuracy [7].

Chaos is an irregular random behavior with initial sensitivity and ergodicity. Some studies [8, 9] have shown that there is chaos in the measured displacement data of dams. The efficient extraction of the chaotic component contained in the observation data is of great practical significance for improving the accuracy of prediction models. Zhang [9] proposed a new method based on empirical mode decomposition and phase space reconstruction theory to analyze the time-varying characteristics of dams. Su [10] combined SVM with phase space reconstruction, wavelet analysis, particle swarm optimization and other methods to establish a dam deformation prediction model. Gu [11] reconstructed the phase plane of the trend effect component of the dam service performance change through chaos technology. Wei [12] considered the chaotic effect of the residual sequence and proposed a new dam deformation prediction model. Obviously, when the phase space reconstruction method is used to design the input variable, the machine learning algorithm shows high applicability in improving the prediction accuracy. However, models based on machine learning algorithms are highly dependent on the adjustment of parameters, which can easily affect the stability of the prediction results. Therefore, using different heuristic search algorithms in the training process has become a popular method.

SVM are widely used in dam behavior prediction because of their obvious advantages in solving low-sample high-dimensional nonlinear problems [13]. Ren [14] used SVM modeling to effectively capture the complex relationships in deformation prediction. Hu [15] established a deformation prediction model for high arch dams during the initial operation period based on a least square support vector machine. The SVM can transform the original nonlinear problem into a high-latitude linear problem through the kernel parameters, so it is well adapted to the nonlinear deformation prediction. However, the selection of the kernel parameters and penalty factors of SVM will directly affect its performance in dealing with nonlinear problems.

Grey Wolf Optimizer (GWO) is a new intelligent optimization algorithm proposed by Mirjalili with reference to the social hierarchy and hunting behavior of grey wolves [16]. The algorithm realizes intelligent optimization through the process of tracking, encircling, hunting, and attacking the grey wolf population [17]. Studies have shown that the GWO algorithm is better than the other evolutionary algorithm in terms of quality, speed and stability of the final solution [1719]. However, the possibility of premature convergence reduces the probability of the algorithm finding the global optimum. The initial population is an important factor influencing the optimization performance of an intelligent algorithm, but it is difficult to guarantee the diversity of the initial population with conventional GWO algorithms. To solve this problem, a hybrid GWO algorithm (HGWO) is proposed. It uses a differential evolution (DE) algorithm to generate a richer initial population. Use the proposed DEGWO algorithm to optimize the optimal kernel parameters and penalty factors of SVMs.

Based on the identification of the chaotic characteristics of the dam deformation observation data, this study combines SVMs and other methods to establish a dam deformation prediction model. The structure of this article is as follows. Section 2 introduces a variety of methods to identify chaotic characteristics of time series. In Section 3, a new dam deformation prediction model is proposed on the basis of phase space reconstruction. An hybrid GWO algorithm based on the fusion of the DE and GWO algorithm is introduced to optimize the parameter settings of SVM. The performance of the DEGWO algorithm is tested through 6 sets of test functions in CEC2005. In Section 4, taking the measured displacement data of Jinping I super high arch dam as an example, the method proposed in this paper and other common methods are used to establish the deformation prediction models. The prediction accuracy of models are compared and analyzed. Section 5 summarizes the main conclusions reached here.

2. Identification of chaotic characteristics

Chaotic systems are deterministic and sensitive to initial conditions [8]. It is worth noting that it is neither random nor disordered. At present, the identification of chaotic characteristics of time series is mainly based on phase space reconstruction, which can obtain more hidden information by recovering the chaotic attractor in the so-called high-dimensional phase space. The Lyapunov exponent, Correlation dimension and Kolmogorov entropy of the singular attractor are calculated to correctly distinguish the chaotic system from the random system [20]. When the correlation dimension D2 exists at a certain value, the maximum Lyapunov exponent λmax is greater than 0 and the Kolmogorov entropy K2 is a finite positive value, it can be judged that the time series has chaotic characteristics.

2.1. Phase space reconstruction

The reconstruction of the phase space is the basis for the quantitative analysis of chaotic time series, in which the embedding dimension and the delay coordinate are the two most critical parameters [21]. The time delay method is currently the most commonly used method. For univariate chaotic time series {xi,i = 1,2,⋯,n}.

(1)

According to Takens Theorem, the appropriate choice of the embedding dimension m and the delay time τ can restore the dynamics properties of the original state space in the sense of topological equivalence.

2.1.1. Delay time.

The mutual information method [22] is introduced to determine the delay time τ of the measured displacement sequence of the dam, as shown below. (2) where P(xi) is the normalized distribution of xi, P(xi) is the normalized distribution of xi, P(xi,xi+τ) is the joint distribution of xi and xi. The time at which the first minimum point appears in the τ~I(τ) curve is often selected as optimal the delay time.

2.1.2. Embedding dimension.

The embedding dimension m is determined by the Cao method [23]. The distance a(t,m) between the phase point and the nearest neighbor point is shown below.

(3)

Where ytm+1 is the tth vector in the reconstructed phase space with the embedding dimension m+1, and yfm+1 is the nearest neighbor to ytm+1. yfm is the nearest neighbor to ytm in the reconstructed phase space with the embedding dimension m.

The average value E(m) of a(t,m) is calculated as follows.

(4)

Then, the change of E(m) is as follows.

(5)

When m > m0, if E1(m) no longer changes, m0 represents the minimum embedding dimension. In order to avoid the situation that the change of E1(m) is difficult to judge, the Cao method adds another definition.

(6)(7)

For chaotic time series, there will always be some value of m, so that E2(m)≠1. By observing whether E1(m) tends to be stable and whether the value of m can achieve E2(m)≠1 to determine the minimum m of the reconstructed phase space.

2.2. Lyapunov exponent method

The maximum Lyapunov exponent (λmax) is usually regarded as an indicator of chaotic motion [24]. λmax > 0 indicates that the system is in a chaotic state. The specific process of calculating λmax by the wolf method is as follows.

For the initial point Y(t0) in the phase space, the distance between it and the nearest neighbor Y0(t0) is L(t0). As time evolves, when the distance between two points exceeds the specified value ε, that is (8)

Keep the point Y(t1), and find a point Y1(t1) near Y(t1) to ensure that ensure that the following conditions are met, that is (9)

And the angle between L’(t1) and L(t1) is as small as possible.

Record the total number of iterations M when Y(t) reaches the end of the time series, and the maximum Lyapunov exponent λmax is calculated as follows.

(10)

2.3. Correlation dimension method

The correlation dimension D2 is mainly determined by the Grassberger Procaccia algorithm [25]. Suppose r is the radius of the sphere centered on yi and yj, then the correlation integral C(r) is given by: (11)

Where θ(⋅) is the Heaviside function: (12)

Where D2 is the correlation dimension (13)

Thus, draw logCn(r)/logr curve, and then the value of D2 can be determined according to the slope of the curve. As the embedding dimension m gradually increases, the slope of the curve converges, and the limit of convergence is the correlation dimension D2. The slope of the curve of a stochastic system will continue to increase with the increase of d, and there will be no convergence phenomenon.

2.4. Kolmogorov entropy method

Kolmogorov entropy [26] describes the generation rate of chaotic orbital information over time. Kolmogorov entropy reflects the chaos level of nonlinear dynamic systems, and the K2 entropy proposed by Grassberger and Procaccia is most commonly used as its estimate. K2 > 0 is a sufficient condition for the nonlinear system to be a chaotic system, and the K2 entropy can be estimated by the correlation integral method.

(14)

When the embedding dimension m is continuously increasing at intervals of Δm, the stable estimation of Kolmogorov entropy can be obtained through the equal slope regression of Eq (12). It should be noted that the minimum value of m must be an integer greater than D2.

In a phase space with embedding dimension i, there is (15) (16)

Where: (17)

Let a = D2, and for the embedding dimensions i and im, there is (18)

Where Δb = bi-bi+m.

The Kolmogorov entropy estimate K2 can be used to judge the motion properties of the nonlinear system: K2 = 0 means the nonlinear system performs regular motion, K2 > 0 means the nonlinear system performs chaotic motion, and K2 < 0 means the nonlinear system performs random motion.

3. Chaotic time series prediction

For the dam deformation time series xi(i = 1,2,…,n), when the delay time τ and the embedding dimension m have been determined, the phase space reconstruction results of the series xi are as follows.

(19)

After determining the input variables and output variables of the model, a novel model based on phase space reconstruction for dam deformation predicting is proposed.

To clarify the influence of the reconstructed phase space as an input variable on the prediction performance of the model, a conventional model with dam deformation influence factors as input variables should be established. Early studies [3, 27, 28] have shown that water level, temperature and aging are the main factors affecting dam deformation, as shown below.

(20)

In order to match the consistency of the model and avoid the larger data information overwhelming the smaller data information, the input data of SVM is normalized. After completing the SVM training process, the output data of the SVM needs to be denormalized.

Normalization equation is as follows.

(21)

Anti-normalization equation is as follows.

(22)

Where, xi is a sample data; xmin and xmax respectively represent the minimum and maximum sample data; xi is the normalized data.

The mean square error (MSE), mean absolute percentage error (MAPE) and square correlation coefficient (R2) are used to evaluate the performance of predictive models, as shown below [10].

(23)(24)(25)

Where, yD and represent predicted values and predicted average values, y and represent measured values and measured average values, and N represents the number of observed samples. The closer the R2 is to 1, the smaller the MSE and the MAPE, the better the prediction effect of the model.

3.1. Support vector machine

SVM [10] usually needs to establish a suitable function f(x) to describe the nonlinear relationship between the characteristic value xi and the target value yi, as shown below.

(26)

Where, w is the coefficient vector, φ(xi) is the transformation function, w and b represent the weight and bias respectively. w and b are estimated by minimizing the regularized hazard function, as shown below (27)

Where: (28)

Where, is the regularization term, C is the penalty coefficient, and Lε(yi,f(xi)) is the ε-insensitive loss function.

The optimization object can be deducted as follows: (29)

Subject to (30)

Where, ξ+ and ξ represent slack variables.

The key is to establish the Lagrangian function.

(31)

Subject to (32)

Therefore (33)

Where K(xi,xj) represents kernel function, including polynomial, radial basis function and sigmoid etc.

3.2. Hybrid Grey Wolf Optimizer (HGWO, DEGWO)

Grey Wolf Optimizer (GWO) is a new intelligent optimization algorithm proposed by Mirjalili with reference to the social hierarchy and hunting behavior of gray wolves [29]. The GWO algorithm realizes the optimization of the intelligent algorithm through the process of tracking, encircling, hunting, and attacking the grey wolf population. The algorithm is characterized by simple principle, few adjustment parameters, easy implementation, and strong global search capability.

Many scholars have improved and applied research on the GWO algorithm from a specific perspective for specific problems. These improvements are mainly concentrated in the following aspects: (1) Improve the initial population for addressing the problem that the random generation method cannot guarantee the initial population diversity [30]. (2) Improve the search mechanism to keep the GWO algorithm away from the local optimum [31, 32]. (3) Adjust the way of parameter changes to balance the algorithm’s global and local search capabilities [3335]. (4) Hybrid algorithm, which combines the advantages of multiple algorithms to improve the algorithm’s performance.

A hybrid GWO (DEGWO) algorithm is proposed, which uses a differential evolution (DE) algorithm to generate a richer initial population. The performance of the DEGWO algorithm is tested using 15 CEC2005 benchmark functions. The test results show that compared with the whale optimization algorithm (WOA), particle swarm optimization (PSO) algorithm and the original GWO algorithm, the DEGWO algorithm has higher execution efficiency.

3.2.1. Grey Wolf Optimizer.

Grey wolves [36, 37] have a very strict social dominance hierarchy, which is mainly divided into four parts: α,β,δ and ω. α is the best solution, followed by β and δ, and the remaining solutions belong to ω. The top three best wolves that are closest to their prey are α, β and δ, and they guide ω to search for prey in promising search areas. During the hunting process, the wolf will update its position around α, β and δ, as shown below.

(34)(35)

Where t is the current iteration number, is the current position of the prey, is the current position of the wolf, and is the distance between the wolf and the prey.

The coefficient vectors and are as follows.

(36)(37)

Where and are two vectors randomly generated between [0, 1], and the convergence factor linearly decreases from 2 to 0 in the iterative process.

Update the position of grey wolves.

(38)(39)(40)

Where , , represent the position of α,β,δ respectively, represents the location of the current solution, , , and represent randomly generated vectors, and , , and represent the distances of α, β and δ from other grey wolves, respectively. , and are random vectors, is the final position of ω.

3.2.2. Differential evolution.

Differential Evolution Algorithm (DE) is an efficient group-based heuristic search algorithm [38]. It mainly obtains the optimal solution through three operations of mutation, crossover and selection. The population size is N, D represents the dimensionality in the solution space, xi = (x1,i, x2,i,, xD,L) is the individual vector, and G = 0,1,,Gmax is the iteration time. is the i-th individual in the G-th generation. and record the lower and upper limit of search space.

The initial population P0 is mainly randomly generated within the upper and lower limit (xL,xU), the j-th index of the i-th individual is obtained by Eq (39).

(41)

The mutation operation generates a new mutation vector , as shown below.

(42)

Where are randomly generated integers, and F is the magnification ratio of the control difference vector, which is a real number with a varying range between [0, 2].

The crossover operation is as follows: (43)

Where CR is the cross probability, which takes a value between [0, 1].

Finally, the selection operation is performed. For specific problems, all mutation crossover individuals are evaluated. For specific problems, all mutation crossover individuals are evaluated. If the fitness of the current individual exceeds the previous generation, it means that the mutation crossover operation is successful, and the current individual is retained; if the current individual’s fitness is not as good as the previous generation, the better individual is retained. The individual with the optimal fitness will become the optimal value of this generation of individuals. When the termination condition is met, the evolution will stop, otherwise the next round will continue.

3.2.3. Hybrid grey wolf optimization algorithm (HGWO, DEGWO).

The conventional GWO algorithm randomly generates the initial population, which may fall into the dilemma of local optimization [39]. The DE algorithm generates group intelligence through mutual cooperation and competition between individuals. Based on the respective advantages and disadvantages of the GWO and DE algorithms, a more efficient hybrid algorithm (DE-GWO, HGWO) is proposed. The pseudo -code of the prosed DEGWO algorithm is shown in Fig 1. Specific steps are as follows.

  1. Set the relevant parameters of the DEGWO algorithm, such as the population size N, the maximum number of iterations tmax, the upper and lower limits of the search range ub and lb, etc.
  2. Initialize the parameters a, A and C. Generate intermediates (variant populations) through evolutionary mutation operations, and then generate initial population individuals through competitive selection operations, and set the iteration time t = 1.
  3. Calculate the objective function value of a single grey wolf individual and determine the best three individuals as Xα, Xβ and Xδ respectively.
  4. Calculate the distance between other grey wolf individuals and the optimal Xα, Xβ and Xδ according to Eq (38), and update the position of each grey wolf according to Eq (39).
  5. Update the values of a, A and C. Crossover and competition operations are applied to retain better individual positions and generate new individuals respectively.
  6. Update the position of the first three grey wolves Xα, Xβ and Xδ.
  7. Determine whether the maximum number of iterations tmax has been reached. If yes, exit and output the current objective function value of Xα; otherwise, t = t + 1, and move to the third step to continue.

Input: the population size N, the maximum number of iterations tmax, the upper and lower limits of the search range ub and lb, etc.

Output: The global optimum

Begin

    Initialize the parent population and offspring population

    Calculate the fitness of each agent by support vector machine (SVM)

    Xα = the best search agent

    Xβ = the second search agent

    Xγ = the third best search agent

    While (t< tmax)

        For each search agent

            Update the parent individual position of the current search agent by GWO algorithm

        end for

    Update a, A and C

    Produce mutant by Mutation of Differential evolution (DE)

    Cross mutant and produce an offspring population by Crossover of the DE algorithm

    Produce a new search agent

    Selection the better fitness

    Update the Xα, Xβ, Xγ of the parent individual

    t = t+1

    End while

Return the global optimum

3.2.4. Simulation experiment test.

In this section, we use 12 sets of test functions in CEC 2005 [40] to prove the effectiveness of the DEGWO algorithm 5 groups of unimodal benchmark functions, 3 groups of multimodal benchmark functions and 4 groups of Fixed-dimension benchmark functions, as shown in Table 1 and Figs 213. The parameters of all intelligent algorithms are set as follows: population size N = 30, particle dimension D = 10, and maximum number of iterations T = 50. Note that the algorithm is run 20 times on each benchmark function.

Table 2 shows the final calculation results of PSO, GWO, WOA and DEGWO on the benchmark function. By comparing the average value (AVE) and standard deviation (STD), it is found that whether it is a single-peak benchmark function, a multi-peak benchmark function or a fixed-dimensional test function, the AVE and STD calculated by the DEGWO algorithm are in most cases smaller than those calculated by the other four algorithms. Figs 1425 records the fitness curve changes of PSO, GWO, WOA and DEGWO algorithms in each test function, from which we can see the specific situation of each algorithm’s convergence. It can be seen from the downward trend of the curve in Figs 1425 that the grey wolf individuals in the DEGWO algorithm can better update their positions as the number of iterations increases, and cooperate to achieve the purpose of improving the optimization results. Results show that compared with the other four algorithms, DEGWO has better convergence performance and can obtain more excellent solutions.

3.3. Construction process of PSR-SVM-DEGWO model predicting dam deformation

A hybrid model combining chaos theory and SVM is proposed, and the DEGWO algorithm is used to select optimal parameters for concrete dam deformation analysis and prediction. Use the method mentioned in Section 2 to identify the chaotic characteristics of the deformed time series, and then determine the input variables of the support vector machine. According to the DEGWO algorithm introduced in Section 3.2, SVM is optimized for parameters. By performing training operations, a PSR-SVM-DEGWO prediction model can be established. The specific process is shown in Fig 26.

thumbnail
Fig 26. Construction process of dam deformation prediction model based on PSR-SVM-DEGWO.

https://doi.org/10.1371/journal.pone.0267434.g026

  1. Reconstruct the phase space of the observed displacement time series. The mutual information method and the Cao method are adopted to determine the optimal delay time τ and the minimum embedding dimension m, and the phase space is reconstructed accordingly.
  2. Identification of chaotic characteristics of the observed displacement time series. Estimate the maximum Lyapunov exponent λmax, correlation dimension D2 and Kolmogorov entropy K2. λmax > 0, K2 is a finite positive value and the saturation of the correlation dimension indicating that the observed displacement time series has the chaotic characteristics.
  3. Determine the input and output variables according to Eq (19).
  4. Use the DEGWO algorithm to find the optimal SVM parameters based on the training samples, generate the optimal values of C and σ, and complete the SVM-DEGWO training process.
  5. According to the prediction samples, the trained SVM will be used for prediction, and the three indicators of Eqs (23)–(25) are used to evaluate the prediction effect of the model.

4. Case study

4.1. Engineering overview

The Jinping I Dam [41] is a concrete arch dam located in Sichuan Province, China. It is the highest arch dam in the world, and the maximum height of the dam is 305.0 m. The normal water storage level is 1880.0 m, and the dead water level is 1800.0 m. The layout of the dam vertical line monitoring system is shown in Fig 27.

The monitoring points PL13-2 and PL13-3 are taken as examples. From June 2014 to June 2017, the horizontal displacement and water level changes of the measuring point are shown in Fig 28. Take the observation data from June 2014 to October 2017 as the training set, and the rest as the prediction set. As shown in Fig 28, the symbol (-) indicates the displacement to the downstream, and the symbol (+) indicates the displacement to the upstream. During the study period, the water level change range is [1750m,1900m], and the displacement showed obvious periodic changes with the water level change, and the change range is [-5mm,45mm].

thumbnail
Fig 28. The observed horizontal displacement and water level of the measured points.

https://doi.org/10.1371/journal.pone.0267434.g028

4.2. Prediction model construction

The delay time τ of the observed displacement time series is estimated by the mutual information method, as shown in Figs 29 and 30. According to the principle of mutual information, the optimal delay times of the measuring points PL13-2 and PL13-3 are τ1 = 10 and τ2 = 11, respectively.

thumbnail
Fig 29. I(τ) ~ τ curve for the observed displacement data series of the PL13-2.

https://doi.org/10.1371/journal.pone.0267434.g029

thumbnail
Fig 30. I(τ) ~ τ curve for the observed displacement data series of the PL13-3.

https://doi.org/10.1371/journal.pone.0267434.g030

According to the determined τ, the Cao method is adopted to calculate m. The change curves of E1(m) and E2(m) as m increases are shown in Figs 31 and 32. When m is greater than the minimum embedding dimension, the change of E1(m) starts to become smaller, and thus the optimal embedding dimension is determined. As shown in Figs 31 and 32, the minimum embedding dimension of the measuring points PL13-2, and PL13-3 are m1 = 4 and m2 = 5, respectively. It can also be found that there are some values of m such that E2(m)≠1, which can prove that the observed displacement data series comes from a deterministic process.

thumbnail
Fig 31. E1(m)~m and E2(m)~m curves for the observed displacement data series of the PL13-2.

https://doi.org/10.1371/journal.pone.0267434.g031

thumbnail
Fig 32. E1(m)~m and E2(m)~m curves for the observed displacement data series of the PL13-3.

https://doi.org/10.1371/journal.pone.0267434.g032

According to the determined τ, the G-P method is adopted to calculate the correlation dimension of the observed displacement data series. When the value of the embedding dimension d ranges from 1 to 10, the corresponding double logarithmic curve of lnC(r)~ln(r) is shown in Figs 33 and 34. As shown in Figs 33 and 34 that the correlation index is saturated to a constant. According to the definition, it is concluded that the observed displacement data series is a chaotic series. And the correlation dimension of the measuring points PL13-2, and PL13-3 are D2-1 = 1.2043 and D2-2 = 1.1217, respectively.

thumbnail
Fig 33. Double logarithmic curve of correlation dimension of the PL13-2.

https://doi.org/10.1371/journal.pone.0267434.g033

thumbnail
Fig 34. Double logarithmic curve of correlation dimension of the PL13-3.

https://doi.org/10.1371/journal.pone.0267434.g034

The wolf method is adopted to calculate the maximum Lyapunov exponent λmax of the observed displacement data series. And the maximum Lyapunov exponent λmax of the measuring points PL13-2, and PL13-3 are λmax−2 = 0.0191 and λmax−3 = 0.0108, respectively. λmax > 0 indicates that the observed displacement data series has chaotic characteristics.

The correlation integral method is used to calculate the Kolmogorov entropy estimate K2 of the observed displacement data series. According to the double logarithmic curve of lnC(r)~ln(r), as shown in Figs 33 and 34, the double logarithmic curve of log2(r)~log2(C(r)) can be derived, so that the Kolmogorov entropy estimate K2 of the measuring points PL13-2 and PL13-3 can be obtained when the delay time is determined respectively, and K2-2 = 0.0043, K2-3 = 0.0044. K2 takes a finite positive value, which indicates the chaotic characteristics of the observed displacement data series.

Based on the above, it can be concluded that the observed displacement data series of the measuring point PL13-2 and PL13-3 has chaotic characteristics, and a chaotic prediction model of dam displacement can be established.

The DEGWO algorithm proposed is adopted to seek the optimal parameters of SVM. The phase space of reconstructed observation displacement and the influencing factors of dam deformation are used as input variables to evaluate the predicting performance of the DEGWO and GWO algorithms. The population size N = 20, the maximum number of iterations tmax = 200, ε = 0.01, the penalty factor C∈[0.01,100], and the kernel parameter γ∈[0.01,1000]. The calculation is terminated when the number of iterations reaches 200.

The fitness curves of the GWO and DEGWO algorithm are shown in Figs 3538. For the conventional model with dam deformation influencing factors as input variables, the optimal parameters [C, γ] of SVM obtained by the DEGWO algorithm are [4.6616, 0.0100] (PL13-2) and [4.5948, 24.2515] (PL13-3). For the chaotic model with the reconstructed observed deformation phase space as the input variable, the optimal parameters [C, γ] of the SVM obtained by the DEGWO algorithm are [6.0011, 0.0100] (PL13-2) and [24.2515, 24.2515] (PL13-3).

thumbnail
Fig 35. The fitness curves of the GWO and DEGWO model (conventional) of the PL13-2.

https://doi.org/10.1371/journal.pone.0267434.g035

thumbnail
Fig 36. The fitness curves of the GWO and DEGWO model (chaotic) of the PL13-2.

https://doi.org/10.1371/journal.pone.0267434.g036

thumbnail
Fig 37. The fitness curves of the GWO and DEGWO model (conventional) of the PL13-3.

https://doi.org/10.1371/journal.pone.0267434.g037

thumbnail
Fig 38. The fitness curves of the GWO and DEGWO model (chaotic) of the PL13-4.

https://doi.org/10.1371/journal.pone.0267434.g038

It can be seen from Figs 3538, whether it is a conventional model with dam deformation influencing factors as input variables or a chaotic model with reconstructed observation displacement data series phase space as the input variable, the DEGWO algorithm reduces the number of iterations and can find the solution closest to the best goal faster. The main explanation is that the DEGWO algorithm enriches the diversity of the initial population and improves the global search capability, thereby accelerating the convergence speed and convergence accuracy.

4.3. Results

For the PSR-SVM-DEGWO-based dam observation displacement prediction model, the relevant information is introduced as follows. The SVM is at the heart of this innovative combination model. The input variable is the reconstructed phase space of the measured displacement data sequence, and the DEGWO algorithm is used to realize the parameter optimization of SVM. In addition, the kernel function of SVM is a radial basis function.

In order to better analyze the predictive performance of the PSR-SVM-DEGWO model, the PSR-SVM and PSR-SVM-GWO models that take the reconstructed phase space of the observed displacement data sequence as input variables are established respectively. In addition, the SVM, SVM-GWO and SVM-DEGWO models with the factors affecting dam deformation as input variables are established to explore the influence of different input variables on the accuracy of model prediction. T-test is adopted to test whether there is a significant difference between the existing method and the proposed PSR-SVM-DEGWO method. The h represents whether the hypothesis is accepted at the significance level. When h = 1, it means that the two sets of data compared with each other have significant differences. At this time, the comparison between the two algorithms is meaningful. The p represents the set standard of significant difference, which is set to 0.05 here. When p is less than 0.05, the results have a significant difference. ci represents the data interval with 95% confidence level.

For the measuring point PL13-2, the prediction performance of the SVM, SVM-GWO, SVM-DEGWO, PSR-SVM, PSR-SVM-GWO and PSR-SVM-DEGWO models are shown in Table 3 and Fig 39. From Table 3, we can see that when the influencing factors of dam deformation are used as the input variables of the model, the square correlation coefficient (R2) is ranked from large to small as SVM-DEGWO model> SVM-GWO model> SVM model, mean absolute percentage error (MAPE) is ranked from small to large SVM-DEGWO model< SVM model< SVM-GWO model, and mean square error (MSE) is ranked from small to large SVM-DEGWO model< SVM-GWO model< SVM model. The SVM-DEGWO model has the largest R2 of 0.9961, and the smallest MAPE and MSE are 0.3070 and 1.9001, respectively. When the reconstructed phase space of the dam deformation time series is used as the input variable of the model, R2 is sorted from large to small as PSR-SVM-DEGWO model> PSR-SVM-GWO model> PSR-SVM model. The ranking of MAPE and MSE from small to large is PSR-SVM-DEGWO model <PSR-SVM-GWO model <PSR-SVM model. The PSR-SVM-DEGWO model has the largest R2 of 0.9993, and the smallest MAPE and MSE are 0.2491 and 0.1399, respectively.

The t-test is used to test whether there are significant differences between the calculation results of the other five algorithms and the results of the proposed PSR-SVM-DEGWO algorithm. It can be seen from Table 3 that all the results meet the condition h = 1 and p<0.05, which represents the existence of significant differences.

As shown in Fig 39, the deformation shows obvious periodic regular changes, but compared to the conventional arch dam below 200m, the deformation fluctuation range of Jinping I super high arch dam is much larger. SVM, SVM-GWO, SVM-DEGWO, PSR-SVM, PSR-SVM-GWO and PSR-SVM-DEGWO models can effectively predict the trend of dam displacement. And the PSR-SVM-DEGWO model has the highest prediction accuracy and the smallest fluctuation range of the prediction error. For the conventional model, the deviation between the predicted value of the model and the observed value is larger than the deviation of the PSR models.

For the measuring point PL13-3, the prediction performance of the SVM, SVM-GWO, SVM-DEGWO, PSR-SVM, PSR-SVM-GWO and PSR-SVM-DEGWO models are shown in Table 4 and Fig 40. From Table 4, we can see that when the influencing factors of dam deformation are used as the input variables of the model, the square correlation coefficient (R2) is also ranked from large to small as SVM-DEGWO model> SVM-GWO model> SVM model, mean absolute percentage error (MAPE) is ranked from small to large SVM model< SVM-DEGWO model< SVM-GWO model, and mean square error (MSE) is ranked from small to large SVM-DEGWO model< SVM-GWO model< SVM model. The SVM-DEGWO model has the largest R2 of 0.9971, and the smallest MAPE and MSE are 0.1381 and 2.0937, respectively. When the reconstructed phase space of the dam deformation time series is used as the input variable of the model, R2 is sorted from large to small as PSR-SVM-DEGWO model> PSR-SVM-GWO model> PSR-SVM model. The ranking of MAPE and MSE from small to large is PSR-SVM-DEGWO model <PSR-SVM-GWO model <PSR-SVM model. The PSR-SVM-DEGWO model has the largest R2 of 0.9987, and the smallest MAPE and MSE are 0.0485 and 0.3347, respectively. The t-test is used to test whether there are significant differences between the calculation results of the other five algorithms and the results of the proposed PSR-SVM-DEGWO algorithm. It can be seen from Table 4 that all the results meet the condition h = 1 and p<0.05, which represents the existence of significant differences.

As shown in Fig 40, the deformation shows obvious periodic regular changes, but compared to the conventional arch dam below 200m, the deformation fluctuation range of Jinping I super high arch dam is much larger. SVM, SVM-GWO, SVM-DEGWO, PSR-SVM, PSR-SVM-GWO and PSR-SVM-DEGWO models can effectively predict the trend of dam displacement. And the PSR-SVM-DEGWO model has the highest prediction accuracy and the smallest fluctuation range of the prediction error. For the conventional model, the deviation between the predicted value of the model and the observed value is larger than the deviation of the PSR models.

The calculation results show: ①The applicability of the GWO optimized SVM algorithm in dam deformation prediction; ②The DEGWO algorithm proposed in this paper has more outstanding optimization ability in optimizing SVM parameters than the conventional GWO; ③For ultra-high arch dams, the PSR model with the reconstructed phase space as the input variable has higher prediction accuracy and smaller prediction error than its corresponding conventional model with dam deformation influence factors as input variables.

In general, the SVM, SVM-GWO, and SVM-DEGWO models can better predict the change trend of dam deformation. Regardless of whether it is a conventional model with the dam deformation influence factors as input variables or the PSR model with the reconstructed observation displacement data series phase space as the input variable, the prediction accuracy of all the model can meet the engineering requirements, but the prediction accuracy of the PSR model is higher. The calculation results also prove the applicability of the GWO algorithm in the field of dam deformation prediction and the more prominent optimization ability of DEGWO compared to GWO. The t-test results show that the calculation results of the other five algorithms are significantly different from the results of the proposed PSR-SVM-DEGWO algorithm. The result of t test also shows that the algorithm comparison is meaningful, no matter it is for the measuring point PL13-2 or PL13-3.

5. Conclusions

This research proposes an innovative model combining chaos theory, support vector machine, difference algorithm and gray wolf algorithm, namely the PSR-SVM-DEGWO model, to predict dam deformation. And taking the measured displacement data of the Jinping I super high arch dam as examples, the prediction effect of the PSR-SVM-DEGWO model is compared and verified. The main conclusions are as follows.

  1. As the correlation dimension of the deformation time series tends to be saturated (D2-2 = 1.2043, D2-3 = 1.1217), the largest Lyapunov exponent (λmax-2 = 0.0191, λmax-3 = 0.0108) is greater than 0 and the Kolmogorov entropy estimate (K2-2 = 0.0043, K2-3 = 0.0044) is a finite positive value, it can be seen that there is chaos in the deformation observation data of the dam.
  2. The optimization performance of the DEGWO algorithm is superior to that of the GWO algorithm. Using the DE algorithm to ensure the initial population diversity can effectively improve the grey wolf optimization algorithm’s ability to find high-quality solutions. Simulation tests show that the convergence speed of the DEGWO algorithm is faster and the convergence accuracy is higher.
  3. It is verified by the example of Jinping I super high arch dam that SVM, SVM-GWO and SVM-DEGWO models can effectively predict the dam deformation trend, but the SVM-DEGWO model has the best prediction performance, which is reflected in the higher accuracy of the model prediction.
  4. The predictive performance of the PSR-SVM, PSR-SVM-GWO and PSR-SVM-DEGWO models with the reconstructed observation data sequence phase space as the input variable is superior to that of the corresponding SVM, SVM-GWO and SVM-DEGWO conventional models with deformation influence factors as input variables. When the conventional model predicts the deformation of an ultra-high arch dam, although the accuracy meets the requirements, the predicted value will gradually deviate from the measured value as time goes by. On the contrary, it is difficult to observe such large deviations in models that adopt the reconstructed phase space of the observation data sequence as the input variable. Among all the models calculated in this paper, the PSR-SVM-DEGWO model has the best prediction performance.

References

  1. 1. Chen B, Hu T, Huang Z, Fang C. A spatio-temporal clustering and diagnosis method for concrete arch dams using deformation monitoring data. Structural Health Monitoring. 2018;18(5–6):1355–71.
  2. 2. Huaizhi SU, Jiang HU, Zhongru WU. A study of safety evaluation and early-warning method for dam global behavior. Structural Health Monitoring: An International Journal. 2011;11(3):269–79.
  3. 3. Salazar F, Morán R, Toledo MÁ, Oñate E. Data-Based Models for the Prediction of Dam Behaviour: A Review and Some Methodological Considerations. Archives of Computational Methods in Engineering. 2015;24(1):1–21.
  4. 4. Lin C, Li T, Liu X, Zhao L, Chen S, Qi H. A deformation separation method for gravity dam body and foundation based on the observed displacements. Structural Control and Health Monitoring. 2019;26(2).
  5. 5. Li B, Yang J, Hu D. Dam monitoring data analysis methods: A literature review. Structural Control and Health Monitoring. 2019;27(3).
  6. 6. Mata J, Salazar F, Barateiro J, Antunes A. Validation of Machine Learning Models for Structural Dam Behaviour Interpretation and Prediction. Water. 2021;13(19).
  7. 7. Zhou C, Jiang S-H, Huang J, Huang F. Prediction of groundwater levels using evidence of chaos and support vector machine. Journal of Hydroinformatics. 2017;19(4):586–606.
  8. 8. Su H, Wen Z, Chen Z, Tian S. Dam safety prediction model considering chaotic characteristics in prototype monitoring data series. Structural Health Monitoring: An International Journal. 2016;15(6):639–49.
  9. 9. Zhang ZJ, Chongshi GU1, Bao TF, Zhang L, Hong Y1, Resources SKLoH-W, et al. Application analysis of empirical mode decomposition and phase space reconstruction in dam time-varying characteristic. 2010.
  10. 10. Su H, Li X, Yang B, Wen Z. Wavelet support vector machine-based prediction model of dam deformation. Mechanical Systems and Signal Processing. 2018; 110:412–27.
  11. 11. Gu H, Yang M, Gu CS, Cao WH, Huang XF, Su HZ. An analytical approach of behavior change for concrete dam by panel data model. Steel and Composite Structures. 2020;36(5):521–31.
  12. 12. Wei BW, Liu B, Yuan DY, Mao Y, Yao SY. Spatiotemporal hybrid model for concrete arch dam deformation monitoring considering chaotic effect of residual series. Engineering Structures. 2021;228.
  13. 13. Djemai S, Brahmi B, Bibi MOJN. A primal–dual method for SVM training. Neurocomputing. 2016;211(oct.26):34–40.
  14. 14. Ren QB, Li MC, Song LG, Liu H. An optimized combination prediction model for concrete dam deformation considering quantitative evaluation and hysteresis correction. Advanced Engineering Informatics. 2020;46.
  15. 15. Hu J, Ma FH. Comparison of hierarchical clustering based deformation prediction models for high arch dams during the initial operation period. Journal of Civil Structural Health Monitoring. 2021;11(4):897–914.
  16. 16. Li SX, Wang JS. Dynamic Modeling of Steam Condenser and Design of PI Controller Based on Grey Wolf Optimizer. Mathematical Problems in Engineering. 2015;2015(1519):1–9.
  17. 17. Wang M, Chen H, Li H, Cai Z, Zhao X, Tong C, et al. Grey wolf optimization evolving kernel extreme learning machine: Application to bankruptcy prediction. Engineering Applications of Artificial Intelligence. 2017;63(aug.):54–68.
  18. 18. Liao K, Wu Y, Miao F, Li L, Xue Y. Using a kernel extreme learning machine with grey wolf optimization to predict the displacement of step-like landslide. Bulletin of Engineering Geology and the Environment. 2019;79(2):673–85.
  19. 19. Wei Y, Ni N, Liu D, Chen H, Wang M, Li Q, et al. An Improved Grey Wolf Optimization Strategy Enhanced SVM and Its Application in Predicting the Second Major. Mathematical Problems in Engineering. 2017;2017:1–12.
  20. 20. Jiang Y, Bao X, Hao S, Zhao H, Wu X. Monthly Streamflow Forecasting Using ELM-IPSO Based on Phase Space Reconstruction. Water Resources Management. 2020(6).
  21. 21. Kennel MB, Brown R, Abarbanel H. Determining embedding dimension for phase-space reconstruction using a geometrical construction. Physical Review A. 1992;45(6):3403–11. pmid:9907388
  22. 22. Wang W, Lei Y, Zhang J. On the Relation Between Identifiability, Differential Privacy and Mutual-Information Privacy. IEEE Transactions on Information Theory. 2016;62(9):5018–29.
  23. 23. Cao L. Practical method for determining the minimum embedding dimension of a scalar time series. Physica D Nonlinear Phenomena. 1997;110(1–2):43–50.
  24. 24. Rosenstein MT, Collins JJ, Luca C. A practical method for calculating largest Lyapunov exponents from small data sets. Physica D-nonlinear Phenomena. 1993;65(1–2):117–34.
  25. 25. Grassberger P, Procaccia I. Characterization of Strange Attractors. Physical Review Letters. 1983;50(5):346.
  26. 26. Benettin G, Galgani L, Strelcyn JM. Kolmogorov Entropy and Numerical Experiments. Physical Review A. 1976;14(6).
  27. 27. Li M, Wang J. An Empirical Comparison of Multiple Linear Regression and Artificial Neural Network for Concrete Dam Deformation Modelling. Mathematical Problems in Engineering. 2019;2019:1–13.
  28. 28. Yin W, Zhao E, Gu C, Huang H, Yang Y. A Nonlinear Method for Component Separation of Dam Effect Quantities Using Kernel Partial Least Squares and Pseudosamples. Advances in Civil Engineering. 2019;2019:1–12.
  29. 29. Mirjalili S, Mirjalili SM, Lewis A. Grey Wolf Optimizer. Advances in Engineering Software. 2014;69(3):46–61.
  30. 30. Madhiarasan M, Deepa SN. Long-Term Wind Speed Forecasting using Spiking Neural Network Optimized by Improved Modified Grey Wolf Optimization Algorithm. International Journal of Advanced Research. 2016;4(7):356–68.
  31. 31. Saremi S, Mirjalili SZ, Mirjalili SM. Evolutionary population dynamics and grey wolf optimizer. Neural Computing and Applications. 2015;26(5):1257–63.
  32. 32. Gholizadeh S. Optimal Design Of Double Layer Grids Considering Nonlinear Behaviour By Sequential Grey Wolf Algorithm. Journal of Optimization in Civil Engineering. 2015.
  33. 33. Long W, Liang X, Cai S, Jiao J, Zhang W. A modified augmented Lagrangian with improved grey wolf optimization to constrained optimization problems. Neural Computing & Applications. 2016.
  34. 34. Elsayed A, Shazly K, Editor J. Modified Hybrid Optimizer for Global Engineering Optimization. Computational Intelligence and Soft Computing. 2020.
  35. 35. Yang JC, Long WJAM, Materials. Improved Grey Wolf Optimization Algorithm for Constrained Mechanical Design Problems. Applied Mechanics & Materials. 2016;851:553–8.
  36. 36. Zhou Z, Wang C, Zhu Z, Wang Y, Yang D. Sliding mode control based on a hybrid grey-wolf-optimized extreme learning machine for robot manipulators. Optik. 2019;185:364–80.
  37. 37. Muangkote N, Sunat K, Chiewchanwattana S, editors. An improved grey wolf optimizer for training q-Gaussian Radial Basis Functional-link nets. 2014 International Computer Science and Engineering Conference (ICSEC); 2014.
  38. 38. Zhou Z, Zhang R, Wang Y, Zhu Z, Zhang J. Color Difference Classification Based on Optimization Support Vector Machine of Improved Grey Wolf Algorithm. Optik. 2018;170.
  39. 39. Lu H. Cognitive Internet of Things: Frameworks, Tools and Applications. Studies in Computational Intelligence. 2020.
  40. 40. Mukherjee S, Osuna E, Girosi F, editors. Nonlinear prediction of chaotic time series using support vector machines. Neural Networks for Signal Processing [1997] VII Proceedings of the 1997 IEEE Workshop; 1997.
  41. 41. Hu J, Ma F. Zoned deformation prediction model for super high arch dams using hierarchical clustering and panel data. Engineering Computations. 2020;ahead-of-print(ahead-of-print).