Dam deformation forecasting using SVM-DEGWO algorithm based on phase space reconstruction

A hybrid model integrating chaos theory, support vector machine (SVM) and the difference evolution grey wolf optimization (DEGWO) algorithm is developed to analyze and predict dam deformation. Firstly, the chaotic characteristics of the dam deformation time series will be identified, mainly using the Lyapunov exponent method, the correlation dimension method and the kolmogorov entropy method. Secondly, the hybrid model is established for dam deformation forecasting. Taking SVM as the core, the deformation time series is reconstructed in phase space to determine the input variables of SVM, and the GWO algorithm is improved to realize the optimization of SVM parameters. Prior to this, the effectiveness of DEGWO algorithm based on the fusion of the difference evolution (DE) and GWO algorithm has been verified by 15 sets of test functions in CEC 2005. Finally, take the actual monitoring displacement of Jinping I super-high arch dam as examples. The engineering application examples show that the PSR-SVM-DEGWO model established performs better in terms of fitting and prediction accuracy compared with existing models.


Introduction
Dam is one of the most important engineering measures to regulate the spatio-temporal distribution of water resources and rationally allocate water resources, and it is also the key component of flood control engineering system [1,2]. The safety status of dam engineering not only directly affects the full utilization of the benefits of the hydropower station, but also affects the life and property safety of the downstream people, even the ecological environment and social stability [3].
As a comprehensive response to dam behavior, deformation is an important indicator to evaluate dams' safety [4]. By analyzing the measured data of dam displacement in time, and then establishing the corresponding prediction model, the deformation behavior and development trend of the dam can be accurately identified, and most hidden dangers can be further discovered to avoid the occurrence of catastrophic accidents. According to different construction methods, conventional dam deformation prediction models are mainly divided into statistical models, deterministic models and hybrid models [5]. However, conventional models are difficult to adapt to the complex nonlinear relationship between multi-factors and effect sizes, and the accuracy of model predictions is hard to guarantee. With the development of machine learning (ML) technology, ML-based models are widely used to explain the structural behavior of dams [6]. Moreover, the existing prediction models only consider the main influencing factors, such as water pressure, temperature and aging, but do not consider the chaotic components that may be included in the deformed time series, which further limits the improvement of fitting accuracy [7].
Chaos is an irregular random behavior with initial sensitivity and ergodicity. Some studies [8,9] have shown that there is chaos in the measured displacement data of dams. The efficient extraction of the chaotic component contained in the observation data is of great practical significance for improving the accuracy of prediction models. Zhang [9] proposed a new method based on empirical mode decomposition and phase space reconstruction theory to analyze the time-varying characteristics of dams. Su [10] combined SVM with phase space reconstruction, wavelet analysis, particle swarm optimization and other methods to establish a dam deformation prediction model. Gu [11] reconstructed the phase plane of the trend effect component of the dam service performance change through chaos technology. Wei [12] considered the chaotic effect of the residual sequence and proposed a new dam deformation prediction model. Obviously, when the phase space reconstruction method is used to design the input variable, the machine learning algorithm shows high applicability in improving the prediction accuracy. However, models based on machine learning algorithms are highly dependent on the adjustment of parameters, which can easily affect the stability of the prediction results. Therefore, using different heuristic search algorithms in the training process has become a popular method.
SVM are widely used in dam behavior prediction because of their obvious advantages in solving low-sample high-dimensional nonlinear problems [13]. Ren [14] used SVM modeling to effectively capture the complex relationships in deformation prediction. Hu [15] established a deformation prediction model for high arch dams during the initial operation period based on a least square support vector machine. The SVM can transform the original nonlinear problem into a high-latitude linear problem through the kernel parameters, so it is well adapted to the nonlinear deformation prediction. However, the selection of the kernel parameters and penalty factors of SVM will directly affect its performance in dealing with nonlinear problems.
Grey Wolf Optimizer (GWO) is a new intelligent optimization algorithm proposed by Mirjalili with reference to the social hierarchy and hunting behavior of grey wolves [16]. The algorithm realizes intelligent optimization through the process of tracking, encircling, hunting, and attacking the grey wolf population [17]. Studies have shown that the GWO algorithm is better than the other evolutionary algorithm in terms of quality, speed and stability of the final solution [17][18][19]. However, the possibility of premature convergence reduces the probability of the algorithm finding the global optimum. The initial population is an important factor influencing the optimization performance of an intelligent algorithm, but it is difficult to guarantee the diversity of the initial population with conventional GWO algorithms. To solve this problem, a hybrid GWO algorithm (HGWO) is proposed. It uses a differential evolution (DE) algorithm to generate a richer initial population. Use the proposed DEGWO algorithm to optimize the optimal kernel parameters and penalty factors of SVMs.
Based on the identification of the chaotic characteristics of the dam deformation observation data, this study combines SVMs and other methods to establish a dam deformation prediction model. The structure of this article is as follows. Section 2 introduces a variety of methods to identify chaotic characteristics of time series. In Section 3, a new dam deformation prediction model is proposed on the basis of phase space reconstruction. An hybrid GWO algorithm based on the fusion of the DE and GWO algorithm is introduced to optimize the parameter settings of SVM. The performance of the DEGWO algorithm is tested through 6 sets of test functions in CEC2005. In Section 4, taking the measured displacement data of Jinping I super high arch dam as an example, the method proposed in this paper and other common methods are used to establish the deformation prediction models. The prediction accuracy of models are compared and analyzed. Section 5 summarizes the main conclusions reached here.

Identification of chaotic characteristics
Chaotic systems are deterministic and sensitive to initial conditions [8]. It is worth noting that it is neither random nor disordered. At present, the identification of chaotic characteristics of time series is mainly based on phase space reconstruction, which can obtain more hidden information by recovering the chaotic attractor in the so-called high-dimensional phase space. The Lyapunov exponent, Correlation dimension and Kolmogorov entropy of the singular attractor are calculated to correctly distinguish the chaotic system from the random system [20]. When the correlation dimension D 2 exists at a certain value, the maximum Lyapunov exponent λ max is greater than 0 and the Kolmogorov entropy K 2 is a finite positive value, it can be judged that the time series has chaotic characteristics.

Phase space reconstruction
The reconstruction of the phase space is the basis for the quantitative analysis of chaotic time series, in which the embedding dimension and the delay coordinate are the two most critical parameters [21]. The time delay method is currently the most commonly used method. For univariate chaotic time series {x i ,i = 1,2,� � �,n}.
According to Takens Theorem, the appropriate choice of the embedding dimension m and the delay time τ can restore the dynamics properties of the original state space in the sense of topological equivalence.
2.1.1. Delay time. The mutual information method [22] is introduced to determine the delay time τ of the measured displacement sequence of the dam, as shown below.
where P(x i ) is the normalized distribution of x i , P(x i +τ) is the normalized distribution of x i +τ, P(x i, x i+τ ) is the joint distribution of x i and x i +τ. The time at which the first minimum point appears in the τ~I(τ) curve is often selected as optimal the delay time.

Embedding dimension.
The embedding dimension m is determined by the Cao method [23]. The distance a(t,m) between the phase point and the nearest neighbor point is shown below.
Where y t m+1 is the tth vector in the reconstructed phase space with the embedding dimension m+1, and y f m+1 is the nearest neighbor to y t m+1 . y f m is the nearest neighbor to y t m in the reconstructed phase space with the embedding dimension m.
The average value E(m) of a(t,m) is calculated as follows.
Then, the change of E(m) is as follows.
When m > m 0 , if E 1 (m) no longer changes, m 0 represents the minimum embedding dimension. In order to avoid the situation that the change of E 1 (m) is difficult to judge, the Cao method adds another definition.
For chaotic time series, there will always be some value of m, so that E 2 (m)6 ¼1. By observing whether E 1 (m) tends to be stable and whether the value of m can achieve E 2 (m)6 ¼1 to determine the minimum m of the reconstructed phase space.

Lyapunov exponent method
The maximum Lyapunov exponent (λ max ) is usually regarded as an indicator of chaotic motion [24]. λ max > 0 indicates that the system is in a chaotic state. The specific process of calculating λ max by the wolf method is as follows.
For the initial point Y(t 0 ) in the phase space, the distance between it and the nearest neighbor Y 0 (t 0 ) is L(t 0 ). As time evolves, when the distance between two points exceeds the specified value ε, that is Keep the point Y(t 1 ), and find a point Y 1 (t 1 ) near Y(t 1 ) to ensure that ensure that the following conditions are met, that is And the angle between L'(t 1 ) and L(t 1 ) is as small as possible.
Record the total number of iterations M when Y(t) reaches the end of the time series, and the maximum Lyapunov exponent λ max is calculated as follows.

Correlation dimension method
The correlation dimension D 2 is mainly determined by the Grassberger Procaccia algorithm [25]. Suppose r is the radius of the sphere centered on y i and y j , then the correlation integral C (r) is given by: Where θ(�) is the Heaviside function: Where D 2 is the correlation dimension Thus, draw logC n (r)/logr curve, and then the value of D 2 can be determined according to the slope of the curve. As the embedding dimension m gradually increases, the slope of the curve converges, and the limit of convergence is the correlation dimension D 2 . The slope of the curve of a stochastic system will continue to increase with the increase of d, and there will be no convergence phenomenon.

Kolmogorov entropy method
Kolmogorov entropy [26] describes the generation rate of chaotic orbital information over time. Kolmogorov entropy reflects the chaos level of nonlinear dynamic systems, and the K 2 entropy proposed by Grassberger and Procaccia is most commonly used as its estimate. K 2 > 0 is a sufficient condition for the nonlinear system to be a chaotic system, and the K 2 entropy can be estimated by the correlation integral method.
When the embedding dimension m is continuously increasing at intervals of Δm, the stable estimation of Kolmogorov entropy can be obtained through the equal slope regression of Eq (12). It should be noted that the minimum value of m must be an integer greater than D 2 .
In a phase space with embedding dimension i, there is Where: Let a = D 2 , and for the embedding dimensions i and i +Δm, there is Where The Kolmogorov entropy estimate K 2 can be used to judge the motion properties of the nonlinear system: K 2 = 0 means the nonlinear system performs regular motion, K 2 > 0 means the nonlinear system performs chaotic motion, and K 2 < 0 means the nonlinear system performs random motion.

Chaotic time series prediction
For the dam deformation time series x i (i = 1,2,. . .,n), when the delay time τ and the embedding dimension m have been determined, the phase space reconstruction results of the series x i are as follows.
X ¼ After determining the input variables and output variables of the model, a novel model based on phase space reconstruction for dam deformation predicting is proposed.
To clarify the influence of the reconstructed phase space as an input variable on the prediction performance of the model, a conventional model with dam deformation influence factors as input variables should be established. Early studies [3,27,28] have shown that water level, temperature and aging are the main factors affecting dam deformation, as shown below.
In order to match the consistency of the model and avoid the larger data information overwhelming the smaller data information, the input data of SVM is normalized. After completing the SVM training process, the output data of the SVM needs to be denormalized.
Normalization equation is as follows.
Anti-normalization equation is as follows.
Where, x i is a sample data; x min and x max respectively represent the minimum and maximum sample data; x i ' is the normalized data.
The mean square error (MSE), mean absolute percentage error (MAPE) and square correlation coefficient (R 2 ) are used to evaluate the performance of predictive models, as shown below [10].
j y D ðiÞ À yðiÞ yðiÞ j ð24Þ Where, y D and � y D represent predicted values and predicted average values, y and � y represent measured values and measured average values, and N represents the number of observed samples. The closer the R 2 is to 1, the smaller the MSE and the MAPE, the better the prediction effect of the model.

Support vector machine
SVM [10] usually needs to establish a suitable function f(x) to describe the nonlinear relationship between the characteristic value x i and the target value y i , as shown below.
Where, w is the coefficient vector, φ(x i ) is the transformation function, w and b represent the weight and bias respectively. w and b are estimated by minimizing the regularized hazard function, as shown below Where: Where, 1 2 kwk 2 is the regularization term, C is the penalty coefficient, and L ε (y i ,f(x i )) is the ε-insensitive loss function. The optimization object can be deducted as follows: Subject to ( Where, ξ + and ξ − represent slack variables.
The key is to establish the Lagrangian function.

PLOS ONE
Dam deformation forecasting using SVM-DEGWO al-gorithm based on phase space reconstruction Subject to Therefore Where K(x i ,x j ) represents kernel function, including polynomial, radial basis function and sigmoid etc.

Hybrid Grey Wolf Optimizer (HGWO, DEGWO)
Grey Wolf Optimizer (GWO) is a new intelligent optimization algorithm proposed by Mirjalili with reference to the social hierarchy and hunting behavior of gray wolves [29]. The GWO algorithm realizes the optimization of the intelligent algorithm through the process of tracking, encircling, hunting, and attacking the grey wolf population. The algorithm is characterized by simple principle, few adjustment parameters, easy implementation, and strong global search capability.
Many scholars have improved and applied research on the GWO algorithm from a specific perspective for specific problems. These improvements are mainly concentrated in the following aspects: (1) Improve the initial population for addressing the problem that the random generation method cannot guarantee the initial population diversity [30]. (2) Improve the search mechanism to keep the GWO algorithm away from the local optimum [31,32]. (3) Adjust the way of parameter changes to balance the algorithm's global and local search capabilities [33][34][35]. (4) Hybrid algorithm, which combines the advantages of multiple algorithms to improve the algorithm's performance.
A hybrid GWO (DEGWO) algorithm is proposed, which uses a differential evolution (DE) algorithm to generate a richer initial population. The performance of the DEGWO algorithm is tested using 15 CEC2005 benchmark functions. The test results show that compared with the whale optimization algorithm (WOA), particle swarm optimization (PSO) algorithm and the original GWO algorithm, the DEGWO algorithm has higher execution efficiency.
3.2.1. Grey Wolf Optimizer. Grey wolves [36, 37] have a very strict social dominance hierarchy, which is mainly divided into four parts: α,β,δ and ω. α is the best solution, followed by β and δ, and the remaining solutions belong to ω. The top three best wolves that are closest to their prey are α, β and δ, and they guide ω to search for prey in promising search areas. During the hunting process, the wolf will update its position around α, β and δ, as shown below.
Where t is the current iteration number, X ! pðtÞ is the current position of the prey, X ! ðtÞ is the current position of the wolf, and D ! is the distance between the wolf and the prey.
The coefficient vectors A ! and C ! are as follows.
Where r ! 1 and r ! 2 are two vectors randomly generated between [0, 1], and the convergence factor a ! linearly decreases from 2 to 0 in the iterative process.
Update the position of grey wolves.
The initial population P 0 is mainly randomly generated within the upper and lower limit (x L ,x U ), the j-th index of the i-th individual is obtained by Eq (39).
The mutation operation generates a new mutation vector v G i , as shown below.
Where r 1 ; r 2 ; r 3 2 f1; 2; � � � ; Ng are randomly generated integers, and F is the magnification ratio of the control difference vector, which is a real number with a varying range between [0, 2].
The crossover operation is as follows:

PLOS ONE
Where CR is the cross probability, which takes a value between [0, 1]. Finally, the selection operation is performed. For specific problems, all mutation crossover individuals are evaluated. For specific problems, all mutation crossover individuals are evaluated. If the fitness of the current individual exceeds the previous generation, it means that the mutation crossover operation is successful, and the current individual is retained; if the current individual's fitness is not as good as the previous generation, the better individual is retained. The individual with the optimal fitness will become the optimal value of this generation of individuals. When the termination condition is met, the evolution will stop, otherwise the next round will continue.

PLOS ONE
3. Calculate the objective function value of a single grey wolf individual and determine the best three individuals as X α, X β and X δ respectively. 4. Calculate the distance between other grey wolf individuals and the optimal X α, X β and X δ according to Eq (38), and update the position of each grey wolf according to Eq (39).

5.
Update the values of a, A and C. Crossover and competition operations are applied to retain better individual positions and generate new individuals respectively.
6. Update the position of the first three grey wolves X α, X β and X δ .
7. Determine whether the maximum number of iterations t max has been reached. If yes, exit and output the current objective function value of X α ; otherwise, t = t + 1, and move to the third step to continue.  Table 1 and Figs 2-13. The parameters of all intelligent algorithms are set as follows: population size N = 30, particle dimension D = 10, and maximum number of iterations T = 50. Note that the algorithm is run 20 times on each benchmark function. Table 2 shows the final calculation results of PSO, GWO, WOA and DEGWO on the benchmark function. By comparing the average value (AVE) and standard deviation (STD), it is found that whether it is a single-peak benchmark function, a multi-peak benchmark function or a fixed-dimensional test function, the AVE and STD calculated by the DEGWO algorithm are in most cases smaller than those calculated by the other four algorithms.

Construction process of PSR-SVM-DEGWO model predicting dam deformation
A hybrid model combining chaos theory and SVM is proposed, and the DEGWO algorithm is used to select optimal parameters for concrete dam deformation analysis and prediction. Use the method mentioned in Section 2 to identify the chaotic characteristics of the deformed time series, and then determine the input variables of the support vector machine. According to the DEGWO algorithm introduced in Section 3.2, SVM is optimized for parameters. By performing training operations, a PSR-SVM-DEGWO prediction model can be established. The specific process is shown in Fig 26.  1. Reconstruct the phase space of the observed displacement time series. The mutual information method and the Cao method are adopted to determine the optimal delay time τ and the minimum embedding dimension m, and the phase space is reconstructed accordingly.
2. Identification of chaotic characteristics of the observed displacement time series. Estimate the maximum Lyapunov exponent λ max , correlation dimension D 2 and Kolmogorov entropy K 2 . λ max > 0, K 2 is a finite positive value and the saturation of the correlation dimension indicating that the observed displacement time series has the chaotic characteristics.
3. Determine the input and output variables according to Eq (19). 4. Use the DEGWO algorithm to find the optimal SVM parameters based on the training samples, generate the optimal values of C and σ, and complete the SVM-DEGWO training process.
5. According to the prediction samples, the trained SVM will be used for prediction, and the three indicators of Eqs (23)-(25) are used to evaluate the prediction effect of the model.

Engineering overview
The

Prediction model construction
The delay time τ of the observed displacement time series is estimated by the mutual information method, as shown in Figs 29 and 30. According to the principle of mutual information, the optimal delay times of the measuring points PL13-2 and PL13-3 are τ 1 = 10 and τ 2 = 11, respectively.
According to the determined τ, the Cao method is adopted to calculate m. The change curves of E 1 (m) and E 2 (m) as m increases are shown in Figs 31 and 32. When m is greater than the minimum embedding dimension, the change of E 1 (m) starts to become smaller, and thus the optimal embedding dimension is determined. As shown in Figs 31 and 32, the minimum embedding dimension of the measuring points PL13-2, and PL13-3 are m 1 = 4 and m 2 = 5, respectively. It can also be found that there are some values of m such that E 2 (m)6 ¼1, which can prove that the observed displacement data series comes from a deterministic process.
According to the determined τ, the G-P method is adopted to calculate the correlation dimension of the observed displacement data series. When the value of the embedding dimension d ranges from 1 to 10, the corresponding double logarithmic curve of lnC(r)~ln(r) is The wolf method is adopted to calculate the maximum Lyapunov exponent λ max of the observed displacement data series. And the maximum Lyapunov exponent λ max of the measuring points PL13-2, and PL13-3 are λ max−2 = 0.0191 and λ max−3 = 0.0108, respectively. λ max > 0 indicates that the observed displacement data series has chaotic characteristics.
The correlation integral method is used to calculate the Kolmogorov entropy estimate K 2 of the observed displacement data series. According to the double logarithmic curve of lnC(r)~ln (r), as shown in Figs 33 and 34, the double logarithmic curve of log 2 (r)~log 2 (C(r)) can be derived, so that the Kolmogorov entropy estimate K 2 of the measuring points PL13-2 and PL13-3 can be obtained when the delay time is determined respectively, and K 2-2 = 0.0043, K 2-3 = 0.0044. K 2 takes a finite positive value, which indicates the chaotic characteristics of the observed displacement data series.
Based on the above, it can be concluded that the observed displacement data series of the measuring point PL13-2 and PL13-3 has chaotic characteristics, and a chaotic prediction model of dam displacement can be established.
The DEGWO algorithm proposed is adopted to seek the optimal parameters of SVM. The phase space of reconstructed observation displacement and the influencing factors of dam deformation are used as input variables to evaluate the predicting performance of the

PLOS ONE
The minimum embdding dimension m 2 =5

PLOS ONE
Dam deformation forecasting using SVM-DEGWO al-gorithm based on phase space reconstruction improves the global search capability, thereby accelerating the convergence speed and convergence accuracy.

Results
For the PSR-SVM-DEGWO-based dam observation displacement prediction model, the relevant information is introduced as follows. The SVM is at the heart of this innovative

PLOS ONE
Dam deformation forecasting using SVM-DEGWO al-gorithm based on phase space reconstruction combination model. The input variable is the reconstructed phase space of the measured displacement data sequence, and the DEGWO algorithm is used to realize the parameter optimization of SVM. In addition, the kernel function of SVM is a radial basis function. In order to better analyze the predictive performance of the PSR-SVM-DEGWO model, the PSR-SVM and PSR-SVM-GWO models that take the reconstructed phase space of the observed displacement data sequence as input variables are established respectively. In addition, the SVM, SVM-GWO and SVM-DEGWO models with the factors affecting dam deformation as input variables are established to explore the influence of different input variables on the accuracy of model prediction. T-test is adopted to test whether there is a significant difference between the existing method and the proposed PSR-SVM-DEGWO method. The h represents whether the hypothesis is accepted at the significance level. When h = 1, it means that the two sets of data compared with each other have significant differences. At this time, the comparison between the two algorithms is meaningful. The p represents the set standard of significant difference, which is set to 0.05 here. When p is less than 0.05, the results have a significant difference. ci represents the data interval with 95% confidence level.
For the measuring point PL13-2, the prediction performance of the SVM, SVM-GWO, SVM-DEGWO, PSR-SVM, PSR-SVM-GWO and PSR-SVM-DEGWO models are shown in Table 3  The t-test is used to test whether there are significant differences between the calculation results of the other five algorithms and the results of the proposed PSR-SVM-DEGWO algorithm. It can be seen from Table 3 that all the results meet the condition h = 1 and p<0.05, which represents the existence of significant differences.
As shown in Fig 39, the deformation shows obvious periodic regular changes, but compared to the conventional arch dam below 200m, the deformation fluctuation range of Jinping I super high arch dam is much larger. SVM, SVM-GWO, SVM-DEGWO, PSR-SVM, PSR-SVM-GWO and PSR-SVM-DEGWO models can effectively predict the trend of dam displacement. And the PSR-SVM-DEGWO model has the highest prediction accuracy and the smallest fluctuation range of the prediction error. For the conventional model, the deviation between the predicted value of the model and the observed value is larger than the deviation of the PSR models.   proposed PSR-SVM-DEGWO algorithm. It can be seen from Table 4 that all the results meet the condition h = 1 and p<0.05, which represents the existence of significant differences. As shown in Fig 40, the deformation shows obvious periodic regular changes, but compared to the conventional arch dam below 200m, the deformation fluctuation range of Jinping I super high arch dam is much larger. SVM, SVM-GWO, SVM-DEGWO, PSR-SVM, PSR-SVM-GWO and PSR-SVM-DEGWO models can effectively predict the trend of dam displacement. And the PSR-SVM-DEGWO model has the highest prediction accuracy and the smallest fluctuation range of the prediction error. For the conventional model, the deviation between the predicted value of the model and the observed value is larger than the deviation of the PSR models.
The calculation results show: ①The applicability of the GWO optimized SVM algorithm in dam deformation prediction; ②The DEGWO algorithm proposed in this paper has more outstanding optimization ability in optimizing SVM parameters than the conventional GWO; ③For ultra-high arch dams, the PSR model with the reconstructed phase space as the input variable has higher prediction accuracy and smaller prediction error than its corresponding conventional model with dam deformation influence factors as input variables.
In general, the SVM, SVM-GWO, and SVM-DEGWO models can better predict the change trend of dam deformation. Regardless of whether it is a conventional model with the dam deformation influence factors as input variables or the PSR model with the reconstructed observation displacement data series phase space as the input variable, the prediction accuracy of all the model can meet the engineering requirements, but the prediction accuracy of the PSR model is higher. The calculation results also prove the applicability of the GWO algorithm in the field of dam deformation prediction and the more prominent optimization ability of DEGWO compared to GWO. The t-test results show that the calculation results of the other five algorithms are significantly different from the results of the proposed PSR-SVM-DEGWO algorithm. The result of t test also shows that the algorithm comparison is meaningful, no matter it is for the measuring point PL13-2 or PL13-3.

Conclusions
This research proposes an innovative model combining chaos theory, support vector machine, difference algorithm and gray wolf algorithm, namely the PSR-SVM-DEGWO model, to predict dam deformation. And taking the measured displacement data of the Jinping I super high arch dam as examples, the prediction effect of the PSR-SVM-DEGWO model is compared and verified. The main conclusions are as follows.
1. As the correlation dimension of the deformation time series tends to be saturated (D 2-2 = 1.2043, D 2-3 = 1.1217), the largest Lyapunov exponent (λ max-2 = 0.0191, λ max-3 = 0.0108) is greater than 0 and the Kolmogorov entropy estimate (K 2-2 = 0.0043, K 2-3 = 0.0044) is a finite positive value, it can be seen that there is chaos in the deformation observation data of the dam.
2. The optimization performance of the DEGWO algorithm is superior to that of the GWO algorithm. Using the DE algorithm to ensure the initial population diversity can effectively improve the grey wolf optimization algorithm's ability to find high-quality solutions. Simulation tests show that the convergence speed of the DEGWO algorithm is faster and the convergence accuracy is higher.
3. It is verified by the example of Jinping I super high arch dam that SVM, SVM-GWO and SVM-DEGWO models can effectively predict the dam deformation trend, but the SVM-DEGWO model has the best prediction performance, which is reflected in the higher accuracy of the model prediction.
4. The predictive performance of the PSR-SVM, PSR-SVM-GWO and PSR-SVM-DEGWO models with the reconstructed observation data sequence phase space as the input variable is superior to that of the corresponding SVM, SVM-GWO and SVM-DEGWO conventional models with deformation influence factors as input variables. When the conventional model predicts the deformation of an ultra-high arch dam, although the accuracy meets the requirements, the predicted value will gradually deviate from the measured value as time goes by. On the contrary, it is difficult to observe such large deviations in models that adopt the reconstructed phase space of the observation data sequence as the input variable. Among all the models calculated in this paper, the PSR-SVM-DEGWO model has the best prediction performance.