Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Debris flow volume prediction model based on back propagation neural network optimized by improved whale optimization algorithm

  • Bo Ni,

    Roles Software

    Affiliation Department of Civil Engineering, Chongqing Three Gorges University, Wanzhou, 404100, Chongqing, China

  • Li Li ,

    Roles Formal analysis

    lily6636694@163.com

    Affiliation Department of Civil Engineering, Chongqing Three Gorges University, Wanzhou, 404100, Chongqing, China

  • Hanjie Lin,

    Roles Methodology, Writing – original draft

    Affiliation Department of Civil Engineering, Chongqing Three Gorges University, Wanzhou, 404100, Chongqing, China

  • Yue Qiang,

    Roles Resources

    Affiliation Department of Civil Engineering, Chongqing Three Gorges University, Wanzhou, 404100, Chongqing, China

  • Hengbin Wu,

    Roles Writing – review & editing

    Affiliation Department of Civil Engineering, Chongqing Three Gorges University, Wanzhou, 404100, Chongqing, China

  • Zhongxu Zhang,

    Roles Validation

    Affiliation Department of Civil Engineering, Chongqing Three Gorges University, Wanzhou, 404100, Chongqing, China

  • Yi Zhang

    Roles Investigation

    Affiliation Department of Civil Engineering, Chongqing Three Gorges University, Wanzhou, 404100, Chongqing, China

Abstract

Debris flow is a sudden natural disaster in mountainous areas, which seriously threatens the lives and property of nearby residents. Therefore, it is necessary to predict the volume of debris flow accurately and reliably. However, the predictions of back propagation neural networks are unstable and inaccurate due to the limited dataset. In this study, the Cubic map optimizes the initial population position of the whale optimization algorithm. Meanwhile, the adaptive weight adjustment strategy optimizes the weight value in the shrink-wrapping mechanism of the whale optimization algorithm. Then, the improved whale optimization algorithm optimizes the final weights and thresholds in the back propagation neural network. Finally, to verify the performance of the final model, sixty debris flow gullies caused by earthquakes in Longmenshan area are selected as the research objects. Through correlation analysis, 4 main factors affecting the volume of debris flow are determined and inputted into the model for training and prediction. Four methods (support vector machine regression, XGBoost, back propagation neural network optimized by artificial bee colony algorithm, back propagation neural network optimized by grey wolf optimization algorithm) are used to compare the prediction performance and reliability. The results indicate that loose sediments from co-seismic landslides are the most important factor influencing the flow of debris flows in the earthquake area. The mean absolute percentage error, mean absolute error and R2 of the final model are 0.193, 29.197 × 104 m3 and 0.912, respectively. The final model is more accurate and stable when the dataset is insufficient and under complexity. This is attributed to the optimization of WOA by Cubic map and adaptive weight adjustment. In general, the model of this paper can provide reference for debris flow prevention and machine learning algorithms.

1 Introduction

Catastrophic earthquakes usually cause strong disturbances in the bedrock of the slope, which produces a large amount of loose material [1]. Therefore, the large amount of loose material from the Wenchuan earthquake provided excellent material source conditions for debris flows in the study area of this paper [2]. Moreover, debris flow disasters will not only cause damage to roads and bridges, but also lead to river blockage [36]. In general, the size of the mudslide volume is a direct characterization of the mudslide hazard [7]. Therefore, reliable and accurate prediction of debris flow volume is of great significance for debris flow prevention and control projects, emergency rescue and disaster relief [8].

The estimation of debris flow volume is mainly divided into two categories. The first category is mainly based on empirical formulas based on actual survey data. For example, Ma, Chang, Cao and Gartner have established different empirical formulas for different study areas [912]. The main contributions of the above four researchers are shown in Table 1.

Table 1 shows that the volume of loose material is the main factor influencing the volume of debris flows. This is particularly evident in the seismic zone. In addition, the basin area and rainstorm are also important factors affecting the volume of debris flow. However, empirical formulas have regional applicability. This means that an empirical formula for one study area may not be applicable to another study area. Moreover, the strong subjectivity of the researchers and the poor logic between the data may lead to large errors in the results of the empirical formula [13]. Therefore, using the traditional empirical formula to estimate the volume of debris flow is unreliable and has poor adaptability.

With the development of machine learning, artificial intelligence algorithms are gradually applied to natural disaster prediction. [14] Therefore, the second prediction method based on machine learning is applied to debris flow events. Zhao used machine learning algorithm and time series algorithm to predict debris flow events in advance [15]. The results show that the Extra Trees (ET) model accurately predicts mudslide events 35 minutes in advance. In addition, Kern said that future work should use machine learning models to improve debris flow volume prediction [16]. Lee used artificial neural network (ANN) to predict the volume of debris flow under extreme rainfall in central South Korea [17]. The prediction results provide a favorable reference for the control design of debris flow-prone areas in South Korea. In general, the machine learning model has good effect and potential in debris flow volume prediction, so more research is needed.

Back propagation neural network (BPNN), as the most widely used and mature neural network model, has been widely used in various fields of civil engineering [18,19]. However, when machine learning algorithms are used to predict geohazard events, the predictions are highly correlated with the quality of the dataset [20]. This is a challenge for neural networks. Because the neural network needs a large enough data set to output reliable and accurate prediction results [21]. As a result, BPNN may output unstable and low-accuracy results when there is insufficient data. However, obtaining data on geohazards through physical surveys would consume a great deal of human and financial resources. At present, optimization algorithms have been shown to improve the prediction accuracy and reliability of BPNN based on original data [22,23]. After comparing whale optimization algorithm (WOA), bacterial foraging optimization (BFO), particle swarm optimization (PSO) and genetic algorithm (GA), Tair et al. found that WOA algorithm has the best performance [24]. Currently, WOA has proven to be robust enough to find a good enough solution in a reasonable amount of time [25]. In addition, compared with the existing algorithms, WOA shows better or equivalent performance [26]. But the initial population position and the weight value in the shrinking mechanism of WOA will affect its performance. As one of the solutions, chaos theory is often used to improve the particle diversity of swarm intelligence algorithms [27]. RANI et al. used five chaotic maps (Chebyshev, Cubic, ICMIC, Neuron and Sine maps) to calculate the minimum values of eight functions (ShiftedSphere, Rastigin, Griewank, Exponential, Rosenbrock, Salamon, Csendes and Qing) [28]. The results show that the best performing chaotic mappings are Cubic and ICMIC. And Cubic outputs the minimum value with the smallest deviation error. Therefore, Cubic chaotic map is used as one of the methods to improve WOA. In addition, the adaptive weight adjustment strategy can be used to optimize the weight values in the shrinking boundary mechanism of WOA. This method can balance the early search ability and the later development ability of the algorithm [29]. In this paper, Cubic map and adaptive weight adjustment are used to optimize WOA. Therefore, the improved whale optimization algorithm (CA-WOA) is developed by the above two methods. Subsequently, the CA-WOA optimizes the weights and thresholds in BPNN, and the final CA-WOA-BPNN model is obtained. Finally, 60 debris flow data from Longmen Mountain are selected as a research sample in this study. This is because complex and inadequate sample data can be used to see whether the predictions are accurate and reliable.

The aim of this paper is to solve the problem of low accuracy and instability of BPNN under insufficient data and data complexity in the study of debris flow volume prediction. In Section 2, The relevant analysis, model construction process, comparative models, and performance indicators are introduced. In Section 3, the study area and models’ results are presented. In section 4, the content of this article and future work that needs further improvement are discussed in detail. In section 5, the conclusions of the CA-WOA-BPNN prediction model used in the research field of this article are introduced.

2 Method

2.1 Analysis of relationship

Correlation analysis is used to evaluate the correlation and importance between the influencing factors of the model, to select more relevant evaluation indicators. This paper uses Pearson correlation coefficient (PCC) and maximum information coefficient (MIC) for feature selection.

PCC is a well—known correlation measurement method with a range of [–1,1] [30]. Among them, 1 represents a completely positive correlation; -1 represents a completely negative correlation; 0 represents irrelevant. PCC as shown in Formula (1).

(1)

Where Xi and Yi are the samples, and and are the average values of the samples.

PCC as a linear analysis method, the factors with low correlation coefficient will not be effectively identified. However, the prediction accuracy of BPNN may be closely related to the factors with low correlation coefficient. The Maximum Information Coefficient (MIC) can be used to overcome this problem, which covers all the functional relationships. The prediction accuracy of the model is improved by combining MIC and PCC for feature selection. Mutual information (MI) is a measure of mutual trust between variables in information theory, as shown in Formula (2).

(2)

Where p(x, y) is the joint distribution of (X, Y), p(x) and p(y) are the marginal distribution.

The maximum information coefficient (MIC) is shown in Formula (3).

(3)

Where a and b are the number of mesh partitions; B is the upper limit of the grid, usually B = n0.6, n is the number of samples.

2.2 Back Propagation Neural Network (BPNN)

2.2.1 Standard BPNN.

The BPNN is a feed-forward network trained by error back propagation algorithm, and it is also the most widely used network at present [31]. The network is composed of input layer, hidden layer and output layer. The weights between the layers are obtained by the forward propagation of the signal and the back propagation of the error. Then the BPNN model is established. The BPNN used in this paper is a single hidden layer network, as shown in Fig 1. It contains d input neurons x; q hidden layer neurons b; l output neurons y.

In the Fig 1, vih is the connection weight between the input layer neuron xi and the hidden layer neuron bn; whj is the connection weight between the hidden layer neuron bn and the output layer yi.

The input received by the hidden layer neuron bn is shown in Formula (4).

(4)

The input received by the output layer neuron yi is shown in Formula (5).

(5)

The training sample error is denoted as Ek. Gradient descent is used as the adjustment strategy of neural network. Adjust the parameters in the negative gradient direction of the target. The learning rate η is assigned to the error Ek, and the weight value is adjusted, as shown in Formulas (6) and (7).

(6)(7)

The goal of the algorithm training is to make the cumulative error E of the training set meet the training accuracy requirements, as shown in Formula (8).

(8)

On the one hand, the oscillation of the network will be affected by the training rate in the neural network. On the other hand, the training rate is proportional to the convergence rate. The allowable error is generally set to 0.001–0. 00001.When the result error of more than two iterations is less than allowable error, the neural network iteration ends. In addition, the number of trainings is usually set to 1000 times. In summary, the key hyperparameters of BPNN in this paper are determined. The allowable error value is 0.000001; the training rate is 0.01; the number of trainings is 1000 times.

2.2.2 Best hidden layer node.

The selection of the number of hidden layers in neural networks has always been a hot topic in neural network research [32]. Too many hidden layer nodes will lead to an increase in training time and too much training [33]. On the contrary, too few hidden layer nodes will lead to insufficient network performance due to the complexity of data [33]. In this paper, the cycle method is used to determine the number of hidden layer nodes and training error. When initializing the setting, the mean square error (MSE) is set to a larger number. This step can find the minimum error and the best hidden layer node in the loop. The calculation of hidden layer nodes is shown in Formula (9) [34].

(9)

Where h is the number of hidden layer nodes; m is the number of input layer nodes; n is the number of output layer nodes; a is generally an integer between 1 and 10.

2.3 Whale Optimization Algorithm (WOA)

As a meta-heuristic optimization algorithm, WOA is mainly inspired by humpback whale hunting behavior [25]. Humpback whales have a special way of preying, known as bubble-net predation. The standard WOA simulates the unique search method and hunting mechanism of humpback whales. Its three important stages mainly include hunting prey, bubble net predation and searching prey. In WOA, each solution is considered as a whale. Whales try to fill new places in the search space, which is considered a reference to the best elements in the group. WOA searches the optimal solution in the solution space iteratively to find the best parameter configuration. In each iteration, WOA adjusts the weight and bias values according to the current solution space position and fitness value. The humpback whale foam hunting model is shown in Fig 2 [35].

2.3.1 Surround prey.

When the whale is hunting, it is necessary to find the location of the prey in advance when the prey is surrounded. But the location is usually unknown. Therefore, the WOA algorithm assumes that the solution with the smallest individual fitness value in the population is the position of the prey or the nearest position to the prey. Then, the positions of other search individuals are updated according to the optimal solution. The formula is as follows: (10)

Where X is the position vector of the search body; X* is the position vector of the current optimal solution; A is used to adjust the global exploration and local search ability of the algorithm; D is the distance vector between the current optimal solution and the search body. The calculation formulas of the A and D are as follows: (11) (12)

Where a decreases linearly from 2 to 0 with the number of iterations. The formula is as follows: (13)

Where t is the current number of iterations; T is the maximum number of iterations; r is a random vector in [0,1]; C is the coefficient vector. The formula is as follows: (14)

2.3.2 Bubble net hunting.

WOA includes two position update strategies, namely the shrink-wrapping mechanism and the spiral position update. Among them, the shrink-wrapping mechanism is realized by the value of the convergence factor a in Formula (11). When |A| ≤ 1, the individual of the updated position moves from the original position to the target position. At this time, the shrink-wrapping mechanism is implemented by Formula (10). The spiral position update is to simulate the spiral motion state of the whale, as shown in Formula (15).

(15)

Where b is a constant used to limit the logarithmic spiral shape; l is a random number between [–1,1];

Since the position of the whale is spirally updated, the encirclement is also reduced. The above synchronization process is realized by the probability p (p is a random number between [0,1]) in the WOA algorithm. This step is used to determine whether to perform the shrink-wrapping mechanism or the spiral position update. The formula is: (16)

2.3.3 Prey search.

When |A| > 1, the positions of other individuals are no longer updated by the target position in WOA. Instead, the position of the search body in the group is randomly found to replace the target position. This will enhance the global search ability of the algorithm. The update formula is as follows: (17) (18)

Where Xrand represents the location of a random searcher in the current population.

In this paper, the initial population size of WOA is set to 30; the maximum number of iterations is set to 50; the independent variable interval is set to [–3,3]; the value of b in Formula (15) is set to 1.

2.4 Cubic map

The randomly generated initial population may cause WOA to fall into local optimum [36]. Because the algorithm group lacks diversity. In this study, Cubic map is added to the initial population position of WOA algorithm to optimize the randomly generated initial population position. Compared with the original random search, it can search the search space thoroughly with faster speed and probability. The Cubic map is shown in Formula (19).

(19)

Where ρ is the control parameter, xk is the initial population generation method for WOA.

In this study, the value of ρ is 1.

2.5 Adaptive weight adjustment

In this study, an index-based adaptive weight adjustment method is proposed to optimize the weight value in the shrink-wrapping mechanism in WOA. The weight value changes with the number of iterations. To improve the weight value in the contraction closure mechanism in WOA. Adaptive weight adjustment is shown in Formula (20).

(20)

Where ωmin is the minimum weight value; ωmax is the maximum weight value; mm is the adjustment coefficient; t is the current number of iterations; maxgen is the number of iterations; (−t/maxgen) decreases with the increase of the number of iterations. In this study, the value of mm is 1.

Firstly, Pearson correlation coefficient (PCC) and maximum information coefficient (MIC) are used to screen influencing factors to improve model accuracy, and then the data is normalized. Subsequently, Cubic map optimizes the initial population position in WOA. At the same time, the adaptive weight adjustment optimizes the weight value in the shrink-wrap mechanism in WOA. Finally, the improved WOA optimizes the weights and thresholds in BPNN to obtain the final CA-WOA-BPNN model. The whole workflow is shown in Fig 3, and the pseudo-code of the final model is shown in S1 Appendix.

2.6 Contrast model

To compare the prediction accuracy and stability of the final model, other machine learning algorithms need to be introduced for comparison [37]. SVR and XGBoost have the advantages of less time-consuming and high precision when the samples are not sufficient. Therefore, these two algorithms are applicable to the study area of this paper. Based on the Optuna framework, the key hyperparameters of SVR and XGBoost are adjusted. Optuna is an automatic hyper-parameter adjustment framework, including grid search method, random search method, Bayesian optimization algorithm and so on. [38]. The Bayesian optimization method is an effective optimization algorithm, which has been widely used in parameter optimization, hyper-parameter optimization and other fields [39].

In SVR, c is the penalty coefficient, which is understood as the tolerance of error. The gamma is a parameter attached to the selection of RBF function as kernel function. It implicitly determines the data distribution mapped to the new feature space.

In XGBoost, n _ estimators are the number of basic learners. The larger the number, the stronger the learning ability of the model, but the easier the model is to over-fit. The max _ depth is the depth of the tree and is an important pruning parameter. The learning _ rate is the step size of the iterative decision tree, also known as the learning rate. It controls the iteration rate of the algorithm and is often used to prevent overfitting. The gamma is used as a threshold to determine whether a leaf node should be further segmented.

On the other hand, to compare the performance of WOA and improved WOA, other meta-heuristic algorithms are selected for comparison. Artificial bee colony (ABC) and grey wolf optimization (GWO) are meta-heuristic algorithms that simulate the hunting behavior of other organisms in nature [40,41]. In addition, these two methods have been widely used in various optimization algorithm fields [4245]. Therefore, this paper chooses ABC and GWO as comparative optimization algorithms to observe the performance of this method. The parameters of these two meta-heuristic algorithms are the same as those of the WOA algorithm in this paper (see Section 1.3 for details).

The key hyper-parameters of SVR, XGBoost, ABC and GWO are shown in Table 2.

2.7 Performance metrics

The Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE) and coefficient of determination (R2) are used to evaluate the model. The smaller the MAE and MAPE, and the closer the R2 is to 1, the performance of the model is excellent. The MAE, MAPE, and R2 are shown in Formulas (21), (22), and (23).

(21)(22)
  1. Where At is the real value of the sample; Ft is the predicted value of the model.
(23)

Where SSR is the sum of regression squares; SST = SSR+SSE; SSE is the sum of residual squares. The calculations of SSR and SSE are shown in Formulas (24) and (25).

(24)(25)

Where is the predicted value; is the average value of the true value; and y is the true value.

Variance and standard deviation can reflect the degree of dispersion between data. Therefore, it is necessary to evaluate the final optimization function of each meta-heuristic algorithm (the weight value and threshold of BPNN in this paper). Calculate the variance (S) and standard deviation (S2) of the W1, W2 and B1. Among them, W1 is the weight value from the input layer to the middle layer; W2 is the weight value from the middle layer to the output layer; B1 is the neuron threshold of the middle layer. The calculation formulas of standard deviation (S) and variance (S2) are shown in Formulas (26) and (27).

(26)(27)

Where n is how many values are generated in each independent operation; is the average value.

3 Cases and results

3.1 Research area

The Longmenshan Mountain is located in a steep mountain range on the edge of the Qinghai-Tibet Plateau in southwestern China. The Longmenshan fault zone starts from the Luding area in the southwest and passes through Wenchuan, Guanxian, Beichuan, Yangpingguan, and Mianxian in the northeast. It passes through the Qinling Mountains for more than 600 kilometers. The Wenchuan earthquake occurred on the Longmenshan fault zone. This makes the Longmenshan fault zone become one of the key research areas [46].The debris flow in the study area is distributed in the rugged mountains of 407 ~ 6100 m. The slope direction is basically along the northeast direction, the slope is as high as 69°, and more than half of the slope is greater than 36° [47].

The rocks in the mountain area are mainly composed of basalt, granite, phyllite, dolomite, limestone, sandstone and shale. The age is from Precambrian to Cretaceous, with strong fracture and weathering characteristics [46]. In addition, in the above geomorphological features, it is easy to produce large thrust earthquakes with coseismic landslip [48].

The loose material produced by the earthquake is prone to debris flow disaster under the action of heavy rainfall or ice and snow melting. Therefore, after the Wenchuan earthquake, landslides or debris flow events occur every monsoon season. Previous studies have shown that under the action of earthquakes, debris flow events will occur many times in the catchment area of the study area [46].Therefore, it is necessary to make predictions about the region.

In other debris flow volume prediction studies, catchment area (A), topographic relief (H), channel length (L), and average channel gradient (J) are used as the main factors [49,50]. The study area in this paper is on the seismic zone. A large number of loose materials produced by the Wenchuan earthquake in 2008 provided excellent source conditions for debris flows. Therefore, total volume of co-seismic landslide debris (V) and distance from seis-mic fault (D) are added to the research factors in this paper [46]. According to the above six influencing factors, sixty typical debris flow data in this area are selected, as shown in Table 3 [46].

In the above table, the A is catchment area, the H is topographic relief, the L is channel length, the D is distance from seismic fault, the J is average channel gradient, the V is total volume of co-seismic landslide debris, and the V0 is the volume of debris flow.

3.2 Data processing

3.2.1 The normalized processing.

The sixty debris flow evaluation factors in Table 2 are taken as the decision variable matrix Z, where Z = {zij} (i = 1, 2…, m; j = 1, 2…, n). The m represents the number of debris flow, m = 60; n represents the number of evaluation factors, n = 6. Due to the different dimensions of each evaluation factor, the results will be affected, and the decision matrix needs to be normalized. The evaluation factors are normalized using the minimum-maximum and mapped to the [0, 1] interval, the normalization as shown in Formula (28).

(28)

Where max () represents the maximum value; min () represents the minimum value.

3.2.2 Analysis of relationship.

PCC and MIC are used to analyze the correlation between the basic data in Table 3. A, H, L, D, J, V and V0 are then sorted according to their importance. The results are shown in Fig 4 and Table 4.

thumbnail
Fig 4. Correlation analysis of each influencing factor.

The left longitudinal axis and the symmetrical line are the influencing factors, and the redder the color, the greater the correlation.

https://doi.org/10.1371/journal.pone.0297380.g004

thumbnail
Table 4. The importance between 6 factors and V0.

https://doi.org/10.1371/journal.pone.0297380.t004

The above results show that the PCC of V and V0 is 0.89, which is the largest positive correlation coefficient. The PCC of other factors are A and V0 (0.71), L and V0 (0.58), H and V0 (0.39), D and V0 (0.06) and J and V0 (-0.35), respectively. In addition, the MICs of D and J are only 0.25 and 0.34, respectively.

Therefore, combined with the results of PCC and MIC, only V, A, L and H are selected as input factors to improve the prediction accuracy of the model. The input factors ratio is 8:2, that is, forty-eight as the training set and twelve data as the prediction set.

3.3 Model prediction results and optimization algorithm analysis

3.3.1 Model prediction results.

Each model is run independently for 5 times, and each prediction result is brought into Formulas (21)–(23) for 5 calculations. Subsequently, the average, median, optimal and worst values are used to calculate the results of five calculations to represent the prediction accuracy and stability of the model, as shown in Fig 5. In addition, the training set and prediction set of each model are the same.

thumbnail
Fig 5. The evaluation results of each model after five independent operations.

https://doi.org/10.1371/journal.pone.0297380.g005

In Fig 5, the smaller the MAE and MAPE, the larger the R2, and the better the prediction performance of the model.

From the two metrics of MAE and MAPE, the prediction accuracy of CA-WOA-BPNN model is the highest, followed by WOA-BPNN. Then there are SVR and XGBoost, and the performance of these two models is almost the same. The worst are ABC-BPNN, GWO-BPNN and BPNN. It is worth noting that the MAE and MAPE of ABC-BPNN and GWO-BPNN are larger than those of BPNN.

From the fitting degree (R2) of the model, the model fitting degree of CA-WOA-BPNN is also the best, followed by WOA-BPNN. Then there are SVR and XGBoost, and the R2 of these two models is almost the same. In addition, the R2 of ABC-BPNN and GWO-BPNN is better than that of BPNN, which is different from the evaluation results of MAE and MAPE.

In general, the model (CA-WOA-BPNN) in this paper has the highest prediction accuracy and the best fitting degree.

3.3.2 Objective function results of meta-heuristic algorithm.

The standard deviation and variance of the objective function (W1, W2 and B1) output by the four meta-heuristic algorithms (ABC, GWO, WOA and CA-WOA) are shown in Fig 6. Among them, the neural network parameters in ABC-BPNN, GWO-BPNN and WOA-BPNN are the same, and the hidden layer is set to 9. Different from the above three algorithms, CA-WOA-BPNN is set to automatically find the number of hidden layers. Moreover, the search interval of all the 4 meta-heuristic algorithms is [–3,3].

thumbnail
Fig 6. Variance and standard deviation of W1, W2 and B1.

In Fig 6, the left side is the standard deviation, the right side is the variance, the vertical axis is the value; and the horizontal axis is the number of runs.

https://doi.org/10.1371/journal.pone.0297380.g006

The greater the standard deviation and variance, the greater the degree of dispersion between the data. In Fig 6, the variance and standard deviation of ABC-BPNN and GWO-BPNN are almost greater than those of WOA-BPNN and CA-WOA-BPNN. This shows that the dispersion between the weight value and the threshold calculated by WOA-BPNN and CA-WOA-BPNN is smaller. In addition, the results in Fig 5 show that the prediction performance of WOA-BPNN and CA-WOA-BPNN is better than that of ABC-BPNN and GWO-BPNN. Combined with the results in Fig 6, the optimization performance of ABC and GWO is weaker than that of WOA and CA-WOA under the same search interval and the data in this paper.

The above results show that the prediction accuracy and reliability of CA-WOA-BPNN are better than the other six algorithms. Under the same parameter setting and data set, the optimization effect of WOA on neural network is obviously better than that of ABC and GWO. In addition, the improved WOA can find better weights and thresholds in BPNN. In general, the prediction results of CA-WOA-BPNN can provide reference for debris flow prevention and emergency rescue under insufficient data.

4 Discussion

Through correlation analysis, four main factors affecting the volume of debris flow in Longmenshan area are determined. They are co-seismic landslide debris (V), catchment area (A), channel length (L) and topographic relief (H). Among them, the total volume of co-seismic landslide debris has the greatest correlation with debris flow volume. Because the study area is in the seismic zone, the landslide material produced by the Wenchuan earthquake provides an excellent source condition for the debris flow. In addition, Fig 4 shows that the other three input factors (A, L and H) are also highly correlated with V. Therefore, it is necessary to start the preliminary work of predicting the volume of mudslides with a correlation analysis.

The Fig 5 shows that there is almost no difference between the mean, median, optimal and worst values of CA-WOA-BPNN, WOA-BPNN, XGBoost and SVR. This means that the prediction results of their four models are more stable. On the contrary, the prediction results of BPNN, GWO-BPNN and ABC-BPNN are unstable. In addition, the prediction performance of SVR and XGBoost is inferior to that of CA-WOA-BPNN. This shows that in the absence of data and complex data relationships, SVR and XGBoost need more learning samples for reference. Even though these two models have been proven to exhibit good predictive performance in the absence of data. On the contrary, the CA-WOA-BPNN prediction model proposed in this paper is more suitable for this research field. It has high accuracy and reliability in the absence of data and strong correlation between data. Especially in the prediction of debris flow volume, only high precision is often not enough. Good stability is also one of the criteria to be considered. Because only one low-precision prediction result may make the disaster relief personnel bear greater risks. Therefore, the CA-WOA-BPNN in this paper is more suitable for the study area.

In Fig 6, the deviation between GWO and ABC output results is greater than that of CA-WOA and WOA. In order to find out the cause of this problem, all the output results of W1 are drawn, as shown in Fig 7. Fig 7 shows that the scatter points of ABC-BPNN and GWO-BPNN are basically evenly distributed between [–3,3]. On the contrary, the scatter points of WOA-BPNN are clustered at—3. The situation of CA-WOA-BPNN is similar to that of WOA-BPNN, and its scatter points are clustered between [0,1]. This is due to the optimization of WOA by Cubic map and adaptive weight adjustment strategy. At the same time, Fig 5 shows that the prediction performance of CA-WOA-BPNN is the best. This indicates that the weight and threshold of CA-WOA output are more suitable for the data set of this paper. This also proves that the optimization of Cubic map and adaptive weight adjustment strategy is effective.

thumbnail
Fig 7. W1 output by meta-heuristic algorithms.

https://doi.org/10.1371/journal.pone.0297380.g007

The optimal prediction results of three models (BPNN, WOA-BPNN, CA-WOA-BPNN) are selected. Then the absolute values of the difference between the predicted value and the real value are used as the index to draw Fig 8. The Fig 8 shows that the errors of WOA-BPNN and CA-WOA-BPPN are much smaller than those of BPNN. Especially in the prediction of large debris flow volume (No.11). In addition, the maximum errors of BPNN, WOA-BPNN and CA-WOA-BPNN are NO.11, NO.5 and NO.7, respectively. These analyses show that the improved BPNN is superior to BPNN. Especially in the prediction of large volume debris flow. Moreover, CA-WOA-BPNN does not misjudge small-volume debris flows as large-volume. This is beneficial for disaster relief personnel to carry out rescue after debris flow disaster. Because the correct prediction results can provide a reliable reference for disaster relief.

thumbnail
Fig 8. The difference between the predicted value and the true value of BPNN, WOA-BPNN and CA-WOA-BPNN.

https://doi.org/10.1371/journal.pone.0297380.g008

In general, CA-WOA-BPNN can achieve high accuracy and stability under insufficient samples and complex data. Under the same parameter settings, the optimization effect of CA-WOA is better than that of WOA, ABC and GWO. This is benefit by Cubic map and adaptive weight adjustment strategy. Therefore, CA-WOA-BPNN overcomes the instability and low prediction accuracy of BPNN under insufficient and complex data. In other words, this method can provide a favorable reference for debris flow prevention and disaster relief. However, this paper still has some limitations:

  1. The performance of machine learning algorithms is related to the quality of data sets. Therefore, how to improve the quality of data sets is the key to achieve efficient prediction. For example, using data simulation software generate data similar to the study area. Then, it is used as input for learning. Finally, the predicted value of the output is compared with the real value. This method can not only test whether the generated data is reliable, but also can expand the data set in a low-cost way.
  2. As the data increases, the running time of the model will be longer. Therefore, it is necessary to focus on how to shorten the running time. Research shows that fuzzy logic can tolerate data inaccuracy and reduce running time [51]. In addition, it can also improve the generalization ability of neural networks [51]. Therefore, fuzzy logic is also a reference for further research.
  3. The optimization results of ABC and GWO are unsatisfactory. Therefore, how to adjust the hyperparameters of these two algorithms needs to be further explored.

5 Conclusion

In this paper, the sixty debris flows in Longmenshan area are taken as research cases. Correlation analysis is used to identify the four main factors affecting the study area. Then, the Cubic map and adaptive weight adjustment are used to optimize the whale optimization algorithm (WOA). Subsequently, the improved WOA is used to optimize the Back Propagation Neural Network (BPNN). Finally, the CA-WOA-BPNN debris flow volume prediction model is established to carry out the study. The following conclusions are obtained:

  1. In the Longmenshan area, co-seismic landslide debris is the main factor influencing the volume of debris flows. This is because loose material from earthquakes provides excellent source conditions for debris flows.
  2. CA-WOA-BPNN outperforms the other six models in terms of reliability and stability. The model can overcome the instability of BPNN under insufficient and complex data. This is attributed to the optimization of WOA by Cubic map and adaptive weight adjustment strategies. In which, Cubic map optimizes the initial population position of WOA. In addition, adaptive weight adjustment optimizes the weight value of the shrink-wrap mechanism in WOA. Overall, CA-WOA-BPNN can provide an effective reference for debris flow volume prediction studies with insufficient samples and complex data.
  3. In future studies, more attention needs to be paid to model runtime, dataset quality, and hyperparameter selection.

References

  1. 1. Tang C., Zhu J., Ding J., Cui X. F., Chen L., Zhang J. S. (2011). Catastrophic debris flows triggered by a 14 August 2010 rainfall at the epicenter of the Wenchuan earthquake. Landslides, 8, 485–497.
  2. 2. Huang R., Li W. (2014). Post-earthquake landsliding and long-term impacts in the Wenchuan earthquake area, China. Engineering Geology, 182, 111–120.
  3. 3. Yang Z., Zhao X., Chen M., Zhang J., Yang Y., Chen W., Wu Q. (2023). Characteristics, Dynamic Analyses and Hazard Assessment of Debris Flows in Niumiangou Valley of Wenchuan County. Applied Sciences, 13(2), 1161. https://doi.org/10.3390/app13021161
  4. 4. Yan Y., Hu S., Zhou K., Jin W., Ma N., Zeng C. (2023). Hazard characteristics and causes of the “7.22” 2021 debris flow in Shenshuicao gully, Qilian Mountains, NW China. Landslides, 20(1), 111–125. https://doi.org/10.1007/s10346-022-01992-7
  5. 5. Xu Q., Fan X., Huang R., Yin Y., Hou S., Dong X., Tang M. (2010). A catastrophic rockslide-debris flow in Wulong, Chongqing, China in 2009: background, characterization, and causes. Landslides, 7, 75–87. https://doi.org/10.1007/s10346-009-0179-y
  6. 6. Hu K. H., Cui P., Zhang J. Q. (2012). Characteristics of damage to buildings by debris flows on 7 August 2010 in Zhouqu, Western China. Natural Hazards and Earth System Sciences, 12(7), 2209–2217. https://doi.org/10.5194/nhess-12-2209-2012
  7. 7. Fuchs S., Kaitna R., Scheidl C., Hübl J. (2008). The application of the risk concept to debris flow hazards. Geomechanik und Tunnelbau: Geomechanik und Tunnelbau, 1(2), 120–129. https://doi.org/10.1002/geot.200800013
  8. 8. Tsai M. P., Hsu Y. C., Li H. C., Shu H. M., Liu K. F. (2011). Application of simulation technique on debris flow hazard zone delineation: a case study in the Daniao tribe, Eastern Taiwan. Natural Hazards and Earth System Sciences, 11(11), 3053–3062. https://doi.org/10.5194/nhess-11-3053-2011
  9. 9. Ma C., Hu K., Tian M. (2013). Comparison of debris-flow volume and activity under different formation conditions. Natural hazards, 67, 261–273. https://doi.org/10.1007/s11069-013-0557-6
  10. 10. Chang C. W., Lin P. S., Tsai C. L. (2011). Estimation of sediment volume of debris flow caused by extreme rainfall in Taiwan. Engineering Geology, 123(1–2), 83–90. https://doi.org/10.1016/j.enggeo.2011.07.004
  11. 11. Cao C., Zhang W., Chen J., Shan B., Song S., Zhan J. (2021). Quantitative estimation of debris flow source materials by integrating multi-source data: a case study. Engineering Geology, 291, 106222. https://doi.org/10.1016/j.enggeo.2021.106222
  12. 12. Gartner J. E., Cannon S. H., Santi P. M., Dewolfe V. G. (2008). Empirical models to predict the volume of debris flows generated by recently burned basins in the western US. Geomorphology, 96(3–4), 339–354. https://doi.org/10.1016/j.geomorph.2007.02.033
  13. 13. Arattano M., Bertoldi G., Cavalli M., Comiti F., D’Agostino V., Theule J. (2015). Comparison of methods and procedures for debris-flow volume estimation. In Engineering Geology for Society and Territory-Volume 3: River Basins, Reservoir Sedimentation and Water Resources (pp. 115–119). Springer International Publishing. https://doi.org/10.1007/978-3-319-09054-2_22
  14. 14. Ogie, R. I., Rho, J. C., Clarke, R. J. (2018, December). Artificial intelligence in disaster risk communication: A systematic literature review. In 2018 5th International Conference on Information and Communication Technologies for Disaster Management (ICT-DM) (pp. 1–8). IEEE. https://doi.org/10.1109/ICT-DM.2018.8636380
  15. 15. Zhao Y., Meng X., Qi T., Li Y., Chen G., Yue D., Qing F. (2022). AI-based rainfall prediction model for debris flows. Engineering Geology, 296, 106456. https://doi.org/10.1016/j.enggeo.2021.106456
  16. 16. Kern A. N., Addison P., Oommen T., Salazar S. E., Coffman R. A. (2017). Machine learning based predictive modeling of debris flow probability following wildfire in the intermountain Western United States. Mathematical Geosciences, 49, 717–735. https://doi.org/10.1007/s11004-017-9681-2
  17. 17. Lee D. H., Cheon E., Lim H. H., Choi S. K., Kim Y. T., Lee S. R. (2021). An artificial neural network model to predict debris-flow volume caused by extreme rainfall in the central region of South Korea. Engineering Geology, 281, 105979. https://doi.org/10.1016/j.enggeo.2020.105979
  18. 18. Yu G.Q., Zhang M.S., Wang G.L., et al. (2012) Comparison and application of support vector machine and BP neural network in predicting average velocity of debris flow [J]. Journal of Water Resources, 43 (S2): 105–110. (In Chinese) https://link.cnki.net/doi/10.13243/j.cnki.slxb.2012.s2.006
  19. 19. Ferentinou M., Fakir M. (2018). Integrating rock engineering systems device and artificial neural networks to predict stability conditions in an open pit. Engineering Geology, 246, 293–309. https://doi.org/10.1016/j.enggeo.2018.10.010
  20. 20. Dikshit A., Pradhan B., Alamri A. M. (2021). Pathways and challenges of the application of artificial intelligence to geohazards modelling. Gondwana Research, 100, 290–301. https://doi.org/10.1016/j.gr.2020.08.007
  21. 21. Buchaniec S., Gnatowski M., Brus G. (2021). Integration of classical mathematical modeling with an artificial neural network for the problems with limited dataset. Energies, 14(16), 5127. https://doi.org/10.3390/en14165127
  22. 22. Tian J., Li Y., Zhang Lei., et al. (2023) Adaptive Control of Temporary Support Force Based on PSO-BP Neural Network [J]. Industrial and Mining Automation, 49 (07): 67–74. (In Chinese) https://link.cnki.net/doi/10.13272/j.issn.1671-251x.2022100017
  23. 23. He Y. B., Guo H., Zhang B. Q., Zhu Q., Tang H. M., Li Y. H. (2023) A prediction method for silicon content in molten iron based on GA-BP neural network. Journal of Iron and Steel Research 1–10 (In Chinese) https://link.cnki.net/doi/10.13228/j.boyuan.issn1001-0963.20230158
  24. 24. Tair M., Bacanin N., Zivkovic M., Venkatachalam K. (2022). A chaotic oppositional whale optimisation algorithm with firefly search for medical diagnostics. Computers, Materials & Continua 72 (1), 959–982 (2022). https://doi.org/10.32604/cmc.2022.024989
  25. 25. Bacanin, N., Antonijevic, M., Bezdan, T., Zivkovic, M., Rashid, T. A. (2022, February). Wireless sensor networks localization by improved whale optimization algorithm. In Proceedings of 2nd International Conference on Artificial Intelligence: Advances and Applications: ICAIAA 2021 (pp. 769–783). Singapore: Springer Nature Singapore. https://doi.org/10.1007/978-981-16-6332-1_62
  26. 26. Mirjalili S., Lewis A. (2016). The whale optimization algorithm. Advances in engineering software, 95, 51–67. https://doi.org/10.1016/j.advengsoft.2016.01.008
  27. 27. Chen K., Zhou F., Liu A. (2018). Chaotic dynamic weight particle swarm optimization for numerical function optimization. Knowledge-Based Systems, 139, 23–40. https://doi.org/10.1016/j.knosys.2017.10.011
  28. 28. Rani G. S., Jayan S., Alatas B. (2023). Analysis of chaotic maps for global optimization and a hybrid chaotic pattern search algorithm for optimizing the reliability of a bank. IEEE Access,11, 24497–24510. https://doi.org/10.1109/ACCESS.2023.3253512.
  29. 29. Zhang X. Y., Zhou K. Q., Li P. C., Xiang Y. H., Zain A. M., Sarkheyli-Hägele A. (2022). An Improved Chaos Sparrow Search Optimization Algorithm Using Adaptive Weight Modification and Hybrid Strategies. IEEE Access,10, 96159–96179. https://doi.org/10.1109/ACCESS.2022.3204798
  30. 30. Adler J., Parmryd I. (2010). Quantifying colocalization by correlation: the Pearson correlation coefficient is superior to the Mander’s overlap coefficient. Cytometry Part A, 77(8), 733–742. https://doi.org/10.1002/cyto.a.20896
  31. 31. Chen H., Zeng Z. (2013). Deformation prediction of landslide based on improved back-propagation neural network. Cognitive computation, 5, 56–62. https://doi.org/10.1007/s12559-012-9148-1
  32. 32. Wang R. B., Xu H. Y., Li B., Feng Y. (2018) Research on the Method for Determining the Number of Hidden Layer Nodes in BP Neural Networks. Computer Technology and Development, 28 (4), 31–35. (In Chinese) https://link.cnki.net/urlid/61.1450.TP.20171205.1436.132
  33. 33. Gao D. W., Wang P., Cai Z. C. (2003) Optimization of Implicit Layer Nodes and Training Times in Artificial Neural Networks. Journal of Harbin Institute of Technology, 35 (2), 207–209. (In Chinese) https://kns.cnki.net/kcms2/article/abstract?v=Eo9-C_M6tLmAFkkjA4H5FX6xP29oLlTExhxpjbXgcOM94UtYUndG5xhJtc7vGpRPyLV5ELVbCSbQd5j-XD8o5nrpLH6WO3EC2XeZaF3MUeiQG7hk7NWdK3vjcRPEYolB0pVWV_XqJyE=&uniplatform=NZKPT&language=CHS
  34. 34. Zhang L., Jiang J. H., Liu P., Liang Y. Z., Yu R. Q. (1997). Multivariate nonlinear modelling of fluorescence data by neural network with hidden node pruning algorithm. Analytica Chimica Acta, 344(1–2), 29–39. https://doi.org/10.1016/S0003-2670(96)00628-9
  35. 35. Gharehchopogh F. S., Gholizadeh H. (2019). A comprehensive survey: Whale Optimization Algorithm and its applications. Swarm and Evolutionary Computation, 48, 1–24. https://doi.org/10.1016/j.swevo.2019.03.004
  36. 36. Tang A. D., Han T., Xu D. W., Xie L. (2021) Chaos Multi Elite Whale Optimization Algorithm. Journal of Beijing University of Aeronautics and Astronautics, 47 (7), 1481–1494. (In Chinese) https://link.cnki.net/doi/10.13700/j.bh.1001-5965.2020.0585
  37. 37. Qian Z. S., Yu Q.Y., Zhang D., Yao C. S., Qin L. Y., Cheng Y. W. (2023) Chain based multipath coverage test case generation combining SVM and XGBoost. Journal of Software 1–26 (In Chinese) https://link.cnki.net/doi/10.13328/j.cnki.jos.006905
  38. 38. Akiba, T., Sano, S., Yanase, T., Ohta, T., Koyama, M. (2019, July). Optuna: A next generation hyperparameter optimization framework. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining (pp. 2623–2631). https://doi.org/10.1145/3292500.3330701
  39. 39. Shahriari B., Swersky K., Wang Z., Adams R. P., De Freitas N. (2015). Taking the human out of the loop: A review of Bayesian optimization. Proceedings of the IEEE, 104(1), 148–175. https://doi.org/10.1109/JPROC.2015.2494218
  40. 40. Banharnsakun A., Achalakul T., Sirinaovakul B. (2011). The best-so-far selection in artificial bee colony algorithm. Applied soft computing, 11(2), 2888–2901. https://doi.org/10.1016/j.asoc.2010.11.025
  41. 41. Mirjalili S., Mirjalili S. M., Lewis A. (2014). Grey wolf optimizer. Advances in engineering software, 69, 46–61. https://doi.org/10.1016/j.advengsoft.2013.12.007
  42. 42. Li L., Fu Y., Fung J. C., Qu H., Lau A. K. (2021). Development of a back-propagation neural network and adaptive grey wolf optimizer algorithm for thermal comfort and energy consumption prediction and optimization. Energy and Buildings, 253, 111439. https://doi.org/10.1016/j.enbuild.2021.111439
  43. 43. Maroufpoor S., Bozorg-Haddad O., Maroufpoor E. (2020). Reference evapotranspiration estimating based on optimal input combination and hybrid artificial intelligent model: Hybridization of artificial neural network with grey wolf optimizer algorithm. Journal of Hydrology, 588, 125060. https://doi.org/10.1016/j.jhydrol.2020.125060
  44. 44. Awan S. M., Aslam M., Khan Z. A., Saeed H. (2014). An efficient model based on artificial bee colony optimization algorithm with Neural Networks for electric load forecasting. Neural Computing and Applications, 25, 1967–1978. https://doi.org/10.1007/s00521-014-1685-y
  45. 45. Hsieh T. J., Hsiao H. F., Yeh W. C. (2011). Forecasting stock markets using wavelet transforms and recurrent neural networks: An integrated system based on artificial bee colony algorithm. Applied soft computing, 11(2), 2510–2525. https://doi.org/10.1016/j.asoc.2010.09.007
  46. 46. Huang J., Hales T. C., Huang R., Ju N., Li Q., & Huang Y. (2020). A hybrid machine-learning model to estimate potential debris-flow volume. Geomorphology, 367, 107333. https://doi.org/10.1016/j.geomorph.2020.107333
  47. 47. Fan X., Domènech G., Scaringi G., Huang R., Xu Q., Hales T. C., Francis O. (2018). Spatio-temporal evolution of mass wasting after the 2008 M w 7.9 Wenchuan earthquake revealed by a detailed multi-temporal inventory. Landslides, 15, 2325–2341. https://doi.org/10.1007/s10346-018-1054-5
  48. 48. Fan X., Scaringi G., Korup O., West A. J., van Westen C. J., Tanyas H., Huang R. (2019). Earthquake‐induced chains of geologic hazards: Patterns, mechanisms, and impacts. Reviews of geophysics, 57(2), 421–503. https://doi.org/10.1029/2018RG000626
  49. 49. deHaas T., Densmore A.L. (2019). Debris-flow volume quantile prediction from catchment morphometry. Geology 47, 791–794 https://doi.org/10.1130/G45950.1
  50. 50. Marchi L., Brunetti M.T., Cavalli M., Crema S. (2019). Debris-flow volume in northeastern Italy: relationship with drainage area and size probability. Earth Surf. Process. Land-forms 44, 933–943. https://doi.org/10.1002/esp.4546
  51. 51. Rajasekar V., Predić B., Saracevic M., Elhoseny M., Karabasevic D., Stanujkic D., Jayapaul P. (2022). Enhanced multimodal biometric recognition approach for smart cities based on an optimized fuzzy genetic algorithm. Scientific Reports, 12(1), 622. pmid:35022474