Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Wind energy resource assessment based on joint wolf pack intelligent optimization algorithm

Abstract

Wind energy is a clean and renewable energy source with great potential for development, but the intermittent and stochastic characteristics of wind speed have brought great challenges to the effective development and utilisation of wind energy resources, resulting in high development costs. Therefore, how to accurately assess the wind energy resources and effectively predict the wind speed has become a key issue to be solved in the current wind energy field. In view of this, the study proposes the Weibull model to model the wind speed data, and then introduces the wolf pack intelligent optimisation algorithm and improves it through the pollination mechanism to improve the accuracy of wind energy resource assessment. Secondly, considering the complexity and diversity of wind speed data characteristics, data decomposition technique, autoregressive moving average (ARIMA) model and cuckoo search algorithm are used to achieve data preprocessing, serial data modelling and hybrid prediction. The experimental results show that the Weibull model has good fitting accuracy for wind speed data, with residual sum of squares, RMSE, and average coefficient of determination of 0.05, 0.014, and 0.96, respectively, accurately reflecting the statistical characteristics of wind speed data. The wind speed prediction performance of the hybrid prediction model is good, with a maximum deviation of no more than 3% from the true value, which is significantly better than the compared VMD-ISOA-KELM model and CNN-BLSTM model, and its prediction error is relatively small. The hybrid prediction model has a smaller relative error value compared to a single algorithm, with a maximum value of less than 0.2. It has better prediction performance than the combination model, with a coefficient of determination approaching 1.0, a fitting accuracy of 0.994, a mean square error of 0.1947, a root mean square error of 0.3847, and an average absolute percentage error of 15.23%. And the research method can effectively evaluate the status of wind energy resources, with low time complexity at different data scales, taking no more than 5 seconds, and improving operational efficiency. This research method can provide strong technical support and reference basis for the development and utilisation of wind energy resources, and help to promote the sustainable development of wind energy industry.

1. Introduction

1.1. Motivation and purpose

The resource shortage and ecological environment problems caused by excessive consumption of fossil fuels have gradually shifted people’s focus to renewable energy. Wind energy, as a sustainable renewable energy, has abundant energy storage, which can better meet energy needs [1]. In wind power development, the site selection is often related to various factors, among which wind energy resources are the primary consideration factor and closely related to the economic benefits of wind farms. Therefore, wind energy resource assessment has become an important prerequisite for determining the location of wind farms [2,3]. Wind energy resource assessment is based on collected meteorological station data and wind speed data for calculation, evaluating indicators such as average wind speed and wind direction frequency, and achieving preliminary analysis of wind energy resources. At a more microscopic level, wind energy resource assessment should consider terrain and terrain trends, surface roughness, obstacle occlusion, and other factors. Based on the corrected detection data, a distribution map of wind energy resources is drawn, providing reference for wind turbine deployment and power generation calculation [4]. In 2002, China issued the “Wind Power Resource Evaluation Method for Wind Farms”. Although it provides a theoretical basis for resource evaluation, there are still many problems in practical application, such as resource evaluation deviation, wind farms not meeting expected power generation, and high maintenance cost. Therefore, it is urgent to develop active and effective wind power resource evaluation methods [5].

1.2. Literature review

Wind energy, due to its rich energy storage and clean environmental friendliness, can greatly reduce the dependence on fossil fuels. It is widely used in the field of energy utilization. According to relevant data statistics, as of 2019, wind power installation has shown strong growth momentum. Previous studies have mostly applied neural networks and intelligent algorithms to wind speed prediction problems. B. Kosovic et al. addressed the uncertainty quantification and probability issues in wind speed prediction through simulation integration methods to combine predicted data with fuzzy logic artificial intelligence systems. This method could better evaluate the characteristics of data changes [6]. Therefore, A. S. Devi et al. applied the Long Short-Term Memory network (LSTM) and enhanced forgetting gate network model to time series data processing. The Cuckoo Search (CS) algorithm was applied to achieve parameter optimization. The hybrid strategy could effectively improve prediction accuracy, overcome existing limitations, and provide reference basis for power management [7]. M. S. Nazir et al. believed that strengthening wind power generation prediction was of great value for grid control design. Previous research mostly used Artificial Aeural Networks (ANN) for wind power prediction, which had better prediction accuracy than other methods [8]. Wind speed prediction is an important part of wind energy resource evaluation, which directly affects the power generation and economic benefits of wind farms. The above content indicates that the comprehensive algorithm has good prediction accuracy and application effect, but there is less consideration for data feature classification. Wind energy resource assessment usually needs to consider factors such as frequency distribution of wind speed, wind direction stability, and wind energy density. However, some scholars have proposed methods for dealing with uncertainty and optimizing design, which can provide reference for the efficient utilization of wind energy resources. For example, P. Sharma and R K. Mishra et al. used machine learning and deep learning models to estimate the power of photovoltaic solar power plants. The results showed that the proposed method had good regression results, smaller error metrics than other models, and better seasonal prediction results [9]. Previous studies have often overlooked the importance of data preprocessing in predicting wind power generation processes, and insufficient attention to missing values has led to limited prediction accuracy. Therefore, the study proposes a wind energy resource evaluation method based on wolf pack intelligent optimization algorithm and predictive statistical model. Based on the advantages of intelligent algorithms, a hybrid prediction algorithm is proposed from linear and nonlinear models to better consider differences in data features. The traditional Wolf Pack Algorithm (WPA) utilizes a bottom-up design approach and a collaborative search path to optimize the objective function, which is prone to problems such as local optima, limited interactivity, and convergence speed. The research improves the wandering and besieging behavior of wolf packs. Specifically, the factors of wolf pack renewal are incorporated into wandering behavior. A walking mechanism from coarse to fine is designed to improve the wandering behavior of wolf packs. Meanwhile, the flower pollination mechanism is integrated into the prey capture stage of the WPA, making the siege behavior have an adaptive adjustment mechanism. The Levy flight mechanism and ratio mutation operation can also be used to avoid the local extremum problem and achieve wolf pack position updates. Wind speed has characteristics such as dynamism, uncertainty, and complexity, and both linear and nonlinear statistical methods have certain shortcomings in prediction, resulting in poor sample processing performance. Based on the performance characteristics and data analysis of wind energy resources, this study evaluates wind energy resources from two aspects: the design of wolf pack intelligent optimization algorithm and the construction of hybrid prediction statistical model.

1.3. Contribution and paper organization

The research mainly conducts predictive analysis on wind energy resource evaluation from five aspects. Part I is a literature review and discussion of current wind power prediction technologies and related analysis algorithms. Part II proposes the wolf pack intelligent optimization algorithm and predictive statistical model to achieve wind speed prediction, considering wind speed characteristics and sequence data. Part Ⅲ tests and analyzes the effectiveness of wind energy resource evaluation under this fusion method. Part Ⅳ is a discussion to overview the research results. Part Ⅴ is a conclusion to provide an overview summary of the content, existing shortcomings, and future improvement directions. The combination prediction method is used to analyze the correlation between wind direction data information. Overall, this combination prediction method consists of two parts. Part I focuses on the linear data features, and completes the analysis and prediction of wind speed linear data through steps such as wind speed time series denoising, sequence auto-regression modeling, and data difference order processing. Part II focuses on nonlinear data features. An improved Back Propagation Neural Network (BPNN) based on cuckoo optimization algorithm is taken as the nonlinear prediction module to analyze the nonlinear features in the data. After processing the data using the improved WPA, a mixed linear and nonlinear prediction model can be imported into the wind speed time series to achieve prediction and evaluation based on the data characteristics, improving application performance and reference value. The innovation of the research lies in the parameter selection based on statistical distribution models, considering the linear and nonlinear characteristics of wind speed time series, and proposing a hybrid model under uncertainty prediction. It is expected to provide reference value for wind energy resource assessment and wind speed analysis. By combining linear and nonlinear prediction methods with hybrid prediction models, the overall prediction accuracy of wind speed and energy resource assessment is significantly improved. It can not only capture the diversity and complexity of wind speed data, but also enhance prediction robustness and reliability through error complementarity and model integration.

The research contribution lies in exploring the wind speed prediction results based on the characteristics of wind speed data. By improving the WPA, the global search capability has been enhanced, avoiding local optima. Considering the diversity and complexity of wind speed data, a mixed linear and nonlinear prediction model is constructed to improve prediction accuracy. A hybrid model combining wolf pack intelligent optimization algorithm and predictive statistical model is adopted to analyze linear and nonlinear data features, enhancing the prediction robustness and reliability. This provides new ideas and references for wind energy resource assessment and wind speed analysis, and helps promote the effective development and utilization of wind energy resources.

Traditional wind energy resource assessment methods often rely on a single statistical model or optimization algorithm, which makes it difficult to effectively handle the high-dimensional, nonlinear, and uncertain nature of wind speed data. Wind speed data has characteristics such as intermittency, randomness, and non-linearity. Although a single statistical model (such as the Weibull model) can capture the statistical characteristics of wind speed, its accuracy is limited when processing complex data; However, a single optimization algorithm (such as traditional WPA) may have global search capabilities, but it is prone to getting stuck in local optima. Therefore, in order to improve the limitations of single intelligent algorithms and single statistical models in wind energy resource assessment, an innovative hybrid model combining wolf pack optimization algorithm (WPA) and statistical models has been proposed, which fully utilizes the powerful global search and dynamic optimization capabilities of WPA, as well as the efficient modeling ability of statistical models, aiming to more accurately evaluate the distribution and potential of wind energy resources. In this hybrid model, the Weibull model can accurately describe the statistical distribution characteristics of wind speed, enabling modeling and preprocessing of wind speed data. WPA has strong global search and dynamic optimization capabilities, which can effectively explore the solution space and approach the optimal solution. The improvement of pollination mechanism on WPA can enhance its global search ability, optimize model parameters, and further improve prediction accuracy. Previous algorithm models often focused on deterministic prediction when predicting wind speed, without considering the uncertainty of wind speed information. Their application methods often have disadvantages such as poor stability. The hybrid model proposed in the study can better handle high-dimensional data and nonlinear relationships in wind energy resource evaluation, and is suitable for situations with multiple data, uncertain wind energy data, different scales, and high prediction demand. This model gives full play to the advantages of intelligent algorithms in global optimization, and combines the characteristics of linear models and nonlinear models to form a comprehensive prediction framework that adapts to different data characteristics. Its accuracy and applicability in wind speed prediction are significantly better than the traditional single model or simple comprehensive model. The research aims to overcome the limitations of a single method by constructing a hybrid model of WPA and statistical models, achieve high-precision evaluation of wind energy resources and wind speed prediction, provide a more effective and reliable wind energy resource evaluation method, provide scientific basis for wind farm site selection and planning, and promote the sustainable development and utilization of wind energy resources.

2. Related works

Wind power prediction is a key technology for its safe operation. Y. Wang et al. designed a hybrid prediction approach based on historical data such as wind power and wind speed using auto-regressive moving average prediction and Support vector machine (SVM). The Particle Swarm Optimization (PSO) was used to solve for the optimal weight value. The hybrid model exhibited good predictive performance and high computational accuracy in case studies [10]. Wind power exerts a crucial function in the current global energy. Therefore, M. Santhosh et al. conducted a comprehensive study on current prediction technologies and performance. The development of related prediction technologies was explained [11]. A. Ahmadpour et al. explored large-scale complex hydrological models with uncertain sources. An uncertainty quantification platform was designed to analyze complex regional hydro-logical systems [12]. Zhang et al. decomposed the wind speed data twice using an improved hybrid model based on its trend fluctuation characteristics. Based on the PSO, the LSTM was subjected to sequence optimization processing and denoising. The linear data was trained using an auto-regressive moving average model. The variational strategy could effectively extract wind speed data features. The optimization algorithm could effectively identify parameter problems, with high wind energy prediction accuracy [13].

The uncertainty of weather factors may increase the prediction error of wind power generation. Therefore, to enhance the prediction accuracy, D.T. Viet et al. used PSO and genetic algorithm to adjust the parameters of the artificial neural network for the first and second time, respectively. The approach had good accuracy and applicability in testing [14]. A. Kisvari et al. proposed a driving approach on the basis of gated recursive deep learning models to address the variability of wind power prediction. The data was preprocessed, resampled, monitored, and adjusted for parameter features. The results showed that this method could achieve lower computational costs when predicting wind power. The computational accuracy was significantly better than other comparative algorithms [15]. M. A. Hossain combined empirical mode decomposition method, Emperor Butterfly optimization algorithm, and LSTM to achieve data prediction analysis and model parameter setting for wind power generation prediction. The hybrid model could improve prediction accuracy by an average of over 30% compared with the benchmark model, effectively reducing Root Mean Square Error (RMSE) and Mean Absolute Percentage Error (MAPE) [16]. F. K. Karim et al. used Recurrent Neural Network (RNN) to achieve wind power prediction, and processed the data for prediction based on dynamic fitness. Compared with other comparative algorithms, the proposed algorithm had better prediction accuracy, and performed well in indicators such as relative RMSE, Mean Absolute Error (MAE), determination coefficient, and Pearson correlation coefficient, with significant application effects [17]. B. Bochenek used different machine learning methods to predict the daily wind power generation of wind farms nationwide. Numerical weather prediction models and extreme gradient enhancement methods were created to train data. The RMSE of wind turbines under this prediction model was less than 5%, with high prediction accuracy and seasonal differences [18].

The modeling of wind uncertainty aims to reveal the probability distribution characteristics of wind field parameters. Wind prediction is the process of collecting and analyzing meteorological data such as wind speed and direction, using prediction models to estimate and predict wind conditions for a period of time in the future, which aims to improve the accuracy of wind farm output power prediction. Probability distribution models, stochastic process models, or set theory methods are commonly used to describe the uncertainty of wind field parameters, while wind prediction often relies on statistical models, artificial intelligence algorithms, and hybrid models to obtain data. Both rely on meteorological data and statistical analysis methods, which can provide decision-making basis for the operation and planning of wind power systems. The uncertainty modeling of wind usually adopts methods such as probability theory and mathematical statistics, while wind prediction relies on advanced prediction models and algorithms. Although the focus of the two is different, they are often combined and applied to the utilization of wind energy resources. For example, M A. Hossain combined ensemble decomposition patterns with intelligent algorithms to predict wind power data. Y. Zhang et al. focused more on changes in wind speed characteristics and time series situations. An ensemble learning hybrid prediction model was proposed to predict data and wind power. Previous models often focused on deterministic prediction when predicting wind speed, without considering the uncertainty of wind speed information, and their application methods often have disadvantages such as poor stability. Therefore, with the help of integrated thinking and based on data analysis features, an innovative hybrid prediction model is designed by combining wolf pack intelligent optimization algorithms with predictive statistical models. This model fully utilizes the advantages of intelligent algorithms in global optimization, combining the characteristics of linear and nonlinear models to form a comprehensive prediction framework that can adapt to different data features. The proposed hybrid prediction model is significantly superior to the traditional single model or simple integrated model in the accuracy and applicability of wind speed prediction. This indicates that the research not only proposes a new methodology in theory, but also verifies its effectiveness in practice, providing scientific basis for practical applications such as wind farm site selection and power generation estimation, and promoting the efficient development and utilization of wind energy resources.

3. Wind energy resource assessment design under hybrid algorithm model

Based on the differences in the distribution and variation characteristics of wind energy resources, wind energy resource evaluation is carried out from two aspects: the design of wolf pack intelligent optimization algorithm and the construction of hybrid prediction statistical model. Specifically, the Weibull distribution function is first applied to wind speed modeling, and the flower pollination mechanism is used to improve the WPA for modeling and solving. Secondly, considering the complexity of wind speed prediction, the study combines linear and nonlinear prediction methods to analyze the correlation between wind direction data information and improve the accuracy of evaluation and prediction.

3.1. Wind energy resource evaluation based on wolf pack intelligent optimization algorithm

Wind speed has random fluctuations, and due to different environments, wind speed also exhibits different characteristics. Generally speaking, wind speed changes throughout the day often have periodicity. On land, the wind speed reaches its minimum and maximum peaks at 6am and 2 pm, while at sea level, the wind speed is higher at night. Wind speed, as an important reference for wind energy resource assessment, can generally be calculated on the basis of wind speed observation data and fitted probability distributions to obtain wind frequency distribution. The Weibull distribution is often applied in wind speed modeling. The calculation process of statistical distribution parameters in wind speed resource assessment is relatively complex. This study first uses the wolf pack intelligent optimization algorithm to optimize the parameters of the selected optimal statistical distribution, and evaluates wind energy resources using the established statistical distribution model. The optimal statistical distribution based on the strategy of selecting the minimum RMSE is determined for fitting accuracy. Wind speed usually has non-negative and right skewed characteristics, which means that wind speed data is concentrated in low wind speed areas, but occasionally high wind speeds may occur. This feature is very consistent with the shape of the Weibull distribution model. Weibull distribution is a probability distribution commonly used to describe life and reliability data, and its shape and scale parameters can be flexibly adjusted to adapt well to the actual wind speed distribution. The Weibull distribution model is a widely used statistical distribution model in wind speed resource assessment, which requires fewer parameters and is convenient for parameter estimation and interpretation applications. The Weibull distribution, as a model for wind speed data rather than other potential models, is based on the applicability and accuracy of the Weibull distribution in wind energy analysis. This model can well describe and simulate the wind speed distribution in nature, expressing the wind speed changes from minimum to maximum. Numerous meteorological studies and practical applications have shown that Weibull distribution can accurately predict wind energy resources, especially in evaluating wind energy potential and feasibility analysis of wind power generation. The Weibull distribution model can directly calculate wind energy density and has good adaptability and fitting accuracy. The wind speed involved in the research is not less than 6m/s, so the Weibull dual parameter distribution model can be selected, and the optimal distribution parameters can be determined using maximum likelihood estimation and goodness of fit testin. Due to fewer restrictions and simple parameter settings in application, it is extensively used in wind speed distribution law calculation, wind energy resource evaluation, and other aspects. Equation (1) shows the Weibull probability density function [19].

(1)

In Equation (1), signifies the scale parameter. signifies the shape parameter. signifies the wind speed (m/s). is a natural constant in mathematics. The mathematical expectation and variance of the Weibull distribution is shown in Equation (2).

(2)

In Equation (2), is the mathematical expectation. is the gamma function. is the variance [20]. When the Weibull shape parameter is 2, the Weibull distribution can be transformed into a Rayleigh distribution. Considering that traditional methods for estimating parameters of Weibull distribution often exhibit performance limitations and iterative calculation problems, intelligent optimization algorithms are used for solving research in this study. Swarm intelligence algorithms have good application advantages in data processing, such as E S. M. El Kenawy et al. believe that metaheuristic methods such as grey lag goose optimization algorithm have great potential in engineering applications and can effectively perform statistical processing on data [21]. The WPA algorithm relies on the principles of wolf pack behavior selection and process to solve high-dimensional and multi-modal numerical functions, including wolf pack partitioning, intelligent behavior, and wolf pack change mechanism design. Wolf packs can be divided into wolf, wolf, wolf, and wolf according to their social hierarchy. According to the optimal solution solving method, the wolf pack position is updated during the algorithm iteration process. In the stage of surrounding prey, the position update formula can be expressed as equation (3).

(3)

In Equation (3), signifies the distance between the individual and the prey. is the number of iterations. is the prey location. represents the individual position of the wolf. is the coefficient vector [22]. If the individual position of a wolf is set as a solution, the position of wolf is updated based on wolf (optimal solution), wolf (sub-optimal solution), and wolf (third optimal solution). Attacking and searching for prey can be achieved through iterative processes to achieve local and global search. WPA utilizes a bottom-up design approach and collaborative search paths to optimize the objective function. However, the traditional WPA is prone to problems such as local optima, interactivity, and limited convergence speed [23]. Therefore, the WPA has been improved. Firstly, the walking behavior of the wolf pack is improved by incorporating the wolf pack update position into the walking behavior. A gradually detailed change walking mechanism is designed, as shown in Equation (4).

(4)

In Equation (4), represents the forward direction. is an individual wolf pack. signifies the maximum number of iterations. signifies the update factor. is the stride of the wolf pack when performing attack behavior at a distance of . denotes the position coordinates of the wolf in the dimension. Secondly, the Flower Pollination Algorithm (FPA) is introduced to enhance the global search ability of wolf pack individuals. The FPA, as a meta-heuristic swarm intelligence optimization algorithm, can simulate the process of cross pollination using the Levi flight mechanism. Its cross pollination and self-pollination can correspond to the global search and local search, respectively. The integration of FPA mechanism into the prey capture stage of WPA can optimize the convergence performance, and its mathematical model can be expressed as Equation (5).

(5)

In Equation (5), represents the solution of the -th generation. signifies the solution of the -th generation. signifies the global optimal solution. signifies the Levi step size [24]. In the local search mechanism, double random values are used to avoid local extremum, and its mathematical model is shown in Equation (6).

(6)

In Equation (6), is the random solution of all individuals in the population except for individual . signifies a uniformly distributed random number in [0,1]. Meanwhile, in the intelligent algorithm optimization, the mutation operation is introduced to randomly perturb the solution to increase the diversity of the solution set. The study uses polynomial mutation process to update the position of wolf pack individuals. Equation (7) is the mutation method.

(7)

In Equation (7), is the disturbance factor. is the upper boundary of the position boundary. signifies the lower boundary of the position boundary. Fig 1 shows the flowchart of the improved Flower Pollination Algorithm -Wolf Pack Algorithm (FPA-WPA).

In Fig 1, when the objective function value of the updated wolf pack position of is less than the function value of , the polynomial mutation is performed on its position. Then, the objective function values of the individual before and after mutation are compared. If it is better, it is accepted. Otherwise, the original solution is retained. In the FPA-WPA algorithm, the input variables include wind speed data, Weibull distribution parameters (including shape parameters (k) and scale parameters), algorithm parameters (wolf pack size, maximum iteration times, global and local search ratios of pollination algorithm, walk compensation of wolf pack algorithm, siege step size, etc.), and fitness function. The core idea of FPA-WPA algorithm is to combine the global search capability of FPA and the local search capability of WPA to optimize the parameters of the predictive statistical model, thereby improving the accuracy of wind energy resource assessment. The fitness function is used to evaluate the advantages and disadvantages of each candidate solution. The fitness function based on the error function of the predictive statistical model is analyzed. Its mathematical expression is shown in Equation (8).

(8)

In Equation (8), represents the sample size. is the actual value. is the predicted value. is the mean square error. By minimizing MSE and optimizing the parameters of the predictive statistical model, the prediction accuracy of the model can be improved. The fitness function is used to evaluate the strengths and weaknesses of each solution (wolf pack individual) and is run as follows: firstly, an initial population (wolf pack individual) is generated based on the input wind speed data and Weibull parameters. Afterwards, for each individual, its corresponding fitness value is calculated. The pollination mechanism will update the position of the solution through global search (heteroflowering) and local search (self-pollination) to enhance the global search capability, adjust the position of the wolf pack solution, and gradually approach the optimal solution. The calculation of the fitness value and the optimisation of the position are repeated until the maximum number of iterations is reached or the convergence condition is satisfied, and the optimal solution and its fitness value can be returned. The study uses the FPA-WPA algorithm to iteratively search for the optimal solution, which can improve the accuracy of parameter estimation and more accurately describe the distribution characteristics of wind speed. Improving the pollination mechanism of flowers can accelerate the iteration speed and enhance computational efficiency using the grey wolf algorithm. The FPA-WPA algorithm, due to its global search capability, is able to find the global optimal solution in a larger solution space, more accurately estimate the shape and scale parameters of the Weibull distribution, and more accurately describe the distribution characteristics of wind speed. The condition for the objective function of FPA-WPA algorithm to reach the fitness value is to find the optimal solution while satisfying the maximum number of iterations.

3.2. Design of wind energy resource evaluation based on hybrid prediction model

Wind speed prediction plays an important role in wind energy resource assessment, and wind speed modeling design is one of the foundations for designing hybrid prediction methods. The accuracy of wind speed prediction directly affects the reliability of wind energy resource assessment. After combining it with the prediction of linear and nonlinear statistical data, a more comprehensive prediction model can be constructed to better capture the volatility and uncertainty of wind speed. The study utilizes the Weber distribution model and FPA-WPA improved algorithm to solve for the optimal values of parameter solutions in the aforementioned content. This result can provide a foundation for understanding the characteristics of wind energy resources and ensure the accuracy of wind speed prediction results, achieving wind energy resource assessment. Given the characteristics of wind speed, predicting wind speed can have many adverse effects. Traditional physics methods are based on mechanics knowledge and establish fluid dynamics models using predicted data from generator sets. These models can take into account the wind speed, direction, pressure, and weather information of the location during the prediction process. However, it needs to establish complex prediction models, with high computational load [25,26]. Due to the limitations of data prediction, linear statistical methods and nonlinear statistical methods inevitably have different problems, and the sample processing effect is poor. Therefore, this study considers using combination prediction methods to analyze the correlation between wind direction data information, and comprehensively process them to obtain the final prediction result. Gaussian white noise is adopted into the time series of meteorological data. The sequence components are calculated and reconstructed to reduce the interference of noisy data on the sequence. The original wind speed time series is . represents the white noise added in step . The two types of mixed signals can be expressed as Equation (9).

(9)

In Equation (9), signifies the total positive noise raw data. signifies the total negative noise raw data [27]. and are decomposed to obtain their corresponding independent intrinsic mode components. Integrating the two can cancel out white noise. The obtained residual sequence is repeatedly decomposed to obtain new component results and residual sequences. Regression analysis models provide effective technical tools for resource assessment and prediction, such as S. G. Khaled et al. using random forest regression models for data association analysis of groundwater resources. The results indicate that the model is helpful for decision-making and management of water resource reserves [28]. Considering the spatiotemporal distribution characteristics of wind power, the study chose the autoregressive moving average model (ARMA) as the linear combination module to model the processed sequence data.. The ARMA combines auto-regressive model and moving average model, which can perform auto-correlation analysis on known parameter linear models. It performs well in state estimation and signal processing, as shown in Equation (10).

(10)

In Equation (10), is the time series. represents the time. is the interference term. signifies the auto-regressive order. signifies the moving average order. is a non-zero undetermined coefficient. is the predicted object [29]. During the modeling process, the predicted time series data is subjected to first-order differential processing to transform non-stationary data into a stationary sequence with a mean of zero, as shown in Equation (11).

(11)

In Equation (11), is the regression coefficient. The auto-correlation function of this model may exhibit “truncation” or “tailing” characteristics due to different conditions. After completing time series recognition, the model is ordered, that is, the order of auto-regressive and moving average is determined. The minimum information criterion method is used for processing. The function expression of the criterion method is shown in Equation (12).

(12)

In Equation (12), is the criterion setting. is the estimated variance of the residual sequence in the model. is the sample length. To model the preprocessed time series linearly and nonlinearly, the time series can be represented as Equation (13).

(13)

In Equation (13), is the linear part of the sequence. is the nonlinear part of the sequence. Fig 2 displays the prediction model process.

thumbnail
Fig 2. Schematic diagram of the prediction model process.

https://doi.org/10.1371/journal.pone.0326035.g002

In the nonlinear prediction part, the study utilizes the CS to adjust the weights and thresholds of BPNN. Then, an improved hybrid prediction model Cuckoo Search-Back Propagation Neural Network (CS-BP) is obtained to improve the prediction accuracy. The CS algorithm simulates the parasitic breeding mechanism of certain species of cuckoo birds to achieve optimal solution. Its assumptions include as follows. The cuckoo bird represents the solution with an egg nest, and the replacement of the nest is achieved through random selection and hatching of bird eggs. The optimal solution is obtained through fixed nests. The Levy flight search mechanism serves as the calculation basis for this algorithm [30]. However, this calculation basis is affected by various practical factors in practical operations. Therefore, the study introduces a jump path and variables that conform to a Gaussian distribution for calculation, resulting in Equation (14).

(14)

In Equation (14), are Gaussian distribution variables. The amount of path adjustment length is adaptively and dynamically adjusted to update the current step size based on the results of the previous iteration. BAPP utilizes weights to connect neurons and reduces the uncertainty of information description through gradient descent and back propagation. Back propagation is the process of gradually increasing prediction errors through layer processing, and adjusting layer weights and thresholds in the network to achieve adaptive learning of the algorithm [31,32]. Due to the algorithm design, BPNN has a slower convergence speed, and the CS algorithm is used for global optimization processing. The BPNN weight and its position vector are used to generate a randomized nest, as displayed in Equation (15).

(15)

In Equation (15), is the generated random number. is the individual upper limit. is the individual lower limit. The fitness function is used to achieve training replacement of the network and obtaining the initial position of the optimal bird’s nest. Comparing the fitness values of adjacent positions can update the optimal position [33]. The obtained values are continuously compared and updated with the set mutation probability until the condition is satisfied. The output position vector is used as the weight threshold of BPNN. Fig 3 displays the design flowchart of the hybrid prediction system.

The study introduces decomposition techniques in the data preprocessing section to denoise sequential data. Then, the data is processed based on the improved wolf pack intelligence algorithm and prediction model. The final hybrid prediction model is then input into a time series to evaluate the prediction results. Due to the consideration of linear and nonlinear data characteristics in the hybrid prediction model designed for research, the selected data information features and sequence features include periodic wind speed time series features (daily cycle and seasonal cycle) as well as wind speed, wind direction, and related meteorological impact variables with complex fluctuations and autocorrelation. As the research mainly analyzes wind speed for wind energy resource assessment, the time interval and span data related to wind speed are the basis, and also cover wind speed anomalies under certain emergencies such as typhoons and hurricanes. The study considers using combination prediction methods to analyze the correlation between wind direction data information, and carries out steps such as denoising, sequence auto-regression modeling, and data difference order processing on wind speed time series data to complete linear wind speed data prediction. Subsequently, the improved BPNN based on the cuckoo optimization algorithm is used as the nonlinear prediction module to analyze the nonlinear features in the data. For non-stationary data, differential processing can be performed to eliminate the trend components in the data, making the time series more stable. Based on the improved WPA for data processing, linear and nonlinear hybrid prediction models can be imported into wind speed time series for integrated evaluation.

4. Evaluation results of wind energy resources under hybrid algorithm model

The wind energy resource evaluation method based on wolf pack intelligent optimization algorithm and predictive statistical model is empirically analyzed. The design is carried out from three aspects: performance testing, accuracy analysis, and empirical testing of the prediction system. The proposed prediction model is compared with other models to test the superiority of the constructed prediction method.

4.1. Performance testing of intelligent optimization algorithm

Due to its high volatility and regional environmental impact, wind energy poses certain problems in wind power generation systems. Wind resource prediction is still an important focus of current research. To test the prediction model proposed in the study, the selected wind speed data comes from empirical data of wind turbines in the Bohai Bay area from 2019 to 2022, with a time interval of five minutes. A total of 1500 sets of data are collected each month. The data is separated into training and testing samples in a 6:4 ratio. The parameters of cuckoo optimisation BP network are chosen in such a way that the higher the number of nests, the higher the number of iterations required, and the higher the probability of variation affects its performance in converging to the optimal solution. In reference [34], the probability of variation of bird nests is chosen as 0.25 and the number of bird nests is 25.The study fits the wind speed data with the help of four statistical distribution models and the results are judged by the minimum root mean square error, and the parameters are improved with the help of an intelligent optimisation algorithm. There are few studies on determining the optimal parameters of Weibull distribution with the help of optimisation algorithms and there is no unanimity on the most accurate parameters of Weibull distribution. Therefore, the parameters of the four statistical distribution models are examined using theoretical values. Due to the few parameters of the wolf pack optimisation algorithm, its core parameters are determined with the help of the error results, in which the group size is selected as 50 based on the results of the grid search test, the maximum number of iterations is determined by observing the loss function curves, the step size factor is dynamically adjusted (0.5 → 0.2) to adapt to the stochastic and intermittent characteristics of the wind data, and the distance threshold (0.1) is set according to the standard deviation of the wind data, and a fixed value is selected to reduce the complexity of parameter tuning. fixed value to reduce the tuning complexity. The initial value of the linear model weights for the hybrid prediction model is 0.6, and the initial value of the nonlinear model weights is 0.4, and the adjustment of its parameter values can be calculated based on the sliding window prediction error. Firstly, the wind speed data is preprocessed to remove outliers and missing values, ensuring data quality. Subsequently, the probability density function of the Weibull distribution function is calculated using formula (1), and the parameter estimation is calculated using the intelligent optimization algorithm. After obtaining the parameters of the Weibull distribution, the fitting result is verified to ensure that the model accurately reflects the distribution characteristics of wind speed data. Firstly, the fitting accuracy of the Weibull distribution parameters is compared. The comparative models are Gamma distribution, Lognormal distribution, and Rayleigh distribution. The fitting results are shown in Fig 4.

thumbnail
Fig 4. Fitting effect of different distribution parameters.

https://doi.org/10.1371/journal.pone.0326035.g004

The sum of squared residuals can reflect the sensitivity of fitting data to errors and the dispersion of sample observations. The RMSE can reflect the magnitude and fluctuation of prediction errors. The determination coefficient represents the percentage change in the explanatory variable of the model, which can intuitively explain the variation of variables. The comparison indicators of sub-graphs (a), (b), and (c) in Fig 4 are residual sum of squares, RMSE, and determination coefficient, respectively. The results in the figure demonstrated that the designed Weibull distribution method fitted the data well, with high and stable fitting accuracy. Its average values in sum of squared residuals, RMSE, and determination coefficient were 0.05, 0.014, and 0.96, respectively. If the value of the judgment coefficient tends to 1, it indicates that its fitting accuracy is higher. Under the RMSE index, the fitting range of the Weibull distribution was [0.0115, 0.0248], the Gamma distribution was [0.013, 0.04], the fitting range of the Lognormal distribution was [0.025, 0.076], and the Rayleigh distribution was [0.041, 0.063]. The fitting effect of Lognormal distribution was poor, and the fluctuation range of its judgment coefficient curve changed significantly. Before the model prediction, the effectiveness of the proposed improved intelligent algorithm and prediction model is tested, as displayed in Fig 5.

thumbnail
Fig 5. Convergence curves of various algorithms under test functions.

https://doi.org/10.1371/journal.pone.0326035.g005

The study uses unimodal and multimodal functions to test the algorithm ability to break out of local extremes, with a testing dimension of 50. The comparative algorithms include Salp Swarm Algorithm (SSA), Adaptive Parameter-GWO (A-GWO), WPA, and the proposed FPA-WPA. The results indicated that the FPA-WPA exhibited relatively smooth convergence curves on two test functions, with relatively few overall turning points and fast convergence speed. The optimal target values reached 102.5 and 102, corresponding to 510 and 100 iterations, respectively. The convergence speed of SSA and A-GWO was improved compared with WPA, but their stability performance in the later stage of iteration was poor. The convergence results in Fig 5 indicate that although FPA-WPA outperforms SSA, A-GWO, and WPA in terms of convergence speed and stability, its final optimization results may still fall into local optima, especially when dealing with high-dimensional multimodal functions. In contrast, SSA and A-GWO show significant fluctuations in the later stages of iteration, reflecting the limitations of parameter adaptation mechanisms in complex optimization problems. This phenomenon suggests that existing intelligent algorithms still have shortcomings in balancing exploration and development. In the future, dynamic mutation strategies or hybrid hierarchical optimization can be introduced to enhance global search capabilities. Subsequently, the proposed hybrid prediction algorithm was compared with other combination models for deterministic prediction, including the Multi-target Elman Network (MEN) model combining multi-objective optimization with Elman neural network [35], the Multi-scale Hybrid Ensemble Decomposition Method (MHED) [36], the VMD-ISOA-KELM model [37], and the Convolutional Neural Network-Bidirectional Long Short-Term Memory (CNN-BLSTM) [38]. The study selects one hour advance prediction experiments on the dataset to validate the advantages of the research model. The predicted results are shown in Fig 6.

thumbnail
Fig 6. Convergence curves of various algorithms under test functions.

https://doi.org/10.1371/journal.pone.0326035.g006

The results in Fig 6 showed that there were significant differences in the prediction results of different combination models in the 60 min advance deterministic prediction results. Specifically, the MEN model exhibited significant lag linearity and large fluctuations on the dataset, with poor overall predictive ability. The overall prediction performance of VMD-ISOA-KELM model and CNN-BLSTM model was good in less than 30 minutes, with a maximum deviation of no more than 8% from the true value. However, in the later stage, both models showed significant prediction deviations, indicating a lack of stability. The MHED model performed poorly in predicting rapid increases or decreases in wind speed. The wind speed prediction curve of the FPA-WPA model proposed in the study has good similarity with the true value, and its wind speed prediction results show small fluctuations and errors in different time periods, with a maximum deviation of no more than 3%, indicating its good ability to grasp and predict wind speed fluctuation characteristics. FPA-WPA has the smallest prediction error, indicating that it can effectively capture the temporal characteristics of wind speed; the lag of the MEN model and the increase in the long-term deviation of VMD-ISOA-KELM and CNN-BLSTM reflect the inadequacy of the traditional methods in modelling non-stationary time series. It is worth noting that the prediction ability of the existing models for sudden changes in wind speed (e.g., extreme weather) has not been fully validated, and future research could incorporate uncertainty quantification or physical model fusion to further optimise the performance in terms of high-dimensional optimisation and non-smooth prediction.

4.2. Performance testing of predictive model

Wind speed has characteristics such as dynamism, uncertainty, and complexity. Both linear and nonlinear statistical methods have certain shortcomings in prediction, and the sample processing effect is poor. Therefore, the study considers using the combination prediction method to analyze the correlation between wind direction data information. Overall, it consists of two parts. The first part focuses on the linear data features, and completes the analysis and prediction of wind speed linear data through steps such as wind speed time series denoising, sequence auto-regression modeling, and data difference order processing. The first part focuses on nonlinear data features, using an improved BPNN model based on cuckoo optimization algorithm as the nonlinear prediction module to analyze the nonlinear features in the data. After processing the data using the improved WPA, a mixed linear and nonlinear prediction model can be integrated into the wind speed time series to achieve prediction and evaluation based on the data characteristics, improving application performance and reference value. When conducting simulation analysis on wind data, the effectiveness is first discussed. In EMD mode, auxiliary white noise is added to solve the pattern aliasing, and adaptive white noise is added at each stage of data decomposition. Residual signals are calculated to obtain each pattern component. Firstly, the local mean of the raw data is calculated. Then, the first residual component and the intrinsic mode component are repeatedly calculated until all decomposed components and residual components are obtained. That is, the prediction model is subjected to data auto-correlation testing and the sequence is tested for white noise using Q-statistics, as shown in Fig 7.

thumbnail
Fig 7. Auto-correlation test and white noise test results of difference sequences in the CS-BP prediction model.

https://doi.org/10.1371/journal.pone.0326035.g007

The dashed lines in Fig 7a and b represent the confidence interval (95% level) and reference line, respectively. In Fig 7, the data sample points with orders greater than 0 were all within the confidence interval. The white noise statistical results were lower than the reference line. The differential sequence of the model has significant correlation and no noise interference, which can be used for subsequent data statistics and extraction. Subsequently, in the neural network prediction model, the tansig function is used as the transfer function, and purelin is used as the output layer function. The BPNN code is written using Matlab software. The data is selected for training prediction, as shown in Fig 8.

The results in Fig 8 indicated that at a sample size of 4,000, the ES exhibited MAE, RMSE, and MAPE values all less than 0.8, 0.7, and 8. Its prediction accuracy was good when the smoothing index was 0.5. The time series processing techniques proposed in the study were compared with TS Fastformer [39], EEMD-ARIMA [40], VMD-PSO-LSTM [41], and the results are shown in Table 1.

thumbnail
Table 1. Data results under different time series processing techniques.

https://doi.org/10.1371/journal.pone.0326035.t001

The results in Table 1 indicate that the CS-BP model has the lowest root mean square value, which is 1.1987m.s-1, indicating that the difference between its predicted results and actual values is very small. The root mean square value of the VMD-PSO-LSTM model is the highest, at 1.2538m.s-1. The difference between the predicted results and the actual values is relatively large, while the root mean square values of other models exceed 1.5m/s-1. The prediction accuracy of the TS Fastformer model does not exceed 90%, and the corresponding benchmark percentage is relatively high, indicating significant data processing errors. In terms of training time, the CS-BP model has a higher usage rate, indicating better performance in sequence data processing. Although the usage rate of VMD-PSO-LSTM and EEMD-ARIMA models exceeds 85%, their training time is relatively long, which may be related to their parameter counts. Overall, the training performance of the CS-BP model is good. Subsequently, performance analysis was conducted on the time series results of the proposed CS-BP prediction model. And the results were compared with the actual values, as shown in Fig 9.

The results in Fig 9 indicated that the CS-BP prediction model exhibited good predictive performance, with good consistency between the curve changes at different sample points and the true values. The curve deviation was relatively small, and the overall prediction error did not exceed 3%. The relative error results showed that the error amplitude of the CS-BP was less affected by sample points, and the maximum value did not exceed 0.6.

4.3. Wind energy resource evaluation and detection results based on hybrid prediction model

A hybrid prediction model is proposed for wind energy resource evaluation. Prediction evaluation indicators include MAPE, RMSE, and other indicators. A low error index value indicates higher predictive performance. Firstly, the statistical characteristics of wind speed in the selected area are analyzed, as displayed in Fig 10.

thumbnail
Fig 10. Wind speed statistics for different months in the region.

https://doi.org/10.1371/journal.pone.0326035.g010

The results in Fig 10 indicated that the overall wind speed in the region fluctuated within the range of [0, 14] m/s, and there were differences in the wind speed variation curves among different months, with obvious randomness. The wind speed fluctuations in June and December were more obvious, and the overall wind speed in June changed significantly compared with the mean in March, with its maximum value approaching 16m/s. The time series data is subjected to modal decomposition, and the number of modes is 4. The decomposition sequence results are obtained, as shown in Fig 11.

From Fig 11, after decomposition, the time series data was divided into various modal parts. The curve of component 1 fluctuated significantly, mainly showing the nonlinear characteristics of the original sequence data. The other components were distributed up and down with zero as the baseline, which could better identify the sequence data. After testing, the decomposed sequence had stationarity. Based on the characteristics of different modal components, linear and nonlinear predictions are performed. Subsequently, a comparison is made between a single algorithm and the proposed combination algorithm. The results are shown in Fig 12.

thumbnail
Fig 12. Comparison results of prediction model with single algorithm and combination algorithm.

https://doi.org/10.1371/journal.pone.0326035.g012

The results in Fig 12a indicated that the proposed hybrid prediction model had a relatively small relative error value compared with the FPA-WPA and CS-BP, with a maximum value not exceeding 0.2. The fluctuation of the error curve was smaller than that of the other two algorithms. The maximum relative error values of FPA-WPA and CS-BP reached 0.34 and 0.31, which were less than 0.4 and also had lower error values. In Fig 12b, the proposed prediction model roughly followed the true value curve, with less node deviation. The hybrid model proposed in references [14, 42] had a small error compared with the true values, but they were significantly affected by the number of sample points, with a node deviation of at least 0.5%. The prediction indicators of several combination models are further analyzed, as displayed in Table 2.

thumbnail
Table 2. Prediction indicators results of combination model.

https://doi.org/10.1371/journal.pone.0326035.t002

The results in Table 2 indicated that the MAE, RMSE, and MAPE exhibited by the research model were 0.1947, 0.3847, and 15.23, which were much higher than the other two comparative algorithms, with maximum differences reaching 0.1509m/s, 0.0475m/s, and 3.02%. The fitting accuracy of these three combination models was good, but the model proposed in the study tended towards 1.0 in the determination coefficient. Subsequently, the proposed prediction method and other combination models are used to analyze the wind speed prediction results, as shown in Table 3.

thumbnail
Table 3. Wind speed prediction results of the combination model.

https://doi.org/10.1371/journal.pone.0326035.t003

The results in Table 3 indicated that the MAE, RMSE, and MAPE values exhibited by the research model in wind speed prediction were 0.1835, 0.4017, and 14.98, with a fitting accuracy of 0.994. The MAE difference between other comparison models, such as VMD-ISOA-KELM and CNN-BLSTM, and the research model did not exceed 0.03, and performed relatively well in RMSE and MAPE indicators, with a fitting accuracy of over 0.95. However, overall, the wind speed prediction results of the proposed model are good, with a judgment coefficient tending towards 1.0. Statistical testing was conducted on the above results to obtain Table 4.

In Table 4, the research method showed a 2% and 4% increase in MAE and RMSE under noisy data, which is much smaller than other algorithms. Moreover, its performance in terms of memory usage, optimization time, and computational efficiency is also better than other comparative algorithms. The statistical test results are significant, demonstrating good application performance and effectiveness. The study selects annual wind speed data from three regions in the Bohai Bay for wind energy resource evaluation. It is assumed that the entry and exit wind speeds of wind turbines are 2.5m/s and 25m/s, respectively, as displayed in Table 5.

thumbnail
Table 5. Energy prediction results of hybrid prediction models in different regions.

https://doi.org/10.1371/journal.pone.0326035.t005

In Table 5, the average wind power density can reflect the wind energy capacity of the region. The effective wind power density can evaluate the performance of wind power generation. The utilization rate is linked with the wind speed of the wind turbine. The higher the utilization rate indicates the higher the wind speed, demonstrating that it is larger and can reflect the potential of wind power generation applications. The utilization rates of these three regions were all greater than 0.96. In region A, it reached 388.547W/m2, while the effective wind power density values in regions B and C were both greater than 1,012W/m2. The above results indicate that the wind energy resources in these three regions are relatively abundant. The wind energy indicators can be computed using relevant formulas and Weibull distribution parameters, indicating that the evaluation method proposed in the study can effectively support regional wind energy resource assessment. When the model is applied to different regions or larger datasets, it may encounter insufficient generalization ability, which requires cross validation or other generalization strategies to improve the adaptability of the model. The time series data of wind energy resources usually have a long time span. As the length of the time series increases, the model may require more complex structures or update mechanisms to handle the time-varying characteristics of wind energy resources. The study analyzes the 24-hour wind power of wind farms in the Bohai Bay area, and the results are shown in Table 6.

The results in Table 6 indicate that the difference between the predicted and actual power values of the wind farm is small, and the absolute value of the maximum power difference does not exceed 20MW. The curve trends in the time ranges of (5, 8) and (18, 24) are relatively similar. The above results indicate that the method proposed in the research can effectively evaluate the situation of wind energy resources and has a good application effect on predicting wind power. The above content has been demonstrated on algorithm convergence, predictive performance testing, etc. To further test the effectiveness of the research method, a time complexity comparison is made between the research method and a single processing method. The data scale comes from wind speed data in Bohai Bay. The scale quantities are 500, 1,000, and 1,500, and the variable dimensions are 5, 10, and 20. The single processing method is the Weibull distribution model and the wolf pack intelligent algorithm. The results are shown in Table 7.

In Table 7, when the data size was less than 1,500, the running time of the research method was generally higher than that of the Weibull distribution model and lower than that of the traditional WPA. The running time was 3.2 seconds when the data scale was 1,000 and the variable dimension was 10. The Weibull distribution model took only 2.8 seconds, while the traditional WPA took 6.5 seconds to run. The Weibull distribution model is a statistical model with low computational complexity, which is suitable for simple scenarios. Although the research method has a longer running time, it combines optimization algorithms and prediction models, which can handle more complex nonlinear problems and is suitable for multidimensional and large-scale data in wind energy resource assessment. The research method reduces unnecessary calculations in the WPA by improving the walking behavior and position update strategy, thereby improving efficiency. Overall, the running time of the research algorithm has a relatively small increase, and it is relatively less affected by the data size. Moreover, the running time at a data size of 1,500 is generally less than 10 seconds.

5. Discussion

Wind energy resource assessment is an important step in determining the potential and feasibility of wind energy development in a certain region, and wind speed prediction plays a crucial role in this process. By analyzing historical wind speed data and predicting future wind speeds, the spatiotemporal distribution characteristics of wind energy resources can be understood, providing scientific basis for the planning and design of wind energy projects. The accuracy of wind speed prediction also directly affects the economic benefits of wind farms. Accurate wind speed prediction can help wind farm operators arrange the operation and maintenance plans of wind turbines reasonably, reduce operating costs, and improve power generation efficiency. On the contrary, it may lead to frequent shutdowns or excessive wear of wind turbines, thereby reducing the economic benefits of wind farms. Wind speed and wind prediction are mostly related to their current atmospheric conditions. The structural advantages of deep neural networks are used to predict data information. However, due to the randomness and immediacy of wind power models, there are still difficulties in model construction. Regarding wind power resource assessment and wind power forecasting, X. Deng et al. utilized complementary ensemble empirical decomposition algorithm and Whale Optimization Algorithm (WOA) to construct a predictive hybrid model on the basis of wind power time series signals. This prediction approach could reduce the computational complexity, with good stability and small reconstruction errors [43]. A. Alkesaiberi et al. used machine learning models to predict wind power time series data. Bayesian optimization was used to optimize Gaussian process regression models, and ensemble learning models. The results indicated that the model could effectively predict wind power while considering lagged data [42].

In response to the limitations of current wind energy resource evaluation methods, a hybrid evaluation method based on wolf pack intelligent optimization algorithm and predictive statistical model was proposed. The scale parameter in the wind speed model determined the threshold size of the wind speed range, and its value was related to the maximum wind speed value in the sample. In the grey wolf optimization algorithm, the initial position of the grey wolf was usually randomly initialized. This paper used the grey wolf optimization algorithm to characterize the initial conditions for wind speed: the population size was 3, the initial position was randomly generated, the total number of iterations was 100, and the search space boundary conditions were [0.1, 10.0] (m/s). The wind speed data was scaled to [0, 1]. The randomness of the initial conditions may lead to an uneven distribution of the algorithm in the search domain, thus affecting the search performance and convergence speed. When the initial position was far from the global optimal solution, the algorithm might require more iterations to converge to the optimal solution, which led to inaccurate prediction of the Weibull distribution scale parameters. The effect of the initial conditions on the convergence speed could also affect the accurate prediction of the Weibull distribution scale parameters. Its performance was tested and application results were analyzed. The proposed Weibull distribution fitted the data well, with an average of 0.05, 0.014, and 0.96 in residual sum of squares, RMSE, and determination coefficient, respectively. Under the RMSE, the fitting range of Weibull distribution ([0.0115, 0.0248]) was better than that of Gamma distribution ([0.013, 0.04]), Lognormal distribution ([0.0225, 0.076]), and Rayleigh distribution ([0.041, 0.063]). The FPA-WPA proposed in the study showed a fast convergence speed, and the prediction model passed the stationarity test. Moreover, the overall prediction error of the CS-BP prediction model at different sample points did not exceed 3%, and the error amplitude was relatively small due to the influence of sample points, with a maximum value not exceeding 0.6. For the application effectiveness, the decomposed time series data had good recognizability, and the proposed hybrid prediction model had a relatively small relative error value compared with the FPA-WPA and CS-BP, with a maximum value not exceeding 0.2. The maximum relative error values of the FPA-WPA and CS-BP reached 0.34 and 0.31, respectively. Compared with other combination models, the predicted model had roughly the same trend as the true value curve, with MAE, RMSE, and MAPE values of 0.1947, 0.3847, and 15.23. However, the other two models had at least 0.5% node deviation. Compared with the research model, the maximum difference in MAE, RMSE, and MAPE values was 0.1509m/s, 0.0475m/s, and 3.02%. The research contribution objectives mainly focus on the application of wolf pack intelligent optimization algorithms and predictive statistical models in wind energy resource evaluation, aiming to improve the accuracy and reliability of wind energy resource evaluation, promote the sustainable development and utilization of wind energy resources, and promote the healthy development of the wind power industry.

6. Conclusions and future research

The evaluation method proposed in this study can effectively assist various regions in conducting wind energy resource assessments. The hybrid model proposed in this study can effectively evaluate wind energy resources and predict wind speed with high prediction accuracy. However, the applicability of the Weibull model may be limited by specific geographical and climatic conditions, such as certain extreme or special meteorological conditions. The hybrid prediction model composed of improved WPA and CS may have high computational complexity. However, in practical wind speed prediction applications, there may be problems such as high computational cost and complex parameter optimization and adjustment.

Future research can focus on improving model generalization ability (such as introducing transfer learning techniques, designing adaptive mechanisms, etc.), optimizing computational complexity (using parallel or distributed computing techniques, developing lightweight models, etc.), enhancing adaptability to extreme conditions, combining deep learning techniques, and developing real-time prediction systems. The current research mainly relies on historical data for prediction, lacking the ability for real-time prediction and dynamic optimization. In the future, real-time data collection and processing systems can be developed to achieve real-time prediction of wind speed and energy. An intelligent wind energy resource assessment and prediction platform can be constructed by combining IoT technology. In the future, the paper will make breakthroughs in algorithm design, wind energy mechanism research, and application scenarios, providing stronger technical support for the development and utilization of wind energy resources, and promoting the sustainable development of the renewable energy industry.

Supporting information

References

  1. 1. Bazionis IK, Georgilakis PS. Review of deterministic and probabilistic wind power forecasting: models, methods, and future research. Electricity. 2021;2(1):13–47.
  2. 2. Nzubechukwu Chukwudum O, Adebayo Olusegun A, Emmanuel Chigozie A, Peter Efosa O, Donald Obinna D, Odulaja BA. AI-driven solutions in renewable energy: A review of data science applications in solar and wind energy optimization. World J Adv Res Rev. 2023;20(3):401–17.
  3. 3. Singh U, Rizwan M, Alaraj M, Alsaidan I. A machine learning-based gradient boosting regression approach for wind power production forecasting: a step towards smart grid environments. Energies. 2021;14(16):5196.
  4. 4. Khan PW, Byun Y-C, Lee S-J, Kang D-H, Kang J-Y, Park H-S. Machine learning-based approach to predict energy consumption of renewable and nonrenewable power sources. Energies. 2020;13(18):4870.
  5. 5. Schneemann J, Theuer F, Rott A, Dörenkämper M, Kühn M. Offshore wind farm global blockage measured with scanning lidar. Wind Energ Sci. 2021;6(2):521–38.
  6. 6. Kosovic B, Haupt SE, Adriaansen D, Alessandrini S, Wiener G, Delle Monache L, et al. A comprehensive wind power forecasting system integrating artificial intelligence and numerical weather prediction. Energies. 2020;13(6):1372.
  7. 7. Devi AS, Maragatham G, Boopathi K, Rangaraj AG. RETRACTED ARTICLE: Hourly day-ahead wind power forecasting with the EEMD-CSO-LSTM-EFG deep learning technique. Soft Comput. 2020;24(16):12391–411.
  8. 8. Nazir MS, Alturise F, Alshmrany S, Nazir HafizMJ, Bilal M, Abdalla AN, et al. Wind generation forecasting methods and proliferation of artificial neural network: a review of five years research trend. Sustainability. 2020;12(9):3778.
  9. 9. Sharma P, Mishra RK, Bhola P, Sharma S, Sharma G, Bansal RC. Enhancing and optimising solar power forecasting in dhar district of india using machine learning. Smart Grids and Energy. 2024;9(1).
  10. 10. Wang Y, Wang D, Tang Y. Clustered hybrid wind power prediction model based on ARMA, PSO-SVM, and clustering methods. IEEE Access. 2020;8:17071–9.
  11. 11. Santhosh M, Venkaiah C, Vinod Kumar DM. Current advances and approaches in wind speed and wind power forecasting for improved renewable energy integration: a review. Engineering Reports. 2020;2(6).
  12. 12. Song X, Zhan C, Kong F, Xia J. Advances in the study of uncertainty quantification of large-scale hydrological modeling system. J Geogr Sci. 2011;21(5):801–19.
  13. 13. Zhang Y, Li R, Zhang J. Optimization scheme of wind energy prediction based on artificial intelligence. Environ Sci Pollut Res Int. 2021;28(29):39966–81. pmid:33763837
  14. 14. Viet DT, Phuong VV, Duong MQ, Tran QT. Models for Short-Term Wind Power Forecasting Based on Improved Artificial Neural Network Using Particle Swarm Optimization and Genetic Algorithms. Energies. 2020;13(11):2873.
  15. 15. Kisvari A, Lin Z, Liu X. Wind power forecasting – A data-driven method along with gated recurrent neural network. Renewable Energy. 2021;163:1895–909.
  16. 16. Hossain MA, Gray E, Lu J, Islam MR, Alam MS, Chakrabortty R, et al. Optimized Forecasting Model to Improve the Accuracy of Very Short-Term Wind Power Prediction. IEEE Trans Ind Inf. 2023;19(10):10145–59.
  17. 17. Karim FK, Khafaga DS, Eid MM, Towfek SK, Alkahtani HK. A novel bio-inspired optimization algorithm design for wind power engineering applications time-series forecasting. Biomimetics (Basel). 2023;8(3):321. pmid:37504209
  18. 18. Bochenek B, Jurasz J, Jaczewski A, Stachura G, Sekuła P, Strzyżewski T, et al. Day-ahead wind power forecasting in poland based on numerical weather prediction. Energies. 2021;14(8):2164.
  19. 19. Chien FS, Kamran HW, Albashar G, Iqbal W. Dynamic planning, conversion, and management strategy of different renewable energy sources: a sustainable solution for severe energy crises in emerging economies. Int J Hydrogen Energ. 2021;46(11):7745–58.
  20. 20. Teimourian H, Abubakar M, Yildiz M, Teimourian A. A comparative study on wind energy assessment distribution models: a case study on weibull distribution. Energies. 2022;15(15):5684.
  21. 21. El-kenawy E-SM, Khodadadi N, Mirjalili S, Abdelhamid AA, Eid MM, Ibrahim A. Greylag Goose Optimization: Nature-inspired optimization algorithm. Expert Syst Appl. 2024;238:122147.
  22. 22. Chen X, Cheng F, Liu C, Cheng L, Mao Y. An improved Wolf pack algorithm for optimization problems: Design and evaluation. PLoS One. 2021;16(8):e0254239. pmid:34437547
  23. 23. Mahela OP, Khan B, Haes Alhelou H, Tanwar S. Assessment of power quality in the utility grid integrated with wind energy generation. IET Power Electronics. 2020;13(13):2917–25.
  24. 24. Mergos PE, Yang X-S. Flower pollination algorithm parameters tuning. Soft comput. 2021;25(22):14429–47. pmid:34539232
  25. 25. Al-Janabi S, Alkaim AF, Adel Z. An innovative synthesis of deep learning techniques (DCapsNet & DCOM) for generation electrical renewable energy from wind energy. Soft Comput. 2020;24(14):10943–62.
  26. 26. Barik AK, Jaiswal S, Das DC. Recent trends and development in hybrid microgrid: a review on energy resource planning and control. Int J Sustain Energ. 2021;41(4):308–22.
  27. 27. Pryor SC, Barthelmie RJ. A global assessment of extreme wind speeds for wind energy applications. Nat Energy. 2021;6(3):268–76.
  28. 28. Khaled K, Singla MK. Predictive Analysis of Groundwater Resources Using Random Forest Regression. JAIM. 2025;09(01):11–9.
  29. 29. Zhang Y, Pan G. A hybrid prediction model for forecasting wind energy resources. Environ Sci Pollut Res Int. 2020;27(16):19428–46. pmid:32215801
  30. 30. Morales Pedraza J. The role of renewable energy in the transition to green, low-carbon power generation in Asia. GLCE. 2023;1(2):68–84.
  31. 31. Liu X, Zhang L, Zhang Z, Zhao T, Zou L. Ultra Short Term Wind Power Prediction Model Based on WRF Wind Speed Prediction and CatBoost. IOP Conf Ser: Earth Environ Sci. 2021;838(1):012001.
  32. 32. Yousuf MU, Al‐Bahadly I, Avci E. Wind speed prediction for small sample dataset using hybrid first‐order accumulated generating operation‐based double exponential smoothing model. Energ Sci Eng. 2022;10(3):726–39.
  33. 33. Reiss MA, MacNeice PJ, Muglach K, Arge CN, Möstl C, Riley P, et al. Forecasting the Ambient Solar Wind with Numerical Models. II. An Adaptive Prediction System for Specifying Solar Wind Speed near the Sun. ApJ. 2020;891(2):165.
  34. 34. Zhang Y, Zhang W, Guo Z, Zhang S. An effective wind speed prediction model combining secondary decomposition and regularised extreme learning machine optimised by cuckoo search algorithm. Wind Energy. 2022;25(8):1406–33.
  35. 35. Wang J, Yang Z. Ultra-short-term wind speed forecasting using an optimized artificial intelligence algorithm. Renewable Energy. 2021;171:1418–35.
  36. 36. Pang J, Dong S. A novel ensemble system for short-term wind speed forecasting based on hybrid decomposition approach and artificial intelligence models optimized by self-attention mechanism. Energy Conversion and Management. 2024;307:118343.
  37. 37. Chen X, Li Y, Zhang Y, Ye X, Xiong X, Zhang F. A novel hybrid model based on an improved seagull optimization algorithm for short-term wind speed forecasting. Processes. 2021;9(2):387.
  38. 38. Lawal A, Rehman S, Alhems LM, Alam MdM. Wind Speed Prediction Using Hybrid 1D CNN and BLSTM Network. IEEE Access. 2021;9:156672–9.
  39. 39. Lee S, Hong J, Liu L, Choi W. TS-Fastformer: Fast Transformer for Time-series Forecasting. ACM Trans Intell Syst Technol. 2024;15(2):1–20.
  40. 40. Liu X, Zhang Y, Zhang Q. Comparison of EEMD-ARIMA, EEMD-BP and EEMD-SVM algorithms for predicting the hourly urban water consumption. J Hydroinform. 2022;24(3):535–58.
  41. 41. Yang S, Yang H, Li N, Ding Z. Short-Term Prediction of 80–88 km Wind Speed in Near Space Based on VMD–PSO–LSTM. Atmosphere. 2023;14(2):315.
  42. 42. Alkesaiberi A, Harrou F, Sun Y. Efficient wind power prediction using machine learning methods: a comparative study. Energies. 2022;15(7):2327.
  43. 43. Deng X, Shao H, Hu C, Jiang D, Jiang Y. Wind power forecasting methods based on deep learning: a survey. Comput Model Eng Sci. 2020;122(1):273–301.