Figures
Abstract
In light of global climate change and the objective of carbon neutrality, the carbon market has become an important tool for the international community to combat climate change. Nonetheless, due to the complexity and non-linear nature of the carbon price, its accurate prediction has always been a research difficulty. This work presents a hybrid model incorporating comprehensive feature screening, optimized quadratic decomposition, and the Optuna-Attention-LSTM prediction method, aiming to improve the accuracy and stability of carbon price prediction. First, the improved complete ensemble empirical mode decomposition with adaptive noise (ICEEMDAN) is used to decompose the carbon price time series once, extract high-frequency and low-frequency components, and denoise the high-frequency components using stacked denoising autoencoder (SDAE). Then, the variational mode decomposition (VMD) method is subsequently employed to execute a secondary decomposition on the reconstructed signal, with the decomposition hyperparameters optimized via crested porcupine optimization (CPO). Subsequently, Boruta and least absolute shrinkage and selection operator (Lasso) regression are employed to identify significant external features; finally, a long short-term memory (LSTM) model integrated with an attention mechanism is utilized for prediction, and optuna is introduced to optimize the hyperparameters. This paper evaluates the performance of the proposed model using the carbon markets of Guangdong, Hubei, and Shanghai in China as examples. The experimental results indicate that compared with the traditional model, the proposed model achieves average reductions of 67.30%, 47.68%, 48.42%, and 48.79% in the mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE), respectively, demonstrating higher prediction accuracy and robustness. Shapley additive explanations (SHAP) analysis results indicate that carbon prices in the Guangdong carbon market are dominated by macroeconomic and regional environmental factors, while those in the Hubei carbon market are mainly driven by changes in the energy market. The Shanghai carbon market, on the other hand, is more significantly influenced by global carbon market dynamics and international trade activities. The research not only verifies the efficacy of the decomposition ensemble prediction framework, but also provides a scientific basis for decision-making for carbon market participants and policymakers.
Citation: Yi Y (2025) Forecasting regional carbon prices in china with a hybrid model based on quadratic decomposition and comprehensive feature screening. PLoS One 20(6): e0326926. https://doi.org/10.1371/journal.pone.0326926
Editor: Jie Zhang, Newcastle University, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND
Received: March 11, 2025; Accepted: June 8, 2025; Published: June 30, 2025
Copyright: © 2025 Yaoyang Yi. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data underlying the findings of this study have been uploaded as Supporting Information files.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1. Introduction
The swift progression of industrialization and the extensive use of traditional fossil fuels have further exacerbated the greenhouse effect, with far-reaching impacts on the energy mix and human society [1]. In response to global warming and environmental degradation, countries have put forward targets and commitments to achieve carbon neutrality through policy announcements and executive orders. 1997 saw the adoption of the Kyoto Protocol, which for the first time brought greenhouse gas emissions under legal constraints, marking the birth of the global carbon market [2]. The Paris Agreement adopted in 2015 not only continues the legal framework of the Kyoto Protocol, but also draws on the successful experience of the European Union’s carbon market, marking the beginning of a new phase in global climate governance [3].
To this end, China has introduced a dual carbon objective to attain peak carbon emissions by 2030 and realize carbon neutrality by 2060 [4]. China’s carbon market is currently in a transition stage from early to mature development. Market mechanisms are still imperfect, and relevant laws, regulations, and policy systems need to be strengthened. Since 2011, nine pilot carbon emission trading programmes have been launched in Beijing, Tianjin, Shanghai, Chongqing, Guangdong, Hubei, Shenzhen, Sichuan, and Fujian. As a developing carbon trading market, it is vulnerable to influences such as weather, energy prices, and international carbon prices [5]. These factors interact with each other, increasing the complexity of traditional modeling and forecasting. Therefore, an appropriate feature selection method is crucial in carbon price forecasting. It can not only enhance the precision of predictions, but also improve the model’s interpretability [6]. In addition, the highly accurate decomposition and integration prediction framework can effectively track evolving patterns and operational dynamics of carbon price, enhancing the ability of feature extraction, representation and learning, which enhances predictive performance [7].
1.1. Literature review
1.1.1. Forecasting methods in the carbon market.
As carbon trading systems are complex mechanisms, a common theme of discussion in the wider literature is how to enhance the precision of carbon market forecasts and the interpretability of the results. In recent years, carbon market forecasting methods have been classified into three main categories: econometric models, artificial intelligence models, and hybrid models. Conventional statistical and econometric methods are extensively utilized for time series forecasting. For example, Liu et al. [8] found that the GARCH-MIDAS model combining economic policy uncertainty (EPU) and realized volatility (RV) is more effective in predicting China’s carbon volatility than the model based on RV only. Niu and Liu [9] studied the volatility of EU quota futures using the mixed data sampling method based on the GJR-GARCH model. They found that accounting for the asymmetric effect of volatility and the influence of macroeconomic variables can substantially enhance the model’s predictive capability. In addition, Zhang et al. [10] extended the classical autoregressive model and found that the EPU index possesses substantial long-term predictive power for the volatility of EU carbon allowance (EUA) futures.
Although traditional econometric models can intuitively measure the importance of characteristics, they falter in capturing the non-linear aspects of carbon prices. Artificial intelligence models not only have the arithmetic calculation capabilities of traditional econometric models, but also can capture historical information, volatility and nonlinear characteristics. Therefore, they significantly improve the accuracy and performance of predictive analysis. Their advantages in predicting financial time series have now been widely proven [11]. Xu et al. [12] used two traditional econometric models alongside three machine learning algorithms to predict carbon returns in China’s carbon trading market, and found that the random forest model had higher accuracy in predicting carbon returns in China than traditional econometric model. Gong et al. [13] chose extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM) and deep neural network (DNN) models to study how climate change impacts carbon futures returns. The findings indicate that the prediction accuracy markedly enhances when climate factors are incorporated into machine learning or deep learning models. Li et al. [14] used multivariate LSTM, multi-layer perceptron, support vector regression and recurrent neural network (RNN) for predicting Hubei emissions allowances (HBEA) and Guangdong emissions allowances (GDEA), and compare the prediction results with each other, which show that multivariate LSTM performs better for carbon price forecasting. Pan et al. [15] constructed an LSTM model that introduced particle swarm optimization to predict carbon prices and extracted effective keywords from Google search volume indices as inputs, which improved the prediction accuracy. Wang et al. [16] proposed a new hybrid prediction model based on a stacked ensemble learning algorithm. which integrates the echo state network, stacked autoencoder, support vector machine (SVM), and LSTM models as base learners, and a convolutional neural network (CNN) – bidirectional LSTM hybrid model as the meta-learner. This model effectively extracts time series features, and achieves high prediction accuracy and good generalisation ability.
AI models outperform traditional econometrics models in predicting nonlinear and non-stationary data. However carbon price data usually contain a large amount of noise and volatility, posing challenges for single models to handle complex, multilevel variations while ensuring predictive stability. For this reason, some scholars have proposed hybrid models and gradually improved them. Huang et al. [17] introduced a new decomposition-ensemble paradigm VMD-GARCH/LSTM-LSTM model, and found that the prediction errors of all decomposition-ensemble models are smaller than that of a single model, which confirms that decomposition-ensemble methods are effective in carbon price forecasting, and the study also shows that nonlinear integration algorithms can boost the prediction performance of multi-scale models. Wang et al. [18] developed a deep learning-based framework for forecasting carbon prices in Hubei province’s carbon trading pilot, and the study showed that principal component analysis and complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) methods significantly improve the accuracy of carbon price prediction.
In hybrid modeling, secondary decomposition has the obvious advantage of capturing the details that cannot be identified by primary decomposition and can better separate the noise from the useful signals. Sun and Huang [19] first decomposed the time series data into multiple intrinsic mode functions (IMFs) using empirical mode decomposition (EMD), and then applied variational mode decomposition (VMD) to the first intrinsic mode function (IMF1) for further decomposition. Their findings showed that model incorporating the quadratic decomposition algorithm achieved significantly higher prediction accuracy compared to those using undecomposed data or solely EMD methods. Compared to the EMD decomposition method, in order to extract the features of the signal more efficiently, Ozan [20] used multiple machine learning models optimized by CEEMDAN, VMD, and genetic algorithms, and found that the secondary decomposition markedly enhances model robustness and prediction accuracy by comparing the key performance metrics of the primary decomposition and secondary decomposition models. This outcome aligns with the findings of Sun and Huang. Zhou et al. [21] proposed a hybrid framework that combines CEEMDAN, sample entropy, VMD and LSTM. The study showed that the integrated VMD-LSTM method can effectively predict high-frequency time series extracted by CEEMDAN. In addition, through verification and comparison with different datasets, the study further proves that LSTM outperforms gated recurrent unit and DNN in carbon price forecasting. In order to further address the modal mixing issue in decomposition, Yang et al. [22] first performed a preliminary decomposition of carbon prices using ICEEMDAN, and then used multiscale fuzzy entropy to divide the decomposition results into high, medium, and low complexity components. After merging the high complexity components, they performed a second decomposition on the merged components using complete ensemble empirical mode decomposition. It is shown that ICEEMDAN surpasses other decomposition methods when dealing with data from Shenzhen and Hubei, and the secondary decomposition markedly diminishes data complexity, establishing a basis for subsequent precise predictions. Xu et al. [23] also employed an ICEEMDAN method, combined with a wavelet decomposition (WD) to perform a secondary decomposition of the original monthly runoff series. Subsequently, they utilised a seagull optimisation algorithm (SOA) optimised SVM to predict each component. This model framework effectively improved the prediction accuracy.
1.1.2. Factors influencing the carbon market.
There is a wide range of literature that explores the factors affecting carbon prices, and these factors can be analyzed from both internal and external perspectives. In terms of internal factors, Sadefo and Njong [24] used a self-refinement to coarse reconstruction algorithm to decompose the subsequences and residuals extracted from the original carbon price into high-frequency, low-frequency, and trend components, aiming to deeply study the characteristics and basic laws of carbon prices in different frequency bands. Similarly, Zhu et al. [25] used improved random forest using particle swarm optimisation, extreme learning machine and LSTM to predict sequences of differing complexity, following the extraction of diverse features from the original carbon price series, and then combined and reconstructed them to better capture the characteristics of each component.
Although decomposition based on the historical data series of carbon price can effectively improve the prediction accuracy and generalization ability, it is limited by its disregard for external influences such as energy price and economic factors. To compensate for this deficiency, Li and Liu [26] classified external factors into structural and non-structural factors, and used Baidu search index as a non-structural data source. They highlighted that incorporating non-structural data can enhance the informational scope and consequently augment predictive accuracy. Similarly, Huang and He [27] refined the Baidu search index in feature selection by selecting the Baidu search index corresponding to low carbon economy, carbon trading, carbon emissions, carbon exchange rate, emission reduction, and carbon neutrality as unstructured influencing factors. Zhang et al. [28] also considered 20 factors related to carbon prices, including Baidu search index, when constructing a mixed model for forecasting carbon price. Finally, the Lasso method was used to extract variables strongly linked to carbon prices. The experimental results showed that the model performed well in terms of prediction accuracy and stability. Li et al. [29] selected text data from three sources: news reports, policy documents, and social commentary. They then integrated China’s daily time-series data on carbon emissions and the text data into a multimodal feature set using a convolutional neural network inceptionv2 and a text convolutional neural network optimised with an attention mechanism. This study comprehensively considered the factors influencing carbon emissions.
As for the structural factors, using external influences such as macroeconomics, energy prices, and international carbon markets as model inputs can substantially enhance the precision of carbon price forecasts [30–31]. Mao and Yu [32] used the SHAP method to analyze the key indicators of carbon price forecasting, and found that the carbon price was affected by the multiple influences of macroeconomic conditions, international carbon market prices, exchange rates, and domestic and international major energy prices. Zhang et al. [33] on the other hand, constructed a dynamic self-learning integrated forecasting framework, choosing Brent crude oil futures price and natural gas futures price as fuel and energy factors, and stoxx 600 index and goldman sachs commodity index (GSCI) as economic factors. The study demonstrates that the model achieves both high forecasting accuracy and stability. In addition, Hu and Cheng [34] selected the main external factors with the largest information coefficient based on the maximum information coefficient (MIC) method, revealing that the Guangdong carbon market price is heavily affected by the EUA price. Wang et al. [35] also selected features using the MIC, covering five aspects: fossil energy, macroeconomics, exchange rates, industrial development, and climate. The study showed that variables reflecting the overall economic situation, such as stock indices, exchange rates, and industrial development indices, have a significant positive contribution to the medium-term and long-term trend of carbon prices, while recent fossil fuel prices have a negative impact on the short-term trend of carbon prices due to supply and demand. It is worth noting that Wang et al. [36] also selected internal carbon market data such as opening price, maximum price and minimum price, and weather pollution such as air quality index (AQI) and PM2.5 as factors affecting the carbon price, and proposed an intelligent optimization decomposition and integration model based on multi-feature fusion, which achieves excellent prediction performance. Zheng et al. [37] also divided the factors affecting carbon emissions into three aspects. Among them, the carbon market aspect is reflected by the EU allowance futures carbon price (EUAP), EU allowances futures carbon trading volume and the Hubei carbon trading volume. The economic environment aspect is reflected by the dollar against the RMB exchange rate and CSl 300 index, etc., and the daily AQI of Wuhan City to reflect the natural environment. The study found that Brent crude oil futures settlement price and EUAP have a positive impact on carbon prices, while CSl 300 index has a negative impact on carbon prices.
1.2. Research gaps and contributions
Through the above literature review, it can be seen that although some scholars have achieved certain results in the research on carbon price forecasting, there are also shortcomings. First, most of the literature only focuses on the influence of conventional factors such as economy and energy, without taking domestic industry factors into account. Second, some studies do not consider whether the decomposition is sufficient in carbon price forecasting, and there are still problems of modal mixing and noise interference. Moreover, many studies still use the traditional hyperparameter optimization method for parameter adjustment, which cannot dynamically change the search range according to the actual results, and the search efficiency is low. Finally, some studies did not assign appropriate weights to different inputs in the final prediction, thus ignoring the contribution of important factors.
To this end, this paper proposes a hybrid model combining comprehensive feature screening, an optimized quadratic decomposition framework, and an Optuna-Attention-LSTM prediction method, providing the following contributions.
- In this paper, we find the best XGBoost model parameters suitable for filling the missing values in each column through grid search and cross-validation. After finding the best combination of parameters, we use the trained XGBoost model to predict the missing values in the data. The model uses the known data to learn the pattern of missing values and fills in the unknown parts, thus supplementing the missing data in the three carbon markets.
- In this paper, while considering external factors such as environmental and climate factors, energy factors, etc., we innovatively introduce domestic industry factors, adopt the feature selection method based on Boruta’s algorithm and LightGBM model, and synthesize it with Lasso’s regression feature selection method, and take the intersection of the filtered features and put it into the input of the final prediction, and at the same time take into account the nonlinear and linear feature.
- The study combines ICEEMDAN and VMD to better extract the nonlinear features of carbon price, improves the quality of the reconstructed signal by noise reduction of the high-frequency component obtained by ICEEMDAN through SDAE, and introduces CPO algorithm to optimize the parameters in the process of VMD, which effectively improves the accuracy of decomposition.
- In terms of model prediction, the optuna automated hyper-parameter optimization framework is innovatively introduced and combined with the attention mechanism to predict the prices of the three carbon markets through LSTM, improving prediction accuracy and stability.
2. Methodology
2.1. XGBoost
Iteratively filling in time series missing values based on machine learning models such as XGBoost can effectively reduce bias and improve the accuracy of data recovery in multiple scenarios [38]. As an algorithm based on gradient boosting decision trees, XGBoost incrementally enhances predictive accuracy by combining multiple weak learners.
To fill the target features containing missing values, XGBoost uses other features as input to train a model that can effectively predict these target features. The loss function is expressed as:
Where denotes the true value of the target feature containing the missing values, and
is the predicted value of the model, and the loss function
in this paper is in the form of negative mean square error, and
is the regularization term, and the input feature set consists of other features through which the model predicts the target variable containing missing values.
To optimize the model, the hyperparameters are adjusted by a grid search to minimize the loss on the validation set. The optimization objective expression is given as:
Where denotes the loss of the validation set of the model in the kth-fold cross-validation, and K is the number of cross-validation folds. By traversing all possible combinations in the hyperparameter space
, the lattice search selects the optimal parameters that minimize the loss
, thus achieving accurate prediction of missing values.
2.2. Feature screening
2.2.1. LightGBM based Boruta feature selection method.
In the Boruta framework, Random Forests are usually used to compute the importance of features, for example, Ghosh et al. [39] proposed an ensemble feature selection (EFS) method incorporating the Boruta and regularized random forest algorithms for filtering explanatory features. The effectiveness of combining the EFS process with advanced AI-based prediction models is well demonstrated. In this paper, we introduce the LightGBM model to be used as a base model in the Boruta algorithm to evaluate the relative significance of original and shadow features.
Let the closing price of the three carbon markets be the target variable, denoted as , where
denotes the target value of the ith sample, and
denotes the value of the feature over all samples. Boruta’s algorithm introduces shadow features, defined as:
Among them, the shadow feature are obtained by randomly permuting the values of the original feature
and do not contain any predictive information, so the shadow feature can be used as a benchmark for the uninformative features.
For a given feature , its importance score
is defined as the cumulative split gain of the feature across all decision trees.
Where denotes the total number of decision trees in the LightGBM model,
denotes the number of nodes in the jth tree,
is an indicator function. When
is used as the splitting feature at the kth node of the jth tree,
Otherwise
. Gain
denotes the gain of the split at the kth node of the jth tree for the feature
.
By using the above formula, the importance scores of all shadow features are calculated and the maximum value is taken as the upper limit of the importance of the shadow feature, denoted as:
Boruta’s algorithm performs multiple rounds of iterations, regenerating the shadow features and calculating the feature importance scores in each round, and then comparing them with the importance scores of the original features until all features are labeled as “confirmed” or “rejected”, or until the maximum number of preset iterations is reached. The stability and reliability of the filtering results are ensured through multiple rounds of iterations. The final retained feature set is defined as:
2.2.2. feature selection method based on Lasso regression.
The main reason for using linear models for feature selection in this paper is that financial data is noisy and nonlinear models may be too flexible and easy to extract irrelevant patterns from the noise, leading to poor feature selection [40].
Lasso regression controls the sparsity of features by adding a regularization term to the objective function of ordinary least squares regression. The optimization objective function for Lasso regression is defined as:
Where is the number of samples,
is the target variable,
is the target value of the ith sample, and
is the value of the jth feature of the ith sample.
is the intercept term of the model.
is the regression coefficient corresponding to the feature.
is the regularisation parameter, which controls the strength of
regularisation.
In the optimization process, Lasso regression imposes sparsity constraints on some of the characteristic coefficients, resulting in a sparse vector of regression coefficients . Ultimately, the model retains the set of features corresponding to non-zero coefficients as valid features. Let the solution of the feature coefficients be
, then the final selected feature set is defined as:
2.3. Secondary decomposition
2.3.1. ICEEMDAN decomposition.
ICEEMDAN is further optimized on the basis of CEEMDAN algorithm, and its core idea is to add noise to the original signal and extract the IMF at each step. it can effectively capture the essence of the sequence and reduce the volatility of the data to improve the prediction accuracy [41], the steps of ICEEMDAN can be summarized as follows.
Step 1. Introduce a set of white noise signals for each sample (where
denotes the number of the noise sample, and (
), and superimpose it with the original signal
superimposed. Form the added noise signal.
Where is the noise amplitude.
For each noise-added signal, EMD is used to extract, and calculate the average value of the first mode in all the added noise signals.
Step 2. For each subsequent modal function , ICEEMDAN is extracted by recursion.
The sum of the extracted modal functions is removed from the original signal to obtain the k-1th-order residual signal
. Then, a new white noise
is added to each
to form a new noise-added residual signal
.
Then, for each , the kth-order modal function
is extracted using EMD, and the kth-order modal mean of all the noise-added residuals is calculated to obtain the kth-order intrinsic modal function.
Step 3. When the residual becomes a monotonic function or meets a preset energy threshold, the decomposition process is terminated and the final residual part no longer has a significant oscillatory component. At this point, the signal
is represented as:
Where is the total number of modes obtained from the decomposition, and
is the residual term.
2.3.2. SDAE.
Stacked denoising autoencoder is a deep learning model for unsupervised learning. In this paper, stacked autoencoder is used for noise reduction of high frequency components after ICEEMDAN decomposition, the model consists of two layers of encoder and decoder, and the signal noise reduction is achieved by layer-by-layer dimensionality reduction and reconstruction.
Step 1. Input the matrix consisting of high frequency components IMF1 through IMF4 of size .
Layer 1 encoder: input 2340 dimensional data, dimensionality reduction to 1170, then to 585. Where is the weight matrix, and
is the bias vector
is the activation function Tanh.
Layer 1 decoder: reconstructs the 585-dimensional representation to a 2340-dimensional input size. Where is the transpose of the encoder’s weight matrix, and
is the bias vector of the decoder, and
is the activation function Tanh.
Layer 2 encoder: receives the output of the layer 1 decoder and once again reduces the 2340 dimensional data to 1170, and finally to 585. Where is the weight matrix of the second layer encoder, and
is the corresponding bias.
Layer 2 decoder: restore the 585-dimensional representation to 2340 dimensions to generate the final denoised signal.
Step 2. The input data is reduced in dimension by two layer encoder and restored by two layer decoder to extract the signal characteristics and remove noise. The forward propagation process is as follows:
Where denotes the input high frequency component containing noise, and
denotes the denoised signal after reconstruction.
step3. To realize the denoising effect, the model uses MSE as a loss function for calculating the error between the reconstructed signal and the target noise-free signal.
Where is the ith sample value in the target noise-free signal,
is the ith sample value in the denoised reconstructed signal, and
is the number of samples. During training, the loss value
is dynamically calculated for each iteration, and the Adam optimizer is used to minimize
and gradually update the network parameters
and
to reduce the reconstruction error.
2.3.3. VMD.
VMD is an adaptive signal processing method designed to decompose a complex signal into a set of IMFs with different center frequencies and narrowband characteristics, thereby effectively extracting features from the original time series [42]. Different from the traditional EMD, the VMD is based on the principle of bandwidth minimization and adopts a variational approach to decompose the signal, ensuring that the decomposed modes have good localization properties in the frequency domain.
The process of VMD is as follows:
Where is the kth modal function,
is the center frequency of the modal function
, and
is the composite function of the Dirac function and the Hilbert transform, which is used to obtain the analytic signal of the signal,
indicates the convolution operation, and
is the frequency shift factor, which shifts the modal function to near the center frequency
.
Where is a balance parameter that controls the trade-off between bandwidth minimization and reconstruction accuracy, and
is the Lagrange multiplier for imposing constraints on the decomposition.
VMD employs the alternating direction multiplier method (ADMM) to address the above optimization problem. ADMM gradually approaches the global optimal solution by separately optimizing the modal function and the center frequency
, as well as the Lagrange multiplier
, in each iteration. Fixing
and
, the update formula for the modal function
is solved by minimizing the partial derivative of the Lagrangian function with respect to each
.
Where denotes the inverse Fourier transform and
is the frequency domain representation of the modal function
.
Then, fixing and
, minimizing the partial derivative of the Lagrangian function with respect to
, we obtain an updated formula for the central frequency.
Finally the Lagrange multipliers are updated to progressively satisfy the constraints. Where is the step parameter, which is used to control the speed of convergence of the Lagrange multipliers.
The VMD algorithm, by iterating the above steps repeatedly, makes the modal function and center frequency
gradually converge to a steady state, thus obtaining a modal decomposition result that satisfies the bandwidth minimization condition.
2.3.4. CPO.
CPO is a biomimetic optimization algorithm that simulates the group defense behavior of the crown porcupines to achieve a global optimal solution search in the balance of exploration and development. The defense mechanism of the crown porcupines includes vision, sound, smell and physical attacks, among which vision and sound correspond to the exploration operator, and smell and physical attacks correspond to the development operator. In addition, CPO also introduces the cyclic population reduction technique to preserve population diversity and enhance the convergence speed [43].
Step1. In the initialization phase, the positions of the individuals are randomly distributed within the given upper and lower bounds, and the position of the th individual in the population is initialized as follows:
Where and
are the lower and upper bounds of the jth dimension respectively, and rand is a random number in the range [0,1].
Step2. In each iteration, the CPO updates the position of an individual through two mechanisms: exploration and exploitation.
The exploration phase simulates the defensive extension behavior of the crown porcupine with the individual position update equation.
Where is a coefficient that controls the exploration range,
is a random number used to increase the diversity of individuals,
is a random individual in the population, and
is the average of the current population position.
The development phase corresponds to the clustering behavior between individuals, and its position update formula is:
Where is the individual with optimal fitness in the current population, and
is the control parameter for the convergence step size, and
is a [0,1] random number.
Step 3. After updating the position, if the individual exceeds the boundary, the boundary reflection mechanism is used to keep it within the reasonable range.
Step 4. At the end of each iteration, the fitness of every individual in the population is assessed, and the prevailing optimal solution is retained as the global optimal solution of the population. If the new solution surpasses the global optimal solution, the global optimal solution is updated.
2.4. Optuna-Attention-LSTM predictive hybrid modeling
2.4.1. LSTM.
LSTM is an improved model of RNN. Compared with traditional RNN, LSTM significantly enhances the learning ability of long-term dependencies through a structured network design, and shows stronger predictive accuracy in tasks involving long sequence data [44]. The LSTM effectively controls the flow of information at each time step by introducing a structure with input gates, forget gates and output gates, as shown in Fig 1, enabling the model to retain important information in long sequence data and suppress cumulative errors. The computational process of the LSTM unit consists of four steps:
(1) Oblivion gate: based on the hidden state of the previous time step and the current input
, the output
of the forgetful gate is calculated to determine the part of the previous momentary cell state
that needs to be forgotten.
(2) Input gate: determines the importance of the current input . The input gate
and the candidate memory state
are calculated as follows:
(3) Updating the cell state: combining the outputs of the forgetting gate and the input gate, the cell state at the current moment is updated.
(4) Output gate: calculate the output gate , and based on the current cell state
update the hidden state
. Where
denotes the sigmoid activation function, and
denotes the element level multiplication.
2.4.2. Attention mechanism.
The Attention mechanism allocates weights to each component of the input sequence, allowing the model to concentrate on critical elements and enhancing its capacity to capture temporal or spatial relationships. The mechanism obtains weighted output by calculating the similarity between the vector , vector
, and vector
. The calculation formula is:
Where is the query matrix, and
is the key matrix, and
the value matrix, and
denotes the dimension of the key vector, which is used as a scaling factor to stabilize the range of values.
2.4.3. Optuna.
Optuna is an automated hyperparameter optimization framework. Its “define-by-run” feature allows users to flexibly define the hyperparameter space and search for the optimal hyperparameter combination in a parallelized manner. In addition, the framework enables users to query and document the hyperparameter values throughout the optimization process [45].
Optuna defines the hyperparameter optimization problem as the minimization of an objective function. In this framework, the goal is to identify a set of hyperparameters that minimizes the objective function
.
Where denotes the model performance metric determined by the set of hyperparameters
, and
denotes the combination of hyperparameters sampled from the defined parameter space. Optuna uses the Tree-structured Parzen Estimator algorithm to quickly converge to the optimal solution by dynamically adjusting the posterior probability distribution to guide the sampling of the next set of hyperparameters.
3. The proposed approach
This paper proposes a hybrid model that integrates comprehensive feature screening, an optimized quadratic decomposition framework, and the Optuna-Attention-LSTM prediction method.
- (1). In this paper, the ICEEMDAN model is first employed to decompose the carbon price time series into IMF sub-series of different frequencies, until decomposition to the residual. Furthermore, the g test reconstruction algorithm is used to differentiate between high and low frequencies in the IMF. To mitigate noise interference in high-frequency components, the SDAE method is employed for denoising. Subsequently, the denoised high-frequency components are combined with the low-frequency components following primary decomposition to reconstruct the signal.
- (2). Then the reconstructed signal is decomposed quadratically by the VMD method, and the CPO algorithm is introduced for parameter optimization by substituting the data fitting penalty alpha, the number of modes K, and the bandwidth penalty tau into the adaptive function, and the optimal solution is continuously iteratively updated to control the VMD, after that, VIMF1 and other components are obtained.
- (3). The feature engineering is mainly segmented into two phases: the first phase first summarizes the external factors affecting the carbon price based on previous research and adds several domestic industry factors, then screens different factors in each type by Boruta; the second phase uses Lasso regression to re-screen the features, and finally, by taking the intersection of the features screened in the two stages, the intersection of the two stages is taken to determine the model’s final input features.
- (4). Finally, the decomposed VIMF sequence and the screened external factors are predicted by LSTM model. Attention mechanism is introduced to allocate varying weights to different inputs during prediction, and optuna is used for hyperparameter optimization. Then, the predicted values of all components are added to derive the final prediction result. The overall model process is shown in Fig 2.
4. Empirical analysis
4.1. Data description
The data used in this study mainly includes data related to carbon prices and external factors influencing them. Data sources include the Choice financial terminal, the Wind database, and http://www.tanjiaoyi.com.
4.1.1. Closing price data.
The study used daily carbon prices in Hubei, Guangdong, and Shanghai from April 4, 2014 to September 30, 2024 as the experimental data, and the first 85% of the dataset was chosen as the training set and the last 15% as the test set. Fig 3 shows the trend of the original price series across the three carbon markets.
4.1.2. Influence factor data.
The model in this paper takes into consideration the factors influencing the carbon price, including structural and non-structural factors.
- Structural factors
The structural factors covered in this paper include environmental and climatic factors, energy factors, carbon market factors, exchange rate and monetary factors, domestic industry factors, and economic factors.
- (1). Environmental and climatic factors
In this paper, daily AQI, Daily maximum temperature and Daily minimum temperature are used to reflect the impact of environment and climate on carbon price. In particular, the relevant data of Guangdong province and Hubei province are replaced by AQI, maximum air temperature and minimum air temperature data of Guangzhou city and Wuhan city, respectively.
- (2). Energy factors
In this paper, the Brent crude oil futures settlement price, Daqing crude oil spot price, Thermal coal futures closing price, China coking coal composite index, and NYMEX natural gas closing price are selected to represent the impact of energy on the carbon price, in which the Thermal coal futures closing price comes from Zhengzhou commodity exchange.
- (3). Carbon market factors
In this paper, EUA futures closing price, EUA futures volume, GDEA volume, HBEA volume, and Shanghai emission allowance (SHEA) volume are chosen to represent the EU carbon market and the impact of internal factors of the carbon market on carbon price.
- (4). Exchange rates and monetary factors
In the paper, the USD to CNY exchange rate, the USD to EUR exchange rate, and the RMB index are chosen to reflect the influence of exchange rate and monetary factors on carbon prices in the three markets.
- (5). Domestic industry factors
In this paper, the Chemicals index, Textile index, Steel index, Energy index, Agricultural and sideline products index, Rubber and plastic index, Non-ferrous metals index, and Commodity price index are selected to reflect the impact of domestic industries on the carbon prices of the three markets.
- (6). Economic factors
In the paper, NASDAQ index, S&P 500 index, SSE composite index volume, SSE composite index closing price, CSI 300 index volume, CSI 300 index closing price, SHFE: Gold futures closing price, SHFE: Silver futures closing price, and BDI index are selected to react to the influence of economic factors on the price of carbon market.
- Unstructural factors
Unstructured indicators can capture market sentiment, public attention and behavioral changes that are difficult to be reflected in traditional structured data, making the information more comprehensive. This study uses Baidu search index as unstructured data, collecting daily search volumes for relevant keywords in Guangdong, Hubei, and Shanghai. Keywords include carbon, low carbon, low carbon economy, energy conservation, emission reduction, carbon sink, carbon emission, carbon trading, carbon emission trading, carbon footprint, carbon neutrality. Finally, the search volume of all keywords in the region is summed as the unstructured data for the region. This study selects data from the same period to match all variables.
4.1.3. Data preprocessing.
This paper first employs the XGBoost method to impute all missing values, using the complete features as input. By learning the complex nonlinear relationships among features, it effectively predicts missing data and significantly improves the accuracy of imputation. Subsequently, to eliminate the impact of different feature dimensions, normalisation is applied to scale all data to the [0, 1] range, standardising the feature scale. Next, to construct training samples suitable for predicting carbon emission allowance closing prices, a sliding window length of 10 is set, corresponding to two weeks of trading days. Finally, the processed data set is divided into a training set comprising the first 85% and a test set comprising the remaining 15% for model training and performance evaluation. During parameter tuning, we used a time series cross-validation method to divide the training set into five folds in chronological order. In each fold, the training set was always earlier than the validation set to match the actual scenario of time series modelling and avoid information leakage.
4.2. Measuring indicators
In order to assess the performance of the prediction model in predicting the carbon prices in Guangdong, Hubei and Shanghai, this paper calculates the following four different statistical metrics: MSE, RMSE, MAE and MAPE as shown in Table 1, where n is the total number of samples and is the ith sample of the true value, and
is the ith sample of the predicted value. Lower values of these metrics indicate reduced errors and better prediction accuracy.
4.3. Secondary decomposition process
4.3.1. ICEEMDAN decomposition and denoising.
This study employs the ICEEMDAN method to decompose the original carbon price series from three carbon markets into modal components of differing frequencies. As illustrated in Fig 4, the IMF1 sequence has the highest frequency, and the subsequent IMF frequency gradually decreases, and the fluctuations gradually level off. From the figure, it can be seen that the trend of the low-frequency components and the trend of the original series as a whole are roughly consistent, and the trend of the low-frequency components is more gentle.
Next, the g test is applied to differentiate the high-frequency and low-frequency components in the subsequence to improve the plausibility of sequence reconstruction. Table 2 shows the g values of the sub-sequences after the ICEEMDAN decomposition of the carbon prices in the three markets. For the Guangdong carbon market, it can be seen that the g-values of IMF1 to IMF3 are low, and the percentages are all below 10%, which are typical high-frequency components. While IMF4 belongs to a turning point and no longer behaves as a typical high-frequency component, the percentage is significantly higher than that of IMF3 but still lower than that of IMF6, which is a transition component and may contain part of the high-frequency signals, so it is also considered within the high-frequency component, and the rest are low-frequency components. For the Hubei carbon market, it can be seen that the g-values from IMF1 to IMF3 are lower, and IMF4 belongs to a turning point, which is a transition component, so it is also considered in the high-frequency component, and the rest are low-frequency components. For Shanghai carbon market, it can be seen that the g-values from IMF1 to IMF4 are low, and the percentages are all below 3%, then IMF1, IMF2, IMF3 and IMF4 are high-frequency components, and the rest are low-frequency components.
Next, a stacked auto-encoder is used to perform noise reduction on the high-frequency components IMF1 to IMF4, after which the noise-reduced IMF1, IMF2, IMF3, IMF4, and the low-frequency components are summed up to obtain the reconstructed signals of the primary decomposition.
4.3.2. Optimized VMD decomposition.
In order to effectively eliminate mode aliasing. The VMD decomposition is started after the primary decomposition, and CPO is introduced to optimize the alpha, tau, and K parameters in the decomposition process to determine the best decomposition parameters. Fig 5 shows the convergence curves of the three carbon markets, which all show the effectiveness of the CPO algorithm in application. The detailed parameter settings for CPO algorithm are shown in Table 3.
Finally, the optimal K value of Guangdong carbon market and Shanghai carbon market is determined to be 9, which is decomposed into 9 modal components, and the optimal K value of Hubei carbon market is 8, which is decomposed into 8 modal components. Fig 6 specifically shows the time domain waveforms of different modal components.The sensitivity analysis of VMD parameters for the three carbon markets is detailed in Tables 4–5, and Table 6.
Through the VMD decomposition, the raw signal of Hubei carbon market is decomposed into VIMF1 to VIMF8, where VIMF1 and VIMF2 correspond to the low-frequency trend of the signal, VIMF3 to VIMF5 capture the mid-frequency dynamics of the signal, and VIMF6 to VIMF8 separate the high-frequency oscillations and noise in the signal. The original signals of Guangdong carbon market and Shanghai carbon market are decomposed into VIMF1 to VIMF9, where VIMF1 to VIMF3 show the low-frequency trend components, VIMF3 to VIMF6 gradually capture the mid-frequency characteristics, and VIMF7 to VIMF9 mainly reflect the high-frequency components. Overall, the optimized VMD decomposition achieves an accurate separation of the signal in the time and frequency domains.
4.4. Screening of external factors
4.4.1. Comprehensive feature screening.
By selecting the top and stable elements, the Boruta algorithm identified 13 key factors in the Guangdong carbon market, including the CSI 300 index closing price, NASDAQ index, etc. By Lasso regression method, SSE composite index volume, SSE composite index closing price and other factors were screened in Guangdong carbon market. Finally, through the comprehensive feature screening by Boruta algorithm and Lasso regression, we determined the intersection of the selected feature variables of the two methods as shown in Fig 7. These commonly contained features are finally used as inputs to the forecasting model, including USD to CNY exchange rate, Daqing crude oil spot price, etc.
Hubei carbon market and Shanghai carbon market also screened the final input external factors through the above process as shown in Figs 8 and 9. The results of the final screening for the three carbon markets are detailed in Table 7.
4.5. Forecasting process
In prediction if the selected time step is too short, it may not be able to cover enough historical information, and if the time step is too long more noise may be introduced, so in this paper, we set up to predict the value of the next time point with the data of the previous 10 time steps and use a two-layer LSTM, the first layer of the LSTM is used to generate the hidden state of the time step, and the second layer is used to further extract the features, and we also add an attention mechanism to pay attention to the important part of the LSTM output sequence. LSTM units increase will enhance the model complexity, while the dropout rate can prevent the model complexity from being too high leading to overfitting, in order to balance the model’s learning ability and generalization ability, this paper adopts optuna to tune LSTM units and dropout rate, and set to carry out 20 experiments. The tuning results are shown in Figs 10, 11 and 12, and the specific parameters listed in Table 8.
Following this, the model is retrained using the optimized parameters and predictions are made on the test set. The prediction results of the three carbon markets are shown in Figs 13, 14 and 15, from which it can be observed that the prediction results are closer to the actual data, indicating the effectiveness of the model proposed in this paper.
Figs 16–17 and 18 show that for the prediction of low and high value regions, the model performs stably with an approximate normal distribution of errors and an average error close to 0, verifying the efficiency and reliability of the model.
4.6. Error evaluation and comparative analysis
In order to demonstrate the superiority of the proposed model, it was compared against eight benchmark models: LSTM, Attention-LSTM, Optuna-Attention-LSTM, ICEEMDAN-Optuna-Attention-LSTM, ICEEMDAN-VMD-Optuna-Attention-LSTM, ICEEMDAN-SDAE-VMD-CPO-Optuna-Attention-LSTM, Lasso-ICEEMDAN-SDAE-VMD-CPO-Optuna-Attention-LSTM, Features-ICEEMDAN-SDAE-VMD-CPO-Bayesian-Attention-LSTM. The model proposed in this study is Features-ICEEMDAN-SDAE-VMD-CPO-Optuna-Attention-LSTM. Attention-LSTM denotes the LSTM model that only incorporates the attention mechanism, and Optuna-Attention-LSTM performs Optuna parameter tuning of the LSTM on the basis of incorporating the attention mechanism, the ICEEMDAN-Optuna-Attention-LSTM adds a primary decomposition of the carbon price to the previous model, ICEEMDAN-VMD-Optuna-Attention-LSTM is a quadratic decomposition prediction hybrid model, ICEEMDAN-SDAE-VMD-CPO-Optuna-Attention-LSTM is the optimized quadratic decomposition prediction hybrid model, Lasso-ICEEMDAN-SDAE-VMD-CPO-Optuna-Attention-LSTM adds features selected by Lasso to the input of the previous model, while Features-ICEEMDAN-SDAE-VMD-CPO-Bayesian-Attention-LSTM adds features selected by comprehensive feature screening to the input, but uses Bayesian optimisation in the prediction process. As illustrated in Figs 19, 20 and 21, the prediction outcomes of the Features-ICEEMDAN-SDAE-VMD-CPO-Optuna-Attention-LSTM model closely align with actual values in most cases, indicating that the model has a strong prediction ability. In order to show the superiority of the model more intuitively, Tables 9–11 demonstrate the results of the prediction error assessment of various models on the test set. The results show that compared with the baseline model, the Features-ICEEMDAN-SDAE-VMD-CPO-Optuna-Attention-LSTM model has the smallest error values. In the three carbon markets, the four indicators of MSE, RMSE, MAE, and MAPE were reduced by 67.30%, 47.68%, 48.42%, and 48.79% on average, respectively. In addition, through gradual optimization, all stages of improvement measures significantly enhance the model’s prediction performance, and from the base model LSTM to the final model Features-ICEEMDAN-SDAE-VMD-CPO-Optuna-Attention-LSTM, the four error indicators showed a gradual downward trend in carbon price predictions for Guangdong, Hubei, and Shanghai, suggesting that the hybrid framework can significantly enhances model prediction accuracy through the synergistic effects of decomposition, noise reduction, parameter optimization, and fusion of external factors. It is further shown that the optimized quadratic decomposition prediction hybrid model with comprehensive feature screening proposed in this paper is reasonably effective in carbon price prediction, a finding consistent with the above conclusion.The parameter settings for the comparative experiments are detailed in Table 12.
In this comparative experiment, although the basic LSTM model can capture the temporal dependencies in time series, it lacks the ability to identify and focus on key information, resulting in limited prediction accuracy. After introducing the attention mechanism, the model can focus on important time step information in a targeted manner, better adapting to the uneven importance distribution characteristic of carbon emission allowance price fluctuations. Through hyperparameter optimisation using the optuna framework, the LSTM units and dropout rate can be further optimised to effectively suppress overfitting. On this basis, ICEEMDAN decomposition was used to decompose the carbon emission quota price sequence, effectively capturing different frequency components. Further noise reduction of the high-frequency components using SDAE better preserved key information. Subsequently, VMD was introduced to refine the signal decomposition and improve the accuracy of feature extraction. Next, features were selected using a comprehensive feature screening method that took into account both non-linear and linear features, thereby retaining key variables while effectively reducing the data dimension. Finally, the Features-ICEEMDAN-SDAE-VMD-CPO-Optuna-Attention-LSTM model demonstrates the best prediction performance.
4.7. Model interpretability analysis
We use SHAP analysis to explain the decision-making process in the model. SHAP Comprehensive charts clearly show the contribution of selected features to the model’s prediction results.
As shown in Figs 22, 23 and 24, In the Guangdong carbon market, the SHFE: Gold futures closing price and GD AQI index contribute the most to the prediction results, indicating that macroeconomic conditions and regional environmental status play a core role in carbon market pricing. Additionally, the Rubber and plastic index and Agricultural and sideline products index also hold significant importance. Furthermore, changes in the USD to CNY exchange rate and CSI 300 index closing price also influence the trend of carbon emission rights prices to some extent. In contrast, EUA futures closing price and Daqing crude oil spot price have a relatively minor impact on the Guangdong carbon market.
In the Hubei carbon market, the Energy index is the most critical influencing factor, contributing the most to fluctuations in carbon emission rights prices, reflecting the direct driving role of energy market changes on carbon market prices. Secondly, the EUA futures closing price and the Non-ferrous metals index also demonstrate significant influence, indicating that the performance of the international carbon trading market and industrial production activities have a significant linkage effect on local carbon emission rights prices. Additionally, the SSE composite index closing price also influences carbon emission rights prices to a certain extent. In contrast, the HBEA volume has a relatively minor direct impact on carbon emission rights prices.
In the Shanghai carbon market, the EUA futures closing price is the most important influencing factor, indicating that global carbon market changes have a direct and significant driving effect on Shanghai carbon market prices. The BDI also has high importance, reflecting the indirect link between international trade activity levels and carbon emission demand. The CSI 300 index closing price also exhibits strong influence, indicating that the macroeconomic environment plays a key role in the carbon emission rights market. Additionally, the Rubber and plastic index and Steel index also exert a certain influence on the forecast results. Overall, Shanghai carbon emission rights prices are jointly influenced by global carbon market dynamics and local economic conditions.
5. Conclusion
Accurately predicting carbon prices not only aids market participants make better decisions, but also promotes the stable evolution of the carbon market. Therefore, this study proposes an optimized quadratic decomposition forecasting hybrid model with comprehensive feature screening, combining the advantages of ICEEMDAN, SDAE, VMD, CPO, Optuna, Attention, and LSTM. The model integrates internal factors and external influences on carbon price in the final input, and innovatively incorporates domestic industry factors into the structural factors. Empirical validation in three carbon markets, Guangdong, Hubei and Shanghai, and comparison with the benchmark model show that the proposed model has superior forecasting performance. The findings of this paper are as follows: (1) The comprehensive feature screening approach not only combines the advantages of nonlinear and linear feature selection, but also improves the robustness, generalization ability and interpretability of carbon price forecasting. (2) Introducing external influences into the decomposition and reconstruction framework can capture the direct effects of external changes on carbon prices more accurately, thus improving the prediction accuracy of the model. (3) The optimized secondary decomposition technique can decompose the original signal of carbon price more thoroughly and remove the residual noise, which provides a stable and reliable foundation for the subsequent prediction model. (4) The Optuna-Attention-LSTM model proposed in this paper performs well, avoids problems such as insufficient weighting of key features, effectively solves the inefficiency of traditional tuning and meets the demand for automated tuning.
Although the predictive hybrid model proposed in this study performs well, it still has certain limitations. First, with the continuous development of subsequent models and algorithms, the predictive performance can be further improved by introducing more advanced methods. Second, the applicability of this model needs to be further expanded. This study only predicts the next time step of carbon prices and does not consider long-term trends. Additionally, the model framework does not account for the impact of policy changes or extreme weather events. Future research could incorporate economic policy uncertainty and climate policy uncertainty to explore their effects on model predictions. However, since policy uncertainty data is primarily monthly, it remains to be explored whether a mixed-frequency model is suitable for predicting daily carbon prices.
References
- 1. Rao A, Lucey B, Kumar S. Climate risk and carbon emissions: examining their impact on key energy markets through asymmetric spillovers. Energy Economics. 2023;126:106970.
- 2. Su CW, Wei S, Wang Y, Tao R. How does climate policy uncertainty affect the carbon market?. Technological Forecasting and Social Change. 2024;200:123155.
- 3. Lyu C, Scholtens B. Integration of the international carbon market: A time-varying analysis. Renewable and Sustainable Energy Reviews. 2024;191:114102.
- 4. Tang TH, Bao HX. Political uncertainty and carbon emission trading: Evidence from China. Cities. 2024;145:104713.
- 5. Yu L, Li C, Wang J, Sun H. A hybrid model for predicting the carbon price in Beijing: A pilot low-carbon city in China. Frontiers in Physics. 2024;12:1427794.
- 6. Zhao Y, Huang YH, Wang ZJ, Liu XF. Carbon futures price forecasting based on feature selection. Engineering Applications of Artificial Intelligence. 2024;135:108646.
- 7. Bai Y, Deng SY, Pu ZQ, Li C. Carbon price forecasting using leaky integrator echo state networks with the framework of decomposition- reconstruction-integration. Energy. 2024;305:132338.
- 8. Liu T, Guan XY, Wei YG, Xue S, Xu L. Impact of economic policy uncertainty on the volatility of China’s emission trading scheme pilots. Energy Economics. 2023;121:106626.
- 9. Niu HW, Liu TY. Forecasting the volatility of European Union allowance futures with macroeconomic variables using the GJR-GARCH-MIDAS model. Empirical Economics. 2024:1–22.
- 10. Zhang LX, Luo Q, Guo XZ, Muhammad U. Medium-term and long-term volatility forecasts for EUA futures with country-specific economic policy uncertainty indices. Resour Policy. 2022;77:102644.
- 11. Guo LL, Huang XY, Li YJ, Li HJ. Forecasting crude oil futures price using machine learning methods: Evidence from China. Energy Economics. 2023;127:107089.
- 12. Xu YY, Dai YF, Guo LL, Chen JJ. Leveraging machine learning to forecast carbon returns: Factors from energy markets. Applied Energy. 2024;357:122515.
- 13. Gong X, Li MJ, Guan KQ, Sun CW. Climate change attention and carbon futures return prediction. Journal of Futures Markets. 2023;43(9):1261–88.
- 14. Li H, Huang X, Zhou D, Cao A, Su M, Wang Y, et al. Forecasting Carbon Price in China: A Multimodel Comparison. Int J Environ Res Public Health. 2022;19(10):6217. pmid:35627753
- 15. Pan D, Zhang C, Zhu D, Hu S. Carbon price forecasting based on news text mining considering investor attention. Environ Sci Pollut Res Int. 2023;30(11):28704–17. pmid:36401005
- 16. Wang WC, Gu M, Li Z, Hong YH, Zang HF, Xu DM. A stacking ensemble machine learning model for improving monthly runoff prediction. Earth Science Informatics. 2025;18(1):120.
- 17. Huang YM, Dai XY, Wang QW, Zhou DQ. A hybrid model for carbon price forecasting using GARCH and long short-term memory network. Applied Energy. 2021;285:116485.
- 18. Wang YF, Qin L, Wang QR, Chen YQ, Yang Q, Xing L, et al. A novel deep learning carbon price short-term prediction model with dual-stage attention mechanism. Applied Energy. 2023;347:121380.
- 19. Sun W, Huang CC. A carbon price prediction model based on secondary decomposition algorithm and optimized back propagation neural network. Journal of Cleaner Production. 2020;243:118671.
- 20. Nadirgil O. Carbon price prediction using multiple hybrid machine learning models optimized by genetic algorithm. J Environ Manage. 2023;342:118061. pmid:37201388
- 21. Zhou FT, Huang ZH, Zhang CH. Carbon price forecasting based on CEEMDAN and LSTM. Applied Energy. 2022;311:118601.
- 22. Yang H, Yang XD, Li GH. Forecasting carbon price in China using a novel hybrid model based on secondary decomposition, multi-complexity and error correction. Journal of Cleaner Production. 2023;401:136701.
- 23. Xu DM, Wang X, Wang WC, Chau KW, Zang HF. Improved monthly runoff time series prediction using the SOA-SVM model based on ICEEMDAN-WD decomposition. Journal of Hydroinformatics. 2023;25(3):943–70.
- 24. Sadefo KJ, Njong BJ. Time-frequency analysis and machine learning models for carbon market forecasting. Annals of Operations Research. 2023;:1–20.
- 25. Zhu Y, Chen Y, Hua Q, Wang J, Guo Y, Li Z, et al. A hybrid model for carbon price forecasting based on improved feature extraction and non-linear integration. Mathematics. 2024;12(10):1428.
- 26. Li JM, Liu DH. Carbon price forecasting based on secondary decomposition and feature screening. Energy. 2023;278:127783.
- 27. Huang Y, He Z. Carbon price forecasting with optimization prediction method based on unstructured combination. Sci Total Environ. 2020;725:138350. pmid:32304964
- 28. Zhang X, Zong Y, Du P, Wang S, Wang J. Framework for multivariate carbon price forecasting: A novel hybrid model. J Environ Manage. 2024;369:122275. pmid:39217908
- 29. Li YY, Wang ZC, Liu SY. Enhance carbon emission prediction using bidirectional long short-term memory model based on text-based and data-driven multimodal information fusion. Journal of Cleaner Production. 2024;471:143301.
- 30. Feng MD, Duan YH, Wang X, Zhang JY, Ma LL. Carbon price prediction based on decomposition technique and extreme gradient boosting optimized by the grey wolf optimizer algorithm. Scientific Reports. 2023;13(1):18447.
- 31. Zhang KF, Yang XL, Wang T, Jesse T, Tan ZC, Yu HS. Multi-step carbon price forecasting using a hybrid model based on multivariate decomposition strategy and deep learning algorithms. Journal of Cleaner Production. 2023;405:136959.
- 32. Mao Y, Yu X. A hybrid forecasting approach for China’s national carbon emission allowance prices with balanced accuracy and interpretability. J Environ Manage. 2024;351:119873. pmid:38159311
- 33. Zhang W, Wu ZB, Zeng XJ, Zhu CH. An ensemble dynamic self-learning model for multiscale carbon price forecasting. Energy. 2023;263:125820.
- 34. Hu B, Cheng Y. Predicting regional carbon price in China based on multi-factor HKELM by combining secondary decomposition and ensemble learning. PLoS One. 2023;18(12):e0285311. pmid:38085727
- 35. Wang BL, Wang ZC, Yao ZY. Enhancing carbon price point-interval multi-step-ahead prediction using a hybrid framework of autoformer and extreme learning machine with multi-factors. Expert Systems with Applications. 2025;126467.
- 36. Wang JJ, Dong J, Zhang X, Li YN. A multiple feature fusion-based intelligent optimization ensemble model for carbon price forecasting. Process Safety and Environmental Protection. 2024;187:1558–75.
- 37. Zheng G, Li K, Yue X, Zhang Y. A multifactor hybrid model for carbon price interval prediction based on decomposition-integration framework. J Environ Manage. 2024;363:121273. pmid:38850918
- 38. Mir AA, Kearfott KJ, Çelebi FV, Rafique M. Imputation by feature importance (IBFI): A methodology to envelop machine learning method for imputing missing patterns in time series data. PLoS One. 2022;17(1):e0262131. pmid:35025953
- 39. Ghosh I, Chaudhuri TD, Alfaro-Cortés E, Gámez M, García N. A hybrid approach to forecasting futures prices with simultaneous consideration of optimality in ensemble feature selection and advanced artificial intelligence. Technological Forecasting and Social Change. 2022;181:121757.
- 40. Pabuccu H, Barbu A. Feature selection with annealing for forecasting financial time series. Financial Innovation. 2024;10(1):87.
- 41. Liang YH, Lin Y, Lu Q. Forecasting gold price using a novel hybrid model with ICEEMDAN and LSTM-CNN-CBAM. Expert Systems with Applications. 2022;206:117847.
- 42. Zhao Y, Zhang WG, Gong X, Liu XF. Carbon futures return forecasting: a novel method based on decomposition-ensemble strategy and Markov process. Applied Soft Computing. 2024;163:111869.
- 43. Abdel-Basset M, Mohamed R, Abouhawwash M. Crested porcupine optimizer: A new nature-inspired metaheuristic. Knowledge-Based Systems. 2024;284:111257.
- 44. Zhan Z, Kim SK. Versatile time-window sliding machine learning techniques for stock market forecasting. Artificial Intelligence Review. 2024;57(8):209.
- 45. Chen CH, Lin YL, Pai PF. Forecasting flower prices by long short-term memory model with optuna. Electronics. 2024;13(18):3646.