Carbon price prediction based on a scaled PCA approach

Xiaolu Wei; Hongbing Ouyang

doi:10.1371/journal.pone.0296105

Abstract

Carbon price prediction is of great importance to regulators and participants in the carbon trading market. It is the basis for developing policies related to the carbon trading market and stabilizing that market. Considering the numerous factors that influence carbon prices in China, dimensionality reduction is needed to improve the prediction accuracy and efficiency. However, the traditional dimensionality reduction methods fail to fully consider the role of influencing factors, which has certain limitations. In this paper, a new dimensionality reduction method, namely scaled principal component analysis (s-PCA), is employed to improve the prediction accuracy of carbon prices. Firstly, a factor library that influence carbon prices is constructed from three perspectives: technical indicators, financial indicators and commodities indicators. Then, the s-PCA method is used to reduce the dimensionality of factors influencing carbon price. Next, two different methods are used to predict carbon prices, including traditional regression method and Long Short-Term Memory (LSTM) method. Finally, the economic value of the s-PCA method is examined by constructing investment portfolios. The empirical results of the Hubei Emissions Exchange show that the s-PCA model outperforms other competing models both in- and out-of-sample. In addition, the LSTM model could improve the performance of the s-PCA model in carbon price prediction. From a market timing perspective, investors can achieve a greater return and a larger Sharpe ratio using the s-PCA method than using other comparative methods and buy-and-hold strategy. Therefore, the s-PCA method is effective and robust in predicting carbon price.

Citation: Wei X, Ouyang H (2024) Carbon price prediction based on a scaled PCA approach. PLoS ONE 19(1): e0296105. https://doi.org/10.1371/journal.pone.0296105

Editor: Sathishkumar Veerappampalayam Easwaramoorthy, Sunway University, MALAYSIA

Received: September 15, 2023; Accepted: December 5, 2023; Published: January 2, 2024

Copyright: © 2024 Wei, Ouyang. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data for technical indicators used in this paper were obtained by looking up the website http://k.tanjiaoyi.com, which have been provided in a Supporting Information file. The data for financial indicators and commodity indicators used in this paper were collected through Wind Platform, which is a third party for finance data collection and analysis. Therefore, the authors have no right to share this part of data. The researchers who are interested in the confidential data could subscribe to the Wind platform and get access to the relevant data through the link https://www.wind.com.cn/ (contact number: +86 400-820-9463), from which the authors obtained the data.

Funding: This work was supported by the China Postdoctoral Science Foundation (Grant Number: 2020M682378) awarded to Xiaolu Wei; the Modern Economics Research Centre, Huazhong University of Science and Technology and the Huazhong University of Science and Technology Double First-Class Funds for Humanities and Social Sciences (Grant Number: 2018WKZDJC004) awarded to Hongbing Ouyang for the project “Application of Artificial Intelligence in Financial Decisions”

Competing interests: The authors have declared that no competing interests exist.

Introduction

Global warming and glacial melting caused by carbon emissions are becoming prominent problems, which seriously threaten human food supply and living environment. China, as the largest carbon emitter, plays an essential role in global climate change and is under increasing pressure to control carbon emissions due to its rapid economic growth. Therefore, China comprehensively promotes energy transformation and accelerates the construction of a clean and low-carbon modern energy system. Since 2011, China has established seven major carbon emission exchanges, including the Guangzhou Carbon Emissions Exchange, Shenzhen Emissions Exchange, Beijing Environmental Exchange, Shanghai Environmental Energy Exchange, Hubei Carbon Emissions Exchange, Tianjin Emissions Exchange and Chongqing Carbon Emissions Exchange, which are considered to be important measures to regulate the allocation of carbon emissions and ease the pressure on carbon emissions. With the development of carbon emission trading markets, carbon emission trading in China has gradually become active. Carbon price prediction is increasingly important to understand the development of China’s carbon market and to make decisions about carbon reduction. Therefore, it is essential to choose the appropriate method to improve the accuracy of predicting carbon price.

As the carbon trading market has matured, the research on carbon price prediction has increased sharply at home and abroad. The models for predicting carbon price mainly include generalized autoregressive conditional heteroscedasticity (GARCH) model [1], least square support vector machine (LSSVM) model [2], long short-term memory (LSTM) model [3], empirical mode decomposition (EMD) model [4], extreme learning machine (ELM) model [5] and ensemble learning methods [6,7]. However, studies on carbon price prediction still have some shortcomings: (a) Most of the existing studies predict carbon prices based on technical indicators and ignore the influence of other factors, which has certain limitations. Therefore, this paper predicts carbon prices by constructing a factor library containing technical indicators, financial indicators and commodities indicators. (b) Due to the large number of factors affecting carbon price, dimensionality reduction methods are very useful which could reduce these factors to a few combinations. However, one recognized weakness of traditional dimensionality reduction methods, such as principal component analysis (PCA), is that it ignores the target information target completely. In this paper, we employ the scaled principal component analysis (s-PCA) approach proposed by Huang et al. [8] to predict carbon prices. The s-PCA approach is a variant of the PCA approach by further incorporating supervised learning. Specifically, before extracting diffusion indexes, the s-PCA approach uses the regression coefficient of the prediction target on each predictor to scale the corresponding predictor. Therefore, the s-PCA approach has the potential to improve the predictability by considering the target information in the process of dimensionality reduction. (c) Most of the existing studies have explored the effectiveness of the dimensionality reduction methods through linear regression models. However, the results of these studies are not robust due to the non-linear and non-stationary nature of the carbon price series. In this paper, we will use both linear regression model and LSTM model to further explore the effectiveness of the s-PCA approach in carbon price prediction. (d) Most of the existing studies focus on the European carbon allowance (EUA), while rarely analyzing China’s carbon trading market. Given this, this paper will conduct an empirical study on carbon price prediction in China.

In this paper, we investigate the carbon price predictability of the s-PCA model based on a factor library containing technical indicators, financial indicators and commodities indicators. The carbon price data of Hubei Carbon Emissions Exchange is used as the research object for empirical study. To demonstrate the superiority of the s-PCA model, it is compared with the PCA model and PLS model. The main contributions and innovations of this paper are listed below:

The s-PCA model is applied to predict carbon price. The diffusion indexes obtained from s-PCA is adopted to predict carbon price, which improves the computational efficiency and prediction accuracy.
We construct a factor library of carbon prices, including technical indicators, financial indicators and commodities indicators, to further improve the interpretability of carbon price prediction.
Combining the s-PCA model with linear regression method and LSTM model, two hybrid models of carbon price prediction are proposed in this paper, which might provide a new idea to test the effectiveness of the dimensionality reduction method, and effectively make up for the shortcomings of carbon price prediction based on linear regression models.
Hubei Carbon Emissions Exchange is used as the research sample for empirical analysis, while two other related models are used for performance comparison to prove the effectiveness of the s-PCA model. In addition, the evaluation metrics used in this paper include R2, RMSE, MAE, and DM.

The remainder of this paper is structured as follows: Section 2 describes the methods used in this paper. Section 3 provides the empirical results and discussion based on the carbon price data of the Hubei Emission Exchange. Section 4 presents a series of robustness tests to verify the prediction performance of the s-PCA model. Section 5 describes the results of market timing, which confirm the economic value of the s-PCA model employed in this paper. Finally, Section 6 provides the conclusions and future work.

Data and methodology

Data

Carbon prices.

To accelerate the progress of emission peak and carbon neutrality, China has established eight carbon emissions trading pilots, including Shenzhen, Guangdong, Hubei, Tianjin, Shanghai, Chongqing, Beijing and Fujian. Among them, Hubei Carbon Emissions Exchange has the largest trading scale and the highest market participation. The cumulative volume of carbon emission allowances (CEA) traded in the Hubei Carbon Emissions Exchange has reached 360 million tons, with a cumulative turnover of 8.7 billion yuan. In addition, there are a total of 332 emission control enterprises in the Hubei Carbon Emissions Exchange, accounting for more than 70% of the secondary industry’s total value. Due to the value of in-depth research, Hubei Carbon Emissions Exchange is chosen as the research object for empirical analysis in this paper. Table 1 shows the statistical descriptions of the carbon price. Fig 1 depicts the general trend of carbon price in the Hubei Carbon Emissions Exchange. It can be seen that the carbon price of Hubei is highly non-linear and volatile.

Download:

Fig 1. The carbon price of Hubei.

https://doi.org/10.1371/journal.pone.0296105.g001

Download:

Table 1. Statistical descriptions of the carbon price in Hubei.

https://doi.org/10.1371/journal.pone.0296105.t001

We collect the daily carbon price data of Hubei Carbon Emissions Exchange from the http://k.tanjiaoyi.com/. The data covers the sample period from April 28, 2014 to March 23, 2022. The data with zero transaction volume is removed from the sample. The cleaned sample is divided into three parts. The first part is the training set (60% of the sample), which is used to train the prediction model. The second part is the validation set (20% of the sample), which is used to tune hyper parameters. The third part is the testing set, which is used to evaluate the performance of the prediction model.

Indicators selection.

By extensive literature, this paper finds that the factors affecting carbon prices mainly include three categories: technical factors, financial factors, and commodity factors. Therefore, this paper employs 71 technical indicators, 13 financial indicators and 25 commodity indicators to predict carbon price. The data of these indicators are collected from the Wind platform and other official website.

Specifically, the 71 technical indicators are based on five popular technical strategies. The first strategy is the momentum (MOM) rule, which constructs a buy or sell signal by comparing the current carbon price and the price k days ago, (2.1) where P_t denotes the carbon price for day t. Following Wang et al. [9], we analyze MOM technical indicators with k=1, 3, 6, 9, 12.

The second strategy is the filtering (FR) rule, in which a buy or sell signal is given by (2.2) (2.3) where we use ten FR indicators with μ = 5, 10 and k=1, 3, 6, 9, 12.

The third strategy is the moving average (MA) rule, which compares two moving average value and generate a trading signal at the end of day t, (2.4) where (2.5)

In this paper, we use six MA indicators with s = 1, 3, 6 and l = 9, 12.

The forth strategy is the oscillator (OSLT) rule, in which a buy or sell signal is produced by (2.6) (2.7) where (2.8)

Up denotes the magnitude of the upward stock price movement over k days, Down denotes the magnitude of the downward stock price movement over k days, and Up+Down denotes the total magnitude of the stock price movement over the period. We use ten OSLT indicators with μ = 5, 10 and k = 1, 3, 6, 9, 12.

The fifth strategy is the support resistance (SR) rule, where a trading signal is given by (2.9) (2.10)

In this study, we analyze ten SR indicators with μ = 5, 10 and k = 1, 3, 6, 9, 12.

Moreover, the 13 financial indicators and 25 commodity indicators are selected from previous typical literature where these indicators show considerable predictive power in carbon price forecasting. The 13 financial indicators include secondary market interest rates for 3-month Treasury bill, 10-year national bond rate [9], S&P 500 index, Dow Jones Composite Index, Shanghai Composite Index, Shenzhen Composite Index, 5-Year Bond Index Yield, WilderHill New Energy Global Innovation Index (NEX), WilderHill Clean Energy Index (CEI) [10], AAA-rated corporate bond spreads, daily spread of 1-year Treasury bill and 10-year government bond [11], USD/CNY and China’s Economic Policy Uncertainty Index [12].

The 25 commodity indicators include ICE-UK natural gas continuous futures price (UKGP), Asia gas price (JKM), S&P GSCI Gas oil index excess return (GGO) [10], ICE-coal Rotterdam continuous futures price (GP), ICE-Brent crude oil continuous futures price (BOP) [13], S&P GSCI Crude oil index excess return (GCO), EUA price, China Electricity Price index and 17 S&P GSCI non-energy commodity indexes (GGOL, GSIL, GALU, GCOP, GLEA, GNIC, GZIN, GCOC, Gcof, Gcor, GCOT, Gsoy, Gsug, Gwhe, GFC, GLH, GLC) [14].

In addition, it should be emphasized that the collection and analysis methods are complied with the terms and conditions for the source of all data.

Methodology

PCA.

The principal Components Analysis (PCA) was first introduced to non-random variables by Pearson (1901), and then extended to random vectors [15]. Nowadays, it’s the most widely used dimension-reduction method [16].

PCA is an algorithm to transform the columns of a dataset into a new set of features called Principal Components, which contain less variables and retain as much information about the original variable as possible. Specifically, the PCA extracts diffusion indexes as a weighted sum of predictors X_i,t, which can be expressed as follows: (2.11) With the PCA diffusion indexes , we can predict the target as: (2.12)

In this way, a large chunk of the information across the full dataset is effectively compressed into fewer feature columns. This allows for dimensionality reduction and the ability to visualize the separation of classes or clusters if any. However, PCA is an unsupervised learning technique that ignores the prediction target in the prediction process. Therefore, the forecasting result of PCA may not be stable. In the extreme cases, when factors are strong, PCA cannot distinguish between the target-relevant and irrelevant latent factors. When the factors are weak, PCA could fail to extract the signals from the large amount of noise, resulting in biased forecasts when all factors are used [8].

s-PCA.

The scaled PCA (s-PCA) is a novel dimensionality reduction method proposed by Huang et al. [8], which modifies the traditional PCA by considering the prediction target. In particular, the s-PCA tends to down-weight those predictors with weak forecasting power and overweight those with strong forecasting power. As a result, the s-PCA could overcome the deficiencies of PCA to identify predictors that are particularly useful for predicting targets and obtain more significant forecast.

Specifically, the s-PCA extracts diffusion indexes in two steps. In the first step, we develop a panel of scaled predictors, (γ₁X_i,t,⋯,γ_NX_N,t), where the scaled coefficient γ₁ denotes the estimated slope through regressing the prediction target y_i,t on the corresponding (standardized) indicators X_i,t: (2.13)

In the second step, we apply PCA to the scaled predictors to extract s-PCA diffusion indexes as the new predictors: (2.14)

Finally, we could predict the target using the s-PCA diffusion indexes as: (2.15)

Because the target y_t+h depends on the predictors instead of the loadings, the s-PCA method has a large chance to outperform the PCA method, especially when all factors are used. Kelly et al. [17], Pelger [18], Gu et al. [19], Lettau and Pelger [20,21] applied similar methods and demonstrated that the s-PCA can yield satisfactory results in various areas.

PLS.

In accordance with the s-PCA, partial least squares (PLS) is a supervised learning method that uses the prediction target to discipline its dimension reduction [22–24].

Specifically, the PLS method extracts diffusion indexes in two steps. In the first step, we regress each predictor (X_i,t,⋯,X_N,t) on the prediction target: (2.16)

In the second step, we extract PLS diffusion indexes through running a time-series regression for each predictor (X_i,t,⋯,X_N,t) and the corresponding estimated in Eq (2.16): (2.17)

Finally, we could predict the target using the PLS diffusion indexes estimated in Eq (2.17): (2.18)

The PLS method makes dependent variable connect tightly with the independent variable and thus may achieve satisfactory results. Kelly and Pruitt [23], Light et al. [25] found that the PLS method exhibited strong forecasting power even with relatively small data.

LSTM.

As a special form of recurrent neural network (RNN), long-short term memory (LSTM) neural network is able to handle the long-term dependence of time-series data well. The LSTM neural network structure contains a series of recurrently connected sub-networks (i.e., memory modules), each of which contains one or more self-connecting cells, as well as a system of three gating units controlling information flow (input gate, output gate, and forget gate). Specifically, the execution steps in an LSTM network can be summarized as follows:

Firstly, determine the information that needs to be extracted from the cell through the forget gate (f_t): (2.19) whereby σ is a sigmoid activation function that sets the information flow weight to a value between 0 and 1. 0 means that the information is completely deleted and 1 means that all information is retained. x_t is the current input vector and h_t is the current hidden layer vector. b_f, W_f, and U_f are the bias, input weights, and loop weights of the forget gate, respectively.

Next, update the state of information in the cell. Let g_t be an input gate between 0 and 1 controlled by the sigmoid activation function: (2.20)

Then the updated cell state C_t on the basis of C_t−1 is: (2.21)

Lastly, the message output controlled by the output gate o_t is: (2.22) whereby the detailed output gate controlled by sigmoid activation function is: (2.23)

In conclusion, LSTM model contains not only the external loop between the hidden layer cells involved in RNN, but also the self-loop within the cells. Because of this special structure, LSTM model can reflect the nonlinearity of the financial time series data and the complex interactions between features. Therefore, LSTM model may have higher prediction accuracy compared to traditional econometric models and other machine learning algorithms.

Empirical analysis model evaluation metrics

The coefficient of determination (R2), the root mean square error (RMSE) and the mean absolute error (MAE) are three widely used metrics to evaluate the performance of the prediction model. Among them, R2 indicates the degree of fitting to the actual value, whose value ranges between 0 and 1. Models with R2 values closer to 1 perform better. RMSE indicates the deviation between the predicted value and the true value. MAE measures the average absolute error between the predicted value and the true value. The smaller the RMSE values and MAE values are, the better the performance of the prediction model is. The three evaluation metrics are calculated as follows: (3.1) (3.2) (3.3) where denote the true value, predicted value and average value at time t, respectively. N is the number of data points.

In addition, we follow Campbell and Thompson [26] and employ to evaluate the out-of-sample performance of the prediction model further, which is defined as follows: (3.4) where represent the true value, the predicted value of the prediction model, and benchmark prediction of the historical average model at time m+t, respectively. m is the total length of the training period and validation period. q is the length of the test period. A positive statistic implies that the prediction model has better performance than the benchmark model.

Results and discussion

This paper employs a new dimensionality reduction method named s-PCA to predict carbon prices. In this section, we predict the carbon price in three steps. Firstly, we construct a library of indicators that affect carbon prices, including technical, financial and commodities indicators. Secondly, we apply the s-PCA method to reduce the dimensionality of the indicators. Finally, we employ traditional regression method and LSTM to predict carbon prices based on the diffusion indexes. The parameters of the LSTM model are adjusted according to the R2 of the validation set. To verify the superiority of the s-PCA method in carbon price prediction, PCA and PLS are selected for comparative analysis. Then we will treat Hubei as a research subject, in which the prediction horizon and the number of diffusion indexes are set to be 1.

In-sample results

The in-sample results of all methods of Hubei in predicting carbon prices are shown in Tables 2 and 3. Based on all results, the analysis of each method is as follows:

In the case of prediction based on linear regression method, the prediction performance of the model based on the s-PCA method is significantly better than that based on the PCA method and the PLS method. Specifically, the R2 of the s-PCA is 71.65%, which is much larger than that of the PCA and PLS, which is 20.02% and 68.11%, respectively. In addition, the RMSE and MAE of the s-PCA are 2.69 and 2.11 respectively, which are smaller than that of the PCA and PLS, which are 4.53 and 2.86 in terms of RMSE, 3.48 and 2.24 in terms of MAE, respectively.

In the case of prediction based on the LSTM method, s-PCA method still has the highest R2 and the lowest errors, which indicates that s-PCA method perform better than the PCA method and the PLS method in carbon price prediction. Specifically, the three evaluation metrics of the s-PCA method (R2=99.67%, RMSE=0.29, MAE=0.22) are much better than the PCA method (R2=97.28%, RMSE = 0.83, MAE = 0.69) and the PLS method (R2=99.55%, RMSE = 0.34, MAE = 0.27), which shows that the s-PCA method can indeed improve the prediction accuracy of carbon prices.

Compared with the linear regression method, the LSTM model could improve the prediction performance of the s-PCA method. Specifically, the prediction based on the s-PCA method and the LSTM model achieves larger R2, as along with smaller RMSE and MAE (R2 = 99.67%, RMSE = 0.29, MAE = 0.22) than the prediction based on the s-PCA method and the linear regression method (R2 = 71.65%, RMSE = 2.69, MAE = 2.11).

As a consequence, in the in-sample analysis, the s-PCA method has a stronger performance than the PCA method and the PLS method in carbon price prediction. In addition, the LSTM model could improve the performance of the s-PCA method compared to the linear regression method.

Download:

Table 2. In-sample results based on linear regression.

https://doi.org/10.1371/journal.pone.0296105.t002

Download:

Table 3. In-sample results based on LSTM method.

https://doi.org/10.1371/journal.pone.0296105.t003

Out-of-sample results

Tables 4 and 5 present the out-of-sample prediction performance of Hubei carbon price by all methods. The statistical significance for RMSE and MAE is based on the Diebold and Mariano [27] test (D-M test), in which the alternative hypothesis is that the prediction accuracy of the s-PCA method is higher than that of the benchmark model. The benchmark model is based on historical average, which is a widely used out-of-sample benchmark according to Welch and Goyal [28]. The observations can be summarized as follows:

The RMSEs and MAEs for all prediction methods are significantly small at the 1% level, which indicates that all prediction methods outperform the historical average benchmark in terms of out-of-sample RMSE and MAE. In other words, these prediction methods show strong out-of-sample forecasting capability in carbon price prediction.

In the case of out-of-sample prediction based on the linear regression method, the s-PCA method yields significantly larger as well as smaller MASE and MAE (R2=68.94%, RMSE=1.95, MAE=1.16) than the PCA method (R2=18.06%, RMSE=4.94, MAE=2.26) and the PLS method (R2=66.87%, RMSE=2.05, MAE=1.22). The results indicate that the s-PCA method is a better dimensionality reduction method when using linear regression method to predict carbon prices.

In the case of out-of-sample prediction based on the LSTM model, the s-PCA method performs much better than the PCA method and the PLS method. As can be seen from Table 5, the of the s-PCA method is 85.12%, which is larger than the PCA method and the PLS method. In addition, the RMSE and MAE of the s-PCA method are 0.83 and 0.60, respectively, which are significantly smaller than the other two comparative methods. The results indicate that the s-PCA method outperforms other dimensionality reduction methods when predicting carbon prices with the LSTM model.

The LSTM model can improve the prediction accuracy of the s-PCA method in carbon price prediction. By comparing three evaluation metrics of the s-PCA method with linear regression method and the s-PCA method with the LSTM model, we can find that the performance of the s-PCA method with LSTM model is much more excellent than that of the s-PCA method with linear regression method. Therefore, using the LSTM model to predict carbon prices can improve the prediction performance of the s-PCA method.

Download:

Table 4. Out-of-sample results based on linear regression.

https://doi.org/10.1371/journal.pone.0296105.t004

Download:

Table 5. Out-of-sample results based on LSTM method.

https://doi.org/10.1371/journal.pone.0296105.t005

In sum, the results in this section shows that consistent with the in-sample results, the s-PCA method is superior to both the PCA method and the PLS method for carbon price prediction. Moreover, the LSTM model can improve the prediction performance of the s-PCA method in terms of predictability.

Robustness test

Alternative proxies of carbon prices

The carbon trading volumes of the Hubei Carbon Emissions Exchange, Guangzhou Carbon Emissions Exchange, and Shanghai Environmental Energy Exchange account for more than half of the total market in China, which indicate that the carbon markets in Hubei, Guangdong, and Shanghai can be a good representative of the Chinese carbon market. For this reason, we further use the carbon prices of Guangzhou and Shanghai to test the out-of-sample performance of the s-PCA method. Tables 6 and 7 report the out-of-sample results for predicting the carbon prices of Guangzhou and Shanghai. The results show that the s-PCA method continues to perform better (i.e., larger , smaller RMSE and MAE) than other comparative methods. Moreover, the LSTM model can improve the prediction accuracy of the s-PCA method. These results prove that our out-of-sample results are robust to other proxies of carbon prices.

Download:

Table 6. Out-of-sample results based on linear regression for Guangzhou and Shanghai.

https://doi.org/10.1371/journal.pone.0296105.t006

Download:

Table 7. Out-of-sample results based on LSTM for Guangzhou and Shanghai.

https://doi.org/10.1371/journal.pone.0296105.t007

Different prediction horizons

Huang et al. [8] argue that it is possible to achieve satisfactory prediction results by chance after data mining on prediction horizons. To alleviate this concern, this paper further choose another four prediction horizons to test the performance of the s-PCA method. Specifically, we set the prediction horizon in [3,6,9,12].

Tables 8 and 9 report the forecasting results for different prediction horizons. It can be seen that for any of the four prediction horizons, the s-PCA method generates larger , smaller RMSE and MAE than the other comparative methods. In addition, the s-PCA method combined with the LSTM model achieve better performance than the s-PCA method combined with the linear regression model. These results all prove that the out-of-sample results are robust to different prediction horizons.

Download:

Table 8. Out-of-sample results based on Linear Regression for different prediction horizons.

https://doi.org/10.1371/journal.pone.0296105.t008

Download:

Table 9. Out-of-sample results based on LSTM for different prediction horizons.

https://doi.org/10.1371/journal.pone.0296105.t009

Alternative forecasting window size

Following Sun and Huang [4], Zhou and Wang [13], we consider another forecasting window size by dividing the data set into a training set (80%), a validation set (10%), and a test set (10%).

Tables 10 and 11 report the out-of-sample results for alternative forecasting window size. We observe that all of the prediction methods always generate significant , RMSEs and MAEs. Among these methods, the s-PCA method has stronger prediction performance (larger , smaller RMSE and MAE) than other competing methods. Furthermore, the LSTM model can improve the prediction performance of the s-PCA method by generating larger , RMSE and MAE. This is consistent with our results when dividing the data set into a training set (60%), a validation set (20%), and a test set (20%). Hence, the out-of-sample prediction results are robust to alternative forecasting window size.

Download:

Table 10. Out-of-sample results based on linear regression for alternative forecasting window size.

https://doi.org/10.1371/journal.pone.0296105.t010

Download:

Table 11. Out-of-sample results based on LSTM for alternative forecasting window size.

https://doi.org/10.1371/journal.pone.0296105.t011

Different size of diffusion index

According to the empirical analysis in the previous section, we can learn that the s-PCA method is superior to other comparative methods based on the first diffusion index. In order to test the robustness of the out-of-sample prediction performance for the s-PCA method, we employ the first and second diffusion indexes to re-predict the carbon price based on all forecasting methods.

The out-of-sample prediction results are reported in Tables 12 and 13 when we use the first and second diffusion indexes. It can be seen that the s-PCA method outperforms other comparative methods with larger , smaller RMSE and MAE. Furthermore, compared with the s-PCA method with the linear regression method, the s-PCA method with the LSTM model performs better, indicating that the LSTM model can improve the prediction performance of the s-PCA method. Overall, the out-of-sample results suggest that the prediction performance of the s-PCA method is robust when we use the first and second diffusion indexes.

Download:

Table 12. Out-of-sample results based on linear regression for different size of diffusion index.

https://doi.org/10.1371/journal.pone.0296105.t012

Download:

Table 13. Out-of-sample results based on LSTM method for different size of diffusion index.

https://doi.org/10.1371/journal.pone.0296105.t013

Market timing

In contrast to studying the statistical significance of carbon price prediction, it is more meaningful for investors to study its economic significance, which could give them investment advice and generate possible profits. Following He et al. [29], we further study the economic benefits of the s-PCA method from the perspective of market timing.

In this study, we will take a long position at the time t if the carbon price at time t+30 is higher than the carbon price at time t. Otherwise, we will take a short position. At the end of time t+30, we will close the position we took. The market timing strategy based on the carbon price prediction for time t can be expressed as follows: (5.1)

where A(t) is the action we take at time t, 1 denotes that we take a long position, and -1 denotes that we take a short position.

To evaluate the performance of the market timing strategy, we consider the Buy-and-Hold strategy as a benchmark strategy, which takes a long position at time t, and close the position at time t+30.

Table 14 reports the market timing results for carbon prices. Here, the average return is annualized and in percentage. It can be seen that the s-PCA method has the largest average returns in all methods and market timing strategies, which is 38.14% based on the linear regression method and 70.39% based on the LSTM model, respectively. However, the risks of the s-PCA method, which are measured by standard deviation, are also much higher than most other market timing strategies, which is 12.66% based on the linear regression method and 12.26% based on the LSTM model, respectively. When we take risk into consideration, the performance of a market timing strategy can be measured in terms of the Sharpe ratio. We observe that the Sharpe ratio of the s-PCA method with the linear regression method is 3.93 and that of the s-PCA method with the LSTM model is 7.49. These Sharpe ratios are much higher than those of most other market timing strategies, except for the Sharpe ratio of the PLS method with the LSTM model, which is 7.68.

Download:

Table 14. Market timing results for carbon prices.

https://doi.org/10.1371/journal.pone.0296105.t014

In a word, the s-PCA method is of greater economic importance compared to other market timing strategies. Moreover, the LSTM model can improve the performance of the s-PCA method in terms of the market timing strategy.

Conclusions and future work

In this paper, we employ the s-PCA model proposed by Huang et al. [8] to predict carbon price with 71 technical indicators, 13 financial indicators and 25 commodity indicators. First, we construct a factor library in which indicators are likely to have an impact on the carbon price. Second, we use the s-PCA method to reduce the dimensionality of the influencing factors. Third, after dimensionality reduction, we employ the linear regression method and the LSTM model to predict the carbon price. Fourth, we examine the economic significance of the s-PCA method from a market timing perspective.

Using the carbon price of Hubei for empirical analysis, the prediction performance of the s-PCA method has been compared with the PCA method and the PLS method. The empirical results show that the s-PCA method is superior to other comparative methods from both a statistical perspective and an economic perspective. Specifically, the s-PCA method yields larger R², smaller RMSE and MAE in both in- and out-of sample analysis. In addition, the LSTM model can provide significant improvements to the s-PCA method for carbon price prediction, which may due to its properties of long-term memory and nonlinearity. Our results are robust to a series of settings, including different carbon markets, different forecasting horizons, alternative forecasting window size, and different size of diffusion index. From the perspective of market timing, an investor can achieve higher average return and Sharpe ratio by applying the s-PCA method than applying other comparative strategies.

In the future, (a) we should look into more advanced prediction models to further improve the prediction performance of the s-PCA method. (b) We should construct more realistic investment strategies and provide more useful advice to investors. (c) We need to analyze the performance of the s-PCA method in other carbon markets.

Supporting information

S1 File.

https://doi.org/10.1371/journal.pone.0296105.s001

(XLS)

Acknowledgments

Thanks for the support provided by Collaborative Innovation Center of Industrial Upgrading and Regional Finance (Hubei, China). Moreover, Qiang Xu is obliged for suggestions for writing.

References

1. Huang Y, Dai X, Wang Q, Zhou D. A hybrid model for carbon price forecasting using GARCH and long short-term memory network. APPL ENERG, 2021; 285: 116485.
- View Article
- Google Scholar
2. Sun W, Zhang J. A novel carbon price prediction model based on optimized least square support vector machine combining characteristic-scale decomposition and phase space reconstruction. ENERGY, 2022; 253: 124167.
- View Article
- Google Scholar
3. Zhou F, Huang Z, Zhang C. Carbon price forecasting based on CEEMDAN and LSTM. APPL ENERG, 2022; 311: 118601.
- View Article
- Google Scholar
4. Sun W, Huang C. A carbon price prediction model based on secondary decomposition algorithm and optimized back propagation neural network. J CLEAN PROD, 2020; 243: 118671.
- View Article
- Google Scholar
5. Wang J, Cui Q, He M. Hybrid intelligent framework for carbon price prediction using improved variational mode decomposition and optimal extreme learning machine. CHAOS SOLITON FRACT, 2022; 156: 111783.
- View Article
- Google Scholar
6. Wang J, Sun X, Cheng Q, Cui Q. An innovative random forest-based nonlinear ensemble paradigm of improved feature extraction and deep learning for carbon price forecasting. SCI TOTAL ENVIRON, 2021; 762: 143099. pmid:33127140
- View Article
- PubMed/NCBI
- Google Scholar
7. Liu J, Wang P, Chen H, Zhu J. A combination forecasting model based on hybrid interval multi-scale decomposition: Application to interval-valued carbon price forecasting. EXPERT SYST APPL, 2022; 191: 116267.
- View Article
- Google Scholar
8. Huang D, Jiang F, Li K, Tong G, Zhou G. Scaled PCA: A new approach to dimension reduction. MANAGE SCI, 2022; 68(3): 1678–1695.
- View Article
- Google Scholar
9. Wang Y, Liu L, Wu C. Forecasting commodity prices out-of-sample: Can technical indicators help?. INT J FORECASTING, 2020; 36(2): 666–683.
- View Article
- Google Scholar
10. Tan X, Sirichand K, Vivian A, Wang X. Forecasting European carbon returns using dimension reduction techniques: Commodity versus financial fundamentals. INT J FORECASTING, 2022; 38(3): 944–969.
- View Article
- Google Scholar
11. Brogaard J, Dai L, Ngo P T H, Zhang B. Global political uncertainty and asset prices. REV FINANC STUD, 2020; 33(4): 1737–1780.
- View Article
- Google Scholar
12. Chen W, Xu H, Jia L, Gao Y. Machine learning model for Bitcoin exchange rate prediction using economic and technology determinants. INT J FORECASTING, 2021; 37(1): 28–43.
- View Article
- Google Scholar
13. Zhou J, Wang S. A carbon price prediction model based on the secondary decomposition algorithm and influencing factors. ENERGIES, 2021; 14(5): 1328.
- View Article
- Google Scholar
14. Zhu B, et al. A multiscale analysis for carbon price drivers. ENERG ECON, 2019; 78: 202–216.
- View Article
- Google Scholar
15. Hotelling H. Analysis of a complex of statistical variables into principal components. J EDUC PSYCHOL, 1933; 24(6): 417.
- View Article
- Google Scholar
16. Martinez A M, Kak A C. Pca versus lda. IEEE T PATTERN ANAL, 2001; 23(2): 228–233.
- View Article
- Google Scholar
17. Kelly B T, Pruitt S, Su Y. Characteristics are covariances: A unified model of risk and return. J FINANC ECON, 2019; 134(3): 501–524.
- View Article
- Google Scholar
18. Pelger M. Understanding systematic risk: A high‐frequency approach. J FINANC, 2020; 75(4): 2179–2220.
- View Article
- Google Scholar
19. Gu S, Kelly B, Xiu D. Autoencoder asset pricing models. J ECONOMETRICS, 2021; 222(1): 429–450.
- View Article
- Google Scholar
20. Lettau M, Pelger M. Estimating latent asset-pricing factors. J ECONOMETRICS, 2020; 218(1): 1–31.
- View Article
- Google Scholar
21. Lettau M, Pelger M. Factors that fit the time series and cross-section of stock returns. REVIEW OF FINANCIAL STUDIES, 2020; 33(5): 2274–2325.
- View Article
- Google Scholar
22. Garthwaite P H. An interpretation of partial least squares. J AM STAT ASSOC, 1994; 89(425): 122–127.
- View Article
- Google Scholar
23. Kelly B, Pruitt S. The three-pass regression filter: A new approach to forecasting using many predictors. J ECONOMETRICS, 2015; 186(2): 294–316.
- View Article
- Google Scholar
24. Moya-Clemente I, Ribes-Giner G, Pantoja-Díaz O. Identifying environmental and economic development factors in sustainable entrepreneurship over time by partial least squares (PLS). PLOS ONE, 2020; 15(9): e0238462. pmid:32886680
- View Article
- PubMed/NCBI
- Google Scholar
25. Light N, Maslov D, Rytchkov O. Aggregation of information about the cross section of stock returns: A latent variable approach. REVIEW OF FINANCIAL STUDIES, 2017; 30(4): 1339–1381.
- View Article
- Google Scholar
26. Campbell J Y, Thompson S B. Predicting excess stock returns out of sample: Can anything beat the historical average?. REVIEW OF FINANCIAL STUDIES, 2008; 21(4): 1509–1531.
- View Article
- Google Scholar
27. Diebold F X, Mariano R S. Comparing predictive accuracy. J BUS ECON STAT, 2002; 20(1): 134–144.
- View Article
- Google Scholar
28. Welch I, Goyal A. A comprehensive look at the empirical performance of equity premium prediction. The REVIEW OF FINANCIAL STUDIES, 2008; 21(4): 1455–1508.
- View Article
- Google Scholar
29. He M, Zhang Y, Wen D, Wang Y. Forecasting crude oil prices: A scaled PCA approach. ENERG ECON, 2021; 97: 105189.
- View Article
- Google Scholar

[ref1] 1. Huang Y, Dai X, Wang Q, Zhou D. A hybrid model for carbon price forecasting using GARCH and long short-term memory network. APPL ENERG, 2021; 285: 116485.
View Article
Google Scholar

[2] View Article

[3] Google Scholar

[ref2] 2. Sun W, Zhang J. A novel carbon price prediction model based on optimized least square support vector machine combining characteristic-scale decomposition and phase space reconstruction. ENERGY, 2022; 253: 124167.
View Article
Google Scholar

[5] View Article

[6] Google Scholar

[ref3] 3. Zhou F, Huang Z, Zhang C. Carbon price forecasting based on CEEMDAN and LSTM. APPL ENERG, 2022; 311: 118601.
View Article
Google Scholar

[8] View Article

[9] Google Scholar

[ref4] 4. Sun W, Huang C. A carbon price prediction model based on secondary decomposition algorithm and optimized back propagation neural network. J CLEAN PROD, 2020; 243: 118671.
View Article
Google Scholar

[11] View Article

[12] Google Scholar

[ref5] 5. Wang J, Cui Q, He M. Hybrid intelligent framework for carbon price prediction using improved variational mode decomposition and optimal extreme learning machine. CHAOS SOLITON FRACT, 2022; 156: 111783.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref6] 6. Wang J, Sun X, Cheng Q, Cui Q. An innovative random forest-based nonlinear ensemble paradigm of improved feature extraction and deep learning for carbon price forecasting. SCI TOTAL ENVIRON, 2021; 762: 143099. pmid:33127140
View Article
PubMed/NCBI
Google Scholar

[17] View Article

[18] PubMed/NCBI

[19] Google Scholar

[ref7] 7. Liu J, Wang P, Chen H, Zhu J. A combination forecasting model based on hybrid interval multi-scale decomposition: Application to interval-valued carbon price forecasting. EXPERT SYST APPL, 2022; 191: 116267.
View Article
Google Scholar

[21] View Article

[22] Google Scholar

[ref8] 8. Huang D, Jiang F, Li K, Tong G, Zhou G. Scaled PCA: A new approach to dimension reduction. MANAGE SCI, 2022; 68(3): 1678–1695.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref9] 9. Wang Y, Liu L, Wu C. Forecasting commodity prices out-of-sample: Can technical indicators help?. INT J FORECASTING, 2020; 36(2): 666–683.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref10] 10. Tan X, Sirichand K, Vivian A, Wang X. Forecasting European carbon returns using dimension reduction techniques: Commodity versus financial fundamentals. INT J FORECASTING, 2022; 38(3): 944–969.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref11] 11. Brogaard J, Dai L, Ngo P T H, Zhang B. Global political uncertainty and asset prices. REV FINANC STUD, 2020; 33(4): 1737–1780.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref12] 12. Chen W, Xu H, Jia L, Gao Y. Machine learning model for Bitcoin exchange rate prediction using economic and technology determinants. INT J FORECASTING, 2021; 37(1): 28–43.
View Article
Google Scholar

[36] View Article

[37] Google Scholar

[ref13] 13. Zhou J, Wang S. A carbon price prediction model based on the secondary decomposition algorithm and influencing factors. ENERGIES, 2021; 14(5): 1328.
View Article
Google Scholar

[39] View Article

[40] Google Scholar

[ref14] 14. Zhu B, et al. A multiscale analysis for carbon price drivers. ENERG ECON, 2019; 78: 202–216.
View Article
Google Scholar

[42] View Article

[43] Google Scholar

[ref15] 15. Hotelling H. Analysis of a complex of statistical variables into principal components. J EDUC PSYCHOL, 1933; 24(6): 417.
View Article
Google Scholar

[45] View Article

[46] Google Scholar

[ref16] 16. Martinez A M, Kak A C. Pca versus lda. IEEE T PATTERN ANAL, 2001; 23(2): 228–233.
View Article
Google Scholar

[48] View Article

[49] Google Scholar

[ref17] 17. Kelly B T, Pruitt S, Su Y. Characteristics are covariances: A unified model of risk and return. J FINANC ECON, 2019; 134(3): 501–524.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref18] 18. Pelger M. Understanding systematic risk: A high‐frequency approach. J FINANC, 2020; 75(4): 2179–2220.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref19] 19. Gu S, Kelly B, Xiu D. Autoencoder asset pricing models. J ECONOMETRICS, 2021; 222(1): 429–450.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref20] 20. Lettau M, Pelger M. Estimating latent asset-pricing factors. J ECONOMETRICS, 2020; 218(1): 1–31.
View Article
Google Scholar

[60] View Article

[61] Google Scholar

[ref21] 21. Lettau M, Pelger M. Factors that fit the time series and cross-section of stock returns. REVIEW OF FINANCIAL STUDIES, 2020; 33(5): 2274–2325.
View Article
Google Scholar

[63] View Article

[64] Google Scholar

[ref22] 22. Garthwaite P H. An interpretation of partial least squares. J AM STAT ASSOC, 1994; 89(425): 122–127.
View Article
Google Scholar

[66] View Article

[67] Google Scholar

[ref23] 23. Kelly B, Pruitt S. The three-pass regression filter: A new approach to forecasting using many predictors. J ECONOMETRICS, 2015; 186(2): 294–316.
View Article
Google Scholar

[69] View Article

[70] Google Scholar

[ref24] 24. Moya-Clemente I, Ribes-Giner G, Pantoja-Díaz O. Identifying environmental and economic development factors in sustainable entrepreneurship over time by partial least squares (PLS). PLOS ONE, 2020; 15(9): e0238462. pmid:32886680
View Article
PubMed/NCBI
Google Scholar

[72] View Article

[73] PubMed/NCBI

[74] Google Scholar

[ref25] 25. Light N, Maslov D, Rytchkov O. Aggregation of information about the cross section of stock returns: A latent variable approach. REVIEW OF FINANCIAL STUDIES, 2017; 30(4): 1339–1381.
View Article
Google Scholar

[76] View Article

[77] Google Scholar

[ref26] 26. Campbell J Y, Thompson S B. Predicting excess stock returns out of sample: Can anything beat the historical average?. REVIEW OF FINANCIAL STUDIES, 2008; 21(4): 1509–1531.
View Article
Google Scholar

[79] View Article

[80] Google Scholar

[ref27] 27. Diebold F X, Mariano R S. Comparing predictive accuracy. J BUS ECON STAT, 2002; 20(1): 134–144.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref28] 28. Welch I, Goyal A. A comprehensive look at the empirical performance of equity premium prediction. The REVIEW OF FINANCIAL STUDIES, 2008; 21(4): 1455–1508.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref29] 29. He M, Zhang Y, Wen D, Wang Y. Forecasting crude oil prices: A scaled PCA approach. ENERG ECON, 2021; 97: 105189.
View Article
Google Scholar

[88] View Article

[89] Google Scholar

Carbon price prediction based on a scaled PCA approach

Carbon price prediction based on a scaled PCA approach

Correction

Figures

Abstract

Introduction

Data and methodology

Data

Carbon prices.

Indicators selection.

Methodology

PCA.

s-PCA.

PLS.

LSTM.

Empirical analysis model evaluation metrics

Results and discussion

In-sample results

Out-of-sample results

Robustness test

Alternative proxies of carbon prices

Different prediction horizons

Alternative forecasting window size

Different size of diffusion index

Market timing

Conclusions and future work

Supporting information

S1 File.

Acknowledgments

References