运筹与管理 ›› 2024, Vol. 33 ›› Issue (9): 134-139.DOI: 10.12005/orms.2024.0296

• 应用研究 • 上一篇    下一篇

基于GA-VMD与CNN-BiLSTM-Attention模型的区域碳排放交易价格预测研究

吴丽丽1, 邰庆瑞1, 卞洋2, 李言辉3   

  1. 1.中国石油大学(北京) 经济管理学院,北京 102249;
    2.对外经济贸易大学 金融学院,北京 100029;
    3.南京大学 计算机科学与技术系,江苏 南京 210008
  • 收稿日期:2022-11-27 出版日期:2024-09-25 发布日期:2024-12-31
  • 通讯作者: 吴丽丽(1979-),女,江苏苏州人,博士,副教授,研究方向:能源经济与政策。
  • 基金资助:
    国家自然科学基金资助项目(72374211,62172202)

Research on the Prediction of Regional Carbon Price: A GA-VMD and CNN-BiLSTM-Attention Approach

WU Lili1, TAI Qingrui1, BIAN Yang2, LI Yanhui3   

  1. 1. School of Economics and Management, China University of Petroleum-Beijing, Beijing 102249, China;
    2. School of Banking and Finance, University of International Business and Economics, Beijing 100029, China;
    3. School of Computer Science and Technology, Nanjing University, Nanjing 210008, China
  • Received:2022-11-27 Online:2024-09-25 Published:2024-12-31

摘要: 准确的碳价预测可为碳排放权交易市场监管者和投资者提供决策依据与参考。本文提出了基于GA-VMD降噪分解及CNN-BiLSTM-Attention混合模型的碳价预测方法,并选取湖北碳市场2014年4月2日到2022年6月15日1857个交易日的数据进行分析:首先通过遗传算法改进变分模态分解(GA-VMD)将原始碳价序列分解为平稳的本征模态函数(IMF)分量,降低数据噪音;随后构建CNN-BiLSTM-Attention混合模型对各IMF分量进行预测。其中,卷积神经网络(CNN)可提取影响碳价多个特征,双向长短期记忆网络(BiLSTM)可实现时间序列信息提取,注意力机制(Attention)可突出某个关键输入对输出的影响。本文将预测出的各IMF分量集合成碳价序列,并提出12个模型,分为3个组进行剥离分析,结果显示GA-VMD-CNN-BiLSTM-Attention的预测结果最好。另外,为给市场参与者提供更多信息,本文在确定性预测的基础上加入区间预测,以便提前测量碳市场的波动性。

关键词: 碳价预测, 深度学习, 变分模态分解, BiLSTM, 注意力机制

Abstract: Recently, a global consensus has been reached that carbon emissions should be reduced in response to global environmental problems. It is now widely believed that an effective carbon tax policy and the establishment of a carbon emission trading system are the key to the transition to a low-carbon economy. Carbon prices, which directly mirror the supply and demand for carbon emission rights in carbon markets, exert significant influence over both investors and regulatory authorities. Accurate forecasts of carbon prices are essential for informed decision-making. However, carbon prices are affected by internal market mechanisms and external environmental fluctuations, and are therefore non-stationary and non-linear. Therefore, carbon price prediction faces a huge challenge. This paper improves the accuracy of carbon price prediction by complementing current research on the application of decomposition-forecast-ensemble hybrid models.
Currently, carbon price prediction models are shifting from traditional models to data-driven ones, enabling deep learning algorithms to have more applications in this field. To build an effective hybrid forecasting model and reduce data noise, the paper proposes a new framework that is based on the GA-VMD-CNN-BiLSTM-Attention hybrid model. Here the genetic algorithm (GA) is adopted to search the optimal parameter combination of variational mode decomposition (VMD); convolutional neural networks (CNN) are established to discover the relationship between influencing factors and carbon prices; a bidirectional long and short-term memory network (BiLSTM) is applied to extract time series information; and an attention mechanism is used to strengthen the influence of important information on carbon prices. In addition to deterministic point prediction, this paper uses a non-parametric kernel density estimation with Gaussian kernel function (KDE-Gaussian) for interval forecasting. The interval forecasting can quantify the uncertainty of carbon prices and serve as a more practical reference for decision-makers. In our empirical analysis, this paper uses data from China's Hubei Emission Exchange dating from April 2, 2014, to June 15, 2022, for a total of 1857 trading days, to predict the daily closing price. The four main innovations and contributions of this paper are as follows. First, GA-VMD is applied to obtain multiple intrinsic mode function (IMF) components, so as to input multiple effective and smooth subsequences in the forecasting model. The optimal parameters of VMD are found through continuous iteration of GA, which avoids the uncertainty of artificial selection and effectively eliminates the noise influence in the decomposition process. Second, a hybrid CNN-BiLSTM-Attention prediction model is established. The CNN feature extraction capability is combined with the BiLSTM time series information extraction capability to improve the one-way transmission LSTM into BiLSTM with simultaneous forward and backward transmission, thus enhancing the memory of the neural network. Third, an attention mechanism is introduced into the BiLSTM side. This method trains weights on the hidden states of all BiLSTM time steps. Consequently, it outputs all forecasting information by weighted summation. Therefore, it can improve the forecasting effect by increasing the influence of important information. Fourth, on the basis of deterministic point prediction, non-parametric KDE-Gaussian is applied for interval forecasting. The prediction intervals at different confidence levels can serve as an improved practical reference for decision-makers.
To verify the superiority of the proposed model, this paper presents a comparative analysis of 12 models divided into 3 groups. The first group includes the models of LSTM, BiLSTM, BiLSTM-Attention, and CNN-BiLSTM-Attention. In the second group, VMD is added to the benchmark models to reflect the effect of noise reduction. Then the genetic algorithm is added to the third group. We then evaluate the forecasting results of different models and compare them using point prediction evaluation metrics. Compared to 11 other models, the GA-VMD-CNN-BiLSTM-Attention model is more accurate and reliable: its goodness-of-fit (R2) reaches 98.91%, while its MAE, RMSE, and MAPE values are as low as 0.1246, 0.7298, and 0.0111, respectively. In addition to deterministic point prediction, the paper performs interval forecasting for 278 data points from the Hubei Emission Exchange in the test set to quantify the uncertainty of carbon prices and provide a more practical reference for decision-makers. The result shows that the KDE (Gaussian) prediction method provides a more reliable interval prediction, with a 15.57% reduction over the KDE (Epanechnikov) method and a 34.92% reduction over normal distribution in coverage width-based criterion (CWC) index at a 95% confidence level.
By revealing the particularly challenging issue that underlies carbon price forecasting, our analysis sheds light on current low-carbon policies in China. To improve these policies, the paper proposes that China should establish a comprehensive carbon emissions data system, gradually implement paid allocation of allowances, enrich trading products, and promote a jointly developed financial market and the carbon market. This paper has not yet considered how and to what extent other factors such as policy making and changes in domestic andinternational situations affect carbon prices. This is a possible direction for future studies.

Key words: carbon price prediction, deep learning, variational mode decomposition, BiLSTM, attention mechanisms

中图分类号: