运筹与管理 ›› 2025, Vol. 34 ›› Issue (2): 38-43.DOI: 10.12005/orms.2025.0040

• 理论分析与方法探讨 • 上一篇    下一篇

新能源汽车销量预测的分解—聚类—集成方法研究

王方1,3, 赵桉坤1, 卜皓玥1, 余乐安2   

  1. 1.西安电子科技大学经济与管理学院,陕西西安 710126;
    2.四川大学商学院,四川成都 610065;
    3.陕西信息化与数字经济软科学研究基地,陕西西安 710126
  • 收稿日期:2022-11-18 出版日期:2025-02-25 发布日期:2025-06-04
  • 通讯作者: 余乐安(1976-),男,湖南常德人,博士,教授,研究方向:大数据挖掘,商务智能,经济预测,金融管理。Email: yulean@amss.ac.cn。
  • 作者简介:王方(1987-),男,陕西商洛人,博士,教授,研究方向:预测与决策分析,循环经济,数字经济
  • 基金资助:
    国家自然科学基金资助项目(72001165);滨州魏桥国科高等技术研究院“产学合作 协同育人”项目(BINTECH-KJZX-20220831-67);陕西省创新能力支撑计划资助项目(2022KJXX-112);西安市科技计划项目软科学研究重点项目(23RKYJ0006)

Sales Forecasting of New Energy Vehicles with a Decomposition-cluster-ensemble Method

WANG Fang1,3, ZHAO Ankun1, BU Haoyue1, YU Lean2   

  1. 1. School of Economics and Management, Xidian University, Xi'an 710126, China;
    2. Business School, Sichuan University, Chengdu 610065, China;
    3. Shaanxi Soft Science Institute of Informatization and Digital Economy, Xi'an 710126, China
  • Received:2022-11-18 Online:2025-02-25 Published:2025-06-04

摘要: 新能源汽车销量预测,对于政府产业布局、车企转型发展和能源部门减碳决策均具有重要意义。为提升新能源汽车月度销量预测的精度,基于“分解—集成”的建模思想,遵循“分而治之”的原则,构建了“分解—聚类—集成”预测框架。首先,通过集合经验模态分解(EEMD)算法,将新能源汽车月度销量的时间序列数据分解为多个分量序列。其次,采用样本熵和K-means聚类法对分解得到的多个分量进行集聚,得到高频、中频、低频三类不同的分量序列集。然后,使用长短期记忆网络(LSTM)、差分整合移动平均自回归模型(ARIMA)和灰色预测GM(1,1)模型,分别对三类分量序列进行预测。最后,以线性加权算法进行集成,得到新能源汽车月度销量的预测结果。基于2012年1月至2022年5月我国新能源汽车销量数据的实证分析表明,提出的“EEMD-K-LSTM/ARIMA/GM(1,1)”预测模型较传统单模型和“分解—集成”模型更优。

关键词: 新能源汽车, 销量预测, EEMD分解, K-means聚类, 分解—集成

Abstract: The monthly sales data of new energy vehicles have the phenomenon of multi-data characteristics such as nonlinear and seasonal aliasing, and the use of a classical single model for forecasting has the disadvantage of low prediction accuracy. To improve the accuracy of monthly sales forecast of new energy vehicles, based on the modeling idea of “decomposition-ensemble”, on the basis of making full use of the advantages of each single model, and following the principle of “divide and conquer”, a comprehensive prediction model of “decomposition-clustering-ensemble” is constructed to achieve high-precision prediction of monthly sales of new energy vehicles.
Firstly, the ensemble empirical mode decomposition (EEMD) model is applied to decompose the time-series data of monthly sales volume of new energy vehicles. This approach effectively handles the nonlinear and non-stationary data characteristics of the data series, and effectively suppresses the occurrence of modal aliasing phenomenon. Then, to improve the efficiency of prediction modeling and reduce the accumulation of errors, sample entropy and K-means method are used to cluster the obtained decomposition components, and three types of components are obtained: high frequency sequence, medium frequency sequence and low frequency sequence. Based on the advantages of GM(1,1) model, which is suitable for the prediction of exponential law data series, the low frequency class component series is predicted. The autoregressive integrated moving average (ARIMA) model can transform complex non-stationary sequences into stationary sequences for modeling, and use it to predict intermediate frequency component sequences. The long short-term memory (LSTM) network model selects and processes the data through the three gates inside the control, which is suitable for more complex high-frequency data series prediction modeling, and uses it to predict the high-frequency component series. Finally, the linear weighting method is used to combine the forecast results of each component, and the forecast results of monthly sales of new energy vehicles are obtained.
The monthly sales volume of new energy vehicles from January 2012 to May 2022 published by the China Association of Automobile Manufacturers is used as the data set to verify the “EEMD-K-LSTM/ARIMA/GM(1,1)” comprehensive forecasting model proposed in this study. The results show that compared with the traditional single model and the “decomposition-ensemble” model, the “decomposition-clustering-ensemble” comprehensive forecasting model achieves a good forecasting effect, and the MAPE value of the one-step forward and three-step forward forecasting of the monthly sales volume of new energy vehicles is 8.75% and 10.62%, respectively. Using the “EEMD-K-LSTM/ARIMA/GM(1,1)” comprehensive forecasting model, the sales data of new energy vehicles in China from January 2012 to October 2022 are modeled. The predicted sales for November 2022 to January 2023 are 800,000 vehicles, 830,000 vehicles, and 520,000 vehicles, respectively, consistent with the overall trend in 2020 and 2021.
It should be noted that in reality, there are multiple factors that influence the monthly sales of new energy vehicles, including national policies, seasonal factors, economic conditions, and so on. To achieve long-term trend forecasting, the next step should consider incorporating various influencing factors into the model and conducting more comprehensive predictions and discussions through methods such as scenario analysis.

Key words: new energy vehicles, sales forecast, EEMD decomposition, K-means clustering, decomposition-ensemble

中图分类号: