Three-way Clustering Model Integrated Decomposition Ensemble Learning for Forecasting Stock Price

doi:10.12005/orms.2024.0273

Operations Research and Management Science ›› 2024, Vol. 33 ›› Issue (8): 213-218.DOI: 10.12005/orms.2024.0273

• Application Research • Previous Articles Next Articles

Three-way Clustering Model Integrated Decomposition Ensemble Learning for Forecasting Stock Price

BAI Juncheng¹, SUN Bingzhen¹, GUO Yuqi¹, CHEN Youwei¹, GUO Jianfeng²

1. School of Economics and Management, Xidian University, Xi’an 710071, China;
2. Research Center for Computational Finance and Risk Management, Xi’an University of Posts and Telecommunications, Xi’an 710061, China

Received:2021-11-18 Online:2024-08-25 Published:2024-10-29

融合三支聚类与分解集成学习的股票价格预测模型

白军成¹, 孙秉珍¹, 郭誉齐¹, 陈有为¹, 郭建峰²

1.西安电子科技大学经济与管理学院,陕西西安 710071;
2.西安邮电大学计算金融与风险管理研究中心,陕西西安 710061

作者简介:白军成(1993-),男,甘肃镇原人,博士研究生,研究方向:机器学习;孙秉珍(1979-),通讯作者,男,甘肃宁县人,博士,教授,博士生导师,研究方向:数据科学与智能决策,应急管理决策,数据驱动的医疗决策。
基金资助:
国家自然科学基金资助项目(72071152);陕西省杰出青年科学基金项目(2023-JC-JQ-11);西安市科技项目(2022RKYJ0030);教育部人文社会科学研究项目(22YJA630008);中央高校基础研究项目(20101236618,20101236262)

Abstract

Abstract: Accurate trend analysis and real-time price prediction are effective ways to achieve optimal investment returns. However, traditional forecasting methods face challenges in the financial markets, which are influenced by changes in the objective economic environment, investors’ expected returns, and other underlying factors. How to discover a reliable forecasting tool in uncertain environments and improve prediction accuracy is a scientific issue worthy of in-depth exploration.
This paper introduces the idea of decomposition ensemble along with the theory of three-way decisions, and proposes a composite forecasting model based on three-way clustering. First, the Complementary Ensemble Empirical Mode Decomposition (CEEMD) method is used to decompose the original time series into several relatively stable sub-series, thereby reducing the complexity of the original time series while uncovering hidden information. Next, to address the different properties of the sub-series, sample entropy is used to measure the complexity of each sub-series, and a probabilistic rough set based on Bayesian risk decision is constructed to classify the sub-series into core, marginal, and trivial domains. Then, to avoid the lack of input information or interference from redundant information, a phase space reconstruction method is employed to determine the optimal input structures for Elman neural networks, extreme learning machines, and BP neural networks to predict the core, marginal, and trivial domains, respectively. Finally, the proposed model is applied to the prediction of ANY stock prices in the U.S. market, as well as to the prediction of important international and domestic stock indices and their constituent stocks.
The method proposed in this paper demonstrates good predictive performance for stock prices, and its outstanding results can be attributed to the following factors: First, the CEEMD effectively uncovers hidden information in the time series. Second, three-way clustering enhances the adaptability of the forecasting method. Third, phase space reconstruction adaptively constructs the input structures of the neural networks. Theoretically, the integration of granular computing with decomposition and integration methods represents a beneficial attempt and exploration in constructing complex dynamic data forecasting decision models and methods. From the perspective of time series complexity, the construction of a three-way clustering model based on Bayesian risk decision and probabilistic rough set offers a new perspective to enrich the theory of three-way decisions. In practice, achieving accurate stock price predictions can enable investors to more effectively avoid future risks and provide scientific support and reference for practical investment decisions.

Key words: three-way clustering, complementary ensemble empirical mode decomposition, stock price forecasting

摘要： 准确的趋势判断与实时价格预测是获得理想投资收益的有效途径。现实的金融市场受客观经济环境变化,投资者预期回报以及其他潜在因素影响,使得传统预测方法面临较多的挑战和压力。如何在不确定的环境中发现一种可靠的预测工具,提高预测的准确性,将是值得深入探讨的科学问题。为了获得准确的预测,帮助投资者赢得最大利润,本文引入分解集成思想和三支决策理论,提出了一种基于三支聚类和分解集成的复合预测方法。首先,使用互补集成经验模态分解方法将原始时间序列分解成若干个相对平稳的子序列,实现降低原始时间序列复杂性的同时挖掘了隐藏的信息。其次,为了针对性地处理不同属性的子序列,构建了基于贝叶斯风险决策的概率粗糙集进行三支聚类。接着,为了避免输入信息的欠缺或者冗余信息的干扰,采用基于相空间重构的特征选择方法确定不同神经网络的输入结构。最后,将提出的方法应用于美股ANY价格预测和国际、国内的重要股票指数以及其成分股预测验证其有效性和实用性。同时为把粒计算思想方法与分解集成融合,构建复杂动态数据预测决策模型与方法进行了有益的尝试和探讨。此外,研究结果将为投资者的实际投资决策提供科学的支持与参考。

关键词: 三支聚类, 互补集成经验模态分解, 股票价格预测

CLC Number:

F830.9

BAI Juncheng, SUN Bingzhen, GUO Yuqi, CHEN Youwei, GUO Jianfeng. Three-way Clustering Model Integrated Decomposition Ensemble Learning for Forecasting Stock Price[J]. Operations Research and Management Science, 2024, 33(8): 213-218.

白军成, 孙秉珍, 郭誉齐, 陈有为, 郭建峰. 融合三支聚类与分解集成学习的股票价格预测模型[J]. 运筹与管理, 2024, 33(8): 213-218.

References

[1] MARIUSZ P, HENRYK R. Financial time series forecasting using rough sets with time-weighted rule voting[J]. Expert Systems with Applications, 2016, 66: 219-233.
[2] HSUA M, LESSMANNA S, SUNGA M. Bridging the divide in financial market forecasting: Machine learners vs. financial economists[J]. Expert Systems with Applications, 2016, 61: 215-234.
[3] 王苏生,王俊博,许桐桐,等.基于ARMA-GARCH-SN模型的沪深300股指期货日内波动率研究与预测[J].运筹与管理,2018,27(4):153-161.
[4] XING D, LI H, LI J, et al. Forecasting price of financial market crash via a new nonlinear potential GARCH model[J]. Physica A, 2021, 566: 125649.
[5] 杨芸,陈亮,樊重俊,等.改进型LOBNN & AR-GARCH模型在股票预测中的应用[J].运筹与管理,2021,30(10):153-158.
[6] PAI P F, LIN C S. A hybrid ARIMA and support vector machines model in stock price forecasting[J]. Omega, 2005, 33: 497-505.
[7] 张贵生,张信东.基于微分信息的ARMAD-GARCH股价预测模型[J].系统工程理论与实践,2016,36(5):1136-1145.
[8] 乔若羽.基于神经网络的股票预测模型[J].运筹与管理,2019,28(10):132-140.
[9] WANG Y, WANG L, YANG F, et al. Advantages of direct input-to-output connections in neural networks: The Elman network for stock index forecasting[J]. Information Sciences, 2021, 547: 1066-1079.
[10] YU Z, QIN L, CHEN Y, et al. Stock price forecasting based on LLE-BP neural network model[J]. Physica A, 2020, 553: 124197.
[11] 汪漂.混合区间多尺度分解的区间时间序列组合预测[J].运筹与管理,2021,30(10):159-164.
[12] 崔焕影,窦祥胜.基于EMD-GA-BP与EMD-PSO-LSSVM的中国碳市场价格预测[J].运筹与管理,2018,27(7):133-143.
[13] WANG P, YAO Y. CE3: A three-way clustering method based on mathematical morphology[J]. Knowledge-Based System, 2018, 155: 54-65.
[14] 徐菲,任爽.基于分解-集成的铁路货运需求预测研究[J].运筹与管理,2021,30(8):133-138.
[15] RICHMAN J S, MOORMAN J R, Physiological time-series analysis using approximate entropy and sample entropy[J]. American Journal of Physiology—Cell Physiology, 2017, 278: 2039-2049.
[16] 薛占熬,庞文莉,荆萌萌.基于直觉模糊覆盖包含关系的三支决策模型[J].模糊系统与数学,2020,34(6):99-108.
[17] 马长峰,陈志娟,张顺明.基于文本大数据分析的会计和金融研究综述[J].管理科学学报,2020,23(9):19-30.
[18] 周弘,张成思,唐火青.融资约束与实体企业金融化[J].管理科学学报,2020,23(12):91-109.
[19] 岑跃峰,张晨光,岑岗,等.基于近端强化学习的股价预测方法[J].控制与决策,2021,36(4):967-973.
[20] MA W M, SUN B Z, On relationship between probabilistic rough set and Bayesian risk decision over two universes[J]. International Journal of General Systems, 2012, 41: 225-245.

Three-way Clustering Model Integrated Decomposition Ensemble Learning for Forecasting Stock Price

融合三支聚类与分解集成学习的股票价格预测模型

PDF

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 1

Recommended Articles

Metrics