运筹与管理 ›› 2025, Vol. 34 ›› Issue (10): 87-92.DOI: 10.12005/orms.2025.0313

• 理论分析与方法探讨 • 上一篇    下一篇

融合情绪指标的股价波动率预测研究——基于微调大语言模型与GAT-TCN网络

吕成双1, 王彤2, 孙浩然3   

  1. 1.广西大学 经济学院 中国-东盟金融合作学院,广西 南宁 530004;
    2.武汉大学 遥感信息工程学院,湖北 武汉 430072;
    3.南开大学 商学院 中国公司治理研究院,天津 300071
  • 收稿日期:2024-01-10 出版日期:2025-10-25 发布日期:2026-02-27
  • 通讯作者: 王彤(1993-),男,新疆乌鲁木齐人,博士研究生,研究方向:机器学习,自然语言处理。Email: wangtong@whu.edu.cn。
  • 作者简介:吕成双(1994-),女,四川达州人,博士研究生,研究方向:科技金融,公司金融。
  • 基金资助:
    2023年度海南省哲学社会科学规划课题(HNSK(QN)23-99);广西大学经济学院/中国-东盟金融合作学院研究生创新计划资助项目(ZX02080034025001)

Research on Stock Price Volatility Forecast Integrating Sentiment Indicators: Based on Fine-tuned Large Language Model and GAT-TCN Network

LYU Chengshuang1, WANG Tong2, SUN Haoran3   

  1. 1. School of Economics, China-Asian Institute of Financial Cooperation, Guangxi University, Nanning 530004, China;
    2. School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430072, China;
    3. Business School, China Academy of Corporate Governance, Nankai University, Tianjin 300071, China
  • Received:2024-01-10 Online:2025-10-25 Published:2026-02-27

摘要: 股价波动率及其背后的复杂动力学因素是金融学领域研究热点之一。为有效预测股价波动率的走势,本研究运用2010年1月4日到2023年9月22日的日度数据,选取中国上证50六大上市公司股票为研究样本。分类实验发现,相较于其它五组模型,微调大语言模型的精确度与召回度最高能提高11%,说明微调大语言模型在金融文本情感分类任务上性能更优。消融实验发现,将投资者情标引入指标体系后,GAT-TCN模型的预测结果拟合优度均值提升了0.028,表明引入情绪指标能够提升股价波动率的预测精度。相较于其它三组模型,本研究采用的GAT-TCN模型的拟合优度均值最高能提升0.21,表明在股价波动率预测精度方面表现更优。本文将微调大语言模型用于金融文本情感分类,同时扩展了GAT-TCN网络在金融领域的运用,对于股票价格的精准预测以及跨学科领域研究有重要意义。

关键词: 股价波动率, 投资者情绪, ChatGPT, 空洞时间卷积, 图注意力

Abstract: Stock price volatility stands as a pivotal research focus within the financial sector, drawing scholars into the intricate dynamics that underpin stock market movements. Recognizing the complexities involved in forecasting stock price volatility, this research embarks on an extensive analysis using daily data spanning from January 4, 2010 to September 22, 2023. For empirical rigor, six prominent Chinese publicly traded companies—namely, Industrial and Commercial Bank of China, Kweichow Moutai, GD Power Development, Daqin Railway, Yangtze Power, and East China Pharmaceutical—are meticulously selected as the study’s focal samples. Leveraging the capabilities of fine-tuned large-scale language models, this study ventures into the realm of sentiment analysis, delving deep into investor commentary on stocks. Through this, an investor sentiment index is meticulously constructed. Complementing this, the research introduces an avant-garde stock price volatility prediction model that amalgamates graph attention mechanisms with dilated temporal convolutional networks. This innovative approach is designed to forecast stock price volatility, anchoring its predictions on a nonlinear and non-stationary indicator framework.
Navigating through this intricate landscape requires an integrative methodological approach. A salient feature is the employment of fine-tuned large-scale language models for a nuanced sentiment analysis of investor-generated stock commentary. The insights garnered from this analysis culminates in the development of an investor sentiment index, providing a quantifiable measure of market sentiment. In tandem, the research unveils the GAT-TCN model, an innovative fusion of graph attention mechanisms and dilated temporal convolutional networks. This model is meticulously crafted to enhance the precision of stock price volatility predictions, operating within a nonlinear and non-stationary indicator milieu.
The empirical findings of this study are significant. Firstly, the superior efficacy of fine-tuned large-scale language models in financial sentiment classification tasks is unequivocally established. Secondly, the integration of the investor sentiment index into the predictive framework markedly elevates the accuracy and reliability of stock price volatility forecasts. Notably, the GAT-TCN network emerges as a frontrunner, showcasing enhanced predictive prowess compared to established models such as BiLSTM, CNN-LSTM, and SVM. The ramifications of these findings are profound, offering actionable insights for harnessing the potential of fine-tuned language models in financial sentiment analysis and advocating for the broader adoption of the GAT-TCN framework within the financial ecosystem. Such advancements hold transformative implications, paving the way for more precise stock price predictions and fostering interdisciplinary collaborations.
Although this study has significantly advanced our understanding of how investor sentiment predicts stock price volatility, it also emphasizes that the GAT-TCN model exhibits superior performance compared to other models that have been predicted. However, several limitations also point to possible avenues for future research. This study employs the GAT-TCN model, which has demonstrated high forecasting accuracy, but future research can continue to optimize the structure of the forecasting model as technology advances. Secondly, the findings presented in this study are based on the text of six stocks, which, although representative and broad, may not capture the entire sample of stocks. Consequently, the findings may be limited and the generalisability of the findings will be enhanced in the future through the inclusion of a larger sample of stocks.

Key words: stock price volatility, investor sentiment, ChatGPT, dilated temporal convolution, graph attention

中图分类号: