运筹与管理 ›› 2025, Vol. 34 ›› Issue (3): 190-197.DOI: 10.12005/orms.2025.0095

• 应用研究 • 上一篇    下一篇

信息异质性对上市公司财务危机预警的影响

李杰, 王文华, 杨芳   

  1. 河北工业大学 经济管理学院,天津 300401
  • 收稿日期:2022-11-10 出版日期:2025-03-25 发布日期:2025-07-04
  • 作者简介:李杰(1973-),女,河北河间人,博士,教授,研究方向:金融大数据分析。
  • 基金资助:
    国家社会科学基金资助项目(16FGL014);河北省自然科学基金项目(G2019202350)

Impact of Information Heterogeneity on Early Warning of Financial Crisis of Listed Companies

LI Jie, WANG Wenhua, YANG Fang   

  1. School of Economics and Management, Hebei University of Technology, Tianjin 300401, China
  • Received:2022-11-10 Online:2025-03-25 Published:2025-07-04

摘要: 上市公司通常采用财务报表、年报或新闻文本数据构建指标体系,进行财务危机预警。由于数据发布主体、意图和渠道的不同,不同数据之间存在差异。现有财务危机预警研究往往专注于各渠道数据本身承载的信息,忽略了不同来源信息之间差异的影响。本文具体分析了信息异质性存在的原因、不同渠道数据的异质性及其与企业危机状态的关系。定义了信息异质性指标,并给出其度量方法。最后,通过实证研究证明信息异质性对于上市公司财务危机预警具有重要作用。基于XGBoost构建的财务危机预警模型在加入信息异质性指标后的模型性能有显著提升。进一步将公司样本按照财务报表表现和信息异质性高低分为4类特定子样本。同样地,信息异质性指标显著提高了4类子样本的危机预警效果,并且信息异质性是XGBoost模型的第一重要指标。

关键词: 信息异质性, 财务危机, 文本分析, 机器学习

Abstract: The financial crisis of a listed company will cause investors to suffer huge economic losses and even have a serious negative impact on the entire society. Scientific and accurate financial crisis prediction of listed companies can effectively reduce investment risks and related losses. The composition of the existing enterprise financial crisis early warning index system focuses on the use of quantitative financial statements, annual reports and news reports and other data itself carried by the information, but due to the different entities of information release, release purposes, information content and release forms, the response to the operation of listed companies may have different tendencies. This paper defines the difference in tendencies as the “information heterogeneity” between different channel data, and in order to improve the accuracy of enterprise financial crisis early warning, it is necessary to study its impact on an early warning of financial crisis of listed companies. It expands the economic value of information heterogeneity, improves the accuracy of financial crisis prediction, further supports investors for making investment decisions, and helps maintain financial system and social stability. It provides new ideas for the early warning of corporate financial crisis from a new perspective, which is of certain significance for enriching the research in this field in China.
Taking the characteristics and differences of financial crisis early warning data of listed companies from different sources as the main starting point, this paper sorts out the root causes of information heterogeneity, and deeply analyzes the relationship between information heterogeneity and enterprise crisis state. Based on this, the information heterogeneity index is defined, the measurement method of information heterogeneity is proposed, and the impact of information heterogeneity on the early warning of financial crisis of listed companies is verified. Firstly, the paper analyzes different types of data content and characteristics. It focuses on the causes and means of falsification of financial data, artificial manipulation of news texts and excessive embellishment of annual report texts, and explains the root causes of information heterogeneity. Secondly, a method to measure information heterogeneity is proposed. The performance of quantitative financial statements is scored by the power coefficient method, the sentiment value of annual reports and news texts is calculated by combining dictionary method and deep learning, and the calculation formula of information heterogeneity index is proposed to measure the information difference contained in the three data. Finally, based on the tree relationship between information heterogeneity and enterprise financial crisis status, XGBoost is selected to establish a financial crisis prediction model of listed companies, and the prediction effect of the total sample model before and after the information heterogeneity index is added and compared, and the contribution of information heterogeneity to improving the accuracy of model early warning is verified. Furthermore, the total samples are divided into four types of specific sub-samples based on the performance of financial statements and the level of information heterogeneity, the relationship between information heterogeneity and financial crisis status of specific subsamples is analyzed, the model prediction effect of information heterogeneity indicators before and after the addition of subsamples is compared, the information heterogeneity indicators that whether they could effectively distinguish crisis companies and healthy ones are verified, and the relationship between information heterogeneity indicators and financial crises is further analyzed from the perspective of feature importance.
The empirical results show that the accuracy, Recall, AUC, F1 and other indicators of the model built based on the total samples and four types of subsamples are significantly improved by using the information heterogeneity index for financial crisis prediction in the machine learning model. And information heterogeneity is the first important indicator of XGBoost model. It can be seen that information heterogeneity is a major feature to judge whether an enterprise has encountered a financial crisis, and plays an important role in distinguishing crisis companies from healthy ones.
This paper shifts the research perspective from the information contained in the data of each channel to the information heterogeneity reflected by the data from different sources, and finds that it has good value in the prediction of corporate financial crisis. In future work, it can also be applied to other research fields, such as credit risk assessment and financial fraud identification. At the same time, more other data, such as investor comments in social media, can be introduced to better optimize the financial crisis early warning model of listed companies and improve the prediction effect of the model by analyzing the heterogeneity of information.

Key words: information heterogeneity, financial crisis, text analytics, machine learning

中图分类号: