运筹与管理 ›› 2025, Vol. 34 ›› Issue (11): 81-87.DOI: 10.12005/orms.2025.0346

• 理论分析与方法探讨 • 上一篇    下一篇

基于在线评分与文本评论的电影排序方法研究

刘瑞, 徐海燕   

  1. 南京航空航天大学 经济与管理学院,江苏 南京 211106
  • 收稿日期:2024-05-30 出版日期:2025-11-25 发布日期:2026-03-30
  • 通讯作者: 徐海燕(1963-),女,江苏扬州人,教授,博士生导师,研究方向:决策理论与冲突分析。Email: xuhaiyan@nuaa.edu.cn。
  • 作者简介:刘瑞(1997-),女,安徽铜陵人,博士研究生,研究方向:智能决策。
  • 基金资助:
    国家自然科学基金资助项目(71971115);智能决策与数字化运营工业和信息化部重点实验室项目(NJ2023027);国家社会科学基金一般项目(23BXW117)

Research on Film Ranking Method Based on Online Rating and Text Reviews

LIU Rui, XU Haiyan   

  1. College of Economics and Management, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
  • Received:2024-05-30 Online:2025-11-25 Published:2026-03-30

摘要: 通过对媒体网站在线评论进行文本挖掘和分析,可以了解观影者偏好,而现有研究只单独考虑了在线评分或文本评论信息,且在各属性信息融合时依赖于决策者提供的主观或客观权重。为此,本文综合在线评分与文本评论信息,提出基于SMAA-TFN方法的电影排序模型。首先运用Python爬虫技术获取媒体网站的影片在线评分和文本评论信息并进行预处理;其次根据LDA主题模型生成的潜在主题、高频主题词以及决策者经验,确定评价指标体系;构建基于情感词典的情感分析模型,计算文本评论消极、中性和积极情感倾向百分比,并构造三角模糊数对情感分析结果进行表征;通过集成基于文本信息的排序函数和平均在线评分结果得到综合排序函数;利用SMAA-TFN方法构建了一个新的电影评估框架,实现电影排序。最后通过豆瓣网站的真实影评数据,展示该方法的实施过程。通过对比分析,表明本文方法可以为决策者提供更多决策信息,验证了模型的可行性和优越性。

关键词: 文本评论, 在线评分, 情感分析, 三角模糊数, 随机多属性可接受度分析

Abstract: As the representative of human wisdom and creativity, film is the product of modern technology and cultural development, which enriches and improves the quality of people’s leisure life. As a rising star in the arts, film has become an indispensable spiritual nourishment in world culture. With the deep integration of the internet and the film industry, a vast amount of diverse online review data has emerged. How to scientifically and accurately mine viewer preferences and market demand information has become a current hot topic of research. This paper proposes a movie ranking method based on big data environment. The evaluation results can help both viewers choose quality movies and producers understand viewer preferences and market trends so that they can make more targeted marketing decisions. In addition, this paper also make contribution to the evaluation of high-quality films, which helps to enhance the influence and competitiveness of the Chinese film internationally and strengthen the national cultural soft power.
   Firstly, this paper establishes a film comprehensive evaluation index system based on text review data. Web crawling techniques are utilized to obtain online review data of films from media websites and preprocess them. We use the Latent Dirichlet Allocation (LDA) theme model to identify the underlying choice preference of the viewer, and the decision-maker can determine the evaluation index system according to the underlying theme and high frequency topic words generated by LDA topic model, and the experience of the decision-maker. Secondly, the paper combines sentiment analysis with triangular fuzzy number to accurately reflect the fuzziness and subjectivity of the text reviews, avoiding the loss of evaluation information. We construct a sentiment analysis model based on sentiment dictionary, calculate the percentages of negative, neutral and positive sentiment tendencies in text reviews and construct triangular fuzzy number to represent the sentiment analysis results, which make the conversion of text data closer to actual values. Finally, we combine text reviews with online rating information to make up for the lack of considering only one of them and construct a film decision model based on obtaining enough information. The comprehensive ranking function is given by integrating the ranking function based on text information and the average online rating results. A new film evaluation framework is constructed by using the Stochastic Multicriteria Acceptability Analysis - Triangular Fuzzy Number (SMAA-TFN) method to calculate the overall acceptability index of each film and achieve the ranking of different films. In addition, the proposed method can provide the decision-maker with the assistant decision information by using center weight vectors to distinguish movie advantages and disadvantages, identifying competitors through analyzing the dominant relationship between the films. The comprehensive method extends the evaluation dimensions and improves the quality of decision-making.
   In this paper, the real online rating and text reviews of five different types films from the Douban website are selected as the experimental data to illustrate the implementation process of the method. Through a comparative analysis, it is found that the results of the proposed method are more accurate than other models, verifying the effectiveness and superiority of the proposed method.
   In theory, constructing a film ranking model that fully combines online rating with text reviews reduces information bias. The use of triangular fuzzy number to represent sentiment analysis results overcomes the ambiguity and uncertainty characteristics of the text. In addition, the SMAA-TFN evaluation framework proposed in this paper does not require the pre-assignment of subjective weight information or the calculation of objective weight information, overcoming the limitations of existing weight determination methods. In practice, this study leverages a new data source (big data) and integrates it into traditional multi-attribute decision-making methods to evaluate different types of films, providing effective management insights for audiences, producers, etc.

Key words: text reviews, online rating, sentiment analysis, triangular fuzzy number, stochastic multi-criteria acceptability analysis

中图分类号: