考虑时间突发特性的中文虚假商品评论识别研究

doi:10.12005/orms.2025.0064

摘要/Abstract

摘要： 为提高虚假评论识别的准确性,本文构建了一种综合考虑评论文本、评论者行为和评论时间突发特性多维特征的识别模型。本文利用评论数量、平均得分和KL散度的三维时间序列评估可疑程度特征,提取评论文本和评论者行为特征;利用常用的深度学习和机器学习算法建立模型,通过实验选择出性能最优的随机森林模型作为分类器;采用SMOTE方法解决数据集中类不平衡问题,结合随机森林算法建立了SRF模型。针对华为手机的评论数据进行实验,结果显示,本文提出的SRF模型具有优越性能,召回率和F1分数分别为0.9693和0.9705。此外,针对重新收集的评论数据,运用SRF模型进行识别分类和统计分析,结果显示SRF模型具有较强的稳健性。

关键词: 虚假评论, 时间突发特征, 文本挖掘, 机器学习, 随机森林

Abstract: In the era of digital economy, online reviews can influence consumers' consumption decisions, which, in turn, plays a critical role in the revenue of an organization. That is why some businesses resort to shady means to post fake reviews. However, genuine customer reviews of products or services contain a lot of useful information, which helps enterprises to further improve their offerings and obtain a better reputation and profitability. Consequently, extensive research has been conducted in recent year to identify fake reviews. Most of the existing studies focus on recognizing fake reviews based on the characteristics of comment text and reviewers' behavior, with a few also considering temporal burst features. In order to enhance the accuracy of fake review detection, this paper develops a comprehensive fake review recognition model that incorporates various features, including review text, reviewers' behavior, and time burst characteristics. This approach addresses the challenges posed by time bursts and class imbalance in online reviews.
Online user reviews can be collected from e-commerce websites, such as JD.COM, using a web crawler. This paper crawls 9141 reviews about Huawei MateX3, Nova11, and P60 mobile phones. Regarding these reviews, this article carried out data cleaning by removing automatically generated system default positive reviews, duplicate comments, and invalid comments, ultimately leaving 8075 valid reviews (referred to as Dataset 1). To label the reviews, a manual annotation process is adopted, considering factors such as authenticity of review object, rationality of reviewer's behavior, overall linguistic coherence, and consistency between image and text descriptions. Fake reviews are assigned a label value of 1, while genuine reviews are labeled with a value of 0. This paper introduces a sliding time window approach to categorize reviews. Additionally, the Local Outlier Factor (LOF) outlier detection algorithm is employed to determine the suspiciousness index of reviews based on a three-dimensional time series analysis. The dimensions considered include the mean of the review scores, the number of reviews, and the Kullback-Leibler Divergence. By combining the suspicion degree feature, text features of the review, and behavior features of the reviewer, a comprehensive feature set is proposed. Based on Dataset 1, seven experiments in total are established, utilizing Convolutional Neural Network, Recurrent Neural Network, Bi-directional Long Short-Term Memory, Multilayer Perceptron, Random Forest, Support Vector Classification, and Adaboost algorithm to construct the model. The Random Forest with the optimal classification effect is selected. To address the issue of imbalanced training samples, the eighth experimental group is created by combining the SMOTE oversampling method with the best performing classifier from control groups. To analyze the influence of each feature category on the final recognition performance, this paper conducts ablation experiments by combining different categories. Sensitivity analysis is performed to explore the impact of varying time window sizes on the identification of fake reviews. Additionally, a dataset of 5,314 comments on Huawei Nova11 mobile phones is collected. After screening, 5,030 valid comments (referred to as Dataset 2) are obtained. The proposed approach is then applied to analyze Dataset 2. To verify the robustness of the model, the statistical features between genuine and fake reviews is compared with Dataset 1.
The experimental results of the model comparison show that the SRF model, combining the SMOTE method with the random forest algorithm, outperforms others with a recall rate of 0.9693 and F1 score of 0.9705. The results of ablation experiments indicate that reviewer behavior features are the most effective category for identifying fake reviews, and adding suspicion degree feature can further improve recognition performance. Combining all of the three categories achieves the best classification performance. Furthermore, the sensitivity analysis experiment shows that as the time window increases, the performance of the fake review recognition model deteriorates. Thus, the model performs best when the time window is set to one day. The robustness analysis confirms the applicability and stability of the model across different datasets.
The theoretical contribution of this paper is the construction of a comprehensive framework for detecting fake reviews, which expands previous research. The practical implication is that the approach proposed in this paper can be utilized by enterprises and platforms to eliminate fake reviews effectively, thereby enhancing consumers' trust, improving company reputation and maintaining order in the e-commerce market.
This paper considers the multidimensional features and class imbalance commonly observed in online reviews. It provides valuable insights to assist e-commerce platforms in effectively filtering fake reviews and offering consumers more reliable review data. However, it is important to note that the SMOTE method may lead to data redundancy and impact classification accuracy. Therefore, future research should explore alternative methods to address data imbalance and improve model accuracy. Moreover, the proposed fake review recognition method in this paper focuses only on mobile phone reviews for verification. Subsequent research in other domains is necessary to validate its applicability. Additionally, enriching the multidimensional feature set of fake reviews should be undertaken to enhance identification accuracy.

Key words: fake reviews, time burst characteristics, text mining, machine learning, random forest

中图分类号:

TP391.1

邓钰佳, 汪鹏, 方兴华, 秦芳. 考虑时间突发特性的中文虚假商品评论识别研究[J]. 运筹与管理, 2025, 34(2): 210-217.

DENG Yujia, WANG Peng, FANG Xinghua, QIN Fang. Research on Chinese Fake Product Review Detection Considering Time Burst Characteristics[J]. Operations Research and Management Science, 2025, 34(2): 210-217.

参考文献

[1] 张文,王强,唐子旭,等.在线虚假评论识别中的数据贫乏问题研究[J].运筹与管理,2022,31(11):167-173.
[2] 魏瑾瑞,徐晓晴.虚假评论、消费决策与产品绩效—虚假评论能产生真实的绩效吗[J].南开管理评论,2020,23(1):189-199.
[3] JINDAL N, LIU B. Review spam detection[C]// WWW'07: Proceedings of the 16th International Conference on World Wide Web, May 08-12,2007, Alberta, Banff, Canada. New York: Association for Computing Machinery, 2007: 1189-1190.
[4] SULTANA N, PALANIAPPAN S. Deceptive opinion detection using machine learning techniques[J]. International Journal of Information Engineering & Electronic Business, 2020, 12(1): 1-7.
[5] LI L, QIN B, REN W, et al. Document representation and feature combination for deceptive spam review detection[J]. Neurocomputing, 2017, 254: 33-41.
[6] TANG X, QIAN T, YOU Z. Generating behavior features for cold-start spam review detection with adversarial learning[J]. Information Sciences, 2020, 526: 274-288.
[7] 杜姗,杨敏,仇蓉蓉.基于SMOTE-RF与多维特征向量的在线商品虚假评论识别研究[J].情报杂志,2023,42(4):156-164.
[8] KUMAR A, GOPAL R D, SHANKAR R, et al. Fraudulent review detection model focusing on emotional expressions and explicit aspects: Investigating the potential of feature engineering[J]. Decision Support Systems, 2022, 155: 113728.
[9] CARDOSO E F, SILVA R M, ALMEIDA T A. Towards automatic filtering of fake reviews[J]. Neurocomputing, 2018, 309: 106-116.
[10] LIU W, HE J, HAN S, et al. A method for the detection of fake reviews based on temporal features of reviews and comments[J]. IEEE Engineering Management Review, 2019, 47(4): 67-79.
[11] HEYDARI A, TAVAKOLI M, SALIM N. Detection of fake opinions using time series[J]. Expert Systems with Applications, 2016, 58: 83-92.
[12] WANG N, YANG J, KONG X, et al. A fake review identification framework considering the suspicion degree of reviews with time burst characteristics[J]. Expert Systems with Applications, 2022, 190: 116207.
[13] 张新,刘生辉,徐峰,等.基于体验型产品的在线评论有用性影响研究[J].经济与管理评论,2023,39(2):95-108.
[14] 孙瑾,郑雨,陈静.感知在线评论可信度对消费者信任的影响研究—不确定性规避的调节作用[J].管理评论,2020,32(4):146-159.
[15] 陶朝杰,杨进.基于主观倾向值和EasyEnsemble算法的虚假评论识别方法[J].计算机应用研究,2021,38(5):1403-1408.
[16] SAIDANI N, ADI K, ALLILI M S. A semantic-based classification approach for an enhanced spam detection[J]. Computers & Security, 2020, 94: 101716.
[17] 王长征,何钐,王魁.网络口碑中追加评论的有用性感知研究[J].管理科学,2015,28(3):102-114.
[18] 郑洁,段宇波.电商平台追加评价对消费者购买意愿的影响—以淘宝为例[J].商业经济研究,2022(13):80-84.
[19] 罗汉洋,李智妮,林旭东,等.网络口碑影响机制:信任的中介和性别及涉入度的调节[J].系统管理学报,2019,28(3):401-414+428.
[20] 李超,向静,向军.在线商品评论可信性评价方法[J].计算机应用,2019,39(1):181-185.
[21] SAVAGE D, ZHANG X, YU X, et al. Detection of opinion spam based on anomalous rating deviation[J]. Expert Systems with Applications, 2015, 42(22): 8650-8657.
[22] 刘云,王梓宇.无偏KL散度算法对时空异常区间检测的优化研究[J].计算机工程与科学,2020,42(7):1318-1324.
[23] 魏瑾瑞,王若彤,王晗.基于网络结构特征的大规模虚假评论群组识别[J].运筹与管理,2023,32(1):194-200.
[24] WU Y, NGAI E W T, WU P, et al. Fake online reviews: Literature review, synthesis, and directions for future research[J]. Decision Support Systems, 2020, 132: 113280.
[25] 袁禄,朱郑州,任庭玉.虚假评论识别研究综述[J].计算机科学,2021,48(1):111-118.