基于深度学习的电商商品购买意图识别模型

doi:10.12005/orms.2024.0022

运筹与管理 ›› 2024, Vol. 33 ›› Issue (1): 145-150.DOI: 10.12005/orms.2024.0022

基于深度学习的电商商品购买意图识别模型

郭小宇, 马静

南京航空航天大学经济与管理学院,江苏南京 211106

收稿日期:2021-11-23 出版日期:2024-01-25 发布日期:2024-03-25
通讯作者: 马静(1966-),女,重庆人,博士,教授,研究方向:文本表示,多模态舆情表示,复杂网络。
作者简介:郭小宇(1995-),女,陕西渭南人,博士研究生,研究方向:文本表示,多模态舆情表示。
基金资助:
国家自然科学基金资助项目(72174086);中央高校基本科研业务费专项资金项目(NW2020001)

Purchasing Intention Identification Model Based on Deep Learning in E-commerce

GUO Xiaoyu, MA Jing

College of Economics and Management,Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China

Received:2021-11-23 Online:2024-01-25 Published:2024-03-25

摘要/Abstract

摘要： 识别用户的购买意图是提升电子商务购买率(PR)的重要方法之一。针对用户购买意图不明确的现象,提出一种新模型。该模型将训练后的Word2Vec(WV)词向量馈入卷积神经网络(CNN),通过深层语义模型(DSSM)进一步提取文本特征。在Keras框架下结合美国建材电商网站家得宝的真实搜索数据进行实证分析。结果表明,在五分类问题中,新模型在测试数据集上的F1-score达80.6%。新模型使用了Word2Vec与CNN提取文本特征,并应用DSSM模型进一步提取了用户检索与商品描述文档在高维空间中的特征表示,最大化利用了用户检索与正确商品描述之间的语义相似度,同时避免了特征提取时主观因素的干扰,提高了商品购买意图的识别效果。

关键词: 购买意图识别, 卷积神经网络, 深层语义模型, 深度学习

Abstract: With the rapid proliferation and intelligent development of e-commerce platforms, accurate identification of user purchase intention has become a crucial influencing factor in driving users from intent toactual purchases. Therefore, identifying user purchase intention is one of the significant methods to enhance the Purchase Rate (PR) in the realm of e-commerce. Purchase intention identification aims to infer the intended purchase of potential customers or users by analyzing the similarity between user query text and product description text, ultimately increasing the PR. Due to the diversity and colloquial nature of user search queries, identifying user purchase intention becomes increasingly challenging, and even more so in vertical e-commerce where users may not even be aware of the names of the products they need.
In response to the phenomenon of unclear user purchase intention, this paper proposes a novel model aimed at identifying user purchase intention from user queries with unclear purchase intention. This model first employs the Word2Vec (WV) algorithm’s Continuous Bag-of-Words (CBOW) model to train word vectors. Subsequently, these word vectors are fed into a one-dimensional Convolutional Neural Network (CNN), followed by further feature extraction using the Deep Semantic Similarity Model (DSSM). This process calculates semantic similarity using cosine similarity, subsequently transforming semantic similarity into a posterior probability form to construct a loss function. During model training, it narrows the textual representations in a high-dimensional space between user queries and intended products while expanding the representations between user queries and non-intended products.
An empirical analysis is conducted using real search data from the U.S. building materials e-commerce website Home Depot, within the Keras framework. The results indicate that our proposed model achieves an F1-score of 80.6% on the test dataset in a five-class classification problem. To test the performance of the model proposed in this paper in more complex purchase scenarios, six, seven, and eight-class classification tasks are designed. The results also indicate that as the number of categories increases, the values of various evaluation metrics decrease. However, the F1-scores for all three classification tasks remain above 70%, demonstrating competitive performance in multi-class tasks.
Through the empirical research, this paper draws the following conclusions: (1)The proposed model leverages Word2Vec and CNN for text feature extraction and employs the DSSM model to further extract feature representations of user queries and product descriptions in a high-dimensional space. This maximizes the utilization of semantic similarity between user queries and the correct product descriptions while avoiding subjective interference during feature extraction, ultimately enhancing the identification of purchase intention for products. (2)Deep learning models are often too large to be practical in real-world scenarios. In contrast to typical deep learning models, the model proposed in this paper converges at a faster rate. (3)The model’s F1-score is significantly higher than the baseline model, and as the number of categories increases, the model’s evaluation scores still maintain a high level. (4)Real training data often exhibit class imbalance issues. The model proposed in this paper constructs negative examples based on positive data to balance the data quantity across different categories, enabling the model to consider all categories during the training process. The method proposed in this paper can only identify users’ intended products within a small number of product descriptions. How to identify users’ intended products within a massive volume of product descriptions is a further research direction.

Key words: purchase intention identification, convolutional neural networks(CNN), deep structured semantic model(DSSM), deep learning

中图分类号:

TP391.1

郭小宇, 马静. 基于深度学习的电商商品购买意图识别模型[J]. 运筹与管理, 2024, 33(1): 145-150.

GUO Xiaoyu, MA Jing. Purchasing Intention Identification Model Based on Deep Learning in E-commerce[J]. Operations Research and Management Science, 2024, 33(1): 145-150.

参考文献

[1] LUO Y C, MA J, LI C. Entity name recognition of cross-border e-commerce commodity titles based on TWs-LSTM[J]. Electronic Commerce Research, 2020, 20(2): 405-26.
[2] MA J, LI X F, LI C, et al. Machine learning based cross-border e-commerce commodity customs product name recognition algorithm[C]//Proceedings of the 16th Pacific Rim International Conference on Artificial Intelligence. Berlin: Springer, 2019: 247-256.
[3] 崔春生,王梦冉,王国成.一种基于可拓学的电子商务内容推荐算法研究[J].运筹与管理,2018,27(6):75-81.
[4] 张文,崔杨波,李健,等.基于聚类矩阵近似的协同过滤推荐研究[J].运筹与管理,2020,29(4):171-178.
[5] LUO X S, GONG Y, CHEN X. Central intention identification for natural language search query in E-Commerce[C]//Proceedings of the 2018 SIGIR Workshop on E-commerce. Aachen: CEUR, 2018.
[6] PENG M J, QIN Y W, TANG C X, et al. An e-commerce customer service robot based on intention recognition model[J]. Journal of Electronic Commerce in Organizations, 2016, 14(1): 34-44.
[7] 乔若羽.基于神经网络的股票预测模型[J].运筹与管理,2019,28(10):132-140.
[8] 罗浩,姜伟,范星,等.基于深度学习的行人重识别研究进展[J].自动化学报,2019,45(11):2032-2049.
[9] 吴鹏,刘恒旺,沈思.基于深度学习和OCC情感规则的网络舆情情感识别研究[J].情报学报,2017,36(9):972-980.
[10] GARDNER M, GRUS J, NEUMANN M, et al. Allennlp: A deep semantic natural language processing platform[EB/OL]. (2018-05-31)[2020-10-29]. https://arxiv.org/pdf/1803.07640.
[11] AN C, HUANG J M, CHANG S F, et al. Question similarity modeling with bidirectional long short-term memory neural network[C]//2016 IEEE First International Conference on Data Science in Cyberspace, June13-16, 2016, Changsha, China. Piscataway: IEEE, 2016: 318-322.
[12] NASSIF H, MOHTARAMI M, GLASS J. Learning semantic relatedness in community question answering using neural models[C]//Proceedings of the 1^st Workshop on Representation Learning for NLP. Stroudsburg: ACL, 2016: 137-147.
[13] DAS A, YENALA H, CHINNAKOTLA M, et al. Together we stand: Siamese networks for similar question retrieval[C]//Proceedings of the 54^th Annual Meeting of the Association for Computational Linguistics, August 7-12, 2016, Berlin, Germany.Stroudsburg: ACL, 2016: 378-387.
[14] HUANG P S, HE X D, GAO J F, et al. Learning deep structured semantic models for web search using clickthrough data[C]//Proceedings of the 22^nd ACM International Conference on Information and Knowledge Management. New York: ACM, 2013: 2333-2338.
[15] SHEN Y L, HE X D, GAO J F, et al. A latent semantic model with convolutional-pooling structure for information retrieval[C]//Proceedings of the 23^rd ACM International Conference on Information and Knowledge Management. New York: ACM, 2014: 101-110.
[16] PALANGI H, DENG L, SHEN Y, et al. Semantic modelling with long-short-term memory for information retrieval[EB/OL]. (2015-02-28)[2020-10-29]. https://arxiv.org/pdf/1412.6629.
[17] 赵雪峰,吴伟伟,时辉凝.基于自然语言处理与深度学习的信用贷款评估模型[J].系统管理学报,2020,29(4):629-638.
[18] ZHOU G D, ZHU Q M. Kernel-based semantic relation detection and classification via enriched parse tree structure[J]. Journal of Computer Science and Technology, 2011, 26(1): 45-56.
[19] CHOI J I, KALLUMADI S, MITRA B, et al. Semantic product search for matching structured product catalogs in e-commerce[EB/OL]. (2020-08-18)[2020-10-29].https://arxiv.org/pdf/2008.08180.

基于深度学习的电商商品购买意图识别模型

Purchasing Intention Identification Model Based on Deep Learning in E-commerce

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 4

编辑推荐

Metrics

[1]	吴彬溶, 王林. 基于注意力机制的ADE-Bi-IndRNN模型的中国粮食产量预测[J]. 运筹与管理, 2024, 33(1): 102-107.
[2]	王宁, 李盼盼, 赵哲耘, 杨剑锋. 基于卷积神经网络的智能制造过程质量异常诊断[J]. 运筹与管理, 2022, 31(6): 220-225.
[3]	那日萨, 孔茸, 高欢. 基于深度学习的直觉模糊集隶属度确定方法[J]. 运筹与管理, 2022, 31(2): 92-98.
[4]	乔若羽. 基于神经网络的股票预测模型[J]. 运筹与管理, 2019, 28(10): 132-140.