运筹与管理 ›› 2024, Vol. 33 ›› Issue (1): 145-150.DOI: 10.12005/orms.2024.0022

• 应用研究 • 上一篇    下一篇

基于深度学习的电商商品购买意图识别模型

郭小宇, 马静   

  1. 南京航空航天大学 经济与管理学院,江苏 南京 211106
  • 收稿日期:2021-11-23 出版日期:2024-01-25 发布日期:2024-03-25
  • 通讯作者: 马静(1966-),女,重庆人,博士,教授,研究方向:文本表示,多模态舆情表示,复杂网络。
  • 作者简介:郭小宇(1995-),女,陕西渭南人,博士研究生,研究方向:文本表示,多模态舆情表示。
  • 基金资助:
    国家自然科学基金资助项目(72174086);中央高校基本科研业务费专项资金项目(NW2020001)

Purchasing Intention Identification Model Based on Deep Learning in E-commerce

GUO Xiaoyu, MA Jing   

  1. College of Economics and Management,Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China
  • Received:2021-11-23 Online:2024-01-25 Published:2024-03-25

摘要: 识别用户的购买意图是提升电子商务购买率(PR)的重要方法之一。针对用户购买意图不明确的现象,提出一种新模型。该模型将训练后的Word2Vec(WV)词向量馈入卷积神经网络(CNN),通过深层语义模型(DSSM)进一步提取文本特征。在Keras框架下结合美国建材电商网站家得宝的真实搜索数据进行实证分析。结果表明,在五分类问题中,新模型在测试数据集上的F1-score达80.6%。新模型使用了Word2Vec与CNN提取文本特征,并应用DSSM模型进一步提取了用户检索与商品描述文档在高维空间中的特征表示,最大化利用了用户检索与正确商品描述之间的语义相似度,同时避免了特征提取时主观因素的干扰,提高了商品购买意图的识别效果。

关键词: 购买意图识别, 卷积神经网络, 深层语义模型, 深度学习

Abstract: With the rapid proliferation and intelligent development of e-commerce platforms, accurate identification of user purchase intention has become a crucial influencing factor in driving users from intent toactual purchases. Therefore, identifying user purchase intention is one of the significant methods to enhance the Purchase Rate (PR) in the realm of e-commerce. Purchase intention identification aims to infer the intended purchase of potential customers or users by analyzing the similarity between user query text and product description text, ultimately increasing the PR. Due to the diversity and colloquial nature of user search queries, identifying user purchase intention becomes increasingly challenging, and even more so in vertical e-commerce where users may not even be aware of the names of the products they need.
In response to the phenomenon of unclear user purchase intention, this paper proposes a novel model aimed at identifying user purchase intention from user queries with unclear purchase intention. This model first employs the Word2Vec (WV) algorithm’s Continuous Bag-of-Words (CBOW) model to train word vectors. Subsequently, these word vectors are fed into a one-dimensional Convolutional Neural Network (CNN), followed by further feature extraction using the Deep Semantic Similarity Model (DSSM). This process calculates semantic similarity using cosine similarity, subsequently transforming semantic similarity into a posterior probability form to construct a loss function. During model training, it narrows the textual representations in a high-dimensional space between user queries and intended products while expanding the representations between user queries and non-intended products.
An empirical analysis is conducted using real search data from the U.S. building materials e-commerce website Home Depot, within the Keras framework. The results indicate that our proposed model achieves an F1-score of 80.6% on the test dataset in a five-class classification problem. To test the performance of the model proposed in this paper in more complex purchase scenarios, six, seven, and eight-class classification tasks are designed. The results also indicate that as the number of categories increases, the values of various evaluation metrics decrease. However, the F1-scores for all three classification tasks remain above 70%, demonstrating competitive performance in multi-class tasks.
Through the empirical research, this paper draws the following conclusions: (1)The proposed model leverages Word2Vec and CNN for text feature extraction and employs the DSSM model to further extract feature representations of user queries and product descriptions in a high-dimensional space. This maximizes the utilization of semantic similarity between user queries and the correct product descriptions while avoiding subjective interference during feature extraction, ultimately enhancing the identification of purchase intention for products. (2)Deep learning models are often too large to be practical in real-world scenarios. In contrast to typical deep learning models, the model proposed in this paper converges at a faster rate. (3)The model’s F1-score is significantly higher than the baseline model, and as the number of categories increases, the model’s evaluation scores still maintain a high level. (4)Real training data often exhibit class imbalance issues. The model proposed in this paper constructs negative examples based on positive data to balance the data quantity across different categories, enabling the model to consider all categories during the training process. The method proposed in this paper can only identify users’ intended products within a small number of product descriptions. How to identify users’ intended products within a massive volume of product descriptions is a further research direction.

Key words: purchase intention identification, convolutional neural networks(CNN), deep structured semantic model(DSSM), deep learning

中图分类号: