运筹与管理 ›› 2025, Vol. 34 ›› Issue (11): 74-80.DOI: 10.12005/orms.2025.0345

• 理论分析与方法探讨 • 上一篇    下一篇

基于类案生成的法条知识推荐算法研究

司林胜1, 闫妍霏1, 崔春生1, 刘俊2   

  1. 1.河南财经政法大学 数据科学与电子商务学院,河南 郑州 450046;
    2.郑州财经学院 智能会计学院,河南 郑州 450044
  • 收稿日期:2025-02-04 出版日期:2025-11-25 发布日期:2026-03-30
  • 通讯作者: 崔春生(1974-),男,河南南阳人,博士,教授,研究方向:运筹优化,推荐系统,决策理论与方法等。Email:traition@126.com。
  • 作者简介:司林胜(1967-),男,内蒙古丰镇人,博士,教授,研究方向:系统工程,系统决策。
  • 基金资助:
    教育部人文社会科学研究规划基金项目(23YJA860004,24YJA860023);河南省高等学校哲学社会科学基础研究重大项目(2024-JCZD-27);河南省科技研发计划联合基金(产业类)项目(225101610054)

Research on Legal Provisions Knowledge RecommendationAlgorithm Based on Similar Case Generation

SI Linsheng1, YAN Yanfei1, CUI Chunsheng1, LIU Jun2   

  1. 1. School of Data Science and E-commerce, Henan University of Economics and Law, Zhengzhou 450046, China;
    2. School of Intelligent Accounting, Zhengzhou College of Finance and Economics, Zhengzhou 450044, China
  • Received:2025-02-04 Online:2025-11-25 Published:2026-03-30

摘要: 快速准确地找到相似的过往案例并对新案提供断案依据,对于提高办案效率,提升断案精准性有重要的意义。本文从类案文本的多元数据特征出发,提出一种基于类案生成的法条知识推荐算法。该算法使用KeyBERT抽取案件事实的关键词序列,得到法条知识库,并使用余弦相似度计算关键词与法条知识的语义表征向量,得到法条及相似度的多维向量。之后,从罪名、法条知识和刑期三个维度计算案例之间的综合相似度,并生成案例推荐结果。实验结果表明,本文提出的方法在DCG@p指标上优于传统的文本匹配方法,验证了其在实际应用中的可行性和有效性。本文的研究成果可以辅助司法人员进行案情分析和案情诊断,对司法人工智能及其应用有一定的参考价值。

关键词: 推荐算法, 类案推荐, KeyBERT算法, 法条知识

Abstract: With the surge in judicial cases, traditional text matching methods struggle to meet the precise retrieval demands of massive legal documents due to inefficiency, weak generalization capability and insufficient interpretability. The professionalism and complexity of legal texts require models to not only capture semantic information but also incorporate structured domain knowledge. This study proposes a case recommendation algorithm integrating legal knowledge and case characteristics through constructing a legal article knowledge base and multi-dimensional similarity computation framework, aiming to enhance recommendation accuracy and interpretability. This assists judicial professionals in efficiently locating similar cases, improves transparency and consistency of legal reasoning, and provides a technically valuable pathway for judicial intelligence with both theoretical significance and practical value. Thus, effectively integrating case facts into legal provisions to build an interpretable case recommendation model becomes crucial for enhancing judicial efficiency and decision consistency.
   This paper presents a multi-dimensional feature fusion-based precedent recommendation model with the following core framework: (1) Data acquisition and preprocessing: crawling 215 criminal judgment documents on intentional injury from China Judgments Online and extracting factual descriptions as raw data. Text segmentation using THULAC with dual filtering through general and legal-domain-specific stop word lists (e.g., “public security bureau”, “review”) optimizes text representation. (2) Keyword extraction and legal knowledge base construction: KeyBERT algorithm extracts top-10 case keywords, filtered through BERT’s semantic understanding. Transforming criminal law provisions into element-based structures (e.g., decomposing “fraud crime” into elements like “defrauding public/private property” and “large amount”), stored in Elasticsearch as structured knowledge. (3) Semantic matching and similarity computation: XS-BERT (legally optimized pre-trained model) generates semantic vectors for keywords and legal elements. A weighted similarity function integrates three dimensions: charge overlap (Jaccard index), legal knowledge similarity (vector inner product) and sentence difference (normalized distance). (4) Recommendation and validation: using DCG as core metric, comparative experiments with TF-IDF and Word2Vec baseline models demonstrate superior retrieval accuracy and interpretability.
   The experimental results show significant advantages in DCG@5, DCG@10, and DCG@20 metrics over traditional methods. By integrating legal knowledge bases into deep learning, this model effectively addresses semantic gaps and logical inconsistencies in conventional legal text processing. The algorithm not only improves recommendation precision but also enhances credibility through structured legal provision matching, offering an efficient and reliable solution for judicial AI systems. Future work will extend to multi-offense joint recommendation, courtroom debate perspective integration and cross-jurisdictional adaptability optimization. The case recommendations demonstrate high consistency in charges, legal provisions and sentencing patterns with real cases, validating practical feasibility. This approach assists judicial professionals in rapidly locating similar precedents while enhancing decision interpretability, providing a technically referential framework balancing efficiency and precision. Subsequent research could incorporate external knowledge like trial arguments to further optimize multi-dimensional recommendation mechanisms.

Key words: recommendation algorithm, similar cases recommendation, KeyBERT algorithm, legal provisions knowledge

中图分类号: