运筹与管理 ›› 2025, Vol. 34 ›› Issue (12): 107-114.DOI: 10.12005/orms.2025.0382

• 应用研究 • 上一篇    下一篇

一种基于学习的模拟退火算法求解逆向物流中车辆路径-装载问题

郑永洪, 吴鹏, 陆永亮   

  1. 福州大学 经济与管理学院,福建 福州 350108
  • 收稿日期:2024-04-25 出版日期:2025-12-25 发布日期:2026-04-29
  • 通讯作者: 陆永亮 (1983-),男,河南固始人,博士,副研究员,研究方向:运筹优化,车辆路径优化。Email: luyongliang@fzu.edu.cn。
  • 作者简介:郑永洪 (2001-),男,福建永春人,硕士研究生,研究方向:运筹优化。
  • 基金资助:
    国家自然科学基金资助项目(72371076)
       

A Learning-based Simulated Annealing Algorithm for Vehicle Routing-Loading Problem in Reverse Logistics

ZHENG Yonghong, WU Peng, LU Yongliang   

  1. School of Economics and Management, Fuzhou University, Fuzhou 350108, China
  • Received:2024-04-25 Online:2025-12-25 Published:2026-04-29

摘要: 在逆向物流中,如何合理规划车辆路径以访问客户并最大化回收物品收益是一个核心的优化问题。在这一背景下,本文研究了逆向物流中车辆路径-装载问题。该问题要求在给定的时间内,合理规划车辆的行驶路线,以访问多个站点并收集物品,从而最大化车辆中收集物品的总价值。本文提出一种基于学习的模拟退火算法来求解这一NP难问题。该算法结合学习机制和随机贪心策略,以生成高质量的初始解。随后,使用模拟退火算法对初始解更新得到优化解。最后,通过比较初始解和优化解,更新概率学习矩阵,而这个矩阵则用于指导高质量初始解的生成。实验结果表明,本文提出的算法优于现有文献中的算法,为解决逆向物流中车辆路径-装载问题提供了一种新的有效方法。

关键词: 逆向物流, 路径-装载, 模拟退火, 背包问题

Abstract: Reverse logistics refers to the process of collecting products from consumers and returning them to retailers or manufacturers. In this process, each returned product has its own specific value and weight. Since truck drivers have limited working hours, the challenge lies in how to efficiently plan the vehicle route within the specified time and simultaneously optimize the loading strategy to maximize profits. This has become a critical issue that logistics companies need to address. Therefore, the Vehicle Routing-Loading Problem (VRLP) is an important optimization problem in reverse logistics.
In VRLP, multiple customer sites are involved, with each site containing several items of known profit and weight. The optimization goal of the problem is to plan vehicle route efficiently within a specified time, visit multiple sites to collect items, and at the same time, maximize the total value of collected items without exceeding the vehicle’s loading capacity. VRLP is a complex NP-hard problem. Its complexity primarily arises from the need to consider multiple interrelated factors, such as vehicle travel time constraints, loading capacity limitations, and the value and weight of the goods. These factors are intertwined, making the problem extremely challenging to solve. Traditional exact solution methods often cannot find the optimal solution within a reasonable time frame. In contrast, heuristic algorithms can provide satisfactory feasible solutions in a shorter amount of time, making them well-suited for solving such problems. VRLP originates from real-world applications in reverse logistics and can solve many practical issues in logistics operations. Therefore, developing and researching efficient algorithms to solve VRLP can not only improve the operational efficiency of logistics companies but also provide important theoretical support and practical references for the academic community.
To address the NP-hard nature of VRLP, this paper proposes an efficient learning-based simulated annealing algorithm. The algorithm consists of three important components: a learning-based random greedy initialization method, a simulated annealing optimization procedure and a learning probability update mechanism. The algorithm first initializes a probability learning matrix and then executes a series of iterations. In each iteration, the algorithm first generates a high-quality initial solution using a learning-based random greedy method, and then updates the initial solution using the simulated annealing procedure to obtain an optimized solution. Finally, the algorithm dynamically updates the probability learning matrix by comparing the initial and optimized solutions, and the probability learning matrix, in turn, guides the creation of high-quality initial solutions. Experimental results show that the proposed algorithm can efficiently solve VRLP. Specifically, the algorithm outperforms comparison algorithms in the literature in terms of solution quality in large and extremely large test cases, offering a new approach to solving the vehicle routing-loading problem in reverse logistics.
Future research can focus on several aspects. First, given the NP-hard nature of VRLP, developing efficient exact algorithms remains an important research direction, aiming to provide optimal solutions for medium- and small-scale problems. Second, future studies should consider more real-world factors, such as customer time windows, customer priorities and satisfaction and multi-vehicle coordinated scheduling, to enhance the practicality and adaptability of the problem. Additionally, integrating machine learning techniques, especially deep reinforcement learning, to solve VRLP is also an exciting direction for future research.

Key words: reverse logistics, routing-loading, simulated annealing, knapsack problem

中图分类号: