一种基于学习的模拟退火算法求解逆向物流中车辆路径-装载问题

doi:10.12005/orms.2025.0382

摘要/Abstract

摘要： 在逆向物流中,如何合理规划车辆路径以访问客户并最大化回收物品收益是一个核心的优化问题。在这一背景下,本文研究了逆向物流中车辆路径-装载问题。该问题要求在给定的时间内,合理规划车辆的行驶路线,以访问多个站点并收集物品,从而最大化车辆中收集物品的总价值。本文提出一种基于学习的模拟退火算法来求解这一NP难问题。该算法结合学习机制和随机贪心策略,以生成高质量的初始解。随后,使用模拟退火算法对初始解更新得到优化解。最后,通过比较初始解和优化解,更新概率学习矩阵,而这个矩阵则用于指导高质量初始解的生成。实验结果表明,本文提出的算法优于现有文献中的算法,为解决逆向物流中车辆路径-装载问题提供了一种新的有效方法。

关键词: 逆向物流, 路径-装载, 模拟退火, 背包问题

Abstract: Reverse logistics refers to the process of collecting products from consumers and returning them to retailers or manufacturers. In this process, each returned product has its own specific value and weight. Since truck drivers have limited working hours, the challenge lies in how to efficiently plan the vehicle route within the specified time and simultaneously optimize the loading strategy to maximize profits. This has become a critical issue that logistics companies need to address. Therefore, the Vehicle Routing-Loading Problem (VRLP) is an important optimization problem in reverse logistics.
In VRLP, multiple customer sites are involved, with each site containing several items of known profit and weight. The optimization goal of the problem is to plan vehicle route efficiently within a specified time, visit multiple sites to collect items, and at the same time, maximize the total value of collected items without exceeding the vehicle’s loading capacity. VRLP is a complex NP-hard problem. Its complexity primarily arises from the need to consider multiple interrelated factors, such as vehicle travel time constraints, loading capacity limitations, and the value and weight of the goods. These factors are intertwined, making the problem extremely challenging to solve. Traditional exact solution methods often cannot find the optimal solution within a reasonable time frame. In contrast, heuristic algorithms can provide satisfactory feasible solutions in a shorter amount of time, making them well-suited for solving such problems. VRLP originates from real-world applications in reverse logistics and can solve many practical issues in logistics operations. Therefore, developing and researching efficient algorithms to solve VRLP can not only improve the operational efficiency of logistics companies but also provide important theoretical support and practical references for the academic community.
To address the NP-hard nature of VRLP, this paper proposes an efficient learning-based simulated annealing algorithm. The algorithm consists of three important components: a learning-based random greedy initialization method, a simulated annealing optimization procedure and a learning probability update mechanism. The algorithm first initializes a probability learning matrix and then executes a series of iterations. In each iteration, the algorithm first generates a high-quality initial solution using a learning-based random greedy method, and then updates the initial solution using the simulated annealing procedure to obtain an optimized solution. Finally, the algorithm dynamically updates the probability learning matrix by comparing the initial and optimized solutions, and the probability learning matrix, in turn, guides the creation of high-quality initial solutions. Experimental results show that the proposed algorithm can efficiently solve VRLP. Specifically, the algorithm outperforms comparison algorithms in the literature in terms of solution quality in large and extremely large test cases, offering a new approach to solving the vehicle routing-loading problem in reverse logistics.
Future research can focus on several aspects. First, given the NP-hard nature of VRLP, developing efficient exact algorithms remains an important research direction, aiming to provide optimal solutions for medium- and small-scale problems. Second, future studies should consider more real-world factors, such as customer time windows, customer priorities and satisfaction and multi-vehicle coordinated scheduling, to enhance the practicality and adaptability of the problem. Additionally, integrating machine learning techniques, especially deep reinforcement learning, to solve VRLP is also an exciting direction for future research.

Key words: reverse logistics, routing-loading, simulated annealing, knapsack problem

中图分类号:

C931

郑永洪, 吴鹏, 陆永亮. 一种基于学习的模拟退火算法求解逆向物流中车辆路径-装载问题[J]. 运筹与管理, 2025, 34(12): 107-114.

ZHENG Yonghong, WU Peng, LU Yongliang. A Learning-based Simulated Annealing Algorithm for Vehicle Routing-Loading Problem in Reverse Logistics[J]. Operations Research and Management Science, 2025, 34(12): 107-114.

参考文献

[1] 李勇建,冯立攀,赵秀堃,等.新运营时代的逆向物流研究进展与展望[J].系统工程理论与实践,2020,40(8):2008-2022.
[2] AGRAWAL S, SINGH R K, MURTAZA Q. A literature review and perspectives in reverse logistics[J]. Resources, Conservation and Recycling, 2015, 97: 76-92.
[3] SANTOS A G, CHAGAS J B C. The thief orienteering problem: Formulation and heuristic approaches[C]//2018 IEEE Congress on Evolutionary Computation(CEC), July 8-13, 2018, Rio de Janeiro, Brazil. Piscataway: IEEE, 2018: 1191-1199.
[4] VANSTEENWEGEN P, SOUFFRIAU W, VAN OUDHEUSDEN D. The orienteering problem: A survey[J]. European Journal of Operational Research, 2011, 209(1): 1-10.
[5] CACCHIANI V, IORI M, LOCATELLI A, et al. Knapsack problems—An overview of recent advances. Part II: Multiple, multidimensional, and quadratic knapsack problems[J]. Computers & Operations Research, 2022, 143: 105693.
[6] VENKATESWARAN C, RAMACHANDRAN M, RAMU K, et al. Application of simulated annealing in various field[J]. Materials and its Characterization, 2022, 1(1): 1-8.
[7] PARDALOS P M, MAVRIFOU T D. Simulated annealing[M]//FLOUDAS C A, PAEDALOS P M. Encyclopedia of Optimization. Cham: Springer International Publishing, c2024: 1-3.
[8] CESCHIA S, SCHAERF A. Multi-neighborhood simulated annealing for the capacitated facility location problem with customer incompatibilities[J]. Computers & Industrial Engineering, 2024, 188: 109858.
[9] SHI K, WU Z, JIANG B, et al. Dynamic path planning of mobile robot based on improved simulated annealing algorithm[J]. Journal of the Franklin Institute, 2023, 60(6): 4378-4398.
[10] I·LHANI·.An improved simulated annealing algorithm with crossover operator for capacitated vehicle routing problem[J]. Swarm and Evolutionary Computation, 2021, 64: 100911.
[11] FONTES D B M M, HOMAYOUNI S M, GONCALVES J F. A hybrid particle swarm optimization and simulated annealing algorithm for the job shop scheduling problem with transport resources[J]. European Journal of Operational Research, 2023, 306(3): 1140-1157.
[12] TURHAN A M, BILGEN B. A hybrid fix-and-optimize and simulated annealing approaches for nurse rostering problem[J]. Computers & Industrial Engineering, 2020, 145: 106531.
[13] KOSANOGLU F, ATMIS M, TURAN H H. A deep reinforcement learning assisted simulated annealing algorithm for a maintenance planning problem[J]. Annals of Operations Research, 2024, 339(1): 79-110.
[14] VINCENT F Y, JEWPANYA P, REDI A A N P, et al. Adaptive neighborhood simulated annealing for the heterogeneous fleet vehicle routing problem with multiple cross-docks[J]. Computers & Operations Research, 2021, 129: 105205.
[15] SUN Z, BENLIC U, LI M, et al. Reinforcement learning based tabu search for the minimum load coloring problem[J]. Computers & Operations Research, 2022, 143: 105745.
[16] LI M, HAO J K, WU Q. A flow based formulation and a reinforcement learning based strategic oscillation for cross-dock door assignment[J]. European Journal of Operational Research, 2024, 312(2): 473-492.
[17] LÓPEZ-LBÁÑEZ M, DUBOIS-LACOSTE J, CÁCERES L P, et al. The irace package: Iterated racing for automatic algorithm configuration[J]. Operations Research Perspectives, 2016, 3: 43-58.