[1] BELLO I, PHAM H, LE Q V, et al. Neural combinatorial optimization with reinforcement learning[C]//5th International Conference on Learning Representations, April 24-26, 2017, Toulon, France. Amherst: Open Review.net, 2017: 1-14. [2] ZHANG Z, WU Z, ZHANG H, et al. Meta-Learning-based deep reinforcement learning for multi objective optimization problems[J]. IEEE Transactions on Neural Networks and Learning Systems, 2022, 32(2): 2334-2342. [3] LI K, ZHANG T, WANG R. Deep reinforcement learning for multi-objective optimization[J]. IEEE Transactions on Cybernetics,2020,51(6): 3103-3114. [4] 王万良,陈浩立,李国庆,等.基于深度强化学习的多配送中心车辆路径规划[J].控制与决策,2022,37(8):2101-2109. [5] ZHENG J, HE K, ZHOU J, et al. Combining reinforcement learning with Lin-Kernighan-Helsgaun algorithm for the traveling salesman problem[C]//35th AAAI Conference on Artificial Intelligence, February 2-9, 2021, Virtual Event, Vancouver, Canada. Palo Alto: AAAI Press, 2021: 12445-12452. [6] PENG B, WANG J H, ZHANG Z Z. A deep reinforcement learning algorithm using dynamic attention model for vehicle routing problems[C]//11th International Symposium on Intelligence Computation and Applications, November 16-17, 2019, Guangzhou Yanling Hotel, Guangzhou, China. Singapore: Springer, 2020: 636-650. [7] LIN B, GHADDAR B, NATHWANI J. Deep reinforcement learning for the electric vehicle routing problem with time windows[J]. IEEE Transactions on Intelligent Transportation Systems,2022,23(8): 11528-11538. [8] ZHANG Q, LI H. MOEA/D: A multi objective evolutionary algorithm based on decomposition[J]. IEEE Transactions on Evolutionary Computation, 2007,11(6): 712-731. [9] JAMES J Q, YU W, GU J T. Online vehicle routing with neural combinatorial optimization and deep reinforcement learning[J]. IEEE Transactions on Intelligent Transportation Systems, 2019, 20(10): 3806-3817. [10] KOOL W, VAN HOOF H, WELLING M. Attention, learn to solve routing problems![C]//7th International Conference on Learning Representations, May 6-9, 2019, Ernest N. Morial Convention Center, New Orleans, USA. Amherst: OpenReview.net, 2019: 1-16. [11] BECKER S, JENTZEN A, MULLER M S, et al. Learning the random variables in monte carlo simulations with stochastic gradient descent: Machine learning for parametric PDEs and financial derivative pricing[J]. Mathematical Finance, 2024, 34(1): 90-150. |