[1] 戴伟.一种改进企业在框架协议下库存管理的方法[J].运筹与管理,2011,20(4):182-186. [2] 毛克宁.报童问题及其商业拓展的两类利润期望模型[J].数学的实践与认识,2021,51(4):87-92. [3] Zhang G, Shi J, et al. Multi-period multi-product acquisition planning with uncertain demands and supplier quantity discounts[J]. Transportation Research Part E: Logistics and Transportation Review, 2019, 132: 117-140. [4] Li B, Yang X, Zhang Y. Distribution-free solutions to the extended multi-period newsboy problem[J]. Journal of Industrial & Management Optimization, 2017, 13(2): 37-37. [5] Kartikeya Puranam David, et al. Managing blood inventory with multiple independent sources of supply[J]. European Journal of Operational Research, 2017, 259: 500-511. [6] Chen F Y, Krass D. Analysis of supply contracts with minimum total order quantity commitments and non-stationary demands[J]. European Journal of Operational Research, 2001, 131(2): 309-323. [7] Cai J, Hu X, et al. Optimal input quantity decisions considering commitment order contracts under yield uncertainty[J]. International Journal of Production Economics, 2019, 216: 398-412. [8] Wang T, Gong X, Zhou S X. Dynamic inventory management with total minimum order commitments and two supply option[J]. Operations Research, 2017, 65(5): 1285-1302. [9] 蒋国飞,吴沧浦.Q学习算法在库存控制中的应用[J].自动化学报,1999(2):96-101. [10] 郑江波,程福阳,杨柳.基于马氏决策过程的易逝品联合策略[J].计算机集成制造系统,2017,2(1):144-153. [11] Kara A, Dogan I. Reinforcement learning approaches for specifying ordering policies of perishable inventory systems[J]. Expert Systems with Applications, 2018, 91: 150-158. [12] Inderfurth K, Kelle P, Kleber R. Dual sourcing using capacity reservation and spot market: optimal procurement policy and heuristic parameter determination[J]. European Journal of Operational Research, 2013, 225(2): 298-309. [13] 杨华龙,叶迪,张倩,曾庆成.时间窗变动的车辆调度干扰管理模型与算法[J].运筹与管理,2017,(10):56-64. [14] 邰世文,商剑平.煤炭码头卸车调度问题多目标优化模型及算法[J].运筹与管理,2018,27(6):91-99. [15] 徐翔斌,李志鹏.强化学习在运筹学的应用:研究进展与展望[J].运筹与管理,2020,29(5):227-239. [16] Mortazavi A, Arshadi Khamseh A, Azimi P. Designing of an intelligent self-adaptive model for supply chain ordering management system[J]. Engineering Applications of Artificial Intelligence, 2015, 37: 207-220. [17] Paraschos P D, Koulinas G K, Koulouriotis D E. Reinforcement learning for combined production-maintenance and quality control of a manufacturing system with deterioration failures[J]. Journal of Manufacturing Systems, 2020, 56: 470-483. |