Reinforcement Learning Model of Dynamic Newsboy Problem with Framework Protocol
QI Yu-qing, ZHAO Xing-lei, ZHAO Tian-dong-jie
Operations Research and Management Science . 2022, (10): 105 -112 .  DOI: 10.12005/orms.2022.0326