考虑劣化维护的单机调度深度强化学习模型和算法
陈勇,杜习之,姜一炜,易文超,裴植,纪祖臻

Deep reinforcement learning models and algorithms for single-machine scheduling considering deteriorated maintenance
Yong CHEN,Xizhi DU,Yiwei JIANG,Wenchao YI,Zhi PEI,Zuzhen JI
图 5 单一策略运行步数-奖励曲线
Fig.5 Single strategy running step and reward curve