考虑劣化维护的单机调度深度强化学习模型和算法 |
| 陈勇,杜习之,姜一炜,易文超,裴植,纪祖臻 |
|
Deep reinforcement learning models and algorithms for single-machine scheduling considering deteriorated maintenance |
| Yong CHEN,Xizhi DU,Yiwei JIANG,Wenchao YI,Zhi PEI,Zuzhen JI |
| 图 5 单一策略运行步数-奖励曲线 |
| Fig.5 Single strategy running step and reward curve |
|