考虑劣化维护的单机调度深度强化学习模型和算法 |
| 陈勇,杜习之,姜一炜,易文超,裴植,纪祖臻 |
|
Deep reinforcement learning models and algorithms for single-machine scheduling considering deteriorated maintenance |
| Yong CHEN,Xizhi DU,Yiwei JIANG,Wenchao YI,Zhi PEI,Zuzhen JI |
| 图 8 规模对回合平均奖励的影响曲线 |
| Fig.8 Influence curves of scale on average reward of episode |
|