考虑劣化维护的单机调度深度强化学习模型和算法
陈勇,杜习之,姜一炜,易文超,裴植,纪祖臻

Deep reinforcement learning models and algorithms for single-machine scheduling considering deteriorated maintenance
Yong CHEN,Xizhi DU,Yiwei JIANG,Wenchao YI,Zhi PEI,Zuzhen JI
表 9 DRL方法的成本优化效果
Tab.9 Cost optimization effect of DRL methods
规模A2CDQNPPO
$\Delta {\mathrm{mean}} $/%$\Delta \min $/%$\Delta {\mathrm{mean}} $/%$\Delta \min $/%$\Delta {\mathrm{mean}} $/%$\Delta \min $/%
10−0.230.00−12.310.00−0.780.00
207.251.015.210.338.010.44
309.533.257.982.7611.883.25
5010.00−0.27−46.41−9.4412.730.98
809.57−0.596.25−0.7612.381.03
1008.33−1.276.960.8712.431.53
15012.002.908.333.6112.833.45