考虑劣化维护的单机调度深度强化学习模型和算法
陈勇,杜习之,姜一炜,易文超,裴植,纪祖臻

Deep reinforcement learning models and algorithms for single-machine scheduling considering deteriorated maintenance
Yong CHEN,Xizhi DU,Yiwei JIANG,Wenchao YI,Zhi PEI,Zuzhen JI
表 7 DRL方法的成本优化均值和标准差
Tab.7 Optimized cost mean and standard deviation of DRL algorithms
规模基准A2CDQNPPO
MeanStdMeanStdMeanStdMeanStd
10191.00.0191.41.0217.861.7192.514.8
20973.822.5908.09.5925.647.6901.56.1
302103.9102.91920.820.11948.3109.01880.56.6
503310.6172.13009.641.16177.72280.42936.719.0
8010642.0494.09712.765.110015.8480.49469.740.3
10015763.6766.814551.0172.214737.7708.414020.2119.5
15030546.11166.527272.6270.028197.41209.327073.3275.1