考虑劣化维护的单机调度深度强化学习模型和算法
陈勇,杜习之,姜一炜,易文超,裴植,纪祖臻

Deep reinforcement learning models and algorithms for single-machine scheduling considering deteriorated maintenance
Yong CHEN,Xizhi DU,Yiwei JIANG,Wenchao YI,Zhi PEI,Zuzhen JI
表 8 DRL方法优化的最优值
Tab.8 Optimized minimum cost of DRLs
规模Min
R-MDQNA2CPPO
10191.0191.0191.0191.0
20904.0895.0901.0900.0
301939.01878.01887.01878.0
502940.02948.03246.42911.6
809494.09550.39566.69397.0
10014030.014210.013909.613818.0
15027519.026743.826560.926600.9