考虑劣化维护的单机调度深度强化学习模型和算法
陈勇,杜习之,姜一炜,易文超,裴植,纪祖臻

Deep reinforcement learning models and algorithms for single-machine scheduling considering deteriorated maintenance
Yong CHEN,Xizhi DU,Yiwei JIANG,Wenchao YI,Zhi PEI,Zuzhen JI
表 2 R-M集成优化策略下的成本均值和标准差
Tab.2 Mean and standard deviation of cost with R-M integrated optimization strategies
规模SPT-MLPT-MFCFS-MEDD-M
MeanStdMeanStdMeanStdMeanStd
10200.00.0258.00.0192.00.0227.00.0
201005.610.21682.0108.81058.727.11334.282.1
302160.658.13725.4110.32678.0113.13075.8143.8
503610.0157.58074.8212.74679.4291.35065.8229.9
8011198.2450.422474.8752.114871.1597.915306.1665.6
10016709.3553.833877.21188.921372.8856.823350.4886.7
15032096.01198.471634.72146.842847.31791.347919.01474.0
规模MST-MCR-MMDD-M基准
MeanStdMeanStdMeanStdMeanStd
10197.00.0226.00.0191.00.0191.00.0
201163.020.61053.527.7973.822.5973.822.5
302879.373.22269.0109.52103.9102.92103.9102.9
504858.1307.33597.3191.63310.6172.13310.6172.1
8015181.1687.811678.6519.010642.0494.010642.0494.0
10021907.0937.216753.6800.715763.6766.815763.6766.8
15043670.21579.232857.91266.630546.11166.530546.11166.5