结合领域经验的深度强化学习信号控制方法
张萌,王殿海,金盛

Deep reinforcement learning approach to signal control combined with domain experience
Meng ZHANG,Dian-hai WANG,Sheng JIN
表 4 不同方法下各进口道的平均排队长度
Tab.4 Average queue length of each approach under different methods
m
算法 Ln Ls Le Lw
Webster 18.73 33.02 37.06 10.77
Actuated 16.66 26.11 22.85 9.70
Delay-Based 21.58 68.22 62.01 14.50
3DQN 10.53 19.81 19.85 8.29
本研究 9.90 17.80 16.81 9.32