结合领域经验的深度强化学习信号控制方法 |
张萌,王殿海,金盛 |
Deep reinforcement learning approach to signal control combined with domain experience |
Meng ZHANG,Dian-hai WANG,Sheng JIN |
图 9 加入相位持续时间模块的双决斗深度Q网络(3DQN)算法与传统3DQN算法在平均旅行时间上的收敛情况 |
Fig.9 Convergence analysis of double-dueling deep Q network (3DQN) algorithms with phase duration module and traditional 3DQN algorithm on average travel time |
![]() |