结合领域经验的深度强化学习信号控制方法 |
| 张萌,王殿海,金盛 |
|
Deep reinforcement learning approach to signal control combined with domain experience |
| Meng ZHANG,Dian-hai WANG,Sheng JIN |
| 图 9 加入相位持续时间模块的双决斗深度Q网络(3DQN)算法与传统3DQN算法在平均旅行时间上的收敛情况 |
| Fig.9 Convergence analysis of double-dueling deep Q network (3DQN) algorithms with phase duration module and traditional 3DQN algorithm on average travel time |
|