结合领域经验的深度强化学习信号控制方法 |
| 张萌,王殿海,金盛 |
|
Deep reinforcement learning approach to signal control combined with domain experience |
| Meng ZHANG,Dian-hai WANG,Sheng JIN |
| 图 8 加入预训练模块与未加入预训练模块的双决斗深度Q网络算法收敛速度情况 |
| Fig.8 Convergence speed comparison of double-dueling deep Q network algorithms with and without pretrained module |
|