结合领域经验的深度强化学习信号控制方法 |
张萌,王殿海,金盛 |
Deep reinforcement learning approach to signal control combined with domain experience |
Meng ZHANG,Dian-hai WANG,Sheng JIN |
图 8 加入预训练模块与未加入预训练模块的双决斗深度Q网络算法收敛速度情况 |
Fig.8 Convergence speed comparison of double-dueling deep Q network algorithms with and without pretrained module |
![]() |