结合领域经验的深度强化学习信号控制方法
张萌,王殿海,金盛

Deep reinforcement learning approach to signal control combined with domain experience
Meng ZHANG,Dian-hai WANG,Sheng JIN
图 8 加入预训练模块与未加入预训练模块的双决斗深度Q网络算法收敛速度情况
Fig.8 Convergence speed comparison of double-dueling deep Q network algorithms with and without pretrained module