结合领域经验的深度强化学习信号控制方法
张萌,王殿海,金盛

Deep reinforcement learning approach to signal control combined with domain experience
Meng ZHANG,Dian-hai WANG,Sheng JIN
图 6 不同方法下路网内车辆平均等待时间变化
Fig.6 Variation of average waiting time for vehicles in road network under different methods