基于深度强化学习的交通信号控制方法
刘智敏,叶宝林,朱耀东,姚青,吴维敏
Traffic signal control method based on deep reinforcement learning
Zhi-min LIU,Bao-Lin YE,Yao-dong ZHU,Qing YAO,Wei-min WU
图 6
用于拟合
Q
值的卷积神经网络
Fig.6
Convolution neural network fitting
Q
−value