基于强化学习的多路口可变车道协同控制方法
徐小高,夏莹杰,朱思雨,邝砾

Cooperative control algorithm of multi-intersection variable-direction lanes based on reinforcement learning
Xiao-gao XU,Ying-jie XIA,Si-yu ZHU,Li KUANG
图 4 全局奖励分解算法
Fig.4 Global reward decomposition algorithm