基于可达集和强化学习的智能汽车决策规划 |
高洪伟,尚秉旭,张鑫康,王洪峰,何维,裴晓飞 |
Decision-making and planning of intelligent vehicle based on reachable set and reinforcement learning |
Hongwei GAO,Bingxu SHANG,Xinkang ZHANG,Hongfeng WANG,Wei HE,Xiaofei PEI |
图 7 DDQN模型平均奖励训练结果 |
Fig.7 Average reward training results of DDQN model |
![]() |