动态窗口法引导的TD3无地图导航算法
柳佳乐,薛雅丽,崔闪,洪君

TD3 mapless navigation algorithm guided by dynamic window approach
Jiale LIU,Yali XUE,Shan CUI,Jun HONG
表 2 训练所得模型的平均奖励值
Tab.2 Average reward value of trained model
方法成功率步数奖励
PPO0.7644.2931.92
DDPG0.8743.8867.33
TD30.9052.2755.71
DWA-TD30.8735.9363.96
LSTM-TD30.9144.7560.07
DWA-LSTM TD30.9136.8970.19