基于改进深度强化学习算法的农业机器人路径规划
赵威,张万枝,侯加林,侯瑞,李玉华,赵乐俊,程进

Path planning of agricultural robots based on improved deep reinforcement learning algorithm
Wei ZHAO,Wanzhi ZHANG,Jialin HOU,Rui HOU,Yuhua LI,Lejun ZHAO,Jin Cheng
图 9 不同路径规划方法在训练环境中的平均奖励值
Fig.9 Average rewards of different path planning methods in training environment