动态环境无人机导航的安全分层强化学习框架
商益铭,杜昌平,杨睿,方天睿,杜泽安,郑耀

Safe hierarchical reinforcement learning framework for dynamic UAV navigation
Yiming SHANG,Changping DU,Rui YANG,Tianrui FANG,Ze’an DU,Yao ZHENG
表 8 不同干扰下的成功率、路径效率和平均决策时间对比
Tab.8 Comparison of success rate, path efficiency and average decision time under different interference
算法与条件Sc/%E$ {T}_{\text{avg}} $/ms
无干扰95.70.9622.57
单阵风干扰81.60.8632.61
单雷达噪声95.30.9622.57
单GPS误差95.60.9612.57
多干扰APF31.80.5931.06
多干扰RRT38.70.5122.07
多干扰DWA55.30.6382.49
多干扰,本文方法81.40.8612.86