动态环境无人机导航的安全分层强化学习框架
商益铭,杜昌平,杨睿,方天睿,杜泽安,郑耀

Safe hierarchical reinforcement learning framework for dynamic UAV navigation
Yiming SHANG,Changping DU,Rui YANG,Tianrui FANG,Ze’an DU,Yao ZHENG
表 6 消融实验中的成功率、路径效率、平均转角和平均决策时间的对比
Tab.6 Comparison of success rate, path efficiency, average turning angle, and average decision time in ablation study
序号+DWA+CBF+PDOSc/%E$ {\theta }_{\text{avg}} $/(°)$ {T}_{\text{avg}} $/ms
064.30.73921.411.87
178.80.7799.992.41
285.60.89317.722.03
375.20.75213.751.88
490.30.92212.702.54
583.40.8078.932.43
689.90.90814.822.11
795.70.9629.412.57