基于自适应课程强化学习的多无人艇对抗围捕决策
陈浪,刘增力,赵宣植

Decision-making for multi-USV adversarial encirclement based on adaptive curriculum reinforcement learning
Lang CHEN,Zengli LIU,Xuanzhi ZHAO
表 2 自适应课程学习环境的参数配置
Tab.2 Parameter configuration of adaptive curriculum learning environment
场景布局$ v_{\text{max}}^{\text{T}} $/($ \mathrm{m}\cdot {\mathrm{s}}^{-1} $)$ {d}_{\text{fin}} $/m$ {h}_{x} $/m
370225
W560275
$ {H}_{\text{1}}\text{,}W $755315
$ {H}_{1}\text{,}{H}_{2}\text{,}W $950365
$ {H}_{1}\text{,}{H}_{2}\text{,}{H}_{3}\text{,}W $1045400