基于自适应课程强化学习的多无人艇对抗围捕决策
陈浪,刘增力,赵宣植

Decision-making for multi-USV adversarial encirclement based on adaptive curriculum reinforcement learning
Lang CHEN,Zengli LIU,Xuanzhi ZHAO
图 11 围捕过程中己方USV的参数变化曲线
Fig.11 Parameter variation curves of friendly USVs during encirclement process