基于自适应课程强化学习的多无人艇对抗围捕决策
陈浪,刘增力,赵宣植

Decision-making for multi-USV adversarial encirclement based on adaptive curriculum reinforcement learning
Lang CHEN,Zengli LIU,Xuanzhi ZHAO
表 1 自适应课程学习调度器的参数配置
Tab.1 Parameter configuration of adaptive curriculum learning scheduler
参数数值
$ {E}_{\text{p},\max } $10
$ {\rho }_{{{g}_{t}}} $0.75, 0.65, 0.55, 0.45, 0.40
$ \lambda $$ 0.6 $
$ {E}_{{{g}_{t}},\min } $20, 35, 60, 70, 80
$ {W}_{\text{base}} $60, 70, 80, 90, 100