基于改进强化学习的多智能体追逃对抗 |
| 薛雅丽,叶金泽,李寒雁 |
|
Multi-agent pursuit and evasion games based on improved reinforcement learning |
| Ya-li XUE,Jin-ze YE,Han-yan LI |
| 图 5 解耦奖励和非解耦奖励下的平均奖励曲线 |
| Fig.5 Mean rewards curve under decoupling reward and non-decoupling reward |
|