Multiagent reinforcement learning algorithms for multiple-UAV confrontation

This is the source code of "Efficient training techniques for multi-agent reinforcement learning in combatant tasks", we construct a multi-agent confrontation environment originated from a combatant scenario of multiple unman aerial vehicles. To begin with, we consider to solve this confrontation problem with two types of MARL algorithms. One is extended from the classical deep Q-network for multi-agent settings (MADQN). The other one is extended from the state-of-art multi-agent reinforcement method, multi-agent deep deterministic policy gradient (MADDPG). We compare the two methods for the initial confrontation scenario and find that MADDPG outperforms MADQN. Then with MADDPG as the baseline, we propose three efficient training techniques, i.e., scenario-transfer training, self-play training and rule-coupled training.

Rule-coupled red agents vs Random-move blue agents

Rule-coupled red agents vs Blue agents trained by self-play

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
MADDPG		MADDPG
MADQN		MADQN
README.md		README.md
Rule-coupled vs Random.gif		Rule-coupled vs Random.gif
Rule-coupled vs Selfplay.gif		Rule-coupled vs Selfplay.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multiagent reinforcement learning algorithms for multiple-UAV confrontation

About

Releases

Packages

Contributors 2

Languages

sanjinzhi/Multiagent-reinforcement-learning-algorithms-for-multiple-UAV-confrontation

Folders and files

Latest commit

History

Repository files navigation

Multiagent reinforcement learning algorithms for multiple-UAV confrontation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages