Hybrid Transformer based Multi-agent Reinforcement Learning for Multiple Unmanned Aerial Vehicle Coordination in Air Corridors

Modeling

Air Corridor, Cylinder and Torus

Animation

cttc, one-transfer

4 air corridors, cylinder-torus-torus-cylinder, 12 UAVs, 4-static, and 3-mobile

cttcttcttc, 3-transfer

10 air corridors, cylinder-torus-torus-cylinder-torus-torus-cylinder-torus-torus-cylinder, 12 UAVs, 4-static, and 3-mobile

RL Training

Network Structure

Embedding network normalizes the input values and standardizes the input dimensions.
Transformer processes dynamic neighbors' information using encoders and decoders.
Actor-critic network outputs the estimated state value and stochastic action in spherical coordinates.

Training File

Train one set of parameters: main.py

Train a batch, parameter grid search: batched_grid_search.sh

Models (actor/critic) are saved every 0.25 million steps. Training process is visualized with terminal log and TensorBoard.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
air_corridor		air_corridor
rl_multi_3d_trans		rl_multi_3d_trans
test and visualization		test and visualization
trained_models		trained_models
README.md		README.md
setup.py		setup.py
torch_env.yml		torch_env.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hybrid Transformer based Multi-agent Reinforcement Learning for Multiple Unmanned Aerial Vehicle Coordination in Air Corridors

Modeling

Air Corridor, Cylinder and Torus

Animation

cttc, one-transfer

cttcttcttc, 3-transfer

RL Training

Network Structure

Training File

Train one set of parameters: main.py

Train a batch, parameter grid search: batched_grid_search.sh

Test File

Serial, generate animation: D3MOVE_test_single_core.py

Parallel, generate data for figs: D3MOVE_test_parallel.py

About

Releases

Packages

Languages

SECNetLabUNM/HTransRL

Folders and files

Latest commit

History

Repository files navigation

Hybrid Transformer based Multi-agent Reinforcement Learning for Multiple Unmanned Aerial Vehicle Coordination in Air Corridors

Modeling

Air Corridor, Cylinder and Torus

Animation

cttc, one-transfer

cttcttcttc, 3-transfer

RL Training

Network Structure

Training File

Train one set of parameters: main.py

Train a batch, parameter grid search: batched_grid_search.sh

Test File

Serial, generate animation: D3MOVE_test_single_core.py

Parallel, generate data for figs: D3MOVE_test_parallel.py

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages