Spiking Crazyflie Gym

Welcome to the Spiking Crazyflie Gym repository! This repository contains a high-performance simulator designed for training end-to-end control of the Crazyflie 2.1 drone. The simulator provides a robust environment for developing and testing control algorithms.

In addition to the simulator, this repository includes training scripts for both artificial neural networks (ANNs) and spiking neural networks (SNNs). These scripts facilitate the training process, enabling you to develop effective control strategies for the Crazyflie 2.1 drone.

Simulator

The simulator is based on the Learning to Fly in Seconds project. The dynamics have been wrapped to follow the Gymnasium API. l2f_gym.py file contains the core implementation of the simulator, providing the necessary interfaces and dynamics to simulate the Crazyflie 2.1 drone within the Gymnasium framework.

NOTE: as of Nov 2024, the learning to fly package supports CUDA operations, which can greatly increase the speed of the simulator!

Training Agents

Multiple training methods are available in this repository.

Online RL

First, the fully online RL methods can be found in the files tianshou_l2f_<method>.py. These are build on top of the Tianshou RL framework.

First, SAC was used for training the networks. However, it was found that the entropy, which promotes exploration negatively influenced training performance. This is attributed to the fact that the environment is highly unstable and has random disturbances, which in itself already promotes exploration. Next, it was found that for SNN, the surrogate gradient slope needed to be much shallower than for similar supervised training rounds. This can be explained due to the fact that the steep surrogate slow more precisely updates towards the computed gradient direction, but updates fewer weights. Due to the inherent uncertainty of gradient direction in the initial training phases of RL, the steep gradient will significantly slow down training of the SNN.

Online-Offline RL

As controllers for the Crazyflie are readily available, one wonders whether it can be used to kickstart the training of the efficient SNN. In TD3BC_Online.py, an implementation of TD3BC which uses Jump-Start RL is presented. One can choose any of the buffers available in this repo, or start with an empty replay buffer and an existing controller.

During training, the agent initially uses the presented existing controller, and soflty rolls in the SNN, filling the buffer with more and more data gathered with the spiking actor. Several mechanisms are in place to avoid filling the buffer with useless data when early termination occurs.

Offline RL

In the most basic sense, we can train an agent by copying the behavior available in the dataset. This boils down to supervised learning. In BC.py, the code to enable this training is presented.

When using an RL setup, with an actor-critic setup, we can leverage reward information to improve over our existing dataset. This can particularly be useful when the dataset is not expert data or when a (limited) reward curriculum is being used. You can refer to TD3BC.py for an implentation of such method.

Name		Name	Last commit message	Last commit date
Latest commit History 186 Commits
RL		RL
bash_scripts		bash_scripts
fastPyDroneSim		fastPyDroneSim
figures		figures
legacy_files		legacy_files
setup_files		setup_files
stabilize/sac		stabilize/sac
static		static
wandb		wandb
.gitignore		.gitignore
BC.py		BC.py
README.md		README.md
TD3BC.py		TD3BC.py
TD3BC_Online.py		TD3BC_Online.py
TD3_l2f.zip		TD3_l2f.zip
__init__.py		__init__.py
actor.py		actor.py
ann_actor.pth		ann_actor.pth
helpers.py		helpers.py
imu.py		imu.py
l2f_agent.py		l2f_agent.py
l2f_gym.py		l2f_gym.py
real_data_buffer_no_zeros.hdf5		real_data_buffer_no_zeros.hdf5
replay_buffer.py		replay_buffer.py
requirements.txt		requirements.txt
sac_l2f.zip		sac_l2f.zip
setup.sh		setup.sh
spikingActorProb.py		spikingActorProb.py
spiking_gym_wrapper.py		spiking_gym_wrapper.py
td3_bc_l2f.py		td3_bc_l2f.py
test_gym_sim.py		test_gym_sim.py
testfile.py		testfile.py
tianshou_l2f_ddpg.py		tianshou_l2f_ddpg.py
tianshou_l2f_ppo.py		tianshou_l2f_ppo.py
tianshou_l2f_sac.py		tianshou_l2f_sac.py
tianshou_l2f_sac_snn.py		tianshou_l2f_sac_snn.py
train_drone_sac.py		train_drone_sac.py
training_sb3.py		training_sb3.py
uploadArtifact.py		uploadArtifact.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spiking Crazyflie Gym

Simulator

Training Agents

Online RL

Online-Offline RL

Offline RL

About

Releases

Packages

Languages

korneelf1/SpikingCrazyflie

Folders and files

Latest commit

History

Repository files navigation

Spiking Crazyflie Gym

Simulator

Training Agents

Online RL

Online-Offline RL

Offline RL

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages