Lottery ticket hypothesis

This repository contains an implementation of the article Lottery Ticket Hypothesis
And an application of this hypothesis to reinforcement learning

Supervised
- Implement iterative magnitude pruning (IMP)
- Test using toy net and dataset CIFAR10
- Test using VGG19 net and dataset CIFAR10
- Make it fast
Reinforcement learning
- Implement DQN
- Test on classic gym environments (CartPole, LunarLander)
- Try IMP (layerwise/global) with DQN on classic problems
- Add IMP with reinitialization to some epoch after training
- Add early stop criterions
- Add Rescaling weight after reinit
- Add Global/Layerwise/ERK pruners
- Analyze the specifics of applying Lottery ticket to DQN (e.g. target function updates)
- Dynamic epochs
- DDPG? Dueling networks? Different RL architecures...
- Atari games?
- Compare with other articles
- Clean up this list

Name		Name	Last commit message	Last commit date
Latest commit History 146 Commits
docs		docs
rl		rl
supervised		supervised
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
results.md		results.md
setup.py		setup.py

Provide feedback