Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 377 Bytes

README.md

File metadata and controls

15 lines (12 loc) · 377 Bytes

RL-algos-colab

Policy gradient based RL algorithms

Implemented RL algorithms

  • REINFORCE
  • REINFORCE with baseline
  • One-step Actor Critic
  • Advantage Actor Critic (A2C)
  • Proximal Policy Optimization (PPO)

Links

CartPole-v1 env document : https://www.gymlibrary.dev/environments/classic_control/cart_pole/

Memo

This codes(.ipynb) are executable in Google Colab.