Skip to content

Latest commit

 

History

History
7 lines (7 loc) · 574 Bytes

File metadata and controls

7 lines (7 loc) · 574 Bytes

Exercise 10

After looking at approximate prediction in exercise 09, we will now analyze the capability of function approximators in control for the MountainCar environment. For the considered setting, the state space is continuous while the action space is still discrete.

Tasks:

  1. semi-gradient Sarsa control using artificial neural networks (and its limitations)
  2. feature engineering for semi-gradient Sarsa control using radial basis functions
  3. feature engineering for least squares policy iteration (LSPI) with recursive least squares Sarsa (RLS-Sarsa)