`action` interface of exploration policies #497

johannes-fischer · 2023-05-31T01:31:53Z

The exploration policies (https://github.com/JuliaPOMDP/POMDPs.jl/blob/master/lib/POMDPTools/src/Policies/exploration_policies.jl) do not meet the action interface described in the documentation action(::Policy, x) and cannot be used with the simulators directly. Instead they have the interface action(p::EpsGreedyPolicy, on_policy::Policy, k, s).

I was wondering if there is a reason for this?

The text was updated successfully, but these errors were encountered:

zsunberg · 2023-06-01T05:20:11Z

I don't remember the details, but they are designed to change as the total number of calls (k) increases. i.e. to decay. I think they are used in things like tabular td learning.

(Since they are Policys they should probably also have the action(p, s) function, though it's not immediately obvious how to do that for them.)

I'm definitely open to changing the design.

johannes-fischer · 2023-06-01T17:29:33Z

I think they would need to store k and the policy. They could have an update! function for k and the policy. The policy field could be P where P<:Union{Nothing,Policy} is a template parameter (nothing to use the current action interface).

johannes-fischer mentioned this issue Jul 12, 2023

WIP: Add action(policy, s) interface to exploration policies #510

Open

dylan-asmar mentioned this issue Mar 7, 2024

ExplorationPolicies don't work with stepthrough #541

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`action` interface of exploration policies #497

`action` interface of exploration policies #497

johannes-fischer commented May 31, 2023

zsunberg commented Jun 1, 2023

johannes-fischer commented Jun 1, 2023

action interface of exploration policies #497

action interface of exploration policies #497

Comments

johannes-fischer commented May 31, 2023

zsunberg commented Jun 1, 2023

johannes-fischer commented Jun 1, 2023

`action` interface of exploration policies #497

`action` interface of exploration policies #497