Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inconsistency between the models for training and testing. #3

Open
SamitHuang opened this issue Apr 24, 2019 · 1 comment
Open

Inconsistency between the models for training and testing. #3

SamitHuang opened this issue Apr 24, 2019 · 1 comment

Comments

@SamitHuang
Copy link

SamitHuang commented Apr 24, 2019

I am re-implementing your interesting work. I have some problems on the Montezuma's Revenge task. During training, in run_hybrid_atari_experiment.py, you used Hdqn(GPU) as the subgoal network, but for testing, in test_model.py, you used another network architecture Net() as the subgoal network. Why are they not consistent? Could you please upload the trained weights and the code for using Hdapn(GPU) in testing?

Also, I notice that in testing, the trained meta controller is actually not used. Instead, the subgoals are manually set and each subgoal is achieved by a simple_net, which seems not surpassing a supervised method that using imitation learning to learn to achieve each fixed subgoal under a fixed environment. Could you explain the generalizability of the method? Thanks!

@SamitHuang SamitHuang changed the title inconsistency between the models for training and testing. Inconsistency between the models for training and testing. Apr 24, 2019
@hoangminhle
Copy link
Owner

Sorry for the late reply. Not sure if I'm missing something from your question, but I'm pretty sure the trained meta controller was used using testing. I'll double check and may upload the network weights once I find it

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants