Accuracy improvement possible? #1

ruze00 · 2022-05-23T17:05:20Z

I'm running the code verbatim but not finding the results which might be expected. For example, running ping_pong_a2c results in barely any improvement after more than 8,000 runs, while I would expect a good level of accuracy (at least > 0 score) by 5,000 iterations or so based on other people reporting results based on using RL with Atari/Pong.

Is there something I'm missing? Do the hyperparameters need to be tuned rather than run as is?

Thank you for creating the code base.

The text was updated successfully, but these errors were encountered:

allohvk · 2022-11-08T07:56:12Z

No, it does not converge. I spent days on this code to debug why but couldn't drill down to the exact issue. Use the openAi gym wrappers to manipulate the frames

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accuracy improvement possible? #1

Accuracy improvement possible? #1

ruze00 commented May 23, 2022 •

edited

Loading

allohvk commented Nov 8, 2022

Accuracy improvement possible? #1

Accuracy improvement possible? #1

Comments

ruze00 commented May 23, 2022 • edited Loading

allohvk commented Nov 8, 2022

ruze00 commented May 23, 2022 •

edited

Loading