You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.
In the ReBeL paper it is mentioned that you use CFR-D modified to stop at a random iteration rather than averaging the strategies; and that this is crucial to not be exploitable. But I don't see the random iteration sampling in subgame_solving.cc. Also CFR-D normally uses a gadget/modified game with "opt out" actions. I can't tell from the paper if this is included in ReBeL, but it seems from the code that it's not?
Does this mean that the solver in subgame_solving.cc is not actually using safe search?
The text was updated successfully, but these errors were encountered:
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
In the ReBeL paper it is mentioned that you use CFR-D modified to stop at a random iteration rather than averaging the strategies; and that this is crucial to not be exploitable. But I don't see the random iteration sampling in subgame_solving.cc. Also CFR-D normally uses a gadget/modified game with "opt out" actions. I can't tell from the paper if this is included in ReBeL, but it seems from the code that it's not?
Does this mean that the solver in subgame_solving.cc is not actually using safe search?
The text was updated successfully, but these errors were encountered: