Skip to content
This repository has been archived by the owner on Nov 1, 2024. It is now read-only.

Does rebel use random iteration sampling or subgame resolving? #41

Open
thomasahle opened this issue Oct 22, 2024 · 0 comments
Open

Does rebel use random iteration sampling or subgame resolving? #41

thomasahle opened this issue Oct 22, 2024 · 0 comments

Comments

@thomasahle
Copy link

thomasahle commented Oct 22, 2024

In the ReBeL paper it is mentioned that you use CFR-D modified to stop at a random iteration rather than averaging the strategies; and that this is crucial to not be exploitable. But I don't see the random iteration sampling in subgame_solving.cc. Also CFR-D normally uses a gadget/modified game with "opt out" actions. I can't tell from the paper if this is included in ReBeL, but it seems from the code that it's not?

Does this mean that the solver in subgame_solving.cc is not actually using safe search?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant