Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert quoridor movement action IDs to relative #1232

Merged
merged 1 commit into from
May 31, 2024

Conversation

tacertain
Copy link
Contributor

The original implementation of Quoridor used absolute position numbering for pawn moves, so moving to b3 was always the same action ID, regardless of where the pawn was. This commit changes to using relative action IDs, so moving directly north is always the same action ID, regardless of what square that is moving to.

See PR #1229 for a deeper discussion of what this change does and why it's only a partial solution to optimzing the game for reinforcement learning.

Fixes #1158

The original implementation of Quoridor used absolute position numbering
for pawn moves, so moving to b3 was always the same action ID, regardless
of where the pawn was. This commit changes to using relative action IDs,
so moving directly north is always the same action ID, regardless of
what square that is moving to.
@lanctot
Copy link
Collaborator

lanctot commented May 22, 2024

Wow there's really something going on with the wheels test.

I've restarted the failed jobs, hopefully this does it.

@tacertain
Copy link
Contributor Author

Looks like they've all passed the point where they usually fail, so 🤞

@lanctot lanctot added imported This PR has been imported and awaiting internal review. Please avoid any more local changes, thanks! merged internally The code is now submitted to our internal repo and will be merged in the next github sync. labels May 27, 2024
@lanctot lanctot merged commit f3c2ce7 into google-deepmind:master May 31, 2024
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
imported This PR has been imported and awaiting internal review. Please avoid any more local changes, thanks! merged internally The code is now submitted to our internal repo and will be merged in the next github sync.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Quoridor Movement Action IDs keep changing
2 participants