add simultaneous node support in policy_aggregator.py and exploitability.py #1034

rezunli96 · 2023-03-13T00:25:49Z

Hi, I found several algorithms like the ones in policy_aggregator.py and exploitability.py didn't handle cases for simultaneous nodes. So this PR is to address this issue.

…ity.py

lukemarris

First pass

open_spiel/python/algorithms/policy_aggregator.py

open_spiel/python/algorithms/policy_aggregator_joint.py

open_spiel/python/algorithms/policy_aggregator.py

lukemarris · 2023-03-17T13:38:08Z

open_spiel/python/algorithms/policy_aggregator.py

@@ -228,13 +246,10 @@ def assert_type(cond, msg):
      if pid == turn_player:
        # update the current node
        # will need the observation to query the policies
-        if state not in self._policy:
+        if state_key not in self._policy:


Is this fixing a bug that existed in the code previously?

Yes I believe this should be state_key instead of state. Because state is a pyspiel.state object which doesn't seem right to be a key.

open_spiel/python/algorithms/policy_aggregator_joint.py

rezunli96 added 5 commits March 12, 2023 20:21

add simultaneous node support in policy_aggregator.py and exploitabil…

784af15

…ity.py

remove unnecessary comments

e089b2d

fix policy_aggregator_joint.py

5e4bb99

fix state_key bug; fix decimal number in policy_aggregator_test

5b903cb

fix places argument

1bb84fe

lukemarris reviewed Mar 13, 2023

View reviewed changes

open_spiel/python/algorithms/policy_aggregator.py Outdated Show resolved Hide resolved

open_spiel/python/algorithms/policy_aggregator_joint.py Outdated Show resolved Hide resolved

use legal_actions for used_move

7b9ffa0

lukemarris suggested changes Mar 17, 2023

View reviewed changes

remove comments, add newline

b0dc735

lanctot added imported This PR has been imported and awaiting internal review. Please avoid any more local changes, thanks! merged internally The code is now submitted to our internal repo and will be merged in the next github sync. labels Mar 24, 2023

lanctot merged commit 181aca5 into google-deepmind:master Mar 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add simultaneous node support in policy_aggregator.py and exploitability.py #1034

add simultaneous node support in policy_aggregator.py and exploitability.py #1034

rezunli96 commented Mar 13, 2023

lukemarris left a comment

lukemarris Mar 17, 2023

rezunli96 Mar 17, 2023

add simultaneous node support in policy_aggregator.py and exploitability.py #1034

add simultaneous node support in policy_aggregator.py and exploitability.py #1034

Conversation

rezunli96 commented Mar 13, 2023

lukemarris left a comment

Choose a reason for hiding this comment

lukemarris Mar 17, 2023

Choose a reason for hiding this comment

rezunli96 Mar 17, 2023

Choose a reason for hiding this comment