You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When running PAIRED and REPAIRED on BipedalWalker env. After a few thousands of updates, there will be an “nan” error in self.actor_critic.evaluate_actions() method (check this line
When running PAIRED and REPAIRED on BipedalWalker env. After a few thousands of updates, there will be an “nan” error in self.actor_critic.evaluate_actions() method (check this line
dcd/algos/ppo.py
Line 91 in 0411f1f
I assume this is caused by some gradient explosion or something. Could you guys check this bug?
The text was updated successfully, but these errors were encountered: