-
Notifications
You must be signed in to change notification settings - Fork 328
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Algo] Update offpolicy examples #1206
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Would it make sense to have configs a bit more "structured" like
collector:
frames_per_batch: 1000
replay_buffer:
size: 1_000_000
then access the fields through
cfg.collector.frames_per_batch
?
Also, how deep do we want the config to be? Maybe we could let the users control the number of cells etc?
Don't we need to update the examples worflow in the CI?
alpha=0.7, | ||
beta=0.5, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we could make that parametrizable too
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Would it make sense to have configs a bit more "structured" like
collector:
frames_per_batch: 1000
replay_buffer:
size: 1_000_000
then access the fields through
cfg.collector.frames_per_batch
?
Also, how deep do we want the config to be? Maybe we could let the users control the number of cells etc?
Don't we need to update the examples worflow in the CI?
… into update_offpolicy_examples
… into update_offpolicy_examples
…pdate_offpolicy_examples
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Final review
Description
Updating the off-policy examples for ddpg, td3 and sac to simple scripts