-
Notifications
You must be signed in to change notification settings - Fork 328
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Implicit Q-Learning (IQL) #933
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you add the class to the docs?
We should also add the example to the tests (there is one workflow dedicated to these)
Otherwise LGTM, just a few minor comments
The examples are failing |
It seems like the test can't handle the situation when wandb asks you to select wandb setup options: I changed now the logging to tensorboard in the tests. This should fix it for now but not sure if it's a permanent solution. |
Can't you put it in local mode? I thing this can be done via an env variable. Also there's an error remaining in TD3
|
fixed the td3 issues and added the mode parameter so that for the tests wandb can be run in offline mode. |
Sorry for dropping the ball |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Description
Adding the Implicit Q-Learning (IQL) objective and an online rl example.
Motivation and Context
Adds the first Offline RL Algorithm to TorchRL. However, currently only an online learning example. Converged for the Pendulum-v1 gym environment:
With the merge of the offline datasets #928 an extra offline example will be added.
Types of changes
What types of changes does your code introduce? Remove all that do not apply:
Checklist
Go over all the following points, and put an
x
in all the boxes that apply.If you are unsure about any of these, don't hesitate to ask. We are here to help!