-
Notifications
You must be signed in to change notification settings - Fork 328
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Algorithm] CrossQ #2033
Merged
Merged
[Algorithm] CrossQ #2033
Changes from 1 commit
Commits
Show all changes
49 commits
Select commit
Hold shift + click to select a range
0a23ae8
add crossQ examples
BY571 9bdee71
add loss
BY571 570a20e
Update naming experiment
BY571 5086249
update
BY571 c3a927f
update add tests
BY571 d1c9c34
detach
BY571 e879b7c
update tests
BY571 75255e7
update run_test.sh
BY571 a7b79c3
move crossq to sota-implementations
BY571 be84f3f
update loss
BY571 2170ad8
update cat prediction
BY571 75d4cee
Merge branch 'main' into crossQ
vmoens 7711a4e
Merge branch 'main' into crossQ
BY571 f0ac167
add batchrenorm to crossq
BY571 37abb14
Merge branch 'crossQ' of github.com:BY571/rl into crossQ
BY571 bc7675a
small fixes
BY571 9543f2e
update docs and sota checks
BY571 53e35f7
hyperparam fix
BY571 172e1c0
test
BY571 fdb7e8b
update batch norm tests
BY571 5501d43
tests
BY571 c47ac84
cleanup
BY571 e718c3f
Merge branch 'main' into crossQ
BY571 f94165e
update
BY571 02c94ff
update lr param
BY571 93b6a7b
Merge branch 'crossQ' of https://github.com/BY571/rl into crossQ
BY571 4b914e6
Apply suggestions from code review
vmoens af8c64a
Merge remote-tracking branch 'origin/main' into crossQ
vmoens 845c8a9
Merge branch 'crossQ' of https://github.com/BY571/rl into crossQ
vmoens 7b4a69d
set qnet eval in actor loss
BY571 77de044
Merge branch 'crossQ' of https://github.com/BY571/rl into crossQ
BY571 35c7a98
take off comment
BY571 68a1a9f
amend
vmoens c04eb3b
Merge branch 'crossQ' of https://github.com/BY571/rl into crossQ
vmoens 12672ee
Merge remote-tracking branch 'origin/main' into crossQ
vmoens 7fbb27d
amend
vmoens ff80481
amend
vmoens caf702e
amend
vmoens 70e2882
amend
vmoens ccd1b7f
amend
vmoens d3c8b0e
Merge remote-tracking branch 'origin/main' into crossQ
vmoens d3e0bb1
Apply suggestions from code review
vmoens 349cb28
amend
vmoens 75a43e7
amend
vmoens abada6c
fix device error
BY571 c878b81
Update objective delay actor
BY571 f222b11
Update tests not expecting target update
BY571 067b560
update example utils
BY571 c010e39
amend
vmoens File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
hyperparam fix
- Loading branch information
commit 53e35f7c71f2ba685dc2ab9add1b4004ef223d53
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Keep?