Skip to content

Actions: August-murr/trl

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
123 workflow runs
123 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Cleanup Cache
Cleanup Cache #51: Scheduled
February 2, 2025 01:45 13s main
February 2, 2025 01:45 13s
Cleanup Cache
Cleanup Cache #50: Scheduled
February 1, 2025 01:46 17s main
February 1, 2025 01:46 17s
Cleanup Cache
Cleanup Cache #49: Scheduled
January 31, 2025 01:42 14s main
January 31, 2025 01:42 14s
Cleanup Cache
Cleanup Cache #48: Scheduled
January 30, 2025 01:40 16s main
January 30, 2025 01:40 16s
📉 Use num_logits_to_keep to reduce memory usage in GRPO (#2683)
Secret Leaks #45: Commit 801582e pushed by August-murr
January 29, 2025 17:15 17s main
January 29, 2025 17:15 17s
Cleanup Cache
Cleanup Cache #47: Scheduled
January 29, 2025 01:41 12s main
January 29, 2025 01:41 12s
🏷️ Add model tags to model trained with GRPO (#2663)
Secret Leaks #44: Commit 1123bd0 pushed by August-murr
January 28, 2025 06:12 16s main
January 28, 2025 06:12 16s
Cleanup Cache
Cleanup Cache #46: Scheduled
January 28, 2025 01:41 17s main
January 28, 2025 01:41 17s
Cleanup Cache
Cleanup Cache #45: Scheduled
January 27, 2025 01:43 12s main
January 27, 2025 01:43 12s
Cleanup Cache
Cleanup Cache #44: Scheduled
January 26, 2025 01:44 16s main
January 26, 2025 01:44 16s
🔎 Finegrained reward logging for GRPO (#2651)
Secret Leaks #43: Commit 317d2d4 pushed by August-murr
January 25, 2025 10:54 21s main
January 25, 2025 10:54 21s
Cleanup Cache
Cleanup Cache #43: Scheduled
January 25, 2025 01:37 12s main
January 25, 2025 01:37 12s
🌯 Fix context manager runtime error when gather is disabled (#2639)
Secret Leaks #42: Commit f34b70a pushed by August-murr
January 24, 2025 07:41 15s main
January 24, 2025 07:41 15s
Cleanup Cache
Cleanup Cache #42: Scheduled
January 24, 2025 01:41 11s main
January 24, 2025 01:41 11s
fix test
Secret Leaks #41: Commit 1eb21e6 pushed by August-murr
January 23, 2025 18:50 22s RLOO_custom_reward_trainer
January 23, 2025 18:50 22s
end this mysery already
Secret Leaks #40: Commit a67b701 pushed by August-murr
January 23, 2025 17:46 17s RLOO_custom_reward_trainer
January 23, 2025 17:46 17s
adding test
Secret Leaks #39: Commit bfb0c5c pushed by August-murr
January 23, 2025 17:40 21s RLOO_custom_reward_trainer
January 23, 2025 17:40 21s
remove get_reward_custom test
Secret Leaks #38: Commit 9b54e56 pushed by August-murr
January 23, 2025 13:54 21s RLOO_custom_reward_trainer
January 23, 2025 13:54 21s
removing get_reward_custom
Secret Leaks #37: Commit 47f2c47 pushed by August-murr
January 23, 2025 13:43 23s RLOO_custom_reward_trainer
January 23, 2025 13:43 23s
rloo custom reward function and test
Secret Leaks #36: Commit b0ba01a pushed by August-murr
January 23, 2025 10:28 18s RLOO_custom_reward_trainer
January 23, 2025 10:28 18s
Cleanup Cache
Cleanup Cache #41: Scheduled
January 23, 2025 01:41 17s main
January 23, 2025 01:41 17s
Cleanup Cache
Cleanup Cache #40: Scheduled
January 22, 2025 01:43 15s main
January 22, 2025 01:43 15s
Cleanup Cache
Cleanup Cache #39: Scheduled
January 21, 2025 01:41 14s main
January 21, 2025 01:41 14s
Merge branch 'main' into tool_fine_tuning_support
Secret Leaks #35: Commit 5deec98 pushed by qgallouedec
January 20, 2025 22:21 16s tool_fine_tuning_support
January 20, 2025 22:21 16s
a comment
Secret Leaks #34: Commit d7caae5 pushed by qgallouedec
January 20, 2025 20:49 15s tool_fine_tuning_support
January 20, 2025 20:49 15s