Pulse · huggingface/trl

January 4, 2025 – January 7, 2025

Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.

add xpu support for DPO
#2533 commented on Jan 8, 2025 • 6 new comments
🕊️ DPO padding free
#2520 commented on Jan 7, 2025 • 5 new comments
add "_prepare_fsdp" for DPOTrainer
#2539 commented on Jan 8, 2025 • 2 new comments
plz make GPOTrainer! (Generalized Preference Optimization)
#2028 commented on Jan 7, 2025 • 0 new comments
[GRPO] initial GRPO trainer
#1954 commented on Jan 5, 2025 • 0 new comments
Add VAS to TRL
#2195 commented on Jan 7, 2025 • 0 new comments
[Liger] Integrate Liger CPO & SimPO
#2506 commented on Jan 7, 2025 • 0 new comments
RLOO trainer: fix calculations of steps, episodes and epochs
#2516 commented on Jan 7, 2025 • 0 new comments
[ORPO] revert orpo changes
#2527 commented on Jan 6, 2025 • 0 new comments
Issues Auto-Labeller
#2542 commented on Jan 7, 2025 • 0 new comments