Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
-
Updated
Jan 24, 2025 - Python
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
Official Implementation of VideoDPO
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
ZYN: Zero-Shot Reward Models with Yes-No Questions
Code and data for "Timo: Towards Better Temporal Reasoning for Language Models" (COLM 2024)
distilled Self-Critique refines the outputs of a LLM with only synthetic data
Add a description, image, and links to the rlaif topic page so that developers can more easily learn about it.
To associate your repository with the rlaif topic, visit your repo's landing page and select "manage topics."