Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
TREC DL and LLM AggreFact experiments for relevance benchmark + promp…
…ts comparisons and groundedness vs Bespoke Minicheck 7B (#1660) * save * improve trubasicapp setup for competitive experiments, add mlflow instrumentation * save * save progress * save * save * add agreement analysis with scoreddocs * notebook updates * cleanup * dataset preprocessing script update * cleanup competitive analysis * add llm aggrefact nb * add llm-aggrefact experiment notebook * move notebooks * edits * pr comments * add back e2e/data nb
- Loading branch information