Skip to content

Commit

Permalink
TREC DL and LLM AggreFact experiments for relevance benchmark + promp…
Browse files Browse the repository at this point in the history
…ts comparisons and groundedness vs Bespoke Minicheck 7B (#1660)

* save

* improve trubasicapp setup for competitive experiments, add mlflow instrumentation

* save

* save progress

* save

* save

* add agreement analysis with scoreddocs

* notebook updates

* cleanup

* dataset preprocessing script update

* cleanup competitive analysis

* add llm aggrefact nb

* add llm-aggrefact experiment notebook

* move notebooks

* edits

* pr comments

* add back e2e/data nb
  • Loading branch information
sfc-gh-dhuang authored Dec 10, 2024
1 parent e25b561 commit f549d91
Show file tree
Hide file tree
Showing 7 changed files with 1,941 additions and 303 deletions.
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -52,6 +52,9 @@ credentials.toml
# test files generated by benchmark experiments
**/tests/*.csv
**/results/*.csv
**/experiments/mlruns/
**/experiments/*.zip
**/experiments/*/*.csv

# test files generated by pytest
test-output.xml
Expand Down
Loading

0 comments on commit f549d91

Please sign in to comment.