Skip to content

Commit

Permalink
update preprocess readme
Browse files Browse the repository at this point in the history
  • Loading branch information
gzqaq committed Nov 13, 2024
1 parent 75567eb commit 208ceb2
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion preprocess/README.org
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ usage: python cli.py [-h] [-t STEP_TAG] [-s SUFFIX]
DATASET_TYPE FILE

positional arguments:
DATASET_TYPE Which dataset FILE is (prm800k, math-aps, or math-shepherd)
DATASET_TYPE Which dataset FILE is (prm800k, math-aps, math-aps-tree, or math-shepherd)
FILE Path to the original dataset file

options:
Expand All @@ -27,4 +27,5 @@ newlines. Several solutions are suggested in [[https://stackoverflow.com/a/50642
** Datasets
- PRM800K: [[https://github.com/openai/prm800k][GitHub repo]]
- Math-APS: collected by ~data/gen_data.py~
- Math-APS-tree: collected by ~data/omegaPRM_v2/~ with ~save_data_tree=True~
- Math-Shepherd: [[https://huggingface.co/datasets/peiyi9979/Math-Shepherd][Hugging Face dataset]]

0 comments on commit 208ceb2

Please sign in to comment.