Commit fc0454e
committed
deleted: result_heatmap/gsm_checklist_Meta-Llama-3.1-70B-Instruct_zeroshot.png
deleted: result_heatmap/gsm_checklist_Meta-Llama-3.1-8B-Instruct_zeroshot.png
deleted: result_heatmap/gsm_checklist_claude-3-5-sonnet-20240620_zeroshot.png
deleted: result_heatmap/gsm_checklist_gpt-4o-mini_zeroshot.png
deleted: result_heatmap/gsm_checklist_o1-mini_zeroshot.png
deleted: result_heatmap/gsm_checklist_o1-preview_zeroshot.png
deleted: results/gsm_checklist_Meta-Llama-3-8B-Instruct_task_all_question_all_zeroshot_prediction.jsonl
deleted: "results/gsm_checklist_o1-mini-\346\265\213\350\257\225\344\270\213_task_all_question_all_zeroshot_prediction.json"
deleted: "results/gsm_checklist_o1-mini-\347\254\254\344\270\200\346\254\241\350\267\221_task_all_question_all_zeroshot_prediction.json"
deleted: scripts/draw.py
modified: scripts/llama3-8b-instruct_inference.sh
deleted: scripts/result_matrix.json
deleted: scripts/result_matrix/gsm_checklist_Meta-Llama-3.1-70B-Instruct_zeroshot.npy
deleted: scripts/result_matrix/gsm_checklist_Meta-Llama-3.1-70B-Instruct_zeroshot.png
deleted: scripts/result_matrix/gsm_checklist_Meta-Llama-3.1-8B-Instruct_zeroshot.npy
deleted: scripts/result_matrix/gsm_checklist_Meta-Llama-3.1-8B-Instruct_zeroshot.png
deleted: scripts/result_matrix/gsm_checklist_claude-3-5-sonnet-20240620_zeroshot.npy
deleted: scripts/result_matrix/gsm_checklist_claude-3-5-sonnet-20240620_zeroshot.png
deleted: scripts/result_matrix/gsm_checklist_gpt-4o-mini_zeroshot.npy
deleted: scripts/result_matrix/gsm_checklist_gpt-4o-mini_zeroshot.png
deleted: scripts/result_matrix/gsm_checklist_o1-mini_zeroshot.npy
deleted: scripts/result_matrix/gsm_checklist_o1-mini_zeroshot.png
deleted: scripts/result_matrix/gsm_checklist_o1-preview_zeroshot.npy
deleted: scripts/run.sh
deleted: scripts/utils/__pycache__/extract_ans.cpython-310.pyc
deleted: scripts/utils/__pycache__/extract_ans.cpython-38.pyc
deleted: scripts/utils/__pycache__/prompt_template.cpython-310.pyc
modified: scripts/utils/extract_ans.py1 parent 8fc66a0 commit fc0454e
29 files changed
Lines changed: 4 additions & 56031 deletions
File tree
- result_heatmap
- results
- scripts
- result_matrix
- utils
- __pycache__
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Lines changed: 0 additions & 27866 deletions
This file was deleted.
Lines changed: 0 additions & 1 deletion
This file was deleted.
Lines changed: 0 additions & 27866 deletions
This file was deleted.
0 commit comments