We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent e9f75c0 commit b86a0f2Copy full SHA for b86a0f2
eval/EVALUATION.md
@@ -59,7 +59,7 @@ pip install -e human-eval
59
python evaluate_humaneval.py -f HumanEval.jsonl -o HumanEval_res.jsonl
60
evaluate_functional_correctness HumanEval_res.jsonl
61
# Qwen-7B-Chat
62
-python evaluate_chat_mmlu.py -f HumanEval.jsonl -o HumanEval_res_chat.jsonl
+python evaluate_chat_humaneval.py -f HumanEval.jsonl -o HumanEval_res_chat.jsonl
63
evaluate_functional_correctness HumanEval_res_chat.jsonl
64
```
65
0 commit comments