Skip to content

Commit

Permalink
add simple human_eval
Browse files Browse the repository at this point in the history
  • Loading branch information
bmosaicml committed Oct 3, 2023
1 parent 0f8b160 commit a84dda0
Show file tree
Hide file tree
Showing 7 changed files with 83 additions and 35 deletions.
34 changes: 34 additions & 0 deletions scripts/eval/yamls/coding_tasks.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -20,3 +20,37 @@ icl_tasks:
pass_at_k: 1
num_beams: 20
icl_task_type: code_evaluation
-
label: human_eval_return_simple
dataset_uri: eval/local_data/programming/human_eval_return_simple.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 27 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

27:12 [colons] too many spaces before colon

Check failure on line 27 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

27:12 [colons] too many spaces before colon
num_beams: 20
-
label: human_eval_return_complex
dataset_uri: eval/local_data/programming/human_eval_return_complex.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 33 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

33:12 [colons] too many spaces before colon

Check failure on line 33 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

33:12 [colons] too many spaces before colon
num_beams: 20
icl_task_type: code_evaluation
-
label: human_eval_25
dataset_uri: eval/local_data/programming/split-0.25.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 40 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

40:12 [colons] too many spaces before colon

Check failure on line 40 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

40:12 [colons] too many spaces before colon
num_beams: 20
icl_task_type: code_evaluation
-
label: human_eval_50
dataset_uri: eval/local_data/programming/split-0.5.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 47 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

47:12 [colons] too many spaces before colon

Check failure on line 47 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

47:12 [colons] too many spaces before colon
num_beams: 20
icl_task_type: code_evaluation
-
label: human_eval_75
dataset_uri: eval/local_data/programming/split-0.75.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 54 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

54:12 [colons] too many spaces before colon

Check failure on line 54 in scripts/eval/yamls/coding_tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

54:12 [colons] too many spaces before colon
num_beams: 20
icl_task_type: code_evaluation
35 changes: 0 additions & 35 deletions scripts/eval/yamls/coding_tasks_simple.yaml

This file was deleted.

15 changes: 15 additions & 0 deletions scripts/eval/yamls/eval_gauntlet.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -123,6 +123,21 @@ eval_gauntlet:
- name: human_eval_js
num_fewshot: 0
random_baseline: 0.0
- name: human_eval_return_simple
num_fewshot: 0
random_baseline: 0.0
- name: human_eval_return_complex
num_fewshot: 0
random_baseline: 0.0
- name: human_eval_25
num_fewshot: 0
random_baseline: 0.0
- name: human_eval_50
num_fewshot: 0
random_baseline: 0.0
- name: human_eval_75
num_fewshot: 0
random_baseline: 0.0
- name: world_knowledge_lm_task_subscore
benchmarks:
- name: jeopardy
Expand Down
34 changes: 34 additions & 0 deletions scripts/eval/yamls/tasks.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -194,3 +194,37 @@ icl_tasks:
pass_at_k: 1
num_beams: 20
icl_task_type: code_evaluation
-
label: human_eval_return_simple
dataset_uri: eval/local_data/programming/human_eval_return_simple.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 201 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

201:12 [colons] too many spaces before colon

Check failure on line 201 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

201:12 [colons] too many spaces before colon
num_beams: 20
-
label: human_eval_return_complex
dataset_uri: eval/local_data/programming/human_eval_return_complex.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 207 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

207:12 [colons] too many spaces before colon

Check failure on line 207 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

207:12 [colons] too many spaces before colon
num_beams: 20
icl_task_type: code_evaluation
-
label: human_eval_25
dataset_uri: eval/local_data/programming/human_eval-0.25.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 214 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

214:12 [colons] too many spaces before colon

Check failure on line 214 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

214:12 [colons] too many spaces before colon
num_beams: 20
icl_task_type: code_evaluation
-
label: human_eval_50
dataset_uri: eval/local_data/programming/human_eval-0.5.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 221 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

221:12 [colons] too many spaces before colon

Check failure on line 221 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

221:12 [colons] too many spaces before colon
num_beams: 20
icl_task_type: code_evaluation
-
label: human_eval_75
dataset_uri: eval/local_data/programming/human_eval-0.75.jsonl # ADD YOUR OWN DATASET URI
num_fewshot: [0]
pass_at_k : 1

Check failure on line 228 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.9, [dev])

228:12 [colons] too many spaces before colon

Check failure on line 228 in scripts/eval/yamls/tasks.yaml

View workflow job for this annotation

GitHub Actions / code-quality (3.10, [dev])

228:12 [colons] too many spaces before colon
num_beams: 20
icl_task_type: code_evaluation

0 comments on commit a84dda0

Please sign in to comment.