Evaluating contamination effect over `evol-codealpaca-v1` #4

ganler · 2023-11-24T21:18:24Z

Thanks for the great work!

I am curious how this decontaminator would perform over https://huggingface.co/datasets/theblackcat102/evol-codealpaca-v1 which seems to help some SOTA models to achieve 78% pass@1 on HumanEval. Would this work out of the box if one just follows the README examples? Thanks!

andy-yang-1 · 2023-12-01T23:03:25Z

Would this work out of the box if one just follows the README examples

@ganler Feel free to follow the steps, and I am really curious about the results too!

wyt2000 · 2024-03-22T12:44:02Z

Is there any result about this? I also found a lot of duplicated data between evol-codealpaca-v1 and HumanEval benchmark.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluating contamination effect over `evol-codealpaca-v1` #4

Evaluating contamination effect over `evol-codealpaca-v1` #4

ganler commented Nov 24, 2023 •

edited

Loading

andy-yang-1 commented Dec 1, 2023 •

edited

Loading

wyt2000 commented Mar 22, 2024

Evaluating contamination effect over evol-codealpaca-v1 #4

Evaluating contamination effect over evol-codealpaca-v1 #4

Comments

ganler commented Nov 24, 2023 • edited Loading

andy-yang-1 commented Dec 1, 2023 • edited Loading

wyt2000 commented Mar 22, 2024

Evaluating contamination effect over `evol-codealpaca-v1` #4

Evaluating contamination effect over `evol-codealpaca-v1` #4

ganler commented Nov 24, 2023 •

edited

Loading

andy-yang-1 commented Dec 1, 2023 •

edited

Loading