Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Evaluating contamination effect over evol-codealpaca-v1 #4

Open
ganler opened this issue Nov 24, 2023 · 2 comments
Open

Evaluating contamination effect over evol-codealpaca-v1 #4

ganler opened this issue Nov 24, 2023 · 2 comments

Comments

@ganler
Copy link

ganler commented Nov 24, 2023

Thanks for the great work!

I am curious how this decontaminator would perform over https://huggingface.co/datasets/theblackcat102/evol-codealpaca-v1 which seems to help some SOTA models to achieve 78% pass@1 on HumanEval. Would this work out of the box if one just follows the README examples? Thanks!

@andy-yang-1
Copy link
Collaborator

andy-yang-1 commented Dec 1, 2023

Would this work out of the box if one just follows the README examples

@ganler Feel free to follow the steps, and I am really curious about the results too!

@wyt2000
Copy link

wyt2000 commented Mar 22, 2024

Is there any result about this? I also found a lot of duplicated data between evol-codealpaca-v1 and HumanEval benchmark.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants