Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the update of environment.yml #70

Open
lucasliunju opened this issue Oct 17, 2024 · 3 comments
Open

About the update of environment.yml #70

lucasliunju opened this issue Oct 17, 2024 · 3 comments

Comments

@lucasliunju
Copy link

Hi,

I try to follow the configuration of environment.yml and cannot run the code successfully. I noticed that the simpo code has some update and I would like to ask whether we need to update the environment.yml

Thank you very much in advance.

@yumeng5
Copy link
Collaborator

yumeng5 commented Oct 20, 2024

Hi @lucasliunju

The environment file should be good for Mistral and Llama3 models. If you are running Gemma2 models, you probably need to upgrade the versions of torch and trl to their latest version. We're working on compiling an upgraded environment file that ensures compatibility for all the models and will provide an update soon.

Best,
Yu

@lucasliunju
Copy link
Author

lucasliunju commented Oct 20, 2024

Hi @yumeng5 ,

Thank you very much for you reply! I can run the code now. By the way, I try to use alpaca-eval to evaluate the performance of SimPO and I find the openai API is very expensive for students. So I would like to ask whether you have some suggestions on that.

Thank you very much!

@xiamengzhou
Copy link
Contributor

@lucasliunju

We've updated the environment info! For AlpacaEval, we find it necessary to use close source models for judging (i.e., GPT, Claude), as there are significant inconsistencies in preferences between open-source models like llama3 and GPT 😢.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants