-
Notifications
You must be signed in to change notification settings - Fork 142
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
llama训练时,best状态存储导致训练卡顿,建议删除存储best文件部分代码,望记得更新 #52
Comments
大佬好,我在pretrain的时候也碰到了训练卡顿的情况,但不知道啥原因。请问是如何分析确定是存储best的部分代码造成卡顿呢? |
你看一下你是多少step存储,然后刚好那个步骤日志显示 saving best 以后 就不训练了,就是这个问题,欢迎加微信沟通,437461219 |
你好,你训练完后文件有多大,我的很小,这是我的执行代码 |
你的训练代码中出现了这个参数 |
No description provided.
The text was updated successfully, but these errors were encountered: