-
Notifications
You must be signed in to change notification settings - Fork 478
Issues: deepseek-ai/DeepSeek-Coder
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
多卡执行微调脚本报错The server socket has failed to bind to [::]:29500 (errno: 98 - Address already in use)
#180
opened Sep 5, 2024 by
zhangyaoyue01
What is the correct padding side for train/eval of base model for FIM?
#173
opened Jul 6, 2024 by
zhzhangcc
Deepseekcoder 6 spitting out corrupt output for code generation question
#166
opened Jun 11, 2024 by
kodergeek
疑惑:为什么 base 模型的 tokenizer 词表中也有类似 <|Assistant|> 这样多用于 chat 模型的 special tokens?
#165
opened Jun 6, 2024 by
yucc-leon
Are EOS tokens masked during pre-training? If so, how does FIM mode know how to connect to the
text_after
?
#163
opened May 29, 2024 by
zhzhangcc
使用vllm加载33b-base或33b-instruct后,使用DS-1000、Program-Aided Math Reasoning (PAL)评估集进行评估,得分很低,与论文上的数据不符
#159
opened Apr 26, 2024 by
aigc001
Why generate "GGGGG...." ,when the input string is longer than a certain length in GGUF model?
#151
opened Mar 26, 2024 by
hzgdeerHo
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.