Skip to content

Issues: GanjinZero/RRHF

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

RRHF with Online Sampling
#57 opened Sep 14, 2024 by sqqiao
Runtime error:数据类型报错
#55 opened Jul 23, 2024 by sqqiao
关于ppl的方差
#54 opened Feb 1, 2024 by skepsun
数据构造问题
#50 opened Dec 11, 2023 by lylcst
bug 计算sft损失的时候
#48 opened Nov 24, 2023 by shyoulala
关于alpaca-7B和LLaMA-7B
#47 opened Oct 22, 2023 by NEUBuffett
有关IMDB数据集的问题
#46 opened Oct 12, 2023 by stevie1023
加载模型的问题
#43 opened Sep 6, 2023 by LiangZhuuu
训练过程OOM的问题
#41 opened Aug 21, 2023 by Guochry
Wombat与RRHF
#40 opened Aug 2, 2023 by Guochry
期待LoRA或ptuning
#31 opened May 31, 2023 by Noyce765103
training with my own gpt2
#22 opened May 3, 2023 by dyyzhmm
PPO implementation
#19 opened Apr 25, 2023 by yuzc19
ProTip! Updated in the last three days: updated:>2024-11-24.