You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What part(s) of the article would you like to see updated?
Loading the datasets part mainly assumes that prompt, chosen and rejected fields are strings but recently there are a lot of popular datasets like argilla/dpo-mix-7k which contain dicts/JSON, which then needs to be tokenized.
I'm assuming we have to run tokenizer.apply_chat_template on these fields and then use a custom dataset type to train the model.
What piece of documentation is affected?
https://github.com/OpenAccess-AI-Collective/axolotl/blob/main/docs/rlhf.md
What part(s) of the article would you like to see updated?
Loading the datasets part mainly assumes that
prompt
,chosen
andrejected
fields are strings but recently there are a lot of popular datasets like argilla/dpo-mix-7k which contain dicts/JSON, which then needs to be tokenized.I'm assuming we have to run
tokenizer.apply_chat_template
on these fields and then use a custom dataset type to train the model.The training is running currently, but I am not sure if this is the correct way to do this. Can anyone confirm?
Additional Information
No response
Acknowledgements
The text was updated successfully, but these errors were encountered: