Skip to content

Commit

Permalink
Merge pull request #124 from Knight7561/fix-module-2
Browse files Browse the repository at this point in the history
[Module 2 Bug] Fixing wrong args passed to DPOTrainer
  • Loading branch information
burtenshaw authored Jan 8, 2025
2 parents dcde502 + f19444f commit dce7afb
Showing 1 changed file with 7 additions and 0 deletions.
7 changes: 7 additions & 0 deletions 2_preference_alignment/notebooks/dpo_finetuning_example.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -276,6 +276,13 @@
" use_mps_device=device == \"mps\",\n",
" # Model ID for HuggingFace Hub uploads\n",
" hub_model_id=finetune_name,\n",
" # DPO-specific temperature parameter that controls the strength of the preference model\n",
" # Lower values (like 0.1) make the model more conservative in following preferences\n",
" beta=0.1,\n",
" # Maximum length of the input prompt in tokens\n",
" max_prompt_length=1024,\n",
" # Maximum combined length of prompt + response in tokens\n",
" max_length=1536\n",
")"
]
},
Expand Down

0 comments on commit dce7afb

Please sign in to comment.