-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement Vision Language Model chapter #49
Comments
Hi, could the issue be assigned? |
Of course, I expect others will contribute too, but open a draft PR with the outline and LGTM. |
Hi there 👋🏻 |
@jungnerd I had been working on sft for vlm and just finished it. There is still the dpo approach notebook, could be based on this blog Hugging Face Blog: Preference Optimization for VLMs, and probably lots of fixes and optimizations in completed the markdown and notebooks for vlm. |
@duydl Am I understanding correctly that you’re suggesting creating a notebook to fine-tune SmolVLM using DPO? If so, that sounds great! I think it would be really exciting to work on a fine-tuning notebook with DPO! |
We need to implement the section on VLMs. It should be based on existing content from the huggingface ecosystem, adapted for SmolVLM, adapted to the course structure, and offer exercises.
Material
Steps to do
The text was updated successfully, but these errors were encountered: