-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MODULE] Implement Chapter 5: Vision Language Model #59
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the PR!
This looks like a great structure. I've reviewed the text but not the notebook. Let's get the text in place then move on to the notebook last.
Co-authored-by: burtenshaw <[email protected]>
Co-authored-by: burtenshaw <[email protected]>
@burtenshaw I think it is ready for a review. |
Nice work! Let's get the notebook work from here in, and then I'll get a reviewer on it. |
@burtenshaw I got the notebook working, though the training would take some time on my hardware. |
@burtenshaw Seem like this should be merged by tomorrow. Sorry, I got unexpected busy and could not work on this. Let see what I can add before the deadline... |
No worries. I am currently merging modules on their release day, so I'll do this tomorrow. |
[MODULE] Implement Chapter 5: Vision Language Model
PR for Issue #49: Implement Vision Language Model chapter
Description
This PR adds the Vision Language Model (VLM) chapter.
Changes Introduced
Overview of VLMs
Fine-Tuning Guide
Exercise Notebooks
vlm_usage_sample.ipynb
: Demonstrates pre-trained VLM usage for tasks such as image and video processing.vlm_finetune_sample.ipynb
: Guides fine-tuning a VLM for various datasets and advanced methods.