[Model] Add TP and BNB quantization support to LlavaMultiModalProjector #10834

Isotr0py · 2024-12-02T16:01:00Z

I just realized that we still can't load unsloth/llava-1.5-7b-hf-bnb-4bit mentioned in #10813, because it has llava multimodal projector quantized as well. 😅

Add TP and quantization support to llava multimodal projector
Enable uninitialized weights tracking for BNB model loader

Signed-off-by: Isotr0py <[email protected]>

github-actions · 2024-12-02T16:01:15Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

vllm/model_executor/models/llava.py

DarkLight1337

Otherwise looks good

Co-authored-by: Cyrus Leung <[email protected]>

jeejeelee

LGTM， thanks for your fixing.

…or (vllm-project#10834) Signed-off-by: Isotr0py <[email protected]> Co-authored-by: Cyrus Leung <[email protected]> Signed-off-by: Dahai Tang <[email protected]>

Isotr0py added 5 commits December 2, 2024 13:44

add quant support to llava projector

21d1a50

Signed-off-by: Isotr0py <[email protected]>

add weight loading tracker to bnb loader

9685973

Signed-off-by: Isotr0py <[email protected]>

fix quant_param_name not in param_dict location

8cd3d89

Signed-off-by: Isotr0py <[email protected]>

address todo

40c44fc

Signed-off-by: Isotr0py <[email protected]>

code format

f904d32

Signed-off-by: Isotr0py <[email protected]>

Isotr0py requested a review from jeejeelee December 2, 2024 16:02

DarkLight1337 reviewed Dec 2, 2024

View reviewed changes

vllm/model_executor/models/llava.py Outdated Show resolved Hide resolved

DarkLight1337 approved these changes Dec 2, 2024

View reviewed changes

Update vllm/model_executor/models/llava.py

ae080c8

Co-authored-by: Cyrus Leung <[email protected]>

jeejeelee approved these changes Dec 2, 2024

View reviewed changes

jeejeelee enabled auto-merge (squash) December 2, 2024 16:12

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 2, 2024

jeejeelee merged commit 4c05edb into vllm-project:main Dec 2, 2024
60 checks passed

Isotr0py deleted the llava-bnb branch December 3, 2024 00:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Model] Add TP and BNB quantization support to LlavaMultiModalProjector #10834

[Model] Add TP and BNB quantization support to LlavaMultiModalProjector #10834

Isotr0py commented Dec 2, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Dec 2, 2024

DarkLight1337 left a comment

jeejeelee left a comment

[Model] Add TP and BNB quantization support to LlavaMultiModalProjector #10834

[Model] Add TP and BNB quantization support to LlavaMultiModalProjector #10834

Conversation

Isotr0py commented Dec 2, 2024 • edited by github-actions bot Loading

github-actions bot commented Dec 2, 2024

DarkLight1337 left a comment

Choose a reason for hiding this comment

jeejeelee left a comment

Choose a reason for hiding this comment

Isotr0py commented Dec 2, 2024 •

edited by github-actions bot

Loading