Update SGLang example to use Qwen2-VL #1030

advay-modal · 2025-01-02T04:36:37Z

Type of Change

Example updates (Bug fixes, new features, etc.)

Checklist

Outside contributors

You're great! Thanks for your contribution.

advay-modal · 2025-01-02T04:37:38Z

@charlesfrye I spent a while trying to get this to work with llama 3.2 11b VL, but the download speed was really slow for some reason

EDIT: I think I needed to set .env({"HF_HUB_ENABLE_HF_TRANSFER": "1"}). Can try with that if we have a strong preference for llama 3.2

charlesfrye · 2025-01-02T04:40:50Z

🚀 The docs preview is ready! Check it out here: https://modal-labs-examples--frontend-preview-75112f8.modal.run

charlesfrye · 2025-01-02T17:06:49Z

🚀 The docs preview is ready! Check it out here: https://modal-labs-examples--frontend-preview-bafc963.modal.run

charlesfrye

Nice! Don't forget to change the title now that the model has changed.

06_gpu_and_ml/llm-serving/sgl_vlm.py

Co-authored-by: Charles Frye <[email protected]>

charlesfrye · 2025-01-03T16:04:39Z

🚀 The docs preview is ready! Check it out here: https://modal-labs-examples--frontend-preview-05070e3.modal.run

charlesfrye · 2025-01-03T16:05:25Z

🚀 The docs preview is ready! Check it out here: https://modal-labs-examples--frontend-preview-97ae146.modal.run

charlesfrye · 2025-01-03T17:28:01Z

🚀 The docs preview is ready! Check it out here: https://modal-labs-examples--frontend-preview-9be71ea.modal.run

charlesfrye · 2025-01-10T22:03:04Z

🚀 The docs preview is ready! Check it out here: https://modal-labs-examples--frontend-preview-487f7b6.modal.run

Update SGLang example to use Qwen2-VL

4000da4

advay-modal requested a review from charlesfrye January 2, 2025 04:36

ruff

356ba89

charlesfrye requested changes Jan 3, 2025

View reviewed changes

06_gpu_and_ml/llm-serving/sgl_vlm.py Outdated Show resolved Hide resolved

06_gpu_and_ml/llm-serving/sgl_vlm.py Outdated Show resolved Hide resolved

06_gpu_and_ml/llm-serving/sgl_vlm.py Outdated Show resolved Hide resolved

advay-modal and others added 2 commits January 3, 2025 10:59

Update 06_gpu_and_ml/llm-serving/sgl_vlm.py

8348d64

Co-authored-by: Charles Frye <[email protected]>

Update 06_gpu_and_ml/llm-serving/sgl_vlm.py

78516d6

Co-authored-by: Charles Frye <[email protected]>

PR changes

39f78a6

advay-modal requested a review from charlesfrye January 3, 2025 17:23

minor text fixes, l40s, cleaner file handling

e5dea24

charlesfrye approved these changes Jan 10, 2025

View reviewed changes

charlesfrye merged commit a8e9d14 into main Jan 10, 2025
7 checks passed

charlesfrye deleted the advay/update-sglang-example branch January 10, 2025 22:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update SGLang example to use Qwen2-VL #1030

Update SGLang example to use Qwen2-VL #1030

advay-modal commented Jan 2, 2025 •

edited

Loading

advay-modal commented Jan 2, 2025 •

edited

Loading

charlesfrye commented Jan 2, 2025

charlesfrye commented Jan 2, 2025

charlesfrye left a comment

charlesfrye commented Jan 3, 2025

charlesfrye commented Jan 3, 2025

charlesfrye commented Jan 3, 2025

charlesfrye commented Jan 10, 2025

Update SGLang example to use Qwen2-VL #1030

Update SGLang example to use Qwen2-VL #1030

Conversation

advay-modal commented Jan 2, 2025 • edited Loading

Type of Change

Checklist

Outside contributors

advay-modal commented Jan 2, 2025 • edited Loading

charlesfrye commented Jan 2, 2025

charlesfrye commented Jan 2, 2025

charlesfrye left a comment

Choose a reason for hiding this comment

charlesfrye commented Jan 3, 2025

charlesfrye commented Jan 3, 2025

charlesfrye commented Jan 3, 2025

charlesfrye commented Jan 10, 2025

advay-modal commented Jan 2, 2025 •

edited

Loading

advay-modal commented Jan 2, 2025 •

edited

Loading