Skip to content

Commit

Permalink
Usage for vllm_generic.py
Browse files Browse the repository at this point in the history
  • Loading branch information
gongy committed Jan 12, 2024
1 parent 72533c3 commit ffa5c43
Show file tree
Hide file tree
Showing 2 changed files with 5 additions and 2 deletions.
2 changes: 1 addition & 1 deletion 06_gpu_and_ml/vllm_generic.py
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
# # Generic model inference with vLLM
# Usage: modal deploy vllm_generic.py --public


import os
import time

from modal import Image, Stub, method, Secret
Expand Down
5 changes: 4 additions & 1 deletion 06_gpu_and_ml/vllm_generic_client.py
Original file line number Diff line number Diff line change
@@ -1,6 +1,9 @@
# Usage:
# 1. `modal volume create my-hf-cache`
# 2. `python vllm_generic.py`

import modal

# First run `modal volume create my-hf-cache` in CLI first.
model_vol = modal.Volume.lookup("my-hf-cache")
hf_secret = modal.Secret.lookup("huggingface")

Expand Down

0 comments on commit ffa5c43

Please sign in to comment.