Reproduce the given benchmark results #500
-
Hello,I would like to know which model and test tool are used for the performance data provided on the homepage? Is the model available on https://huggingface.co/? |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments
-
lmsys/vicuna-13b-v1.3, young-geng/koala, openlm-research/open_llama_13b, or others ? |
Beta Was this translation helpful? Give feedback.
-
@zhuohan123 Please help me solve this problem, thank you |
Beta Was this translation helpful? Give feedback.
-
Hi we use the official LLaMA model (e.g., |
Beta Was this translation helpful? Give feedback.
-
We use this script to get the benchmark results: https://github.com/vllm-project/vllm/blob/main/benchmarks/benchmark_serving.py |
Beta Was this translation helpful? Give feedback.
-
https://github.com/vllm-project/vllm/blob/main/benchmarks/launch_tgi_server.sh |
Beta Was this translation helpful? Give feedback.
We use this script to get the benchmark results: https://github.com/vllm-project/vllm/blob/main/benchmarks/benchmark_serving.py