florence2-benchmark

Benchmark florence2 batch performance using ray[serve]

The Ray Serve engine (https://docs.ray.io/en/latest/serve/getting_started.html) is used to serve Florence2 model, and has the following features:

Start the ray server with cmd:

serve run serve_config.yaml

Run benchmark against the ray server with cmd:

python benchmark_serving.py --images-dir PATH_OF_DATA_DIR --request-rate 7.22 --num-prompts 500

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
florence_inference		florence_inference
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md

Provide feedback