You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When a single qps check is below or above the threshold, the downscale_counter or upscale_counter is set to 0.
This means a single jitter in qps could disrupt scaling.
I propose we allow a sampling over a period to allow scaling to occur based on a percentage of occurrences vs resetting to 0.
This could be set in the scaling policy.
Also, since scaling utilizes math.ceil, it errors on scaling and keeping qps below the value as a max bar vs a target.
In autoscalers.py within serve:
skypilot/sky/serve/autoscalers.py
Lines 258 to 269 in 3f62588
When a single qps check is below or above the threshold, the
downscale_counter
orupscale_counter
is set to 0.This means a single jitter in qps could disrupt scaling.
I propose we allow a sampling over a period to allow scaling to occur based on a percentage of occurrences vs resetting to 0.
This could be set in the scaling policy.
Also, since scaling utilizes
math.ceil
, it errors on scaling and keeping qps below the value as a max bar vs a target.skypilot/sky/serve/autoscalers.py
Line 192 in 3f62588
Version & Commit info:
sky -v
: 0.7.0sky -c
: 3f62588The text was updated successfully, but these errors were encountered: