Skip to content

Commit

Permalink
chore(trtllm): ensure max throughput scheduling policy is selected
Browse files Browse the repository at this point in the history
  • Loading branch information
mfuntowicz committed Oct 21, 2024
1 parent 1b56a33 commit 4a0f05e
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions backends/trtllm/lib/backend.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -53,6 +53,7 @@ tle::ExecutorConfig huggingface::tgi::backends::GetExecutorConfig(const json &co
// Define some configuration variables
execConfig.setKvCacheConfig(tle::KvCacheConfig(true));
execConfig.setEnableChunkedContext(computeCapabilities.isPostAmpere());
execConfig.setSchedulerConfig(tle::SchedulerConfig(tle::CapacitySchedulerPolicy::kMAX_UTILIZATION));
return execConfig;
}

Expand Down

0 comments on commit 4a0f05e

Please sign in to comment.