Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
902 workflow runs
902 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Core] Allow specifying custom Executor
Add label on auto-merge enabled #52: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 23:34 14s
July 19, 2024 23:34 14s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #51: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 22:27 10s
July 19, 2024 22:27 10s
[Bugfix] [SpecDecode] AsyncMetricsCollector: update time since last collection
Add label on auto-merge enabled #50: Pull request #6578 auto_merge_enabled by cadedaniel
July 19, 2024 20:53 11s
July 19, 2024 20:53 11s
[ Kernel ] Enable Dynamic Per Token fp8
Add label on auto-merge enabled #49: Pull request #6547 auto_merge_enabled by robertgshaw2-neuralmagic
July 19, 2024 18:34 13s
July 19, 2024 18:34 13s
[Misc] Fix input_scale typing in w8a8_utils.py
Add label on auto-merge enabled #48: Pull request #6579 auto_merge_enabled by mgoin
July 19, 2024 14:31 11s
July 19, 2024 14:31 11s
[Bugfix][Frontend] remove duplicate init logger
Add label on auto-merge enabled #47: Pull request #6581 auto_merge_enabled by DarkLight1337
July 19, 2024 14:19 13s
July 19, 2024 14:19 13s
[BUGFIX] Raise an error for no draft token case when draft_tp>1
Add label on auto-merge enabled #46: Pull request #6369 auto_merge_enabled by cadedaniel
July 19, 2024 08:21 9s
July 19, 2024 08:21 9s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #45: Pull request #6557 auto_merge_enabled by Yard1
July 19, 2024 05:06 11s
July 19, 2024 05:06 11s
[Core] Allow specifying custom Executor
Add label on auto-merge enabled #44: Pull request #6557 auto_merge_enabled by comaniac
July 19, 2024 04:57 12s
July 19, 2024 04:57 12s
[Bugfix][Frontend] Fix missing /metrics endpoint
Add label on auto-merge enabled #43: Pull request #6463 auto_merge_enabled by simon-mo
July 19, 2024 02:00 11s
July 19, 2024 02:00 11s
Add support for a rope extension method
Add label on auto-merge enabled #42: Pull request #6553 auto_merge_enabled by simon-mo
July 19, 2024 01:16 1m 17s
July 19, 2024 01:16 1m 17s
[Kernel] Implement fallback for FP8 channelwise using torch._scaled_mm
Add label on auto-merge enabled #41: Pull request #6552 auto_merge_enabled by robertgshaw2-neuralmagic
July 18, 2024 22:00 13s
July 18, 2024 22:00 13s
[ci][test] add correctness test for cpu offloading
Add label on auto-merge enabled #40: Pull request #6549 auto_merge_enabled by youkaichao
July 18, 2024 21:49 12s
July 18, 2024 21:49 12s
[Misc] Small perf improvements
Add label on auto-merge enabled #39: Pull request #6520 auto_merge_enabled by Yard1
July 18, 2024 20:13 15s
July 18, 2024 20:13 15s
[Model] Support Mistral-Nemo
Add label on auto-merge enabled #38: Pull request #6548 auto_merge_enabled by mgoin
July 18, 2024 19:35 15s
July 18, 2024 19:35 15s
[Bugfix] Update flashinfer.py with PagedAttention forwards - Fixes Gemma2 OpenAI Server Crash
Add label on auto-merge enabled #37: Pull request #6501 auto_merge_enabled by comaniac
July 18, 2024 06:10 12s
July 18, 2024 06:10 12s
[Misc] Minor patch for draft model runner
Add label on auto-merge enabled #36: Pull request #6523 auto_merge_enabled by cadedaniel
July 18, 2024 05:39 10s
July 18, 2024 05:39 10s
[BugFix] Avoid secondary error in ShmRingBuffer destructor
Add label on auto-merge enabled #35: Pull request #6530 auto_merge_enabled by youkaichao
July 18, 2024 04:04 15s
July 18, 2024 04:04 15s
[BugFix][Frontend] Use LoRA tokenizer in OpenAI APIs
Add label on auto-merge enabled #34: Pull request #6227 auto_merge_enabled by DarkLight1337
July 18, 2024 00:55 14s
July 18, 2024 00:55 14s
[ Kernel ] FP8 Dynamic-Per-Token Quant Kernel
Add label on auto-merge enabled #33: Pull request #6511 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 23:08 11s
July 17, 2024 23:08 11s
[ Misc ] Improve Min Capability Checking in compressed-tensors
Add label on auto-merge enabled #32: Pull request #6522 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 22:05 11s
July 17, 2024 22:05 11s
[Core] Introduce SPMD worker execution using Ray accelerated DAG
Add label on auto-merge enabled #31: Pull request #6032 auto_merge_enabled by rkooo567
July 17, 2024 21:55 11s
July 17, 2024 21:55 11s
[Misc] Small perf improvements
Add label on auto-merge enabled #30: Pull request #6520 auto_merge_enabled by Yard1
July 17, 2024 21:52 11s
July 17, 2024 21:52 11s
[ Kernel ] Fp8 Channelwise Weight Support
Add label on auto-merge enabled #29: Pull request #6487 auto_merge_enabled by robertgshaw2-neuralmagic
July 17, 2024 19:05 11s
July 17, 2024 19:05 11s
[Misc] Use torch.Tensor for type annotation
Add label on auto-merge enabled #28: Pull request #6505 auto_merge_enabled by DarkLight1337
July 17, 2024 12:51 32s
July 17, 2024 12:51 32s
ProTip! You can narrow down the results and go further in time using created:<2024-07-17 or the other filters available.