Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
902 workflow runs
902 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Bug Fix] Illegal memory access, FP8 Llama 3.1 405b
Add label on auto-merge enabled #102: Pull request #6852 auto_merge_enabled by comaniac
July 27, 2024 02:18 9s
July 27, 2024 02:18 9s
Update README.md
Add label on auto-merge enabled #101: Pull request #6847 auto_merge_enabled by zhuohan123
July 27, 2024 00:25 12s
July 27, 2024 00:25 12s
[TPU] Support collective communications in XLA devices
Add label on auto-merge enabled #100: Pull request #6813 auto_merge_enabled by WoosukKwon
July 26, 2024 23:01 14s
July 26, 2024 23:01 14s
enforce eager mode with bnb quantization temporarily
Add label on auto-merge enabled #99: Pull request #6846 auto_merge_enabled by mgoin
July 26, 2024 21:57 10s
July 26, 2024 21:57 10s
[Bugfix][Kernel] Promote another index to int64_t
Add label on auto-merge enabled #98: Pull request #6838 auto_merge_enabled by WoosukKwon
July 26, 2024 17:43 13s
July 26, 2024 17:43 13s
[Misc][TPU] Support TPU in initialize_ray_cluster
Add label on auto-merge enabled #97: Pull request #6812 auto_merge_enabled by WoosukKwon
July 26, 2024 17:28 11s
July 26, 2024 17:28 11s
[Doc] Add missing mock import to docs conf.py
Add label on auto-merge enabled #96: Pull request #6834 auto_merge_enabled by DarkLight1337
July 26, 2024 15:57 11s
July 26, 2024 15:57 11s
[Doc] Add missing mock import to docs conf.py
Add label on auto-merge enabled #95: Pull request #6834 auto_merge_enabled by DarkLight1337
July 26, 2024 13:29 16s
July 26, 2024 13:29 16s
[doc][debugging] add known issues for hangs
Add label on auto-merge enabled #94: Pull request #6816 auto_merge_enabled by DarkLight1337
July 26, 2024 04:45 9s
July 26, 2024 04:45 9s
[Frontend] New allowed_token_ids decoding request parameter
Add label on auto-merge enabled #93: Pull request #6753 auto_merge_enabled by DarkLight1337
July 26, 2024 02:26 11s
July 26, 2024 02:26 11s
[Docs] Publish 5th meetup slides
Add label on auto-merge enabled #92: Pull request #6799 auto_merge_enabled by zhuohan123
July 25, 2024 23:46 12s
July 25, 2024 23:46 12s
[Bugfix] Fix empty (nullptr) channelwise scales when loading wNa16 using compressed tensors
Add label on auto-merge enabled #91: Pull request #6798 auto_merge_enabled by mgoin
July 25, 2024 21:28 13s
July 25, 2024 21:28 13s
Fix ReplicatedLinear weight loading
Add label on auto-merge enabled #90: Pull request #6793 auto_merge_enabled by comaniac
July 25, 2024 20:08 11s
July 25, 2024 20:08 11s
[Core] Fix ray forward_dag error mssg
Add label on auto-merge enabled #89: Pull request #6792 auto_merge_enabled by simon-mo
July 25, 2024 17:04 12s
July 25, 2024 17:04 12s
[Bugfix] Add image placeholder for OpenAI Compatible Server of MiniCPM-V
Add label on auto-merge enabled #88: Pull request #6787 auto_merge_enabled by DarkLight1337
July 25, 2024 16:30 17s
July 25, 2024 16:30 17s
[Bugfix] Fix kv_cache_dtype=fp8 without scales for FP8 checkpoints
Add label on auto-merge enabled #87: Pull request #6761 auto_merge_enabled by mgoin
July 25, 2024 14:51 12s
July 25, 2024 14:51 12s
[Bugfix] Fix encoding_format in examples/openai_embedding_client.py
Add label on auto-merge enabled #86: Pull request #6755 auto_merge_enabled by DarkLight1337
July 25, 2024 02:21 11s
July 25, 2024 02:21 11s
[ Misc ] fp8-marlin channelwise via compressed-tensors
Add label on auto-merge enabled #85: Pull request #6524 auto_merge_enabled by mgoin
July 25, 2024 00:45 11s
July 25, 2024 00:45 11s
[Bugfix] Fix decode tokens w. CUDA graph
Add label on auto-merge enabled #84: Pull request #6757 auto_merge_enabled by comaniac
July 24, 2024 20:40 13s
July 24, 2024 20:40 13s
[Bugfix] Bump transformers to 4.43.2
Add label on auto-merge enabled #83: Pull request #6752 auto_merge_enabled by mgoin
July 24, 2024 18:55 13s
July 24, 2024 18:55 13s
[Bugfix] Fix awq_marlin and gptq_marlin flags
Add label on auto-merge enabled #82: Pull request #6745 auto_merge_enabled by mgoin
July 24, 2024 18:48 15s
July 24, 2024 18:48 15s
[Frontend] split run_server into build_server and run_server
Add label on auto-merge enabled #81: Pull request #6740 auto_merge_enabled by simon-mo
July 24, 2024 15:54 18s
July 24, 2024 15:54 18s
Adding f-string to validation error which is missing
Add label on auto-merge enabled #80: Pull request #6748 auto_merge_enabled by comaniac
July 24, 2024 15:52 14s
July 24, 2024 15:52 14s
[Bugfix] Miscalculated latency lead to time_to_first_token_seconds inaccurate.
Add label on auto-merge enabled #79: Pull request #6686 auto_merge_enabled by comaniac
July 24, 2024 15:44 11s
July 24, 2024 15:44 11s
[Kernel] Tuned FP8 Kernels for Ada Lovelace
Add label on auto-merge enabled #78: Pull request #6677 auto_merge_enabled by mgoin
July 24, 2024 14:39 12s
July 24, 2024 14:39 12s
ProTip! You can narrow down the results and go further in time using created:<2024-07-24 or the other filters available.