Skip to content

Actions: neuralmagic/vllm

yapf

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
256 workflow runs
256 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[torch.compile] Dynamic fp8 + rms_norm fusion
yapf #207: Pull request #31 synchronize by ProExpertProg
December 4, 2024 22:45 1m 47s luka/rms-norm-fusion-refactor
December 4, 2024 22:45 1m 47s
December 3, 2024 14:50 2m 9s
[misc] use out argument for flash attention (#10822)
yapf #205: Commit a4c4daf pushed by tlrmchlsmth
December 2, 2024 14:49 2m 3s main
December 2, 2024 14:49 2m 3s
[Bugfix] Ignore lm_head when loading embedding models (#10719)
yapf #204: Commit 9b4b150 pushed by tlrmchlsmth
November 27, 2024 21:13 1m 49s main
November 27, 2024 21:13 1m 49s
[Model] Enable optional prefix when loading embedding models (#10639)
yapf #203: Commit cf73f0c pushed by tlrmchlsmth
November 25, 2024 18:15 1m 58s main
November 25, 2024 18:15 1m 58s
November 23, 2024 01:12 1m 53s
[bugfix] fix full graph tests (#10581)
yapf #201: Commit db100c5 pushed by tlrmchlsmth
November 22, 2024 19:21 2m 47s main
November 22, 2024 19:21 2m 47s
[Minor] Revert change in offline inference example (#10545)
yapf #200: Commit 46fe9b4 pushed by ProExpertProg
November 21, 2024 23:35 1m 54s main
November 21, 2024 23:35 1m 54s
November 21, 2024 14:14 1m 50s
[ci][bugfix] fix kernel tests (#10431)
yapf #198: Commit 2298e69 pushed by robertgshaw2-neuralmagic
November 18, 2024 23:32 1m 45s main
November 18, 2024 23:32 1m 45s
[Model] Remove redundant softmax when using PoolingType.STEP (#10415)
yapf #197: Commit 01aae1c pushed by ElizaWszola
November 18, 2024 12:23 1m 53s main
November 18, 2024 12:23 1m 53s
November 18, 2024 06:24 1m 44s
Semi structured v2
yapf #195: Pull request #32 synchronize by ilmarkov
November 15, 2024 16:53 1m 51s semi_structured_v2
November 15, 2024 16:53 1m 51s
November 15, 2024 14:30 1m 46s
[Bugfix] bitsandbytes models fail to run pipeline parallel (#10200)
yapf #193: Commit ac49b59 pushed by tlrmchlsmth
November 13, 2024 19:27 1m 47s main
November 13, 2024 19:27 1m 47s
Semi structured v2
yapf #192: Pull request #32 opened by ilmarkov
November 13, 2024 11:55 1m 46s semi_structured_v2
November 13, 2024 11:55 1m 46s
[V1] Enable Inductor when using piecewise CUDA graphs (#10268)
yapf #191: Commit 1f55e05 pushed by SageMoore
November 12, 2024 21:44 2m 11s main
November 12, 2024 21:44 2m 11s
[1/N] torch.compile user interface design (#10237)
yapf #190: Commit eea55cc pushed by varun-sundar-rabindranath
November 12, 2024 03:09 1m 53s main
November 12, 2024 03:09 1m 53s
[V1] Fix non-cudagraph op name (#10166)
yapf #189: Commit b5815c8 pushed by SageMoore
November 8, 2024 18:32 1m 46s main
November 8, 2024 18:32 1m 46s
Add hf_transfer to testing image
yapf #188: Pull request #29 synchronize by mgoin
November 7, 2024 15:19 1m 45s test-image-hf-transfer
November 7, 2024 15:19 1m 45s
November 7, 2024 05:42 1m 47s
[CI/Build] Always run the ruff workflow (#10092)
yapf #186: Commit 74f2f8a pushed by tlrmchlsmth
November 7, 2024 00:23 1m 49s main
November 7, 2024 00:23 1m 49s
Emulated dynamic MX quantization
yapf #185: Pull request #27 opened by mgoin
November 6, 2024 18:21 1m 43s mx-quant
November 6, 2024 18:21 1m 43s
[CI/Build] Drop Python 3.8 support (#10038)
yapf #184: Commit 098f94d pushed by tlrmchlsmth
November 6, 2024 14:54 1m 38s main
November 6, 2024 14:54 1m 38s
[Model] Add Idefics3 support (#9767)
yapf #183: Commit a5bba7d pushed by ElizaWszola
November 6, 2024 11:46 1m 47s main
November 6, 2024 11:46 1m 47s