Skip to content

Actions: charlifu/vllm

yapf

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
64 workflow runs
64 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[misc] use out argument for flash attention (#10822)
yapf #64: Commit a4c4daf pushed by charlifu
December 2, 2024 15:33 1m 50s main
December 2, 2024 15:33 1m 50s
[Doc] Fix typos in docs (#10636)
yapf #63: Commit 1b583cf pushed by charlifu
November 25, 2024 19:17 2m 6s main
November 25, 2024 19:17 2m 6s
[Bugfix] Allow token ID-only inputs in Qwen2-Audio (#10536)
yapf #62: Commit e7a8341 pushed by charlifu
November 21, 2024 19:38 1m 50s main
November 21, 2024 19:38 1m 50s
November 21, 2024 18:03 1m 46s
[6/N] torch.compile rollout to users (#10437)
yapf #60: Commit 803f37e pushed by charlifu
November 19, 2024 20:51 1m 52s main
November 19, 2024 20:51 1m 52s
[5/N][torch.compile] torch.jit.script --> torch.compile (#10406)
yapf #59: Commit 7851b45 pushed by charlifu
November 18, 2024 16:14 1m 50s main
November 18, 2024 16:14 1m 50s
[1/N] Initial prototype for multi-modal processor (#10044)
yapf #58: Commit 0b8bb86 pushed by charlifu
November 13, 2024 16:25 1m 47s main
November 13, 2024 16:25 1m 47s
November 4, 2024 16:45 2m 38s
[Frontend] Use a proper chat template for VLM2Vec (#9912)
yapf #56: Commit ba0d892 pushed by charlifu
November 1, 2024 16:07 2m 39s main
November 1, 2024 16:07 2m 39s
[Doc] Update Qwen documentation (#9869)
yapf #55: Commit 5608e61 pushed by charlifu
October 31, 2024 14:12 2m 35s main
October 31, 2024 14:12 2m 35s
[Model][VLM] Add multi-video support for LLaVA-Onevision (#8905)
yapf #54: Commit 5f8d807 pushed by charlifu
October 28, 2024 19:29 2m 48s main
October 28, 2024 19:29 2m 48s
[V1] Support sliding window attention (#9679)
yapf #53: Commit 9645b9f pushed by charlifu
October 25, 2024 15:32 2m 39s main
October 25, 2024 15:32 2m 39s
October 24, 2024 15:25 2m 32s
[Model][Bugfix] Fix batching with multi-image in PixtralHF (#9518)
yapf #51: Commit 5241aa1 pushed by charlifu
October 21, 2024 20:18 2m 44s main
October 21, 2024 20:18 2m 44s
[Model] Molmo vLLM Integration (#9016)
yapf #50: Commit dfe43a2 pushed by charlifu
October 14, 2024 16:36 2m 27s main
October 14, 2024 16:36 2m 27s
[Doc] Improve contributing and installation documentation (#9132)
yapf #49: Commit de24046 pushed by charlifu
October 8, 2024 21:07 2m 31s main
October 8, 2024 21:07 2m 31s
[Build/CI] Upgrade to gcc 10 in the base build Docker image (#8814)
yapf #48: Commit f70bcca pushed by charlifu
September 26, 2024 17:47 2m 25s main
September 26, 2024 17:47 2m 25s
[CI/Build] Re-enabling Entrypoints tests on ROCm, excluding ones that…
yapf #47: Commit 6cb748e pushed by charlifu
September 19, 2024 20:38 2m 19s main
September 19, 2024 20:38 2m 19s
[Feature][kernel] tensor parallelism with bitsandbytes quantization (…
yapf #46: Commit 9855b99 pushed by charlifu
September 17, 2024 16:51 2m 15s main
September 17, 2024 16:51 2m 15s
[Model] support minicpm3 (#8297)
yapf #45: Commit 8a0cf1d pushed by charlifu
September 14, 2024 16:06 2m 15s main
September 14, 2024 16:06 2m 15s
September 9, 2024 15:08 2m 11s
Move verify_marlin_supported to GPTQMarlinLinearMethod (#8165)
yapf #43: Commit 2ee4528 pushed by charlifu
September 5, 2024 16:17 2m 10s main
September 5, 2024 16:17 2m 10s
August 26, 2024 18:17 1m 57s
[ci][test] fix RemoteOpenAIServer (#7838)
yapf #41: Commit aab0fcd pushed by charlifu
August 25, 2024 00:15 1m 47s main
August 25, 2024 00:15 1m 47s
[BugFix] Fix server crash on empty prompt (#7746)
yapf #40: Commit e25fee5 pushed by charlifu
August 23, 2024 15:23 1m 55s main
August 23, 2024 15:23 1m 55s