Skip to content

Actions: SJTU-IPADS/PowerInfer

Publish Docker image

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
135 workflow runs
135 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix compiling issue under git worktrees
Publish Docker image #94: Pull request #146 opened by hodlen
February 20, 2024 04:38 1m 6s fix-compile-worktree
February 20, 2024 04:38 1m 6s
Docs: add detailed instruction on downloading HF models
Publish Docker image #90: Pull request #136 synchronize by hodlen
January 29, 2024 08:34 49s docs/download-hf
January 29, 2024 08:34 49s
Docs: add detailed instruction on downloading HF models
Publish Docker image #89: Pull request #136 opened by hodlen
January 29, 2024 08:25 52s docs/download-hf
January 29, 2024 08:25 52s
Fix activation file detection
Publish Docker image #88: Pull request #134 opened by hodlen
January 25, 2024 15:18 1m 5s fix/detect-activation-files
January 25, 2024 15:18 1m 5s
Update README: add Windows-specific commands
Publish Docker image #87: Pull request #133 opened by hodlen
January 25, 2024 14:55 1m 0s docs/windows-command
January 25, 2024 14:55 1m 0s
Remove unused toolchain files
Publish Docker image #86: Pull request #132 opened by hodlen
January 24, 2024 15:19 52s chore/clean-old-deps
January 24, 2024 15:19 52s
Fix CUDA performance regression due to auto offloading
Publish Docker image #85: Pull request #127 synchronize by hodlen
January 24, 2024 14:54 3m 43s fix/gpu-offload-perf
January 24, 2024 14:54 3m 43s
Fix CUDA performance regression due to auto offloading
Publish Docker image #84: Pull request #127 synchronize by hodlen
January 24, 2024 14:13 1m 2s fix/gpu-offload-perf
January 24, 2024 14:13 1m 2s
Fix CUDA performance regression due to auto offloading
Publish Docker image #83: Pull request #127 synchronize by hodlen
January 24, 2024 03:36 56s fix/gpu-offload-perf
January 24, 2024 03:36 56s
Fix CUDA performance regression due to auto offloading
Publish Docker image #82: Pull request #127 synchronize by hodlen
January 23, 2024 07:38 56s fix/gpu-offload-perf
January 23, 2024 07:38 56s
Support CPU/GPU inference on Windows
Publish Docker image #81: Pull request #114 synchronize by hodlen
January 11, 2024 05:10 3s build-windows
January 11, 2024 05:10 3s
Support CPU/GPU inference on Windows
Publish Docker image #80: Pull request #114 synchronize by hodlen
January 11, 2024 05:03 3s build-windows
January 11, 2024 05:03 3s
Support CPU/GPU inference on Windows
Publish Docker image #79: Pull request #114 synchronize by hodlen
January 11, 2024 04:55 3s build-windows
January 11, 2024 04:55 3s
Support CPU/GPU inference on Windows
Publish Docker image #78: Pull request #114 synchronize by bobozi-cmd
January 10, 2024 04:47 2s build-windows
January 10, 2024 04:47 2s
Convert HF models with sparse threshold specified
Publish Docker image #77: Pull request #76 synchronize by hodlen
January 4, 2024 15:20 1m 4s Szy0127:main
January 4, 2024 15:20 1m 4s
Support CPU/GPU inference on Windows
Publish Docker image #76: Pull request #114 synchronize by hodlen
January 4, 2024 13:01 3s build-windows
January 4, 2024 13:01 3s
Support CPU/GPU inference on Windows
Publish Docker image #75: Pull request #114 opened by hodlen
January 3, 2024 13:31 1m 1s build-windows
January 3, 2024 13:31 1m 1s
Support broader python and pip versions
Publish Docker image #74: Pull request #112 synchronize by hodlen
January 3, 2024 10:42 1m 8s fix/python-dep
January 3, 2024 10:42 1m 8s
Fix argument parsing in examples/batched
Publish Docker image #73: Pull request #113 opened by hodlen
January 3, 2024 10:33 1m 5s fix/batched-opts
January 3, 2024 10:33 1m 5s
Support broader python and pip versions
Publish Docker image #72: Pull request #112 opened by hodlen
January 3, 2024 10:29 58s fix/python-dep
January 3, 2024 10:29 58s
Support setting VRAM budget for examples/server
Publish Docker image #71: Pull request #106 opened by hodlen
December 29, 2023 08:10 1m 23s server-vram-budget
December 29, 2023 08:10 1m 23s
Fix generation error under INT4 quantization and batched prompting
Publish Docker image #70: Pull request #99 synchronize by hodlen
December 27, 2023 13:51 1m 9s fix/q4-batch
December 27, 2023 13:51 1m 9s
Fix generation error under INT4 quantization and batched prompting
Publish Docker image #69: Pull request #99 opened by hodlen
December 27, 2023 13:35 57s fix/q4-batch
December 27, 2023 13:35 57s
Update issue templates of PowerInfer
Publish Docker image #68: Pull request #90 synchronize by hodlen
December 26, 2023 17:26 55s issue-templates
December 26, 2023 17:26 55s