Skip to content

Actions: huggingface/text-generation-inference

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
13,707 workflow run results
13,707 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add support for FP8 KV cache scales
CI build #1585: Pull request #2628 synchronize by danieldk
October 21, 2024 17:21 4m 12s feature/fp8-kv-cache-scale
October 21, 2024 17:21 4m 12s
Add support for FP8 KV cache scales
Server Tests #3252: Pull request #2628 synchronize by danieldk
October 21, 2024 17:21 4m 16s feature/fp8-kv-cache-scale
October 21, 2024 17:21 4m 16s
Add support for FP8 KV cache scales
Nix Tests #380: Pull request #2628 synchronize by danieldk
October 21, 2024 17:21 4m 16s feature/fp8-kv-cache-scale
October 21, 2024 17:21 4m 16s
Add support for FP8 KV cache scales
Automatic Documentation for Launcher #1554: Pull request #2628 synchronize by danieldk
October 21, 2024 17:21 6m 54s feature/fp8-kv-cache-scale
October 21, 2024 17:21 6m 54s
feat(trtllm): add stop words handling
Secret Leaks #1934: Commit f631742 pushed by mfuntowicz
October 21, 2024 15:08 17s trtllm-stop-words
October 21, 2024 15:08 17s
[TENSORRT-LLM] - Implement new looper thread based backend
Nix Tests #379: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 15:03 6m 22s trtllm-executor-thread
October 21, 2024 15:03 6m 22s
[TENSORRT-LLM] - Implement new looper thread based backend
Server Tests #3251: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 15:03 8m 23s trtllm-executor-thread
October 21, 2024 15:03 8m 23s
[TENSORRT-LLM] - Implement new looper thread based backend
Automatic Documentation for Launcher #1553: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 15:03 7m 1s trtllm-executor-thread
October 21, 2024 15:03 7m 1s
[TENSORRT-LLM] - Implement new looper thread based backend
CI build #1584: Pull request #2357 synchronize by mfuntowicz
October 21, 2024 15:03 55m 47s trtllm-executor-thread
October 21, 2024 15:03 55m 47s
Revert "chore(trtllm): remove unused method"
Secret Leaks #1933: Commit f5b9ee3 pushed by mfuntowicz
October 21, 2024 15:03 17s trtllm-executor-thread
October 21, 2024 15:03 17s
feat(trtllm): add stop words handling
Secret Leaks #1932: Commit 9cf43a7 pushed by mfuntowicz
October 21, 2024 15:00 20s trtllm-stop-words
October 21, 2024 15:00 20s
Upload PR Documentation
Upload PR Documentation #97: completed by Narsil
October 21, 2024 13:25 29s
October 21, 2024 13:25 29s
Choosing input/total tokens automatically based on available VRAM?
Build PR Documentation #208: Pull request #2673 synchronize by Narsil
October 21, 2024 13:24 43s auto_length
October 21, 2024 13:24 43s
Choosing input/total tokens automatically based on available VRAM?
Nix Tests #378: Pull request #2673 synchronize by Narsil
October 21, 2024 13:24 8m 4s auto_length
October 21, 2024 13:24 8m 4s
Choosing input/total tokens automatically based on available VRAM?
Automatic Documentation for Launcher #1552: Pull request #2673 synchronize by Narsil
October 21, 2024 13:24 7m 5s auto_length
October 21, 2024 13:24 7m 5s
Choosing input/total tokens automatically based on available VRAM?
Server Tests #3250: Pull request #2673 synchronize by Narsil
October 21, 2024 13:24 7m 22s auto_length
October 21, 2024 13:24 7m 22s
Choosing input/total tokens automatically based on available VRAM?
CI build #1583: Pull request #2673 synchronize by Narsil
October 21, 2024 13:24 49m 22s auto_length
October 21, 2024 13:24 49m 22s
Remove generated files.
Secret Leaks #1931: Commit a31db04 pushed by Narsil
October 21, 2024 13:24 18s auto_length
October 21, 2024 13:24 18s
break when there's nothing to read (#2582)
Secret Leaks #1930: Commit 058d306 pushed by Narsil
October 21, 2024 13:22 16s main
October 21, 2024 13:22 16s
break when there's nothing to read (#2582)
CI build #1582: Commit 058d306 pushed by Narsil
October 21, 2024 13:22 51m 35s main
October 21, 2024 13:22 51m 35s
pages build and deployment
pages-build-deployment #1014: by Narsil
October 21, 2024 13:22 58s main
October 21, 2024 13:22 58s
Upload PR Documentation
Upload PR Documentation #96: completed by Narsil
October 21, 2024 13:08 32s
October 21, 2024 13:08 32s
Choosing input/total tokens automatically based on available VRAM?
Automatic Documentation for Launcher #1551: Pull request #2673 synchronize by Narsil
October 21, 2024 13:07 7m 26s auto_length
October 21, 2024 13:07 7m 26s