Skip to content
@gpustack

GPUStack

Open-source GPU cluster manager for running large language models(LLMs)

Pinned Loading

  1. gpustack gpustack Public

    Manage GPU clusters for running AI models

    Python 1.2k 113

  2. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 75 7

  3. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 25 4

  4. llama-box llama-box Public

    LM inference server implementation based on *.cpp.

    C++ 67 9

Repositories

Showing 9 of 9 repositories
  • gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    gpustack/gguf-parser-go’s past year of commit activity
    Go 75 MIT 7 0 0 Updated Jan 29, 2025
  • llama-box Public

    LM inference server implementation based on *.cpp.

    gpustack/llama-box’s past year of commit activity
    C++ 67 MIT 9 5 0 Updated Jan 29, 2025
  • gpustack Public

    Manage GPU clusters for running AI models

    gpustack/gpustack’s past year of commit activity
    Python 1,153 Apache-2.0 113 162 (1 issue needs help) 4 Updated Jan 27, 2025
  • gpustack-ui Public
    gpustack/gpustack-ui’s past year of commit activity
    TypeScript 4 Apache-2.0 9 0 0 Updated Jan 26, 2025
  • gpustack/gpustack.github.io’s past year of commit activity
    HTML 0 1 0 0 Updated Jan 22, 2025
  • .github Public

    Meta-Github repository for all GPUStack repositories.

    gpustack/.github’s past year of commit activity
    Dockerfile 0 Apache-2.0 0 0 0 Updated Jan 21, 2025
  • vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    gpustack/vox-box’s past year of commit activity
    Python 25 Apache-2.0 4 5 0 Updated Jan 14, 2025
  • fastfetch Public Forked from fastfetch-cli/fastfetch

    Like neofetch, but much faster because written mostly in C.

    gpustack/fastfetch’s past year of commit activity
    C 0 MIT 457 0 0 Updated Oct 24, 2024
  • gguf-packer-go Public

    Deliver LLMs of GGUF format via Dockerfile.

    gpustack/gguf-packer-go’s past year of commit activity
    Go 8 MIT 2 0 0 Updated Oct 24, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.