feat: support quantization for local LLMs #189

kdziedzic68 · 2024-11-17T18:03:54Z

Feature description

the workflow should be planned mean-less with respect to the blogpost: https://huggingface.co/blog/4bit-transformers-bitsandbytes

Motivation

Huggingface has an easy to use interface for configurable model quantization - we should have support for it in order to run local LLMs with lower GPU capacity available

Additional context

No response

kdziedzic68 added the feature New feature or request label Nov 17, 2024

kdziedzic68 added this to ragbits Nov 17, 2024

mhordynski moved this to Backlog in ragbits Nov 18, 2024

kdziedzic68 moved this from Backlog to Ready in ragbits Nov 18, 2024

mhordynski added this to the Ragbits 0.6 milestone Nov 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support quantization for local LLMs #189

feat: support quantization for local LLMs #189

kdziedzic68 commented Nov 17, 2024

feat: support quantization for local LLMs #189

feat: support quantization for local LLMs #189

Comments

kdziedzic68 commented Nov 17, 2024

Feature description

Motivation

Additional context