Notifications for new models

feat(llama.cpp): expose cache_type_k and cache_type_v for quant of kv cache #981

Sign in to view logs

Triggered via pull request December 6, 2024 09:24

mudler

closed #4329

feat/llama.cpp-quantcache

Status Skipped

Total duration 3s

Artifacts –

notify-models.yaml

on: pull_request