Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Models speak with themselves #605

Open
sha367 opened this issue Aug 7, 2024 · 5 comments
Open

Models speak with themselves #605

sha367 opened this issue Aug 7, 2024 · 5 comments

Comments

@sha367
Copy link

sha367 commented Aug 7, 2024

We are testing MacOS version of the Maid with gguf models from huggingface (https://huggingface.co/bartowski and https://huggingface.co/QuantFactory) and they start to speak with themselves. With anythingLLM these models work fine. I assume there is some issue with end of response token initialization.
IMG_1078
IMG_1077

@danemadsen
Copy link
Member

This is a known issue caused by using the wrong template with a model. Templates are ment to be stored in the gguf but often isn't leading to it falling back to chatml

@danemadsen
Copy link
Member

try settings the template parameter to "gemma" or "llama2"

The available template strings are "chatml", "llama2", "phi3", "zephyr", "monarch", "gemma", "gemma2", "orion", "vicuna", "vicuna-orca", "deepseek", "command-r", "llama3", "minicpm" and "deepseek2"

These are the templates llama.cpp handles out of the box and if they arnt set in gguf file theres currently no way of detecting which template should be used.

@sha367
Copy link
Author

sha367 commented Aug 8, 2024

gemma_screen

Hi @danemadsen, thanks for the previous comment!
I tried to set template parameter to gemma or gemma2, but still model doesn't work.

@sha367
Copy link
Author

sha367 commented Aug 8, 2024

Also after that I tried bunch of other models and I got wrong answers or maid quit unexpectedly.
finance_llama_3_screen

@danemadsen danemadsen reopened this Aug 8, 2024
@Bondigame-personal
Copy link

Hello. I have same issues.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants