some models fail to prepare tokens: No chat template #150

davidkoski · 2024-11-01T21:53:20Z

Model loaded -> id("mlx-community/phi-2-hf-4bit-mlx")
Error: chatTemplate("No chat template was specified")

For models that have a chat template this is fine, but for those that do not:

I don't see a way to probe for the presence of a chat template
nor a way to provide a default

I think we need to do the config-mutate path and inject a template where needed.

The text was updated successfully, but these errors were encountered:

awni · 2024-11-01T23:10:22Z

Base models don't usually have a chat template. Is there something like https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/generate.py#L213C1-L214C1 in Swift Transformers?

davidkoski · 2024-11-01T23:21:53Z

Base models don't usually have a chat template. Is there something like https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/generate.py#L213C1-L214C1 in Swift Transformers?

No, nothing like that: https://github.com/huggingface/swift-transformers/blob/main/Sources/Tokenizers/Tokenizer.swift#L359

Perhaps there should be? @pcuenca @maiqingqiang

johnmai-dev · 2024-11-02T05:21:29Z

Base models don't usually have a chat template. Is there something like https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/generate.py#L213C1-L214C1 in Swift Transformers?

No, nothing like that: https://github.com/huggingface/swift-transformers/blob/main/Sources/Tokenizers/Tokenizer.swift#L359

Perhaps there should be? @pcuenca @maiqingqiang

I agree. Similar to https://github.com/huggingface/transformers/blob/33868a057c02f0368ba63bd1edb746be38fe3d90/src/transformers/tokenization_utils_base.py#L1628-L1629

Alternatively, it could capture throw TokenizerError.chatTemplate and use tokenizer.encode instead.

davidkoski · 2024-12-16T16:39:22Z

Per above I added a workaround -- it will try to apply the template, but if that throws an error it will fall back to a built in template and tokenization.

davidkoski self-assigned this Nov 1, 2024

davidkoski mentioned this issue Nov 1, 2024

add VLM support, refactor common LM code into MLXLMCommon. breaking API changes #151

Merged

davidkoski added a commit that referenced this issue Dec 10, 2024

workaround for #150 -- important as this model is out new default

f18213d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

some models fail to prepare tokens: No chat template #150

some models fail to prepare tokens: No chat template #150

davidkoski commented Nov 1, 2024

awni commented Nov 1, 2024

davidkoski commented Nov 1, 2024 •

edited

Loading

johnmai-dev commented Nov 2, 2024

davidkoski commented Dec 16, 2024

some models fail to prepare tokens: No chat template #150

some models fail to prepare tokens: No chat template #150

Comments

davidkoski commented Nov 1, 2024

awni commented Nov 1, 2024

davidkoski commented Nov 1, 2024 • edited Loading

johnmai-dev commented Nov 2, 2024

davidkoski commented Dec 16, 2024

davidkoski commented Nov 1, 2024 •

edited

Loading