Missing Transformers initializer for Falcon models #1988

martin-gorner · 2024-11-19T14:59:54Z

Repro code:
model5 = keras_hub.models.CausalLM.from_preset("hf://tiiuae/falcon-7b-instruct", dtype="bfloat16")
Result:
ValueError: KerasHub has no converter for huggingface/transformers models with model type 'falcon'

Now that the Falcon model family exists in Keras-hub, this should work.

The text was updated successfully, but these errors were encountered:

mehtamansi29 · 2024-12-04T18:54:07Z

Hi @martin-gorner -

Thanks for reporting the issue. You can intialize falcon-7b-instruct model using transformers AutoTokenizer, AutoModelForCausalLM class.

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_name = "tiiuae/falcon-7b-instruct"

tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype=torch.bfloat16, trust_remote_code=True, device_map="auto")

And for loading falcon model family(falcon_refinedweb_1b_en) in keras-hub, you can use like this.
model5 = keras_hub.models.CausalLM.from_preset("hf://keras/falcon_refinedweb_1b_en", dtype="bfloat16")

Attached gist here for the reference.

github-actions · 2024-12-19T02:05:18Z

This issue is stale because it has been open for 14 days with no activity. It will be closed if no further activity occurs. Thank you.

martin-gorner · 2024-12-19T13:55:58Z

Thanks @mehtamansi29 but this issue is filed in keras-hub. The problem is to initialize a Keras-hub model from the safetensor checkpoint, as is possible for Llama, Gemma etc. I'm logging this because cross-compatibility between Keras-hub and Transformers checkpoints is not guaranteed even if the model architecture exists on both sides. A checkpoint translation module is necessary. It is important to keep track of model architectures where this translation module was implemented and where it was not.

mehtamansi29 self-assigned this Dec 4, 2024

mehtamansi29 assigned mehtamansi29 and unassigned mehtamansi29 Dec 4, 2024

mehtamansi29 added type:support stat:awaiting response from contributor labels Dec 4, 2024

github-actions bot added the stale label Dec 19, 2024

github-actions bot removed stale stat:awaiting response from contributor labels Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing Transformers initializer for Falcon models #1988

Missing Transformers initializer for Falcon models #1988

martin-gorner commented Nov 19, 2024 •

edited

Loading

mehtamansi29 commented Dec 4, 2024

github-actions bot commented Dec 19, 2024

martin-gorner commented Dec 19, 2024

Missing Transformers initializer for Falcon models #1988

Missing Transformers initializer for Falcon models #1988

Comments

martin-gorner commented Nov 19, 2024 • edited Loading

mehtamansi29 commented Dec 4, 2024

github-actions bot commented Dec 19, 2024

martin-gorner commented Dec 19, 2024

martin-gorner commented Nov 19, 2024 •

edited

Loading