Question: Is the model available Instruction tuned? #3

CHesketh76 · 2024-02-29T17:38:34Z

Hello,

Just wondering if the model that you provided on huggingface was instruction tuned to perform the needle in the haystack test.

Also, (hypothetically speaking) would some of the practices to reduce GPU requirements also apply to SSSM models? For example, Unsloth reduces the GPU demand so consumer GPUs can train Llama2 -7B and Mistral - 7B models. My 8BG GPU was able to finetune Mistral for a small usecase of mine. It would absolutely amazing to see a Mamba-7B model train for half the resources that Unsloth Mistral 7B needs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question: Is the model available Instruction tuned? #3

Question: Is the model available Instruction tuned? #3

CHesketh76 commented Feb 29, 2024 •

edited

Loading

Question: Is the model available Instruction tuned? #3

Question: Is the model available Instruction tuned? #3

Comments

CHesketh76 commented Feb 29, 2024 • edited Loading

CHesketh76 commented Feb 29, 2024 •

edited

Loading