Check for supported file formats when displaying available inference engines #106

dadmobile · 2024-06-10T17:18:12Z

Specifically MLX only supports some weight file formats (safetensors and nfz I think?). We currently only check architecture which means you sometimes get a "No safetensors for..." error when trying to run a model with MLX.

There are several possible ways to address:

check weight file formats (currently there is a formats array in model gallery)
use allow_patterns to see if there's a supported file type (lots of reasons this might not work)
take advantage if we add MLX field to models that says minimum version required to support like transformers (separate issue)
create some other way for a plugin to take a model and return if it's supported (kind of ugly but more flexible)

dadmobile · 2024-11-07T14:19:26Z

More users have hit: #187

We tried options that added a field to gallery models that sort of worked, but most users are importing models either from their drive or HuggingFace. So we will need to figure out how to detect this automatically in the code that generates the list of possible engines (and then possibly also needs to show somewhere why, for example, MLX isn't available if it is a standard Hugging Face modeal and you are on a Mac?)

dadmobile mentioned this issue Nov 7, 2024

unable to run models--openlm-research--open_llama_3b_v2 on Apple MLX due to 'no safetensors found' error #187

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check for supported file formats when displaying available inference engines #106

Check for supported file formats when displaying available inference engines #106

dadmobile commented Jun 10, 2024 •

edited

Loading

dadmobile commented Nov 7, 2024

Check for supported file formats when displaying available inference engines #106

Check for supported file formats when displaying available inference engines #106

Comments

dadmobile commented Jun 10, 2024 • edited Loading

dadmobile commented Nov 7, 2024

dadmobile commented Jun 10, 2024 •

edited

Loading