Webui prototype #13

fearnworks · 2023-09-13T23:37:35Z

Adds gradio interface to run Base Model and MoE model side by side
Includes expert selection table with info on weights of the selected experts and their ids
Includes parameter selection for Max tokens, expertsK, and method
Add container build/run option with docker compose

fearnworks · 2023-09-13T23:49:51Z

args.py

I was geting a lot of warnings about having max_tokens set without this being passed in. Saw an improvement in runtime performance once it was added in.

pharaouk · 2023-09-14T01:36:39Z

I am facing this error when building the docker with webui

hydra-moe-webui-1 | Traceback (most recent call last):
hydra-moe-webui-1 | File "/hydra-moe/server.py", line 127, in
hydra-moe-webui-1 | moe.initialize_model()
hydra-moe-webui-1 | File "/hydra-moe/moe.py", line 66, in initialize_model
hydra-moe-webui-1 | model, tokenizer = get_inference_model(args, checkpoint_dirs)
hydra-moe-webui-1 | File "/hydra-moe/moe_utils.py", line 52, in get_inference_model
hydra-moe-webui-1 | model = AutoModelForCausalLM.from_pretrained(
hydra-moe-webui-1 | File "/usr/local/lib/python3.10/dist-packages/transformers/models/auto/auto_factory.py", line 493, in from_pretrained
hydra-moe-webui-1 | return model_class.from_pretrained(
hydra-moe-webui-1 | File "/usr/local/lib/python3.10/dist-packages/transformers/modeling_utils.py", line 2903, in from_pretrained
hydra-moe-webui-1 | ) = cls._load_pretrained_model(
hydra-moe-webui-1 | File "/usr/local/lib/python3.10/dist-packages/transformers/modeling_utils.py", line 3260, in _load_pretrained_model
hydra-moe-webui-1 | new_error_msgs, offload_index, state_dict_index = _load_state_dict_into_meta_model(
hydra-moe-webui-1 | File "/usr/local/lib/python3.10/dist-packages/transformers/modeling_utils.py", line 725, in _load_state_dict_into_meta_model
hydra-moe-webui-1 | set_module_quantized_tensor_to_device(
hydra-moe-webui-1 | File "/usr/local/lib/python3.10/dist-packages/transformers/utils/bitsandbytes.py", line 99, in set_module_quantized_tensor_to_device
hydra-moe-webui-1 | new_value = bnb.nn.Params4bit(new_value, requires_grad=False, **kwargs).to(device)
hydra-moe-webui-1 | File "/usr/local/lib/python3.10/dist-packages/bitsandbytes/nn/modules.py", line 178, in to
hydra-moe-webui-1 | return self.cuda(device)
hydra-moe-webui-1 | File "/usr/local/lib/python3.10/dist-packages/bitsandbytes/nn/modules.py", line 156, in cuda
hydra-moe-webui-1 | w_4bit, quant_state = bnb.functional.quantize_4bit(w, blocksize=self.blocksize, compress_statistics=self.compress_statistics, quant_type=self.quant_type)
hydra-moe-webui-1 | File "/usr/local/lib/python3.10/dist-packages/bitsandbytes/functional.py", line 799, in quantize_4bit
hydra-moe-webui-1 | absmax = torch.zeros((blocks,), device=A.device)
hydra-moe-webui-1 | RuntimeError: CUDA error: no kernel image is available for execution on the device
hydra-moe-webui-1 | CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect.
hydra-moe-webui-1 | For debugging consider passing CUDA_LAUNCH_BLOCKING=1.
hydra-moe-webui-1 | Compile with TORCH_USE_CUDA_DSA to enable device-side assertions.

fearnworks added 6 commits September 12, 2023 17:27

Add docker quickstart

4603c32

Web ui prototype

e60f771

Initial inference in webui, docker compose

a4bc3f5

Update docker commands

1746a87

Add configs for a/b test

e40c8a8

Add weight table

ddc63b6

fearnworks changed the base branch from main to v1 September 13, 2023 23:37

fearnworks commented Sep 13, 2023

View reviewed changes

fearnworks force-pushed the webui_prototype branch from 772a0d7 to ddc63b6 Compare September 14, 2023 00:20

pharaouk added 2 commits September 14, 2023 05:28

Merge branch 'v1' into webui_prototype

b5eedb1

Update Dockerfile

f835bc7

pharaouk merged commit 0994e5c into SkunkworksAI:v1 Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Webui prototype #13

Webui prototype #13

fearnworks commented Sep 13, 2023

fearnworks Sep 13, 2023

pharaouk commented Sep 14, 2023 •

edited

Loading

Webui prototype #13

Webui prototype #13

Conversation

fearnworks commented Sep 13, 2023

fearnworks Sep 13, 2023

Choose a reason for hiding this comment

pharaouk commented Sep 14, 2023 • edited Loading

pharaouk commented Sep 14, 2023 •

edited

Loading