qwen2_moe support w multipack #1455

winglian · 2024-03-29T13:08:03Z

Description

resolves #1453

✅ multipack
✅ qwen2_moe 4-bit QLoRA
✅ qwen2_moe 16-bit LoRA
❓ qwen2_moe 8-bit LoRA

winglian · 2024-03-29T13:16:20Z

8-bit LoRA still doesn't work atm

  File "/root/miniconda3/envs/py3.10/lib/python3.10/site-packages/bitsandbytes/autograd/_functions.py", line 487, in backward
    grad_A = torch.matmul(grad_output, CB).view(ctx.grad_shape).to(ctx.dtype_A)
RuntimeError: mat1 and mat2 shapes cannot be multiplied (2048x1 and 32x2048)

qwen2_moe support w multipack

a82b313

winglian merged commit 6086be8 into main Mar 29, 2024
7 checks passed

winglian deleted the qwen2-moe branch March 29, 2024 15:04

djsaunde pushed a commit that referenced this pull request Dec 17, 2024

qwen2_moe support w multipack (#1455)

d33f316

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qwen2_moe support w multipack #1455

qwen2_moe support w multipack #1455

winglian commented Mar 29, 2024 •

edited

Loading

winglian commented Mar 29, 2024

qwen2_moe support w multipack #1455

qwen2_moe support w multipack #1455

Conversation

winglian commented Mar 29, 2024 • edited Loading

Description

winglian commented Mar 29, 2024

winglian commented Mar 29, 2024 •

edited

Loading