Add bias support for sparse layers #25

mgoin · 2024-02-16T20:38:20Z

Tested both unstructured and semi_structured - opt-125m has a bias which previous would assert:

  File "neuralmagic-vllm/vllm/model_executor/layers/sparsity/sparse_w16a16_linear_method.py", line 73, in apply_weights
    assert bias is None

from vllm import LLM, SamplingParams

model = LLM(
    "nm-testing/opt-125m-pruned2.4",
    sparsity="semi_structured_sparse_w16a16",
    enforce_eager=True,
)

sampling_params = SamplingParams(max_tokens=10, temperature=0)
outputs = model.generate("Hi", sampling_params=sampling_params)
print(outputs[0].outputs[0].text)

from vllm import LLM, SamplingParams

model = LLM(
    "nm-testing/opt-125m-pruned2.4",
    sparsity="sparse_w16a16",
    enforce_eager=True,
)

sampling_params = SamplingParams(max_tokens=10, temperature=0)
outputs = model.generate("Hi", sampling_params=sampling_params)
print(outputs[0].outputs[0].text)

mgoin added 4 commits February 16, 2024 15:30

Add support for bias for sparse linear methods

9782f9d

Fused bias for semi-structured

887a848

Format

e9587c1

Format

85535a8

mgoin merged commit ab469e5 into main Feb 16, 2024
2 checks passed

mgoin deleted the add-bias-support-for-sparse-layers branch February 16, 2024 22:02

mgoin mentioned this pull request Feb 16, 2024

Bias semi structured sparse #24

Closed

robertgshaw2-redhat pushed a commit that referenced this pull request Feb 20, 2024

Add bias support for sparse layers (#25)

fb1dc1a

robertgshaw2-redhat pushed a commit that referenced this pull request Feb 20, 2024

Add bias support for sparse layers (#25)

fc5ef64

robertgshaw2-redhat pushed a commit that referenced this pull request Feb 21, 2024

Add bias support for sparse layers (#25)

18bd0a9

tlrmchlsmth pushed a commit that referenced this pull request Feb 21, 2024

Add bias support for sparse layers (#25)

ae45b23

robertgshaw2-redhat pushed a commit that referenced this pull request Feb 21, 2024

Add bias support for sparse layers (#25)

1bc4e53

robertgshaw2-redhat pushed a commit that referenced this pull request Feb 22, 2024

Add bias support for sparse layers (#25)

8a4d025

robertgshaw2-redhat pushed a commit that referenced this pull request Feb 22, 2024

Add bias support for sparse layers (#25)

e802bc2

andy-neuma mentioned this pull request Feb 23, 2024

andy/bump main to v0.3.2 #49

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add bias support for sparse layers #25

Add bias support for sparse layers #25

mgoin commented Feb 16, 2024 •

edited

Loading

Add bias support for sparse layers #25

Add bias support for sparse layers #25

Conversation

mgoin commented Feb 16, 2024 • edited Loading

mgoin commented Feb 16, 2024 •

edited

Loading