Skip to content

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #6908

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #6908

Annotations

1 error

build (4.36.*, ubuntu-20.04)

failed Jan 7, 2025 in 4m 14s