Skip to content

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #6208

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #6208

build (3.9, macos-13)

succeeded Jan 7, 2025 in 1m 21s