Skip to content

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #3776

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #3776

This job was skipped