ONNX Runtime / Test GPU

Allow GPTQModel to auto select Marlin or faster kernels for inference only ops #3776

Sign in to view logs

Re-run triggered January 7, 2025 10:25

IlyasMoutawwakil

#2138

LRL-ModelCloud:select-quant-linear-with-pack

Status Skipped

Total duration 7s

Artifacts –

test_onnxruntime_gpu.yml

on: pull_request

Start self-hosted EC2 runner