Remove assert on zero_point dtype of quantize/dequantize op #6475

lsy323 · 2024-02-06T00:33:05Z

Per the current PT2E implementation, the dtype of zp will always be in torch.int64 ref. From the math calculating the zp, it will always be within the bound ref.

Remove assertion on zero_point to have the same dtype as the quantized integer dtype. Assert the dtype has to be integer point.
Per StableHLO spec on the quantized tensor ref (C8) storage_min <= zero_points <= storage_max, add the assertion on the range of zero_point.

cc @sdasgup3 @paulinesho

…6475)

do not assert on zero point dtype

830a9d8

lsy323 requested a review from qihqi February 6, 2024 00:33

assert on dtype, zp range

e8567a9

lsy323 requested a review from sdasgup3 February 6, 2024 21:04

qihqi approved these changes Feb 6, 2024

View reviewed changes

lsy323 merged commit 8128480 into master Feb 6, 2024
18 checks passed

amithrm pushed a commit to amithrm/xla that referenced this pull request Mar 1, 2024

Remove assert on zero_point dtype of quantize/dequantize op (pytorch#…

6e4adaf

…6475)

lsy323 deleted the lsiyuan/remove-dq-dtype-assertion branch March 4, 2024 19:13

bhavya01 pushed a commit that referenced this pull request Apr 22, 2024

Remove assert on zero_point dtype of quantize/dequantize op (#6475)

35af9c3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove assert on zero_point dtype of quantize/dequantize op #6475

Remove assert on zero_point dtype of quantize/dequantize op #6475

lsy323 commented Feb 6, 2024 •

edited

Loading

Remove assert on zero_point dtype of quantize/dequantize op #6475

Remove assert on zero_point dtype of quantize/dequantize op #6475

Conversation

lsy323 commented Feb 6, 2024 • edited Loading

lsy323 commented Feb 6, 2024 •

edited

Loading