You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have noticed that the quantization mechanism is quantize(w, ak, c) = round(clamp(w, c)/s) × s. It seems like you have use the "Training with simulated quantization" in paper "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference". Is it true?
If yes, Training with simulated quantization is just used for training. I wonder does the model also use the Integer-arithmetic-only inference process like round(clamp(w, c)/s) during the inference process?
The text was updated successfully, but these errors were encountered:
I have noticed that the quantization mechanism is quantize(w, ak, c) = round(clamp(w, c)/s) × s. It seems like you have use the "Training with simulated quantization" in paper "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference". Is it true?
If yes, Training with simulated quantization is just used for training. I wonder does the model also use the Integer-arithmetic-only inference process like round(clamp(w, c)/s) during the inference process?
The text was updated successfully, but these errors were encountered: