Add smooth quant #1398

mht-sharma · 2023-09-20T13:37:04Z

What does this PR do?

Integrates the SmoothQuant into the Optimum ONNXRuntime Quantizer.

The implementation uses INC behind the scenes for the quantisation.
To apply smooth quant call the quantizer.apply_smooth_quant before the calibration step in the regular quantization process.

Usage

# Load calibration dataset
calibration_dataset = quantizer.get_calibration_dataset(...)

# Apply smooth quantization to the model
quantizer.apply_smooth_quant(
    dataset=calibration_dataset,
    save_dir=save_dir,
    quantization_config=qconfig,
)

# Perform calibration
quantizer.fit(...)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

fxmarty · 2023-10-26T12:51:14Z

optimum/onnxruntime/quantization.py

+        import importlib
+
+        try:
+            importlib.import_module("neural_compressor.adaptor.ox_utils.smooth_quant")
+        except Exception as e:
+            logging.error(f"{e}.")
+            raise RuntimeError("Neural-compressor is required for SmoothQuant. Please install the library") from e
+
+        import copy
+
+        import onnx
+        from neural_compressor.adaptor.ox_utils.smooth_quant import ORTSmoothQuant


This is not recommended, reference: https://peps.python.org/pep-0008/#imports

This is something done by the ONNXRuntime source code too! Are u suggesting importing it at top level? https://github.com/microsoft/onnxruntime/blob/0f72739b6db129373d221483d61d6637ec11fb28/onnxruntime/python/tools/quantization/quantize.py#L421

[email protected] and others added 4 commits September 20, 2023 13:42

add smoothquant

5785ed0

updated config

2c6c09b

updated config

6fc369b

updated default values docstring

8ee658d

mht-sharma force-pushed the add_smooth_quant branch from 7af07ab to 8ee658d Compare September 20, 2023 13:56

add datafiles

8a2874b

fxmarty reviewed Oct 26, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add smooth quant #1398

Add smooth quant #1398

mht-sharma commented Sep 20, 2023

fxmarty Oct 26, 2023

mht-sharma Oct 26, 2023

Add smooth quant #1398

Are you sure you want to change the base?

Add smooth quant #1398

Conversation

mht-sharma commented Sep 20, 2023

What does this PR do?

Usage

Before submitting

fxmarty Oct 26, 2023

Choose a reason for hiding this comment

mht-sharma Oct 26, 2023

Choose a reason for hiding this comment