ONNX Runtime is a cross-platform inference and training machine-learning accelerator. Learn more about it here.
This repo demonstrates how converting a model saved in H5
format to .onnx
for a free speed gain. The model used is a simple neural network trained on fashion_mnist dataset.
- Python 3.8+
pip3 install -r requirements.txt
Run all cells in onnx.ipynb