Skip to content

v1.24.0: SD3 & Flux, DinoV2, Modernbert, GPTQModel, Transformers v4.48...

Latest
Compare
Choose a tag to compare
@IlyasMoutawwakil IlyasMoutawwakil released this 30 Jan 16:26
· 2 commits to main since this release

Release Notes: Optimum v1.24.0

We’re excited to announce the release of Optimum v1.24.0. This update expands ONNX-based model capabilities and includes several improvements, bug fixes, and new contributions from the community.

🚀 New Features & Enhancements

  • ORTQuantizer now supports models with ONNX subfolders.
  • ONNX Runtime IO Binding support for all supported Transformers models (no models left behind).
  • SD3 and Flux model support added to ORTDiffusionPipeline enabling latest diffusion-based models.
  • Transformers v4.47 and v4.48 compatibility, ensuring seamless integration with the latest advancements in Hugging Face's ecosystem.
  • ONNX export support extended to various models, including Decision Transformer, ModernBERT, Megatron-BERT, Dinov2, OLMo, and many more (see details).

🔧 Key Fixes & Optimizations

  • Dropped support for Python 3.8
  • Bug fixes in ModelPatcher, SDXL refiner export, and device checks for improved reliability.

👥 New Contributors

A huge thank you to our first-time contributors:

Your contributions make Optimum better! 🎉

For a detailed list of all changes, please check out the full changelog.

🚀 Happy optimizing!

What's Changed