Skip to content

0.1.0b1 - Initial Release

Compare
Choose a tag to compare
@mfuntowicz mfuntowicz released this 18 Dec 15:05
· 112 commits to main since this release
4aa44f6

This release is the first for optimum-nvidia and focus on bringing the latest performance improvements for Llama based model such as float8 on the latest generation of Nvidia Tensor Cores GPUs