0.1.0b1 - Initial Release
This release is the first for optimum-nvidia
and focus on bringing the latest performance improvements for Llama based model such as float8
on the latest generation of Nvidia Tensor Cores GPUs
This release is the first for optimum-nvidia
and focus on bringing the latest performance improvements for Llama based model such as float8
on the latest generation of Nvidia Tensor Cores GPUs