Skip to content

v0.1.0b4

Compare
Choose a tag to compare
@mfuntowicz mfuntowicz released this 21 Mar 14:29
· 70 commits to main since this release
5ee2ff0

#Highlights

  • Update to TensorRT-LLM version 03-19-2024
  • pip installation
  • Float8 quantization workflow updated on more robust
  • Save and restore prebuild engine from the Hugging Face Hub or locally on the machine

What's Changed

New Contributors

Full Changelog: v0.1.0b3...v0.1.0b4