What's Changed
- Post-release setup for 0.5.5.dev1 by @Andrei-Aksionov in #1885
- Falcon3 by @ysjprojects in #1881
- ChatML prompt template by @ysjprojects in #1882
- Small fixes and refactoring by @mseeger in #1861
- Drop interleave placement in QKV matrix by @Andrei-Aksionov in #1013
- Bump PyTorch, PyTorch-Lightning and BnB versions by @Andrei-Aksionov in #1893
- Pin version of mistune in check links workflow by @Andrei-Aksionov in #1895
- Skip converting .safetensors to .bin by @ysjprojects in #1853
- Some improvements for KV caching by @mseeger in #1891
- added query-key norm to accomodate OLMo2 by @ysjprojects in #1894
- Improve HF download speed by @rasbt in #1899
- Bump version for 0.5.5 release by @rasbt in #1901
New Contributors
Full Changelog: v0.5.4...v0.5.5