Flash decoding kernel adding and prefill-chunking and prefix caching enabling in intel cpu/xpu#2815
Open
sywangyi wants to merge 7 commits intohuggingface:main from sywangyi:flash_decoding
+100-41
Commits
Commits on Nov 25, 2024
- committed
- committed
Commits on Dec 2, 2024
Commits on Dec 10, 2024
Commits on Dec 19, 2024
Commits on Dec 20, 2024
- committed