-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature/npu #5310
Feature/npu #5310
Commits on Nov 30, 2023
-
[accelerator] init the accelerator module (#5129)
* [accelerator] init the accelerator module * polish code * polish code * polish code * polish code
Configuration menu - View commit details
-
Copy full SHA for f4e72c9 - Browse repository at this point
Copy the full SHA f4e72c9View commit details -
[npu] support triangle attention for llama (#5130)
* update fused attn * update spda * tri attn * update triangle * import * fix * fix
Configuration menu - View commit details
-
Copy full SHA for d6df19b - Browse repository at this point
Copy the full SHA d6df19bView commit details
Commits on Jan 8, 2024
-
[npu] use extension for op builder (#5172)
* update extension * update cpu adam * update is * add doc for cpu adam * update kernel * update commit * update flash * update memory efficient * update flash attn * update flash attention loader * update api * fix * update doc * update example time limit * reverse change * fix doc * remove useless kernel * fix * not use warning * update * update
Configuration menu - View commit details
-
Copy full SHA for dd2c28a - Browse repository at this point
Copy the full SHA dd2c28aView commit details
Commits on Jan 9, 2024
-
[npu] change device to accelerator api (#5239)
* update accelerator * fix timer * fix amp * update * fix * update bug * add error raise * fix autocast * fix set device * remove doc accelerator * update doc * update doc * update doc * use nullcontext * update cpu * update null context * change time limit for example * udpate * update * update * update * [npu] polish accelerator code --------- Co-authored-by: Xuanlei Zhao <[email protected]> Co-authored-by: zxl <[email protected]>
Configuration menu - View commit details
-
Copy full SHA for d202cc2 - Browse repository at this point
Copy the full SHA d202cc2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9102d65 - Browse repository at this point
Copy the full SHA 9102d65View commit details
Commits on Jan 18, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 1484693 - Browse repository at this point
Copy the full SHA 1484693View commit details -
Merge pull request #5278 from ver217/sync/npu
[sync] sync npu branch with main
Configuration menu - View commit details
-
Copy full SHA for d66e698 - Browse repository at this point
Copy the full SHA d66e698View commit details
Commits on Jan 22, 2024
-
Configuration menu - View commit details
-
Copy full SHA for d7f8db8 - Browse repository at this point
Copy the full SHA d7f8db8View commit details
Commits on Jan 25, 2024
-
[feat] refactored extension module (#5298)
* [feat] refactored extension module * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish
Configuration menu - View commit details
-
Copy full SHA for 7cfed5f - Browse repository at this point
Copy the full SHA 7cfed5fView commit details
Commits on Jan 29, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 73f4dc5 - Browse repository at this point
Copy the full SHA 73f4dc5View commit details