-
-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat: auto select llama-cpp cuda runtime (#2306)
* auto select cpu variant Signed-off-by: Sertac Ozercan <[email protected]> * remove cuda target for now Signed-off-by: Sertac Ozercan <[email protected]> * fix metal Signed-off-by: Sertac Ozercan <[email protected]> * fix path Signed-off-by: Sertac Ozercan <[email protected]> * cuda Signed-off-by: Sertac Ozercan <[email protected]> * auto select cuda Signed-off-by: Sertac Ozercan <[email protected]> * update test Signed-off-by: Sertac Ozercan <[email protected]> * select CUDA backend only if present Signed-off-by: mudler <[email protected]> * ci: keep cuda bin in path Signed-off-by: mudler <[email protected]> * Makefile: make dist now builds also cuda Signed-off-by: mudler <[email protected]> * Keep pushing fallback in case auto-flagset/nvidia fails There could be other reasons for which the default binary may fail. For example we might have detected an Nvidia GPU, however the user might not have the drivers/cuda libraries installed in the system, and so it would fail to start. We keep the fallback of llama.cpp at the end of the llama.cpp backends to try to fallback loading in case things go wrong Signed-off-by: mudler <[email protected]> * Do not build cuda on MacOS Signed-off-by: mudler <[email protected]> * cleanup Signed-off-by: Sertac Ozercan <[email protected]> * Apply suggestions from code review Signed-off-by: Ettore Di Giacinto <[email protected]> --------- Signed-off-by: Sertac Ozercan <[email protected]> Signed-off-by: Ettore Di Giacinto <[email protected]> Signed-off-by: mudler <[email protected]> Co-authored-by: Ettore Di Giacinto <[email protected]> Co-authored-by: mudler <[email protected]>
- Loading branch information
1 parent
84e2407
commit a670318
Showing
5 changed files
with
62 additions
and
58 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters