-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[refactor] Make llm.c modular #1
Conversation
after spending some time messing around with llm.c, i feel like the author doesn't intend to make this good for engineering, but more like an example for demonstrating how large and complex modern llm works under the hood. thus embedding llm.c into WasmEdge is not as easy as ggml. There are at least 4 problems i can think of:
|
Let's create a CMakefile for it. So we can reduce several dependencies searching issue.
Since there is an issue talked about the CMake support, I don't think the upstream would like to merge our changes.
IMO, we should have a flag in the CMakefile, so users can choose which backend they really need. I expect that there will be two plugins, one for CPU and another for GPU(cuda). This should be easier for the installer, and let users select which version they want.
Oh, this is a big issue. However, we can have a tutorial to tell users how to get the first step done. We can also provide an existing checkpoint in the example and share it with users. |
There's a bug in cudnn-frontend (NVIDIA/cudnn-frontend#90) that blocks us from supporting cuda training. maybe we can just land the CPU support in this pr? @hydai |
Hi @junaire |
4016420
to
b52413a
Compare
I did some local testing and it works, I feel good to land this now. @hydai |
@junaire |
Hi @junaire |
can you approve the workflow? |
146d90a
to
d709e38
Compare
i may need some help here... any ideas about why the windows shit keeps failing? @hydai |
33c932f
to
399e56c
Compare
Hi @junaire |
gpu tests failing seem to be unrelated, as they are required to be run on self hosted runners. |
Let's remove these self-hosted workflow jobs for now. Since we are not able to deploy such runners inside this repo. |
Signed-off-by: Jun Zhang <[email protected]>
Done. |
Related WasmEdge/WasmEdge#3523