-
Notifications
You must be signed in to change notification settings - Fork 385
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] KoboldCpp - Version 1.47.1/1.47.2.yr0-ROCm programme crash and close #495
Comments
I'm having the same issue, system: Operating System: Windows 11 Professional 22H2 KoboldCpp - Version 1.47.2.yr0-ROCm |
@Richard7656 @jamesalster |
Please tell us about these workarounds! I have a 6600 xt (gfx1032) and I get the same error. |
@Richard7656 @jamesalster @brknsoul After having done these steps you can compile the Koboldcpp from source by following these steps https://github.com/YellowRoseCx/koboldcpp-rocm/releases/tag/Windows-v1.43-ROCm, even though the instructions are under the 1.43 release, they also apply to higher versions of the ROCm branch. Then you can run This is at least what I did on my RX 5700 XT (gfx1010), if you don't want to do these steps, you could wait until YellowRoseCx themself manages to compile the EXE for your LLVM arches, or I could try helping you if you have issues (though I am pretty clueless, so I don't know everything). |
Why remove, stop compile gfx908? |
@ccbadd What do you mean? |
@Kazaflow, running python rmake.py (in native tools cmd prompt as admin) results in ;
Full error; |
@brknsoul But it seems it doesn't find something it tries to link. Your cmakefail file also says,
I think it would tell us what exactly it failed at.
|
I guess I'll wait until YellowRose makes an exe for gfx1032 |
Well, that could take some time, since gfx1032 for the HIP SDK has to come first (and its not like AMD seems to be picking up pace as far as I know). Kinda sucks it doesn't work like 1010. Only way for any gfx1032 seems to be Linux, with that HSA overwrite for the time being. |
Sorry for the delayed response, but my question is that gfx908 is one of the included fully supported libraries (MI100). When I read your earlier response it sounds like it was just dropped from the default compile list. For whatever reason, I am having problems compiling myself as anything multi gpu outputs gibberish. I have come to believe that the problem is actually with recent changes to llama.cpp and the memory pool changes they are trying to make. |
@ccbadd |
@Kazaflow I'm not running Windows so that explains the difference. MI100s are only supported in Linux. AS for llama.cpp, the last release this weekend rolled a bunch of stuff back and I was able to compile and get multi gpu working again. Hopefully this works it's way back into Koboldcpp soon. One more thing, it does not say it anywhere but when you launch koboldcpp from the command line you need to add --usecublas for it to work as there is no usehipblas option. I didn't see that documented anywhere. |
As others pointed out, people running in to this issue can blame AMD for not caring about their GPU and not shipping it as part of their SDK. This is not a bug, this is AMD not shipping what we need. Those effected can use the regular Koboldcpp edition with CLBlast which does support your GPU. |
My Computer:
Operating System: Windows 11 Professional 22H2 (Chinese Traditional)
CPU: AMD Ryzen 5 5600 6-Core Processor 3.50 GHz
GPU: GIGABYTE AMD Radeon RX 6650 XT (Driver Version 23.10.2)
RAM: 64GB
KoboldCpp - Version 1.47.1/1.47.2.yr0-ROCm
After press “Launch” to run the programme, the error message
“rocBLAS error: Cannot read C:\Users\richa\AppData\Local\Temp_MEI60562/library/TensileLibrary.dat: No such file or dire ctory “
at the bottom and the programme crash and close.
In KoboldCpp - Version 1.46.1.yr0-ROCm, the programme can still be launched except the problem of reply with garbage characters in certain condition.
The text was updated successfully, but these errors were encountered: