-
Notifications
You must be signed in to change notification settings - Fork 7.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Request] Add LongWriter model(s) #2883
Comments
Related: The first linked issue should now technically be resolved ( Do you have any updates on that? I've checked the linked issue again and there has not been another response by now. It is entirely possible that there is a bug somewhere, of course, or that the model itself is not as capable as advertised. Edit: |
Maybe the "GGG" problem can be solved by updating llama.cpp: ggerganov/llama.cpp#8412 |
I've seen that, but not tried yet. I'm planning to look into what's wrong with the available 'glm-4-9b-chat' models, first. |
It looks like 'glm-4-9b-chat' models themselves are not quite usable here in GPT4All, so I don't have much confidence in the It's probably better to look at the Llama based variant once more. What problem(s) did you have with that again? |
The glm4 weight worked fine with the python-bindings except for the GGG problem(which could have been solved(I don't know how to manually update llama.cpp in gpt4all)). The LongWriter llama weight did not work for me at all, and moved on to the glm4, but I didn't take notes, so I do not have the console print out right now. On HF the LongWriter space has been featured as number 2 space so a lot of people are taking an interest in LW. |
The "GGG problem" seems to have recently been fixed in llama.cpp: ggerganov/llama.cpp#9130 but I wouldn't recommend trying to update that manually. That was not the only problem with the original In the meantime, I've tested the Llama version with GPT4All and didn't run into any problems. I got a decent response, too.
Alright, maybe talk to me on Discord then, so we can together have a look at what's going on? |
LongWriter: Unleashing 10,000+ Word Generation From Long Context LLMs
https://github.com/THUDM/LongWriter
LongWriter_demo.mp4
HF Space: https://huggingface.co/spaces/THUDM/LongWriter
It comes in two flavors:
https://huggingface.co/THUDM/LongWriter-glm4-9b
https://huggingface.co/THUDM/LongWriter-llama3.1-8b
Several weights as GGUF are up:
https://huggingface.co/models?search=LongWriter
With the help of cosmic-snow, I have been experimenting a bit using this as a gpt4all template (I couldn't get the LLama weight to work, so this is the glm4):
The text was updated successfully, but these errors were encountered: