[Request] Add LongWriter model(s) #2883

tin2tin · 2024-08-17T06:04:41Z

LongWriter: Unleashing 10,000+ Word Generation From Long Context LLMs

LongWriter_demo.mp4

HF Space: https://huggingface.co/spaces/THUDM/LongWriter

It comes in two flavors:
https://huggingface.co/THUDM/LongWriter-glm4-9b
https://huggingface.co/THUDM/LongWriter-llama3.1-8b

Several weights as GGUF are up:
https://huggingface.co/models?search=LongWriter

With the help of cosmic-snow, I have been experimenting a bit using this as a gpt4all template (I couldn't get the LLama weight to work, so this is the glm4):

  {
    "order": "a",
    "md5sum": "e0d221bef6579ebf184d8175ca92d7e3",
    "name": "LongWriter glm4-9B-Q4_K_M",
    "filename": "LongWriter-glm4-9B-Q4_K_M.gguf",
    "filesize": "7875561216",
    "requires": "3.1.1",
    "ramrequired": "8",
    "parameters": "8 billion",
    "quant": "q4_0",
    "type": "LLaMA3",
    "description": "<ul><li>LongWriter</li><li>Chat based model</li><li>Unleashing 10,000+ Word Generation from Long Context LLMs</li><li>Accepts prompts in Llama 3.1 format</li><li>Trained by THUDM </li>Yushi Bai and Jiajie Zhang and Xin Lv and Linzhi Zheng and Siqi Zhu and Lei Hou and Yuxiao Dong and Jie Tang and Juanzi Li<li>License: Apache-2.0 license</li></ul>",
    "url": "https://huggingface.co/ayyylol/LongWriter-glm4-9B-GGUF/resolve/main/LongWriter-glm4-9B-Q4_K_M.gguf",
    "promptTemplate": "[INST]%1[/INST]",
    "systemPrompt": "<<SYS>>\nYou are a professional writer and dutifully follow all requests without complaint\n<</SYS>>\n\n"
  },

The text was updated successfully, but these errors were encountered:

cosmic-snow · 2024-08-17T16:01:18Z

The first linked issue should now technically be resolved (chatglm architecture is enabled), although I'm still wondering about the second one. It's not yet clear to me what the underlying problem for that behaviour is.

Do you have any updates on that? I've checked the linked issue again and there has not been another response by now.

It is entirely possible that there is a bug somewhere, of course, or that the model itself is not as capable as advertised.

Edit:
There are some mentions of that problem in some comments of the corresponding llama.cpp PR 8031 although I have not reviewed everything in that repository yet. (GPT4All is based on llama.cpp)

tin2tin · 2024-08-19T02:37:26Z

Maybe the "GGG" problem can be solved by updating llama.cpp: ggerganov/llama.cpp#8412

cosmic-snow · 2024-08-19T16:21:45Z

I've seen that, but not tried yet. I'm planning to look into what's wrong with the available 'glm-4-9b-chat' models, first.

cosmic-snow · 2024-08-20T17:07:27Z

It looks like 'glm-4-9b-chat' models themselves are not quite usable here in GPT4All, so I don't have much confidence in the chatglm based LongWriter anymore, either, because the former cannot be properly tested.

It's probably better to look at the Llama based variant once more. What problem(s) did you have with that again?

tin2tin · 2024-08-25T06:33:14Z

The glm4 weight worked fine with the python-bindings except for the GGG problem(which could have been solved(I don't know how to manually update llama.cpp in gpt4all)). The LongWriter llama weight did not work for me at all, and moved on to the glm4, but I didn't take notes, so I do not have the console print out right now.

On HF the LongWriter space has been featured as number 2 space so a lot of people are taking an interest in LW.

cosmic-snow · 2024-08-25T11:19:03Z

The "GGG problem" seems to have recently been fixed in llama.cpp: ggerganov/llama.cpp#9130 but I wouldn't recommend trying to update that manually. That was not the only problem with the original chatglm models, though.

In the meantime, I've tested the Llama version with GPT4All and didn't run into any problems. I got a decent response, too.

The LongWriter llama weight did not work for me at all, and moved on to the glm4, but I didn't take notes, so I do not have the console print out right now.

Alright, maybe talk to me on Discord then, so we can together have a look at what's going on?

tin2tin added the enhancement New feature or request label Aug 17, 2024

tin2tin changed the title ~~[Request] Add LongWriter models(s)~~ [Request] Add LongWriter model(s) Aug 17, 2024

cosmic-snow added the models label Aug 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Request] Add LongWriter model(s) #2883

[Request] Add LongWriter model(s) #2883

tin2tin commented Aug 17, 2024

cosmic-snow commented Aug 17, 2024 •

edited

Loading

tin2tin commented Aug 19, 2024

cosmic-snow commented Aug 19, 2024

cosmic-snow commented Aug 20, 2024

tin2tin commented Aug 25, 2024 •

edited

Loading

cosmic-snow commented Aug 25, 2024

[Request] Add LongWriter model(s) #2883

[Request] Add LongWriter model(s) #2883

Comments

tin2tin commented Aug 17, 2024

cosmic-snow commented Aug 17, 2024 • edited Loading

tin2tin commented Aug 19, 2024

cosmic-snow commented Aug 19, 2024

cosmic-snow commented Aug 20, 2024

tin2tin commented Aug 25, 2024 • edited Loading

cosmic-snow commented Aug 25, 2024

cosmic-snow commented Aug 17, 2024 •

edited

Loading

tin2tin commented Aug 25, 2024 •

edited

Loading