[Feature Request] Token Banning/Bias #9

neCo2 · 2024-01-06T09:30:56Z

Not too familiar with what options the various backends make available, but if at all possible, a field to ban or bias certain tokens and phrases would be great to have.

neCo2 · 2024-01-13T09:07:24Z

Read some stuff and basic bias/banning on a token basis seems simple enough to implement. llama.cpp, koboldcpp and the openAI compatible oogabooga endpoints just take a logit_bias in their request with a { key:value } dictionary where key is a token id#, and value is an integer between -100 and 100, with -100 being a ban.
It'd probably be best to take input as a string and then send that to the backend's tokenizer for ease of use.

Problem is, this only works for individual tokens, not phrases, and I've found no concrete answers on how this is implemented in the few places I know that have phrase bias like novelAI and AI21.
I found this which could be a solution? Essentially

Take first token of banned phrase and add it to stop tokens
If the token is generated, generation will stop
Then generate however many tokens the banned phrase is
If it generates the entire banned phrase, remove it from the prompt
Bias the first token of the phrase (or possibly all tokens of the phrase to achieve the total phrase bias that was initially set)
Generate number of tokens of the banned phrase again
Unbias tokens
Resume normal generation

What's your opinion on this? I feel like it'd be really inefficient to implement this way. Is there any way to mess with the parameters without stopping and restarting the generation?

I mostly use bias for punctuation and newlines which are all single tokens more often than not, so I might try to implement the single-token bias when I find some time, but phrase bias would be incredibly useful to have to stop some weird habits models tend to have.

lmg-anon · 2024-01-14T22:50:37Z

I think there's no need to include the token in the stop tokens given that we're streaming the tokens, which allows us to determine on the fly the point at which the banned sequence must be omitted from the prompt. But, I don't think there's any way around stopping the generation process to remove the sequence from the context and add the token bias.

neCo2 · 2024-01-15T08:41:24Z

Ah right, we can just manually send a stop command.
I really wonder if that is the way it's implemented in NAI. I remember it sometimes getting stuck for a bit in places when a biased phrase was (probably) generated, so maybe?

neCo2 · 2024-01-17T22:00:53Z

@lmg-anon I'm bad at finding API documentation, does KoboldCPP actually not have a detokenize endpoint?

lmg-anon · 2024-01-19T15:53:33Z

@neCo2 No, I don't think so.

neCo2 · 2024-01-20T16:27:35Z

Maintainer said he'll consider adding one in future.

culturedniichan · 2024-02-10T08:59:44Z

This would be a very nice feature. This is what I found for the OpenAI API which I think ooba also implements now: https://platform.openai.com/docs/api-reference/chat/create#chat-create-logit_bias

But I have kinda little knowledge about the inner workings of these backends. For example, since the user would be typing words, like in NovelAI, I suppose it would need to be translated into the token ids by querying the API first? I think SillyTavern is doing it, I will take a look and see how it does it. But this would be great to be able to make certain sequences less likely

lmg-anon · 2024-02-18T01:01:12Z

@culturedniichan That's right. Also, in case you didn't notice, OP did a PR for this issue (but it's still not complete), you can take a look at it here: #25

neCo2 · 2024-05-03T20:17:56Z

Closing this since regular token bias has been implemented now. Opened up a separate issue for phrase bias here: #58

neCo2 closed this as completed May 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Token Banning/Bias #9

[Feature Request] Token Banning/Bias #9

neCo2 commented Jan 6, 2024

neCo2 commented Jan 13, 2024

lmg-anon commented Jan 14, 2024

neCo2 commented Jan 15, 2024

neCo2 commented Jan 17, 2024

lmg-anon commented Jan 19, 2024

neCo2 commented Jan 20, 2024

culturedniichan commented Feb 10, 2024

lmg-anon commented Feb 18, 2024

neCo2 commented May 3, 2024

[Feature Request] Token Banning/Bias #9

[Feature Request] Token Banning/Bias #9

Comments

neCo2 commented Jan 6, 2024

neCo2 commented Jan 13, 2024

lmg-anon commented Jan 14, 2024

neCo2 commented Jan 15, 2024

neCo2 commented Jan 17, 2024

lmg-anon commented Jan 19, 2024

neCo2 commented Jan 20, 2024

culturedniichan commented Feb 10, 2024

lmg-anon commented Feb 18, 2024

neCo2 commented May 3, 2024