Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generating and Parsing requirements strictness clarification #2

Open
leighklotz opened this issue Apr 16, 2024 · 1 comment
Open

Generating and Parsing requirements strictness clarification #2

leighklotz opened this issue Apr 16, 2024 · 1 comment

Comments

@leighklotz
Copy link

leighklotz commented Apr 16, 2024

in re "When generating OpenChatML, the same structure and rules should be followed to ensure compatibility and consistency."

Is this to be different from Postel's law, which would encourage parsers to be more forgiving?

If strictness is the requirement, there are some areas that may need clarification including escaping, nested tags, character encoding, and a machine verifiable definition of some sort.

@WolframRavenwolf
Copy link

I think the spec should be strict, i. e. clear and unambiguous. Implementation, i. e. when finetuning the model based on data that's formatted accordingly, it would probably make sense to add a percentage of differently-formatted examples to get the model to be able to deal with that, too.

I'm convinced that LLMs understand even terribly misspelled and grammatically incorrect input because of them being pretrained on the whole Internet where there's all kinds of weird data, but still the majority is proper spelling and grammar, so the model knows what's right and at the same time understands what you mean. I'd expect the same to hold true regarding the finetuning data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants