[Misc] Validate grammar and fail early #11119

comaniac · 2024-12-11T23:03:19Z

If the grammar is invalid, we now let xGrammar throw RuntimeError when compiling it. However, this happens in logits processor, so the exception is raised from model executor. Since we don't expect model executor to throw any exception now, the exception will crash the engine and kill the worker process.

This PR adds a validation to make sure the grammar is valid when constructing the GrammarConfig to solve this issue.

~~Note that there is another issue with the xgrammar backend that isn't addressed by this PR #11118~~

cc @mgoin

github-actions · 2024-12-11T23:03:30Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: Cody Yu <[email protected]>

kouroshHakha · 2024-12-11T23:16:39Z

Hey @comaniac,

1/ Let's add unittests for this. Let's make sure it doesn't diverge from this behavior later on.
2/ We should do a similar thing for json schema as well. Use the following extra_body and it will still kill the engine:
extra_body={"guided_json": {"type": "str"}}

comaniac · 2024-12-12T00:36:53Z

I don't know where to add unit tests for xgrammar backend. It seems not much unit tests have been added for this component. @mgoin do you have any pointer or should we merge this PR first and make up unit tests later?

Per offline discussion with @mgoin, this PR also fixes the Lark-like issue. I also found another issue that crashes the engine with my fix:

Send an invalid grammar. It failed with bad request. However, at this moment the tokenizer data is cached.
Send a valid grammar. Since the tokenizer data is cached, we have encoded_vocab = None, which results in crash in get_compiler. This is because here is a hidden assumption that when encoded_vocab = None, the compiler must be initialized already, but this is no longer guaranteed.

The fix in this PR is to make sure encoded_vocab would never be None. This shouldn't hurt performance because the tokenizer data is cached anyways.

vllm/model_executor/guided_decoding/xgrammar_utils.py

comaniac · 2024-12-12T00:39:16Z

vllm/model_executor/guided_decoding/xgrammar_decoding.py

-        if tokenizer_hash in TokenizerDataCache._cache:
-            encoded_vocab = None
-            stop_token_ids = None
-            backend_str = None
-        else:
-            tokenizer_data = TokenizerDataCache.get_tokenizer_data(tokenizer)
-            encoded_vocab = tokenizer_data.encoded_vocab
-            stop_token_ids = tokenizer_data.stop_token_ids
-            backend_str = tokenizer_data.backend_str


encoded_vocab cannot be None anymore because the compiler may not be initialized even the tokenizer data is cached if the grammar is invalid. This change shouldn't hurt performance because the tokenizer data is cached anyways.

Signed-off-by: Cody Yu <[email protected]>

[Misc] Validate grammar and fail early

4baf2cb

Signed-off-by: Cody Yu <[email protected]>

comaniac force-pushed the validate_grammar branch from c5c5aa8 to 4baf2cb Compare December 11, 2024 23:07

mgoin self-requested a review December 12, 2024 00:19

comaniac linked an issue Dec 12, 2024 that may be closed by this pull request

[Bug]: grammar_is_likely_lark doesn't work correctly #11118

Open

1 task

mgoin reviewed Dec 12, 2024

View reviewed changes

vllm/model_executor/guided_decoding/xgrammar_utils.py Outdated Show resolved Hide resolved

comaniac commented Dec 12, 2024

View reviewed changes

comaniac added 2 commits December 12, 2024 00:43

fix

0633583

Signed-off-by: Cody Yu <[email protected]>

comment

4f003b3

Signed-off-by: Cody Yu <[email protected]>

comaniac force-pushed the validate_grammar branch from 83a32cb to 4f003b3 Compare December 12, 2024 00:43

comaniac added 4 commits December 12, 2024 00:45

fix lark

06b63ea

Signed-off-by: Cody Yu <[email protected]>

fix lark

ef80b01

Signed-off-by: Cody Yu <[email protected]>

fix lark

5848882

Signed-off-by: Cody Yu <[email protected]>

fix typo

61c1d49

Signed-off-by: Cody Yu <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc] Validate grammar and fail early #11119

[Misc] Validate grammar and fail early #11119

comaniac commented Dec 11, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Dec 11, 2024

kouroshHakha commented Dec 11, 2024

comaniac commented Dec 12, 2024

comaniac Dec 12, 2024

[Misc] Validate grammar and fail early #11119

Are you sure you want to change the base?

[Misc] Validate grammar and fail early #11119

Conversation

comaniac commented Dec 11, 2024 • edited by github-actions bot Loading

github-actions bot commented Dec 11, 2024

kouroshHakha commented Dec 11, 2024

comaniac commented Dec 12, 2024

comaniac Dec 12, 2024

Choose a reason for hiding this comment

comaniac commented Dec 11, 2024 •

edited by github-actions bot

Loading