[test] fix transformers neuronx integration test failure #2539

sindhuvahinis · 2024-11-08T23:35:30Z

Description

EDIT: Even tiny models have higher context length. So, we trained model and tokenizer to have smaller context length 4k on runtime. I trained the model offline and uploaded it to s3. So the previous fix is not needed.

This should fix the tnx unit tests in integration tests failures. This is due to transformers getting upgraded to 4.45.2. This does not affect our handlers. This only affects, train_new_from_iterator in transformers.

Raised PR in transformers to fix it. huggingface/transformers#34661

siddvenk · 2024-11-11T17:12:59Z

Do we need 4.45.2 for neuron at the moment? (I think yes for multimodal?)

siddvenk · 2024-11-11T17:13:33Z

engines/python/setup/djl_python/tests/neuron_test_scripts/tiny_models.py

+    """
+    underlying_tokenizer = tokenizer.backend_tokenizer
+    if underlying_tokenizer.pre_tokenizer is None:
+        split_pattern = r"(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\r\n\p{L}\p{N}]?\p{L}+|\p{N}{1,3}| ?[^\s\p{L}\p{N}]+[\r\n]*|\s*[\r\n]+|\s+(?!\S)|\s+"


where does this come from?

This comes from Llama 3.2 1B tokenizer.json. https://huggingface.co/meta-llama/Llama-3.2-1B/blob/main/tokenizer.json.
Here is the raw json version. Here i just converted into Object.

"pre_tokenizer": { "type": "Sequence", "pretokenizers": [ { "type": "Split", "pattern": { "Regex": "(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\\r\\n\\p{L}\\p{N}]?\\p{L}+|\\p{N}{1,3}| ?[^\\s\\p{L}\\p{N}]+[\\r\\n]*|\\s*[\\r\\n]+|\\s+(?!\\S)|\\s+" }, "behavior": "Isolated", "invert": false }, { "type": "ByteLevel", "add_prefix_space": false, "trim_offsets": true, "use_regex": false } ] }

sindhuvahinis · 2024-11-11T18:10:43Z

Do we need 4.45.2 for neuron at the moment? (I think yes for multimodal?)

Yes. For vllm 0.6.2 version, it is needed. #2518 MultiModal support is also based on vllm 0.6.2

siddvenk · 2024-11-11T19:22:31Z

spoke offline - we're going to look into whether we need to do the tokenizer training stuff we currently do. If we can just use the tiny llama tokenizer directly, then that simplifies things here.

engines/python/setup/djl_python/tests/neuron_test_scripts/test_neuron_tnx_rolling_batch.py

…rary#2539)

…test failure (#2539) (#2573)

sindhuvahinis requested review from zachgk and a team as code owners November 8, 2024 23:35

siddvenk reviewed Nov 11, 2024

View reviewed changes

[ci] lookup s3 artifacts instead of training it runtime

264b970

sindhuvahinis force-pushed the fix branch 2 times, most recently from a47657f to d00d6c1 Compare November 18, 2024 17:15

siddvenk reviewed Nov 18, 2024

View reviewed changes

engines/python/setup/djl_python/tests/neuron_test_scripts/test_neuron_tnx_rolling_batch.py Outdated Show resolved Hide resolved

[ci] download s3 files

d16139d

sindhuvahinis force-pushed the fix branch from d00d6c1 to d16139d Compare November 18, 2024 17:29

siddvenk approved these changes Nov 18, 2024

View reviewed changes

sindhuvahinis merged commit 746fdd5 into deepjavalibrary:master Nov 18, 2024
9 checks passed

sindhuvahinis added a commit to sindhuvahinis/djl-serving that referenced this pull request Nov 18, 2024

[test] fix transformers neuronx integration test failure (deepjavalib…

6922623

…rary#2539)

sindhuvahinis added a commit that referenced this pull request Nov 18, 2024

[cherry-pick][0.30.0-dlc][test] fix transformers neuronx integration …

ed00459

…test failure (#2539) (#2573)

sindhuvahinis deleted the fix branch December 2, 2024 18:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[test] fix transformers neuronx integration test failure #2539

[test] fix transformers neuronx integration test failure #2539

sindhuvahinis commented Nov 8, 2024 •

edited

Loading

siddvenk commented Nov 11, 2024

siddvenk Nov 11, 2024

sindhuvahinis Nov 11, 2024

sindhuvahinis commented Nov 11, 2024

siddvenk commented Nov 11, 2024

[test] fix transformers neuronx integration test failure #2539

[test] fix transformers neuronx integration test failure #2539

Conversation

sindhuvahinis commented Nov 8, 2024 • edited Loading

Description

siddvenk commented Nov 11, 2024

siddvenk Nov 11, 2024

Choose a reason for hiding this comment

sindhuvahinis Nov 11, 2024

Choose a reason for hiding this comment

sindhuvahinis commented Nov 11, 2024

siddvenk commented Nov 11, 2024

sindhuvahinis commented Nov 8, 2024 •

edited

Loading