Read EOS token from model runtime information for speculative_decoding_lm #353

anzr299 · 2024-04-09T18:43:12Z

Extension to issue #277, Added the functionality to read EOS token from model runtime information in the speculative_decoding_lm.

Made changes to accommodate the dynamic EOS Token

getting the rt_info from the tokenizer instead of the LLM.

Made changes according to the review. This is the latest commit.

Fixed the error Related to the tokenizer read_model

Added comments and organised the code better

Changed Beam Search header to accommodate eos token in parameter and removed the comment regarding eos token not implemented

Added EOS token into the parameters

anzr299 · 2024-04-09T18:43:55Z

@ilya-lavrenov this is the PR corresponding to the change we discussed.

anzr299 · 2024-04-09T18:54:42Z

@ilya-lavrenov The readme does not contain proper instruction for running speculative_decoding_lm. in the example, it runs the "Llama-2-7b-chat-hf" model as the main model directory;
/build/Release/speculative_decoding_lm ./TinyLlama-1.1B-Chat-v1.0/pytorch/dldt/FP16/ ./Llama-2-7b-chat-hf/pytorch/dldt/FP16/ "Why is the Sun yellow?"
The previous steps do not mention any installation of this model and it also requires extra permission to use this model(meta-llama/Llama-2-70b-chat-hf)

pavel-esir · 2024-04-12T07:25:39Z

The previous steps do not mention any installation of this model and it also requires extra permission to use this model(meta-llama/Llama-2-70b-chat-hf)

Thanks for noticing that. I will update Readme in the next following PRs

anzr299 and others added 22 commits March 20, 2024 21:39

Update greedy_causal_lm.cpp

6c97219

Made changes to accommodate the dynamic EOS Token

Merge branch 'master' into patch-1

abe8cec

Update greedy_causal_lm.cpp

344a894

Merge branch 'openvinotoolkit:master' into patch-1

4c8b47f

Update greedy_causal_lm.cpp

380eb7c

Update greedy_causal_lm.cpp

44fa828

getting the rt_info from the tokenizer instead of the LLM.

Merge branch 'master' into patch-1

22937e9

Update greedy_causal_lm.cpp

0f33ee1

Update greedy_causal_lm.cpp

085057d

Made changes according to the review. This is the latest commit.

Merge branch 'master' into patch-1

405bf68

Update greedy_causal_lm.cpp

db34a2b

Fixed the error Related to the tokenizer read_model

Update greedy_causal_lm.cpp

a49e1e9

Added comments and organised the code better

Update group_beam_searcher.hpp

de90100

Changed Beam Search header to accommodate eos token in parameter and removed the comment regarding eos token not implemented

Update beam_search_causal_lm.cpp

e235825

Added EOS token into the parameters

Merge branch 'master' into patch-1

e020827

Merge branch 'openvinotoolkit:master' into patch-1

7380f8a

Update the tokenizer submodule to latest commit

2711896

Merge branch 'master' into patch-1

aac6278

Merge branch 'openvinotoolkit:master' into patch-1

35b3565

Change branch for openvino tokenizer submodule to releases/2024/0

258ca05

Add reading of EOS token spculative_decoding_lm to include

f939192

Merge branch 'openvinotoolkit:master' into patch-2

d2405bf

anzr299 mentioned this pull request Apr 9, 2024

[Good First Issue]: causal_lm/cpp must read EOS token value from rt_info of openvino_tokenizer.xml #277

Closed

ilya-lavrenov requested a review from pavel-esir April 9, 2024 18:48

ilya-lavrenov approved these changes Apr 11, 2024

View reviewed changes

ilya-lavrenov merged commit e84defc into openvinotoolkit:master Apr 11, 2024
10 checks passed

anzr299 deleted the patch-2 branch January 8, 2025 09:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read EOS token from model runtime information for speculative_decoding_lm #353

Read EOS token from model runtime information for speculative_decoding_lm #353

anzr299 commented Apr 9, 2024

anzr299 commented Apr 9, 2024

anzr299 commented Apr 9, 2024 •

edited

Loading

pavel-esir commented Apr 12, 2024

Read EOS token from model runtime information for speculative_decoding_lm #353

Read EOS token from model runtime information for speculative_decoding_lm #353

Conversation

anzr299 commented Apr 9, 2024

anzr299 commented Apr 9, 2024

anzr299 commented Apr 9, 2024 • edited Loading

pavel-esir commented Apr 12, 2024

anzr299 commented Apr 9, 2024 •

edited

Loading