Support inputs_embeds #687

samhavens · 2023-10-20T19:57:34Z

This allows users to pass in embeddings directly instead of looking them up based on input_ids. This is useful for PEFT (prompt/prefix-tuning) and multimodal models.

llmfoundry/models/mpt/modeling_mpt.py

vchiley

Should input_ids default to None with this change?

input_ids: Optional[torch.LongTensor] = None,

llmfoundry/models/mpt/modeling_mpt.py

in MPTForCausalLM. It is checked in the base model, and it is actually a common practice to pass both during autoregressive generation. Embeds are used first, then once the kvcache is nonempty, iids are used instead

llmfoundry/models/mpt/modeling_mpt.py

dakinggg · 2023-10-24T17:16:09Z

@samhavens here is what HF expects wrt these two args: https://github.com/huggingface/transformers/blob/9333bf0769561c048700377c2e0813221ab9d2c9/src/transformers/models/llama/modeling_llama.py#L955-L963

samhavens · 2023-10-27T17:07:34Z

@samhavens here is what HF expects wrt these two args: https://github.com/huggingface/transformers/blob/9333bf0769561c048700377c2e0813221ab9d2c9/src/transformers/models/llama/modeling_llama.py#L955-L963

@dakinggg That validation happens in the case model, but it the CausalLM model they can both be there https://github.com/huggingface/transformers/blob/9333bf0769561c048700377c2e0813221ab9d2c9/src/transformers/models/llama/modeling_llama.py#L1140-L1150

I think this is because in prepare_inputs_for_generation they use them on the first decoding step then ignore them https://github.com/huggingface/transformers/blob/9333bf0769561c048700377c2e0813221ab9d2c9/src/transformers/models/llama/modeling_llama.py#L1209-L1213

dakinggg

Approving cause LGTM, but please add a test for the edge cases:
(1) both input_ids and input_embeds are None
(2a) both input_ids and input_embeds are specified without kv cache
(2b) both input_ids and input_embeds are specified with kv cache

samhavens · 2023-11-30T19:13:46Z

Approving cause LGTM, but please add a test for the edge cases: (1) both input_ids and input_embeds are None (2a) both input_ids and input_embeds are specified without kv cache (2b) both input_ids and input_embeds are specified with kv cache

@dakinggg do you mean new tests or add these edge cases to the 2 tests that have input embeds

dakinggg · 2023-11-30T19:22:40Z

@samhavens add these edge cases to existing tests is fine (assuming that fits the tests). Anything that tests the right thing happens for those combinations is sufficient.

* support inputs_embeds * update tests to test inputs_embeds * make iids optional inputs to fwd * remove check for both iids and inputs_embeds in MPTForCausalLM. It is checked in the base model, and it is actually a common practice to pass both during autoregressive generation. Embeds are used first, then once the kvcache is nonempty, iids are used instead * reorder kwargs * add more tests * fix device merge artifact in test_model.oy * fix generate test * yapf

* Add eval loader to eval script * small input tests * updates * fix typing and formatting * fixes, add tests * remove circular dependency * tests pass * nits + small fixes * add metrics at the end, refactor to put icl/gauntlet as helpers * NOT * metrics instead of models, add unit tests * Move tests into directories * add copyright to inits * fix relative paths * fixes * revert gauntlet test change * Support inputs_embeds (#687) * support inputs_embeds * update tests to test inputs_embeds * make iids optional inputs to fwd * remove check for both iids and inputs_embeds in MPTForCausalLM. It is checked in the base model, and it is actually a common practice to pass both during autoregressive generation. Embeds are used first, then once the kvcache is nonempty, iids are used instead * reorder kwargs * add more tests * fix device merge artifact in test_model.oy * fix generate test * yapf * Better error message when test does not complete (#769) * run script tests first * comment out * ascripts -> scripts * bad dirs * try this * hacks * add a note about a_scripts --------- Co-authored-by: Sam Havens <[email protected]>

* Add eval loader to eval script * small input tests * updates * fix typing and formatting * fixes, add tests * remove circular dependency * tests pass * nits + small fixes * add metrics at the end, refactor to put icl/gauntlet as helpers * NOT * metrics instead of models, add unit tests * Move tests into directories * add copyright to inits * fix relative paths * fixes * revert gauntlet test change * Support inputs_embeds (mosaicml#687) * support inputs_embeds * update tests to test inputs_embeds * make iids optional inputs to fwd * remove check for both iids and inputs_embeds in MPTForCausalLM. It is checked in the base model, and it is actually a common practice to pass both during autoregressive generation. Embeds are used first, then once the kvcache is nonempty, iids are used instead * reorder kwargs * add more tests * fix device merge artifact in test_model.oy * fix generate test * yapf * Better error message when test does not complete (mosaicml#769) * run script tests first * comment out * ascripts -> scripts * bad dirs * try this * hacks * add a note about a_scripts --------- Co-authored-by: Sam Havens <[email protected]>

samhavens added 2 commits October 20, 2023 12:52

support inputs_embeds

c6a2a09

update tests to test inputs_embeds

2a62e6d

samhavens requested review from dakinggg and vchiley October 20, 2023 20:37

vchiley reviewed Oct 20, 2023

View reviewed changes

llmfoundry/models/mpt/modeling_mpt.py Show resolved Hide resolved

vchiley reviewed Oct 20, 2023

View reviewed changes

llmfoundry/models/mpt/modeling_mpt.py Outdated Show resolved Hide resolved

vchiley reviewed Oct 20, 2023

View reviewed changes

samhavens added 2 commits October 20, 2023 15:05

make iids optional inputs to fwd

cdfbe83

Merge branch 'main' into support-inp-emb

591e060

samhavens requested a review from vchiley October 23, 2023 16:46

vchiley reviewed Oct 23, 2023

View reviewed changes

llmfoundry/models/mpt/modeling_mpt.py Outdated Show resolved Hide resolved

samhavens commented Oct 23, 2023

View reviewed changes

llmfoundry/models/mpt/modeling_mpt.py Outdated Show resolved Hide resolved

remove check for both iids and inputs_embeds

c4fac61

in MPTForCausalLM. It is checked in the base model, and it is actually a common practice to pass both during autoregressive generation. Embeds are used first, then once the kvcache is nonempty, iids are used instead

vchiley reviewed Oct 23, 2023

View reviewed changes

llmfoundry/models/mpt/modeling_mpt.py Outdated Show resolved Hide resolved

samhavens and others added 2 commits October 23, 2023 14:21

reorder kwargs

9705e3a

Merge branch 'main' into support-inp-emb

3749877

dakinggg approved these changes Oct 27, 2023

View reviewed changes

samhavens added 6 commits November 30, 2023 13:48

add more tests

5dfc436

merge main and fix conflict with rope

b10b106

fix device merge artifact in test_model.oy

4d80bbc

fix generate test

1765822

yapf

ea57c8c

Merge branch 'main' into support-inp-emb

8dbe307

samhavens merged commit 22ae919 into main Dec 1, 2023
10 checks passed

samhavens deleted the support-inp-emb branch December 1, 2023 01:47

ShashankMosaicML mentioned this pull request Dec 2, 2023

Reorganize tests to make them easier to find (#768) ShashankMosaicML/llm-foundry#16

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support inputs_embeds #687

Support inputs_embeds #687

samhavens commented Oct 20, 2023

vchiley left a comment

dakinggg commented Oct 24, 2023

samhavens commented Oct 27, 2023

dakinggg left a comment

samhavens commented Nov 30, 2023

dakinggg commented Nov 30, 2023 •

edited

Loading

Support inputs_embeds #687

Support inputs_embeds #687

Conversation

samhavens commented Oct 20, 2023

vchiley left a comment

Choose a reason for hiding this comment

dakinggg commented Oct 24, 2023

samhavens commented Oct 27, 2023

dakinggg left a comment

Choose a reason for hiding this comment

samhavens commented Nov 30, 2023

dakinggg commented Nov 30, 2023 • edited Loading

dakinggg commented Nov 30, 2023 •

edited

Loading