-
-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Model] Add support for embedding model GritLM #10816
base: main
Are you sure you want to change the base?
Conversation
This model is a fork of GritLM/GritLM-7B. The main change in the fork wrt the original repo is the name of the architecture to make vLLM adoption easier. Signed-off-by: Pooya Davoodi <[email protected]>
👋 Hi! Thank you for contributing to the vLLM project. Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can do one of these:
🚀 |
Signed-off-by: Pooya Davoodi <[email protected]>
the pooler Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
5f07c31
to
4941376
Compare
Signed-off-by: Pooya Davoodi <[email protected]>
f610673
to
5f32d7c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Otherwise LGTM!
Signed-off-by: Pooya Davoodi <[email protected]>
In case of PP, some layers don't have self_attn attribute Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
Head branch was pushed to by a user without write access
Looks like we have two failures in ToT that are unrelated to this PR:
|
Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
And make test prompt shorter Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
I ended up adding generation support because some model tests expect it due to GritLM using LLamaForCausalLM. |
Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
Signed-off-by: Pooya Davoodi <[email protected]>
You might have to merge from main to fix the CI failures. |
Signed-off-by: Pooya Davoodi <[email protected]>
This model is a fork of GritLM/GritLM-7B. The main change in the fork wrt the original repo is the name of the architecture to make vLLM adoption easier.