[Misc][LoRA] Move the implementation of lora bias to punica.py #10829

jeejeelee · 2024-12-02T10:46:13Z

Motivation

Moving the lora bias calculation code from layers.py and fully_sharded_layers.py to punica.py.

The purpose is to consolidate all lora-related calculations in punica.py, making it easier to support lora on different hardware platforms in the future.

cc @SanjuCSudhakaran

Signed-off-by: Jee Jee Li <[email protected]>

github-actions · 2024-12-02T10:46:25Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

jeejeelee · 2024-12-02T15:54:13Z

@DarkLight1337 If you have bandwith, plz look at this PR, thanks.

DarkLight1337

Thanks for the code cleanup!

DarkLight1337 · 2024-12-02T15:58:44Z

Please fix the lint errors.

Signed-off-by: Jee Jee Li <[email protected]>

jeejeelee added 5 commits November 30, 2024 01:31

Init

f6fa261

Signed-off-by: Jee Jee Li <[email protected]>

Merge remote-tracking branch 'origin/main' into move-lora-bias

6523be3

Done 1/2

aff0182

Signed-off-by: Jee Jee Li <[email protected]>

Done

20f8018

Signed-off-by: Jee Jee Li <[email protected]>

Add lora bias test

0a5aa73

jeejeelee changed the title ~~Move lora bias~~ [Misc][LoRA] Move the implementation of lora bias to punica.py Dec 2, 2024

jeejeelee requested a review from DarkLight1337 December 2, 2024 10:46

jeejeelee marked this pull request as draft December 2, 2024 10:47

jeejeelee added 2 commits December 2, 2024 15:39

Format code

ec9400d

Modify code

0e1613e

jeejeelee marked this pull request as ready for review December 2, 2024 15:52

DarkLight1337 approved these changes Dec 2, 2024

View reviewed changes

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 2, 2024

format code

db99ede

Signed-off-by: Jee Jee Li <[email protected]>

DarkLight1337 enabled auto-merge (squash) December 2, 2024 16:06

DarkLight1337 merged commit b45f0d7 into vllm-project:main Dec 2, 2024
49 checks passed

jeejeelee deleted the move-lora-bias branch December 2, 2024 23:21

jeejeelee mentioned this pull request Dec 3, 2024

[Bugfix] Fix QKVParallelLinearWithShardedLora bias bug #10844

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc][LoRA] Move the implementation of lora bias to punica.py #10829

[Misc][LoRA] Move the implementation of lora bias to punica.py #10829

jeejeelee commented Dec 2, 2024 •

edited by github-actions bot

Loading

github-actions bot commented Dec 2, 2024

jeejeelee commented Dec 2, 2024

DarkLight1337 left a comment

DarkLight1337 commented Dec 2, 2024

[Misc][LoRA] Move the implementation of lora bias to punica.py #10829

[Misc][LoRA] Move the implementation of lora bias to punica.py #10829

Conversation

jeejeelee commented Dec 2, 2024 • edited by github-actions bot Loading

Motivation

github-actions bot commented Dec 2, 2024

jeejeelee commented Dec 2, 2024

DarkLight1337 left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Dec 2, 2024

jeejeelee commented Dec 2, 2024 •

edited by github-actions bot

Loading