feat: Try experimental commit making tensorizer less intrusive #3

sangstar · 2024-12-20T18:08:25Z

I feel my current feature branch was proposing fairly "ambitious" changes for exllamav2, or at least ones that would ask the maintainer(s) to add fairly significant properties to their inference engine (like a state_dict as a public attribute for their config class).

I tried in this commit to see if I could integrate tensorizer in a way that minimizes modifications plainly in the source code of exllamav2's model loading machinery by applying the tensorizer_context decorator to functions where tensorizer hooks are needed.

This is unfinished and shoddy. I honestly do not need to go this overboard to make my PR "less intrusive" with this proposal of changes, but felt it might be interesting to consider. Some motivations for this weren't necessarily made "better" either (for instance, I still create a private _state_dict attribute for their config class, but only if the decorator is called).

I figured I'd get initial feedback before going off in this direction, and scrapping it otherwise, while this change is still unfinished.

This change should (hopefully) make for a smaller line count in their core logic. The diff here is for my feature branch, but this can be made clear when comparing to this fork's master branch. It also likely might make the integration less readable as it plays with things like attributes for function objects. It also tries to modularize certain parts of exl2's loading logic in to functions to apply hooks in to, which goes against the unintrusiveness idealism.

feat: Try experimental commit making tensorizer less intrusive

2767d92

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Try experimental commit making tensorizer less intrusive #3

feat: Try experimental commit making tensorizer less intrusive #3

sangstar commented Dec 20, 2024 •

edited

Loading

feat: Try experimental commit making tensorizer less intrusive #3

Are you sure you want to change the base?

feat: Try experimental commit making tensorizer less intrusive #3

Conversation

sangstar commented Dec 20, 2024 • edited Loading

sangstar commented Dec 20, 2024 •

edited

Loading