-
Notifications
You must be signed in to change notification settings - Fork 533
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Allows interweaving of arbitrary kinds of 'attention' layers, like sl…
…iding window, reuse prev layer kv cache etc. (#1299) * [WIP] Allows interweaving of arbitrary kinds of 'attention' layers, like RNN, sliding window etc. * lint * applying overrides to blocks rather than just attentions * add docstring * minor * changing yaml specification style * .. * fixes * fix * fix * fix * refactoring * add warning * compute only query vector when reusing kv * refactor * fixing * adding test for reusing previous layer kv cache * adding error messages * .. * adding test * add logging * adding logging * minor * bug fix, adding test * minor * addressing some comments * addressing some comments * setting absolute absolute value for reuse_kv_layer_idx * lint * adding tests for override_block_args * adding error if reusing kv cache from a mismatch layer * fixing test * fixing code, test * fix * .. * refactoring * fix * .. * .. * .. * refactoring * .. * .. * .. * adding test for _get_modules_order_expanded * fixing test * fixing test * lint * lint * adding test * addressing comment * .. * fixing test * changing yaml format * fix configuation * fixing test * allowing repeat at top level * allowing overriding error * addressing comments * lint * addressing comments * fix * .. * .. * .. * .. * .. * addressing comment * fixing test
- Loading branch information
1 parent
88511f7
commit 8604bba
Showing
6 changed files
with
849 additions
and
25 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.