Vectorize KV-Cache by using `Vec4` #222

officialcjunior · 2024-06-21T09:07:51Z

Currently, the KV-Cache operation is scalar, this PR attempts to vectorize it.

Fixes #210

github-actions · 2024-06-21T09:08:52Z

Code Metrics Report

  ===============================================================================
 Language            Files        Lines         Code     Comments       Blanks
===============================================================================
 TOML                    1           75           63            2           10
-------------------------------------------------------------------------------
 Rust                   62        13276        11417          185         1674
 |- Markdown            34          311            0          244           67
 (Total)                          13587        11417          429         1741
===============================================================================
 Total                  63        13351        11480          187         1684
===============================================================================

FL33TW00D · 2024-06-21T09:23:21Z

crates/ratchet-core/src/ops/cache.rs

-        builder.register_storage("C", BindingMode::ReadWrite, Array::<P>::default());
-        builder.register_storage("S", BindingMode::ReadOnly, Array::<P>::default());
-        builder.register_storage("D", BindingMode::ReadWrite, Array::<P>::default());
+        builder.register_storage("C", BindingMode::ReadWrite, Array::<vec4<f32>>::default());


This will only work for vec4<f32>!

Okay, just learnt that Vec4<T> implements WgslPrimitive for any T. Making it Array::<P> itself, from what I understand.

FL33TW00D · 2024-06-21T09:33:11Z

crates/ratchet-core/src/ops/cache.rs


            let dim = metadata.dim;
            if (dst_index[dim] < metadata.cum0) {
                //Inside cache, just copy from cache to DST
-                let src_offset = ndIndexToOffset(dst_index, metadata.cache_stride);
+                let src_offset = ndIndexToOffset(dst_index, metadata.cache_stride) / 4u;
                D[dst_offset] = C[src_offset];
                return;
            }

            if (dst_index[dim] < metadata.cum1) {


What will happen to cum1 here if all lengths are / 4

Hmm 🤔 . I'm still trying to understand this one and what needs to be done about it.

I suppose I need to divide cum1 by 4 in write_metadata too?

crates/ratchet-core/src/ops/cache.rs

…or `KernelElement`

FL33TW00D · 2024-07-02T15:47:23Z

@officialcjunior Haven't forgotten about this, just working on the refactor 👍🏻

Vectorize KV-Cache by using Vec4

5bd5740

FL33TW00D reviewed Jun 21, 2024

View reviewed changes

crates/ratchet-core/src/ops/cache.rs Show resolved Hide resolved

Use Array::<P>> while registering storage and add more conditions f…

18f24fb

…or `KernelElement`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vectorize KV-Cache by using `Vec4` #222

Vectorize KV-Cache by using `Vec4` #222

officialcjunior commented Jun 21, 2024

github-actions bot commented Jun 21, 2024 •

edited

Loading

FL33TW00D Jun 21, 2024

officialcjunior Jun 23, 2024

FL33TW00D Jun 21, 2024

officialcjunior Jun 23, 2024

FL33TW00D commented Jul 2, 2024

Vectorize KV-Cache by using Vec4 #222

Are you sure you want to change the base?

Vectorize KV-Cache by using Vec4 #222

Conversation

officialcjunior commented Jun 21, 2024

github-actions bot commented Jun 21, 2024 • edited Loading

FL33TW00D Jun 21, 2024

Choose a reason for hiding this comment

officialcjunior Jun 23, 2024

Choose a reason for hiding this comment

FL33TW00D Jun 21, 2024

Choose a reason for hiding this comment

officialcjunior Jun 23, 2024

Choose a reason for hiding this comment

FL33TW00D commented Jul 2, 2024

Vectorize KV-Cache by using `Vec4` #222

Vectorize KV-Cache by using `Vec4` #222

github-actions bot commented Jun 21, 2024 •

edited

Loading