Feat/Split Operator #2490

agelas · 2024-11-14T20:09:26Z

Pull Request Template

Checklist

Confirmed that run-checks all script has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

#2440

Changes

Adds support for the split operation. The ONNX -> Burn conversion will be part of a separate PR.

Testing

Added tests under burn-tensor.

…izes()`, add some clarity vs `chunk()`

… support them

codecov · 2024-11-15T06:28:50Z

Codecov Report

Attention: Patch coverage is 66.31356% with 159 lines in your changes missing coverage. Please review.

Project coverage is 82.86%. Comparing base (6d105ea) to head (86f5d31).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
crates/burn-tensor/src/tensor/ops/qtensor.rs	0.00%	30 Missing ⚠️
crates/burn-tensor/src/tensor/api/base.rs	58.82%	28 Missing ⚠️
crates/burn-tch/src/ops/base.rs	0.00%	21 Missing ⚠️
crates/burn-autodiff/src/ops/int_tensor.rs	0.00%	14 Missing ⚠️
crates/burn-autodiff/src/ops/bool_tensor.rs	0.00%	10 Missing ⚠️
crates/burn-tch/src/ops/bool_tensor.rs	0.00%	10 Missing ⚠️
crates/burn-tch/src/ops/int_tensor.rs	0.00%	10 Missing ⚠️
crates/burn-tch/src/ops/tensor.rs	0.00%	10 Missing ⚠️
crates/burn-tensor/src/tensor/ops/bool_tensor.rs	0.00%	10 Missing ⚠️
crates/burn-tensor/src/tensor/ops/int_tensor.rs	0.00%	10 Missing ⚠️
... and 1 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2490      +/-   ##
==========================================
- Coverage   82.93%   82.86%   -0.08%     
==========================================
  Files         815      817       +2     
  Lines      105344   105853     +509     
==========================================
+ Hits        87371    87714     +343     
- Misses      17973    18139     +166

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

agelas · 2024-11-15T20:42:33Z

crates/burn-tensor/src/tests/ops/chunk.rs

@@ -4,6 +4,7 @@ mod tests {
    use alloc::vec::Vec;
    use burn_tensor::{Int, Shape, Tensor, TensorData};

+    #[test]


I think this was accidentally left out in #998

agelas · 2024-11-15T20:48:00Z

crates/burn-tensor/src/tests/mod.rs

@@ -107,6 +107,7 @@ macro_rules! testgen_quantization {
        burn_tensor::testgen_q_sin!();
        burn_tensor::testgen_q_slice!();
        burn_tensor::testgen_q_sort_argsort!();
+        // burn_tensor::testgen_q_split!();


@antimora @louisfd It looks like there's some sort of nuance here that I'm a bit out of the loop on- do I need to do something extra for the quantized set of tests/do I even need quantized tests here?

Adding quantized tests is not well documented at the moment, so you're not expected to do it.
@laggui will work on making it more straightforward and he'll be able to adapt your non-quantized tests.

nathanielsimard · 2024-11-17T14:28:00Z

crates/burn-tensor/src/tensor/api/split.rs

+pub fn split<B: Backend, K: TensorKind<B> + BasicOps<B>>(
+    tensor: K::Primitive,
+    split_size: usize,
+    dim: usize,
+) -> Vec<K::Primitive> {
+    let size = K::shape(&tensor).dims[dim];
+    let mut tensors = Vec::new();
+
+    let mut start = 0;
+    while start < size {
+        let length = usize::min(split_size, size - start);
+        tensors.push(narrow::<B, K>(tensor.clone(), dim, start, length));
+        start += length;
+    }
+
+    tensors
+}


I think this function will likely by the fastest in most cases, since there is not kernel to execute, only metadata to update.

louisfd

LGTM

louisfd · 2024-11-19T12:36:35Z

crates/burn-tensor/src/tensor/api/base.rs

+    ///
+    /// To split a tensor, users should prefer the [Tensor::split](Tensor::split) function,
+    /// which is more high-level and designed for public use.
+    fn split(tensor: Self::Primitive, split_size: usize, dim: usize) -> Vec<Self::Primitive>;


What's the difference between this version of split and chunk? Is it only that split takes the size of the new chunks while chunk takes the number of chunks?

Correct- and then split_with_sizes goes a step further and lets you specify the size of each chunk.

louisfd · 2024-11-19T13:11:45Z

crates/burn-tensor/src/tests/mod.rs

@@ -107,6 +107,7 @@ macro_rules! testgen_quantization {
        burn_tensor::testgen_q_sin!();
        burn_tensor::testgen_q_slice!();
        burn_tensor::testgen_q_sort_argsort!();
+        // burn_tensor::testgen_q_split!();


Adding quantized tests is not well documented at the moment, so you're not expected to do it.
@laggui will work on making it more straightforward and he'll be able to adapt your non-quantized tests.

Luni-4 · 2024-11-21T17:21:26Z

Thanks a lot @agelas for your implementation! And thank you, @louisfd and @nathanielsimard for your reviews!

agelas added 23 commits November 13, 2024 17:22

Create high-level function signatures for split() and `split_with_s…

242094e

…izes()`, add some clarity vs `chunk()`

Add checks for split() and split_with_sizes()

64df3d9

Remove pointless negative checks, usize is an int

6e2ca04

Add element-type specific function signatures

21d06fe

Implement split and split_with_sizes for tch backend

4e30256

Add split ops to burn-autodiff

d4cf026

Add split ops to list of basic operations in burn-book

3cfdd32

Add fallbacks for split and split_with_size since a few backends dont…

5a14745

… support them

Cleanup split_with_sizes and fix arguments

185703f

Clippy fixes

2b1c152

Finish todos in tensor.rs

01272f1

Add documentation to split functions

334fa0b

Small consistency/grammar fixes for chunk

bd5343f

Punctuation fix

236b919

Small doc fixes and change to split_with_sizes

c5d08ee

Add split functions to float, int, and bool BasicOps impls

824cb30

Add split functions to int, bool, and quantized tensors

1deee50

Minor doc + code fixes

a99a306

Add documentation

9ac5fc0

Start adding tests

3093b5c

Try to get this to compile

36a6ff5

Fix docs and add examples

3e790df

Get initial tests to a working state

d1624f0

agelas added 6 commits November 14, 2024 22:44

Fix doc tests

d647a9e

Add more split tests

54ef092

Add more tests for 3D tensor splitting

2aedd20

Check with full dims so no indexing panic

b3ed77a

Correct flow of check

1775c1a

Specify which panic to expect in tests

7ddab38

agelas added 2 commits November 15, 2024 02:10

Add split_with_sizes tests

3d064f8

Small fix to chuk tests

9fb3aad

agelas marked this pull request as ready for review November 15, 2024 20:40

agelas commented Nov 15, 2024

View reviewed changes

nathanielsimard reviewed Nov 17, 2024

View reviewed changes

agelas added 3 commits November 18, 2024 16:11

Resolve merge conflicts

dfb571a

Fix redundant closure

e98240a

Fix redunant closure even more

86f5d31

louisfd approved these changes Nov 19, 2024

View reviewed changes

Resolve merge conflict

fe8bd67

agelas requested a review from nathanielsimard November 20, 2024 21:43

nathanielsimard approved these changes Nov 21, 2024

View reviewed changes

nathanielsimard merged commit d1398d6 into tracel-ai:main Nov 21, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/Split Operator #2490

Feat/Split Operator #2490

agelas commented Nov 14, 2024 •

edited

Loading

codecov bot commented Nov 15, 2024 •

edited

Loading

agelas Nov 15, 2024

agelas Nov 15, 2024

louisfd Nov 19, 2024

nathanielsimard Nov 17, 2024

louisfd left a comment

louisfd Nov 19, 2024

agelas Nov 19, 2024

louisfd Nov 19, 2024

Luni-4 commented Nov 21, 2024

Feat/Split Operator #2490

Feat/Split Operator #2490

Conversation

agelas commented Nov 14, 2024 • edited Loading

Pull Request Template

Checklist

Related Issues/PRs

Changes

Testing

codecov bot commented Nov 15, 2024 • edited Loading

Codecov Report

agelas Nov 15, 2024

Choose a reason for hiding this comment

agelas Nov 15, 2024

Choose a reason for hiding this comment

louisfd Nov 19, 2024

Choose a reason for hiding this comment

nathanielsimard Nov 17, 2024

Choose a reason for hiding this comment

louisfd left a comment

Choose a reason for hiding this comment

louisfd Nov 19, 2024

Choose a reason for hiding this comment

agelas Nov 19, 2024

Choose a reason for hiding this comment

louisfd Nov 19, 2024

Choose a reason for hiding this comment

Luni-4 commented Nov 21, 2024

agelas commented Nov 14, 2024 •

edited

Loading

codecov bot commented Nov 15, 2024 •

edited

Loading