Performance regression in `arrow_batch_points/query` benchmark #3233

teh-cmc · 2023-09-06T14:20:18Z

Somehow it seems that #3162 caused a serious performance regression for this specific benchmark:

(See it live here)

emilk · 2023-09-25T08:21:36Z

For 0.9: let's see if this has significant impact on a real point cloud example, or if we can punt on this to 0.10

emilk · 2023-09-28T22:31:48Z

I wonder if some optimizations got lost in #3162, e.g the stuff in this one: #2970

emilk · 2023-10-06T06:31:14Z

Just as a note: real-life point cloud performance has improved by 3x from 0.8.2. to 0.9: #1136 (comment)

teh-cmc · 2023-10-09T12:43:23Z

The regression only impacts the legacy query APIs, which do not exist anymore.

$ git co 905fcaf56  # pre-offending PR

$ cargo bench --all-features -p re_query -- 'arrow_batch_points/query'  # legacy APIs
arrow_batch_points/query
                        time:   [5.4049 µs 5.4188 µs 5.4373 µs]
                        thrpt:  [183.92 Melem/s 184.54 Melem/s 185.02 Melem/s]

$ cargo bench --all-features -p re_query -- 'arrow_batch_points2/query' # new APIs
arrow_batch_points2/query
                        time:   [7.4022 µs 7.4485 µs 7.4947 µs]
                        thrpt:  [133.43 Melem/s 134.26 Melem/s 135.09 Melem/s]

$ git co 0a2258a  # offending PR

$ cargo bench --all-features -p re_query -- 'arrow_batch_points/query'  # legacy APIs
arrow_batch_points/query
                        time:   [14.340 µs 14.359 µs 14.382 µs]
                        thrpt:  [69.531 Melem/s 69.642 Melem/s 69.734 Melem/s]
                 change:
                        time:   [+164.89% +165.74% +166.47%] (p = 0.00 < 0.05)
                        thrpt:  [-62.472% -62.369% -62.248%]
                        Performance has regressed.

$ cargo bench --all-features -p re_query -- 'arrow_batch_points2/query'  # new APIs
arrow_batch_points2/query
                        time:   [7.4420 µs 7.4699 µs 7.4966 µs]
                        thrpt:  [133.39 Melem/s 133.87 Melem/s 134.37 Melem/s]

teh-cmc · 2023-10-09T12:48:09Z

Oh, and for completeness, here's main:

$ git co main

$ cargo bench --all-features -p re_query -- 'arrow_batch_points2/query'
    Finished bench [optimized + debuginfo] target(s) in 0.16s
     Running benches/query_benchmark.rs (target/release/deps/query_benchmark-78136916f96ddd02)
arrow_batch_points2/query
                        time:   [2.6338 µs 2.6503 µs 2.6766 µs]
                        thrpt:  [373.60 Melem/s 377.32 Melem/s 379.67 Melem/s]
                 change:
                        time:   [-64.660% -64.496% -64.320%] (p = 0.00 < 0.05)
                        thrpt:  [+180.27% +181.66% +182.96%]
                        Performance has improved.

Which is indeed 3x faster than the legacy.

teh-cmc added 🚀 performance Optimization, memory use, etc 🦟 regression A thing that used to work in an earlier release labels Sep 6, 2023

teh-cmc added this to the 0.9 - Codegen, Type Oriented Api milestone Sep 6, 2023

emilk assigned emilk and unassigned emilk Sep 29, 2023

teh-cmc self-assigned this Oct 9, 2023

teh-cmc closed this as completed Oct 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance regression in `arrow_batch_points/query` benchmark #3233

Performance regression in `arrow_batch_points/query` benchmark #3233

teh-cmc commented Sep 6, 2023

emilk commented Sep 25, 2023

emilk commented Sep 28, 2023 •

edited

Loading

emilk commented Oct 6, 2023

teh-cmc commented Oct 9, 2023 •

edited

Loading

teh-cmc commented Oct 9, 2023

Performance regression in arrow_batch_points/query benchmark #3233

Performance regression in arrow_batch_points/query benchmark #3233

Comments

teh-cmc commented Sep 6, 2023

emilk commented Sep 25, 2023

emilk commented Sep 28, 2023 • edited Loading

emilk commented Oct 6, 2023

teh-cmc commented Oct 9, 2023 • edited Loading

teh-cmc commented Oct 9, 2023

Performance regression in `arrow_batch_points/query` benchmark #3233

Performance regression in `arrow_batch_points/query` benchmark #3233

emilk commented Sep 28, 2023 •

edited

Loading

teh-cmc commented Oct 9, 2023 •

edited

Loading