Add benchmarks using iai-callgrind #447

tgross35 · 2025-01-16T02:16:14Z

Running walltime benchmarks in CI is notoriously unstable, Introduce benchmarks that instead use instruction count and other more reproducible metrics, using iai-callgrind 1, which we are able to run in CI with a high degree of reproducibility.

Inputs to this benchmark are a logspace sweep, which gives an approximation for real-world use, but may fail to indicate outlier cases.

Benchmarks need a way to limit how many iterations get run. Introuce a way to inject this information here.

Running walltime benchmarks in CI is notoriously unstable, Introduce benchmarks that instead use instruction count and other more reproducible metrics, using `iai-callgrind` [1], which we are able to run in CI with a high degree of reproducibility. Inputs to this benchmark are a logspace sweep, which gives an approximation for real-world use, but may fail to indicate outlier cases. [1]: https://github.com/iai-callgrind/iai-callgrind

Add support in `ci-util.py` for finding the most recent baseline and downloading it, which new tests can then be compared against. Arbitrarily select nightly-2025-01-16 as the rustc version to pin to in benchmarks.

The icount benchmarks are what we will be relying on in CI more than the existing benchmarks. There isn't much reason to keep these around, but there isn't much point in dropping them either. So, just reduce the runtime.

This failed a couple of times recently in CI, once on i686 and once on aarch64-apple: thread 'main' panicked at crates/libm-test/benches/random.rs:76:65: called `Result::unwrap()` on an `Err` value: ynf Caused by: 0: input: (681, 509.90924) (0x000002a9, 0x43fef462) expected: -3.2161271e38 0xff71f45b actual: -inf 0xff800000 1: mismatched infinities thread 'main' panicked at crates/libm-test/benches/random.rs:76:65: called `Result::unwrap()` on an `Err` value: ynf Caused by: 0: input: (132, 50.46604) (0x00000084, 0x4249dd3a) expected: -3.3364996e38 0xff7b02a5 actual: -inf 0xff800000 1: mismatched infinities Add a new override to account for this.

tgross35 mentioned this pull request Jan 16, 2025

WIP: iai-callgrind #444

Closed

tgross35 force-pushed the icount-benchmarks branch from 806b471 to 78d1f58 Compare January 16, 2025 08:49

Provide a way to override iteration count

342600b

Benchmarks need a way to limit how many iterations get run. Introuce a way to inject this information here.

tgross35 force-pushed the icount-benchmarks branch 2 times, most recently from 56e19b4 to 26aaf84 Compare January 16, 2025 09:06

tgross35 changed the title ~~WIP: iai-callgrind~~ Add benchmarks using iai-callgrind Jan 16, 2025

tgross35 added 3 commits January 16, 2025 09:07

Run iai-callgrind benchmarks in CI

6e6ab78

Add support in `ci-util.py` for finding the most recent baseline and downloading it, which new tests can then be compared against. Arbitrarily select nightly-2025-01-16 as the rustc version to pin to in benchmarks.

Reduce the warm up and measurement time for short-benchmarks

31b4115

The icount benchmarks are what we will be relying on in CI more than the existing benchmarks. There isn't much reason to keep these around, but there isn't much point in dropping them either. So, just reduce the runtime.

tgross35 force-pushed the icount-benchmarks branch from 26aaf84 to 31b4115 Compare January 16, 2025 09:07

tgross35 enabled auto-merge January 16, 2025 09:53

tgross35 merged commit a298d92 into rust-lang:master Jan 16, 2025
35 checks passed

tgross35 deleted the icount-benchmarks branch January 16, 2025 10:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmarks using iai-callgrind #447

Add benchmarks using iai-callgrind #447

tgross35 commented Jan 16, 2025 •

edited

Loading

Add benchmarks using iai-callgrind #447

Add benchmarks using iai-callgrind #447

Conversation

tgross35 commented Jan 16, 2025 • edited Loading

tgross35 commented Jan 16, 2025 •

edited

Loading