PHPLIB-1187: Run benchmark on Evergreen #1185

alcaeus · 2023-10-20T08:09:57Z

This pull request adds a task to run benchmarks on Evergreen. It also uses perf.send to store benchmark metrics in CI so we can analyse the performance.

I've also removed a couple of benchmarks in order to bring the benchmark time down. Most recent tests show a 33 minute runtime on Evergreen. I also had to skip the AMP subjects as I ran into issues with socket creation. Since AMP does not perform significantly different from forking, I think it's safe to exclude them from the metrics we collect on Evergreen.

alcaeus · 2023-10-20T08:10:43Z

benchmark/phpbench.json.dist

@@ -6,6 +6,5 @@
    "runner.file_pattern": "*Bench.php",
    "runner.path": "src",
    "runner.php_config": { "memory_limit": "1G" },
-    "runner.iterations": 3,
-    "runner.revs": 10


Removed this in favour of increasing revs only for benchmarks that run in the microsecond range. For anything that takes milliseconds, the precision is usually good enough with a single rev.

alcaeus · 2023-10-20T08:11:20Z

benchmark/src/DriverBench/ParallelMultiFileExportBench.php

@@ -74,15 +72,15 @@ public function afterIteration(): void
     * Using a single thread to export multiple files.
     * By executing a single Find command for multiple files, we can reduce the number of roundtrips to the server.
     *
-     * @param array{chunk:int} $params
+     * @param array{chunkSize:int} $params


Decided to rename this since I got confused as to what chunk meant: I thought chunk: 1 meant a single chunk of files while it actually meant 100 chunks of 1 file each.

alcaeus · 2023-10-20T08:11:59Z

benchmark/src/DriverBench/ParallelMultiFileImportBench.php

-     * @param array{chunk:int} $params
-     */
-    #[ParamProviders(['provideChunkParams'])]
-    public function benchBulkWrite(array $params): void


This subject essentially tests the same as benchInsertMany, so I decided to skip it in order to shave ~6 minutes off the run time.

We could keep this and remove the other that is a little less efficient

The reason I removed bulkWrite and kept insertMany is that benchmarking the latter also exposes performance regressions introduced in the library, while the other only tests the extension.

alcaeus · 2023-10-20T08:13:06Z

benchmark/src/Extension/EvergreenReport.php

+                            'created_at' => date(DATE_ATOM),
+                            'completed_at' => date(DATE_ATOM),


I don't think this information is relevant to us, so I didn't bother trying to extract the exact dates. The documentation also didn't mark this as optional, so I decided to just include the current date instead.

Well, created_at could take $_SERVER['REQUEST_TIME'].

GromNaN

LGTM.

GromNaN · 2023-10-20T12:33:30Z

.evergreen/config/php.ini

@@ -1 +1,2 @@
 extension=mongodb.so
+memory_limit=-1


You should increase the limit. Otherwise we don't know how much memory is used if it's too much. What is the memory limit of the job runner?

For some reason it aborted when it hit a previous 128M limit, apparently ignoring the 1G limit definer in the runner config. We can also report memory usage from the benchmarks using perf.send, which would be a better indicator than failing the build when it hits some limit. Want me to add those numbers?

perf.send is too late if the process crashes, no?

Correct, perf.send would not be executed in that case. With -1 the memory would be unlimited, so the only case in which it would crash is if it used up all memory including the page file, which I'd consider highly unlikely.

GromNaN · 2023-10-20T12:38:48Z

benchmark/src/Extension/EvergreenReport.php

+                            'created_at' => date(DATE_ATOM),
+                            'completed_at' => date(DATE_ATOM),


Well, created_at could take $_SERVER['REQUEST_TIME'].

Run benchmark on evergreen

2ddd4c7

alcaeus requested a review from GromNaN October 20, 2023 08:09

alcaeus self-assigned this Oct 20, 2023

alcaeus commented Oct 20, 2023

View reviewed changes

alcaeus added 4 commits October 20, 2023 10:14

Add evergreen report to benchmark

babb286

Only run multiple revs for fast benchmarks

1b2486d

Aim to speed up ParallelMultiFile benchmarks

710e655

Skip AMP worker benchmarks in CI

bbcc0f9

alcaeus force-pushed the phplib-1187-evg-benchmark branch from 40f013a to bbcc0f9 Compare October 20, 2023 08:14

GromNaN approved these changes Oct 20, 2023

View reviewed changes

alcaeus merged commit 567cfe1 into mongodb:master Oct 30, 2023
13 checks passed

alcaeus deleted the phplib-1187-evg-benchmark branch October 30, 2023 08:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PHPLIB-1187: Run benchmark on Evergreen #1185

PHPLIB-1187: Run benchmark on Evergreen #1185

alcaeus commented Oct 20, 2023

alcaeus Oct 20, 2023

alcaeus Oct 20, 2023

alcaeus Oct 20, 2023

GromNaN Oct 20, 2023

alcaeus Oct 20, 2023

alcaeus Oct 20, 2023

GromNaN Oct 20, 2023

GromNaN left a comment

GromNaN Oct 20, 2023

alcaeus Oct 20, 2023

GromNaN Oct 20, 2023

alcaeus Oct 20, 2023

GromNaN Oct 20, 2023

		'created_at' => date(DATE_ATOM),
		'completed_at' => date(DATE_ATOM),

PHPLIB-1187: Run benchmark on Evergreen #1185

PHPLIB-1187: Run benchmark on Evergreen #1185

Conversation

alcaeus commented Oct 20, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

GromNaN left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment