Benchmarks #200

paualarco · 2020-06-01T20:59:04Z

WIP
This PR aims to add benchmarking for monix-kafka. #116
The approach to do so would be to spin up a kafka cluster using docker containers, for the moment only one broker but later might add more.
The plan/strategy for this benchmark is better explained in the readme.md within benchmarks subproject.

Guess would also be cool to have benchmarks for previousVersion and nextVersion in order to compare them, but that might go in a different PR?

Avasil · 2020-06-01T21:35:59Z

Awesome, I'm very happy to see it!
Not sure how reliable these benchmarks will be but hopefully good enough to spot any noticable regressions or areas for improvement (when compared to other libraries)

paualarco · 2020-09-27T10:10:36Z

@Avasil have done progress on this one ☝️

Finally, the benchmarks are structured as follows:

Consumer Observable

Topic with partitioning of 1 and 2.
Manual Commit
AutoCommit (Async and Sync)

Single Producer

Syncronous for topic with partitioning of 1 and 2

Sink Producer

Topic with partitioning of 1 and 2.
Parallelism of 100
Single threaded with no parallelism

If you think there should other scenarios contemplated, please suggest :)
Would be good in the future to compare with other kafka integrations with other reactive stream libs.

Avasil · 2020-09-27T10:29:47Z

Fantastic, I hope I can finally find some time to update the library 😅

If you think there should other scenarios contemplated, please suggest :)
Would be good in the future to compare with other kafka integrations with other reactive stream libs.

I remember this blog post from a while ago. We could do something similar, that is compare vs plain Kafka producer/consumer to see what kind of overhead do we introduce

paualarco · 2020-09-27T10:53:28Z

Writing a blog pos would be nice, even we could include that in the web docs once they get merged.
But probably most important now is to keep the library updated. On that side, is there something I could do to contribute ongoing maintenance?

Avasil · 2020-09-27T11:06:59Z

Writing a blog pos would be nice, even we could include that in the web docs once they get merged.

I've meant more as a benchmark scenario - I'm hesitant to advertise benchmarks here because the inaccuracy / error is very high ( e.g. 49.599 ± 12.737 ) and well, the library really needs an update :D

On that side, is there something I could do to contribute ongoing maintenance?

It would be awesome to pick up and fix the issue described in #104
I've been procrastinating it for a year now. :)

Some stuff is on me, like checking out your docs PR.

Other than that, I would probably release a version (1.0.0?) that is similar to the current master and then remove all version-specific modules, maybe do some refactoring around Serializers, returning Consumer as a Resource etc. and release as 2.0.0 soon after. But that's more "mid-term"

paualarco · 2020-09-27T19:28:15Z

I'm hesitant to advertise benchmarks here because the inaccuracy / error is very high ( e.g. 49.599 ± 12.737 ) and well, the library really needs an update :D

Yup, the error is quite high, do you know if using jmh it is possible to replicate benchmarks that the blog post is exposing?

They seem quite general, so in order to know how many elements were produced/consumed from kafka topics we could maybe just run the same scenarios in tests but in this case using an unlimited number of elements and setting a timeout limit (each test in a separate topic).
And then count them.

Avasil · 2020-09-27T19:43:38Z

Yup, the error is quite high, do you know if using jmh it is possible to replicate benchmarks that the blog post is exposing?

They probably have (or had) the benchmark there: https://github.com/akka/alpakka-kafka/tree/master/benchmarks/src

We don't have to do it exactly the same way

They seem quite general, so in order to know how many elements were produced/consumed from kafka topics we could maybe just run the same scenarios in tests but in this case using an unlimited number of elements and setting a timeout limit (each test in a separate topic).
And then count them.

I don't think we need to provide msg/s result, I think it's more important to see what kind of overhead we introduce.
If our result is 80% of plain Kafka result then it's valuable information. And let's say another library has 90% then we know that there is a room for improvements

Block Fix Binded connection First benchmark for kafka producer Added kafka benchmark strategy plan Added sink and consumer benchmarks Producer results Akka Removed references to akka a Final

paualarco marked this pull request as draft June 2, 2020 13:58

paualarco force-pushed the benchmarks branch from 41e5887 to af494f5 Compare June 15, 2020 07:59

paualarco force-pushed the benchmarks branch from 5b6f1e5 to b1d4da6 Compare September 27, 2020 10:03

paualarco marked this pull request as ready for review September 27, 2020 10:04

paualarco added 4 commits March 25, 2021 19:45

Monix Kafka benchmarks for consumer, single and sink producer

56b4012

Block Fix Binded connection First benchmark for kafka producer Added kafka benchmark strategy plan Added sink and consumer benchmarks Producer results Akka Removed references to akka a Final

Benchmarks with different pollHeartbeatIntervals

d7af23c

New benchmark results

f065a60

Re run benchmarks

75f5f21

paualarco force-pushed the benchmarks branch from b1d4da6 to 75f5f21 Compare March 25, 2021 22:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks #200

Benchmarks #200

paualarco commented Jun 1, 2020 •

edited

Loading

Avasil commented Jun 1, 2020

paualarco commented Sep 27, 2020

Avasil commented Sep 27, 2020

paualarco commented Sep 27, 2020

Avasil commented Sep 27, 2020 •

edited

Loading

paualarco commented Sep 27, 2020

Avasil commented Sep 27, 2020

Benchmarks #200

Are you sure you want to change the base?

Benchmarks #200

Conversation

paualarco commented Jun 1, 2020 • edited Loading

Avasil commented Jun 1, 2020

paualarco commented Sep 27, 2020

Avasil commented Sep 27, 2020

paualarco commented Sep 27, 2020

Avasil commented Sep 27, 2020 • edited Loading

paualarco commented Sep 27, 2020

Avasil commented Sep 27, 2020

paualarco commented Jun 1, 2020 •

edited

Loading

Avasil commented Sep 27, 2020 •

edited

Loading