Tracking: setup micro benchmark for stream executors with in-memory state #5678

lmatz · 2022-10-01T09:20:43Z

jon-chuang · 2022-10-10T01:01:31Z

To my understanding, the current impl is a per-task store:

Line 171 in b0fb507

inner: Arc<RwLock<BTreeMap<KeyWithEpoch, Option<Bytes>>>>,

which is probably as good as one can do. But we won't be able to scale up/down or test recovery with this impl, but its not our objective.

Or, are we using shared version of MemoryStateStore? This would be bad as it would then be single lock.

So, to confirm, we are using per-task store for purposes of bench, correct?

BugenZhao · 2022-10-10T03:59:45Z

The shared one is a singleton, which can be used to simulate shared storage when running multiple compute nodes in a single process with risedev p.

For other cases, we're using the one constructed here whose lifetime is the same as compute_node_serve and is shared by all executors in this compute node.

risingwave/src/compute/src/server.rs

Line 111 in 833358c

let state_store = StateStoreImpl::new(

jon-chuang · 2022-10-10T05:47:54Z

😥 I see, it would be nice if we can create one that is spawned on a per-thread or per task basis, so we don't have to worry about contention at all... To my understanding, that is the objective of these in-memory benchmarks? I.e. to test purely the compute performance up to data serialization/deserialization?

BugenZhao · 2022-10-10T06:28:07Z

If we bench a single operator with the style of integrated tests in a single thread, there'll also be no contention. IIRC, the storage team is working on a refactoring of the local state store, we may check if we can improve it then.

fuyufjh · 2023-01-30T03:30:59Z

Removed from the milestone. Do it later.

kwannoel · 2023-02-08T07:55:48Z

Will this be priority again? Since we are looking at performance of stream engine?
Performance dashboard runs daily, whereas these benchmarks can be easily run ad-hoc to see if certain optimizations work or not.
Additionally we can generate flamegraph and see mem and cpu cost centres.

lmatz added type/feature type/tracking Tracking issue. component/streaming Stream processing related issue. labels Oct 1, 2022

github-actions bot added this to the release-0.1.14 milestone Oct 1, 2022

lmatz added the type/perf label Oct 1, 2022

lmatz modified the milestones: release-0.1.14, next-release-0.1.15 Oct 3, 2022

lmatz modified the milestones: release-0.1.15, next-release-0.1.16 Nov 21, 2022

lmatz mentioned this issue Dec 6, 2022

feat(sink): add blackhole sink #6756

Merged

3 tasks

fuyufjh assigned jon-chuang Dec 28, 2022

fuyufjh removed this from the release-0.1.16 milestone Jan 30, 2023

lmatz unassigned jon-chuang Feb 8, 2023

kwannoel mentioned this issue Mar 31, 2023

Tracking: Streaming benchmark extras #8906

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking: setup micro benchmark for stream executors with in-memory state #5678

Tracking: setup micro benchmark for stream executors with in-memory state #5678

lmatz commented Oct 1, 2022 •

edited

Loading

jon-chuang commented Oct 10, 2022 •

edited

Loading

BugenZhao commented Oct 10, 2022

jon-chuang commented Oct 10, 2022

BugenZhao commented Oct 10, 2022

fuyufjh commented Jan 30, 2023

kwannoel commented Feb 8, 2023

Tracking: setup micro benchmark for stream executors with in-memory state #5678

Tracking: setup micro benchmark for stream executors with in-memory state #5678

Comments

lmatz commented Oct 1, 2022 • edited Loading

jon-chuang commented Oct 10, 2022 • edited Loading

BugenZhao commented Oct 10, 2022

jon-chuang commented Oct 10, 2022

BugenZhao commented Oct 10, 2022

fuyufjh commented Jan 30, 2023

kwannoel commented Feb 8, 2023

lmatz commented Oct 1, 2022 •

edited

Loading

jon-chuang commented Oct 10, 2022 •

edited

Loading