Distributed performance of Nexmark q15: 3CN equals to 1CN #11866

lmatz · 2023-08-24T06:23:38Z

The source throughput of q15 under 1CN setting is almost the same as under 3CN setting (both setting colocated with compactors)

1 CN:

3 CN:

Two settings are using the same amount CPU in total.

    SELECT
        TO_CHAR(date_time, 'yyyy-MM-dd') as day,
        count(*) AS total_bids,
        count(*) filter (where price < 10000) AS rank1_bids,
        count(*) filter (where price >= 10000 and price < 1000000) AS rank2_bids,
        count(*) filter (where price >= 1000000) AS rank3_bids,
        count(distinct bidder) AS total_bidders,
        count(distinct bidder) filter (where price < 10000) AS rank1_bidders,
        count(distinct bidder) filter (where price >= 10000 and price < 1000000) AS rank2_bidders,
        count(distinct bidder) filter (where price >= 1000000) AS rank3_bidders,
        count(distinct auction) AS total_auctions,
        count(distinct auction) filter (where price < 10000) AS rank1_auctions,
        count(distinct auction) filter (where price >= 10000 and price < 1000000) AS rank2_auctions,
        count(distinct auction) filter (where price >= 1000000) AS rank3_auctions
    FROM bid
    GROUP BY to_char(date_time, 'yyyy-MM-dd');

We notice that the query is grouping by to_char(date_time, 'yyyy-MM-dd'). It is likely that during one period of time, the source input data all belong to the same day, and we are observing data skewness because of Two settings are using the same amount CPU in total.

The plan:

 StreamMaterialize { columns: [day, total_bids, rank1_bids, rank2_bids, rank3_bids, total_bidders, rank1_bidders, rank2_bidders, rank3_bidders, total_auctions, rank1_auctions, rank2_auctions, rank3_auctions], stream_key: [day], pk_columns: [day], pk_conflict: NoCheck }
 └─StreamHashAgg [append_only] { group_key: [$expr3], aggs: [count, count filter(($expr4 < 10000:Int32)), count filter(($expr4 >= 10000:Int32) AND ($expr4 < 1000000:Int32)), count filter(($expr4 >= 1000000:Int32)), count(distinct $expr5), count(distinct $expr5) filter(($expr4 < 10000:Int32)), count(distinct $expr5) filter(($expr4 >= 10000:Int32) AND ($expr4 < 1000000:Int32)), count(distinct $expr5) filter(($expr4 >= 1000000:Int32)), count(distinct $expr6), count(distinct $expr6) filter(($expr4 < 10000:Int32)), count(distinct $expr6) filter(($expr4 >= 10000:Int32) AND ($expr4 < 1000000:Int32)), count(distinct $expr6) filter(($expr4 >= 1000000:Int32))] }
   └─StreamExchange { dist: HashShard($expr3) }
     └─StreamProject { exprs: [ToChar($expr2, 'yyyy-MM-dd':Varchar) as $expr3, Field(bid, 2:Int32) as $expr4, Field(bid, 1:Int32) as $expr5, Field(bid, 0:Int32) as $expr6, _row_id] }
       └─StreamFilter { predicate: (event_type = 2:Int32) }
         └─StreamRowIdGen { row_id_index: 6 }
           └─StreamWatermarkFilter { watermark_descs: [Desc { column: $expr2, expr: ($expr2 - '00:00:05':Interval) }], output_watermarks: [$expr1, $expr2] }
             └─StreamProject { exprs: [event_type, person, auction, bid, Proctime as $expr1, Case((event_type = 0:Int32), Field(person, 6:Int32), (event_type = 1:Int32), Field(auction, 5:Int32), Field(bid, 5:Int32)) as $expr2, _row_id], output_watermarks: [$expr1] }
               └─StreamSource { source: nexmark, columns: [event_type, person, auction, bid, _row_id] }
(9 rows)

So we wonder if two-phase aggregation can help in this case.
But set rw_force_two_phase_agg to true does not change the plan.

Any other ideas to improve?

Let's also wait for the numbers of other systems.

The text was updated successfully, but these errors were encountered:

BugenZhao · 2023-08-28T03:58:52Z

Since there's distinct in the aggregation call, current two-phase optimization cannot be applied.

risingwave/src/frontend/src/optimizer/plan_node/generic/agg.rs

Lines 102 to 105 in eba646f

    
           let distinct_ok = 
        
               matches!(call.agg_kind, agg_kinds::result_unaffected_by_distinct!()) 
        
                   || !call.distinct; 
        
           agg_kind_ok && order_ok && distinct_ok

However, we may apply the optimization of Split Distinct Aggregation which is specifically designed for these data skew cases.

lmatz · 2023-08-28T04:07:35Z

True, I forgot this option has already been added, let me try this

lmatz · 2023-09-14T04:49:24Z

Duplicated with #11964, close

lmatz added the type/perf label Aug 24, 2023

github-actions bot added this to the release-1.2 milestone Aug 24, 2023

lmatz mentioned this issue Aug 29, 2023

Tracking: distributed performance on Nexmark #11932

Open

12 tasks

lmatz changed the title ~~Distributed performance of Nexmark q15~~ Distributed performance of Nexmark q15: 3CN equals to 1CN Aug 29, 2023

lmatz removed this from the release-1.2 milestone Sep 11, 2023

lmatz closed this as not planned Won't fix, can't repro, duplicate, stale Sep 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distributed performance of Nexmark q15: 3CN equals to 1CN #11866

Distributed performance of Nexmark q15: 3CN equals to 1CN #11866

lmatz commented Aug 24, 2023 •

edited

Loading

BugenZhao commented Aug 28, 2023

lmatz commented Aug 28, 2023

lmatz commented Sep 14, 2023

Distributed performance of Nexmark q15: 3CN equals to 1CN #11866

Distributed performance of Nexmark q15: 3CN equals to 1CN #11866

Comments

lmatz commented Aug 24, 2023 • edited Loading

BugenZhao commented Aug 28, 2023

lmatz commented Aug 28, 2023

lmatz commented Sep 14, 2023

lmatz commented Aug 24, 2023 •

edited

Loading