Restore Sink Metric Emission Behaviour: Emit them per-Sink instead of per-FireHydrant #17170

findingrish · 2024-09-26T17:12:41Z

This change #15757 to merge FireHydrants flatly for realtime queries for optimising memory usage led to metrics like query/segment/time getting emitted per-FireHydrant instead of per-Sink.

This change restores the metric emission behaviour while keeping the optimisation intact.

It introduces a new SinkMetricsEmittingQueryRunner which accumulates the FireHydrants metrics per-Sink and emits them in the end.
The emitted metrics are query/segment/time, query/segmentAndCacheTime & query/wait/time.
query/wait/time is the time taken to start processing the first FireHydrant for the sink.

Testing

Added UT to verify that each of the above metrics are emitted once per Sink.
Locally verified the changes for realtime ingestion. Sample emitted metric,

{"type":"scan","version":"32.0.0-SNAPSHOT","duration":"PT86400S","feed":"metrics","metric":"query/segment/time","hasFilters":"false","service":"druid/middleManager","segment":"kttm_2019-08-25T04:00:00.000Z_2019-08-25T05:00:00.000Z_2024-09-26T15:28:40.091Z","host":"localhost:8100","context":{"defaultTimeout":300000,"finalize":false,"maxQueuedBytes":5242880,"maxScatterGatherBytes":9223372036854775807,"queryFailTime":1727364858115,"queryId":"8dab0179-ec39-4f5f-a9e2-9039da416a5a","queryResourceId":"14c72b21-2dbb-471b-91c0-d7fbe11414b5","scanOutermost":false,"sqlOuterLimit":1001,"sqlQueryId":"8dab0179-ec39-4f5f-a9e2-9039da416a5a","sqlStringifyArrays":false,"timeout":299997},"interval":["2019-08-25T00:00:00.000Z/2019-08-26T00:00:00.000Z"],"id":"8dab0179-ec39-4f5f-a9e2-9039da416a5a","value":0,"dataSource":"kttm","timestamp":"2024-09-26T15:29:18.215Z"}

…rant

gianm

In addition to checking out the comments, please check the effect on query performance after applying this patch in a scenario where there are many (like 100) FireHydrants per Sink.

The original patch that refactored this stuff (#15757) caused a noticeable performance regression in cases where there were many FireHydrants per Sink and where individual queries were quite fast (10s of milliseconds), but made at a high rate. This was due to the additional overhead from the additional metrics. So, hopefully, reducing the number of metrics improves performance as well.

gianm · 2024-11-07T16:55:24Z

processing/src/main/java/org/apache/druid/java/util/emitter/service/ServiceMetricEvent.java

@@ -197,7 +198,7 @@ public ServiceMetricEvent build(ImmutableMap<String, String> serviceDimensions)
      return new ServiceMetricEvent(
          createdTime,
          serviceDimensions,
-          userDims,
+          new HashMap<>(userDims),


why does this need to be copied?

I see that StreamAppenderatorTest#testQueryByIntervals fails if we directly pass userDims.

My best rationale so far is that this change is only needed because the test uses StubServiceEmitter, hence the previously stored metrics don't actually get emitted anywhere, and the previously stored metrics end up getting mutated because they use the same userDims.

@findingrish Can you confirm if this is the reason, and this is done only for test purposes, and directly passing userDims should work for any real world scenarios?

Please let me know if I'm missing some other aspect of this change. Appreciate your inputs, thanks!

Can you confirm if this is the reason, and this is done only for test purposes

Yes, this was only done for test purpose and directly passing userDims should work for real world scenarios.

@findingrish Thanks for the confirmation!

It doesn't seem ideal to be copying the userDims just for test purposes. One possible option that would allow us to keep passing userDims directly while not making any compromises with the test quality is to change StubServiceEmitter.

The following can be changed to also store the userDims:

druid/processing/src/test/java/org/apache/druid/java/util/metrics/StubServiceEmitter.java

Line 41 in 7705694

private final ConcurrentHashMap<String, List<ServiceMetricEvent>> metricEvents = new ConcurrentHashMap<>();

So it can look like ConcurrentHashMap<String, List<Map<ServiceMetricEvent, Map<String, Object>>>> where the innermost Map<String, Object> represents the userDims. This would work since ServiceMetricEvent#getUserDims returns a copy of userDims:

druid/processing/src/main/java/org/apache/druid/java/util/emitter/service/ServiceMetricEvent.java

Line 95 in 7705694

return ImmutableMap.copyOf(userDims);

(Instead of ConcurrentHashMap<String, List<Map<ServiceMetricEvent, Map<String, Object>>>>, we can have a separate class to encapsulate List<Map<ServiceMetricEvent, Map<String, Object>>> for readability, but the basic idea remains the same)

@gianm @findingrish Thoughts on this approach?

Yes, let's change the test rather than do a needless copy in production code.

@gianm Thanks for the input!
Have made the change by adding a ServiceMetricEventSnapshot helper class in StubServiceEmitter.

gianm · 2024-11-07T16:58:10Z

processing/src/test/java/org/apache/druid/query/DefaultQueryMetricsTest.java

@@ -66,11 +66,11 @@ public void testDefaultQueryMetricsQuery()
        .context(ImmutableMap.of("testKey", "testValue"))
        .build();
    queryMetrics.query(query);
+    queryMetrics.sqlQueryId("dummy");
+    queryMetrics.queryId("dummy");


why did these need to move?

This is related to the userDims -> new HashMap<>(userDims) change.

This test fails if these lines aren't moved before queryMetrics.reportQueryTime(0).emit(serviceEmitter).

Reason: Previously, doing queryMetrics.queryId("dummy") after emit() was updating userDims in serviceEmitter.getEvents(), as we had the same userDims being referenced. But with new HashMap<>(userDims) change, that isn't the case anymore.

Let's wait for Rishabh's inputs on the other comment.

gianm · 2024-11-26T17:08:30Z