Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: OTel metrics & metering #1011

Open
wants to merge 2 commits into
base: jjaakola-aiven-fastapi
Choose a base branch
from

Conversation

nosahama
Copy link
Contributor

@nosahama nosahama commented Dec 19, 2024

About this change - What it does

  • we add OTel metering and HTTP metrics to the service.
  • the metrics are exported to prometheus via the opentelemetry-collector as shown below
Screenshot 2024-12-19 at 10 47 58
  • no changes to the grafana dashboard shows that the prometheus source still works as expected
Screenshot 2024-12-19 at 10 49 00

Follow up

  • update the schema_reader metrics to use the OTel meter
  • maybe deprecate statsd
  • close the prometheus PR instrumentation: fastapi prometheus middleware #1007 as we now use the OTel metering and exporting to prometheus, thus native prometheus instrumentation is not needed

@nosahama nosahama requested review from a team as code owners December 19, 2024 09:51
@nosahama nosahama force-pushed the nosahama/EC-681/otel-metrics branch 3 times, most recently from e06b32e to 7c9d0c6 Compare December 19, 2024 11:27
Copy link

github-actions bot commented Dec 19, 2024

Coverage report

Click to see where and how coverage changed

FileStatementsMissingCoverageCoverage
(new stmts)
Lines missing
  src/karapace
  config.py 32-33
  src/schema_registry
  __main__.py 21
  factory.py 21, 63
  src/schema_registry/telemetry
  container.py
  meter.py
  middleware.py
  setup.py 35-36
Project Total  

This report was generated by python-coverage-comment-action

@nosahama nosahama force-pushed the nosahama/EC-681/otel-metrics branch 2 times, most recently from 1b21afc to 944a4f1 Compare December 20, 2024 11:37
@nosahama nosahama force-pushed the nosahama/EC-681/otel-metrics branch from 944a4f1 to efe2e90 Compare January 8, 2025 14:42
@nosahama nosahama force-pushed the nosahama/EC-681/otel-metrics branch from efe2e90 to df31d0e Compare January 8, 2025 14:53
) -> Response:
resource = request.url.path.split("/")[1]
with tracer.get_tracer().start_as_current_span(name=f"{request.method}: /{resource}", kind=SpanKind.SERVER) as span:
span.add_event("Creating metering resources")
karapace_http_requests_in_progress: UpDownCounter = meter.get_meter().create_up_down_counter(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this creating the counter or reusing the existing one? Similar question for the duration histogram and requests counter.

tracer.update_span_with_request(request=request, span=span)
span.add_event("Calling request handler")
response: Response = await call_next(request)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If call to next fails, are the subsequent calls to tracer and meters called?

setattr(request.state, meter.START_TIME_KEY, time.monotonic())

# Extract request labels
path = request.url.path
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will create quite large cardinality when Karapace handles large number of topics and versions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants