Skip to content

Commit

Permalink
Update steps for setting up metrics on openshift, focusing on single … (
Browse files Browse the repository at this point in the history
#953)

* Update steps for setting up metrics on openshift, focusing on single cluster

Signed-off-by: David Martin <[email protected]>

* Address feedback

Signed-off-by: David Martin <[email protected]>

---------

Signed-off-by: David Martin <[email protected]>
  • Loading branch information
david-martin authored Oct 24, 2024
1 parent b771a77 commit 28ba551
Show file tree
Hide file tree
Showing 8 changed files with 165 additions and 98 deletions.
6 changes: 1 addition & 5 deletions config/observability/kustomization.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -12,11 +12,7 @@ resources:
# scrape config.
# See https://github.com/prometheus-operator/prometheus-operator/issues/3071#issuecomment-763746836
- prometheus/additional-scrape-configs.yaml
#https://istio.io/latest/docs/reference/config/telemetry/#MetricSelector-IstioMetric
- prometheus/monitors/service-monitor-limitador-operator.yaml
- prometheus/monitors/service-monitor-kuadrant-operator.yaml
- prometheus/monitors/service-monitor-authorino-operator.yaml
- prometheus/monitors/service-monitor-dns-operator.yaml
- prometheus/monitors/operators.yaml


patchesStrategicMerge:
Expand Down
73 changes: 73 additions & 0 deletions config/observability/prometheus/monitors/operators.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,73 @@

apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
labels:
control-plane: controller-manager
name: authorino-operator-metrics
namespace: kuadrant-system
spec:
endpoints:
- path: /metrics
port: metrics
scheme: http
selector:
matchLabels:
control-plane: authorino-operator
---
apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
labels:
control-plane: controller-manager
app.kubernetes.io/name: servicemonitor
app.kubernetes.io/instance: controller-manager-metrics-monitor
app.kubernetes.io/component: metrics
app.kubernetes.io/created-by: dns-operator
app.kubernetes.io/part-of: dns-operator
app.kubernetes.io/managed-by: kustomize
name: dns-operator-metrics-monitor
namespace: kuadrant-system
spec:
endpoints:
- path: /metrics
port: metrics
scheme: http
selector:
matchLabels:
control-plane: dns-operator-controller-manager
---
apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
labels:
control-plane: controller-manager
name: kuadrant-operator-metrics
namespace: kuadrant-system
spec:
endpoints:
- path: /metrics
port: metrics
scheme: http
selector:
matchLabels:
control-plane: controller-manager
app: kuadrant
---
apiVersion: monitoring.coreos.com/v1
kind: ServiceMonitor
metadata:
labels:
control-plane: controller-manager
name: limitador-operator-metrics
namespace: kuadrant-system
spec:
endpoints:
- path: /metrics
port: metrics
scheme: http
selector:
matchLabels:
control-plane: controller-manager


This file was deleted.

This file was deleted.

This file was deleted.

This file was deleted.

48 changes: 34 additions & 14 deletions doc/install/install-openshift.md
Original file line number Diff line number Diff line change
Expand Up @@ -172,36 +172,56 @@ Wait for Envoy Gateway to become available::
kubectl wait --timeout=5m -n envoy-gateway-system deployment/envoy-gateway --for=condition=Available
```

### Step 6 - Optional: Configure observability and metrics

Kuadrant provides a set of example dashboards that use known metrics exported by Kuadrant and Gateway components to provide insight into different components of your APIs and Gateways. While not essential, it is best to set up an OpenShift monitoring stack. This section provides links to OpenShift and Thanos documentation on configuring monitoring and metrics storage.

You can set up user-facing monitoring by following the steps in the OpenShift documentation on [configuring the monitoring stack](https://docs.openshift.com/container-platform/latest/observability/monitoring/configuring-the-monitoring-stack.html).

If you have user workload monitoring enabled, it is best to configure remote writes to a central storage system such as Thanos:

- [OpenShift remote write configuration](https://docs.openshift.com/container-platform/latest/observability/monitoring/configuring-the-monitoring-stack.html#configuring_remote_write_storage_configuring-the-monitoring-stack)
- [Kube Thanos](https://github.com/thanos-io/kube-thanos)
### Step 6 - Optional: Configure observability and metrics (Istio only)

Kuadrant provides a set of example dashboards that use known metrics exported by Kuadrant and Gateway components to provide insight into different components of your APIs and Gateways. While not essential, it is recommended to set these up.
First, enable [monitoring for user-defined projects](https://docs.openshift.com/container-platform/4.17/observability/monitoring/enabling-monitoring-for-user-defined-projects.html#enabling-monitoring-for-user-defined-projects_enabling-monitoring-for-user-defined-projects).
This will allow the scraping of metrics from the gateway and Kuadrant components.
The [example dashboards and alerts](https://docs.kuadrant.io/latest/kuadrant-operator/doc/observability/examples/) for observing Kuadrant functionality use low-level CPU metrics and network metrics available from the user monitoring stack in OpenShift. They also use resource state metrics from Gateway API and Kuadrant resources.

To scrape these additional metrics, you can install a `kube-state-metrics instance`, with a custom resource configuration as follows:
To scrape these additional metrics, you can install a `kube-state-metrics` instance, with a custom resource configuration as follows:

```bash
kubectl apply -f https://raw.githubusercontent.com/Kuadrant/kuadrant-operator/main/config/observability/openshift/kube-state-metrics.yaml
kubectl apply -k https://github.com/Kuadrant/gateway-api-state-metrics?ref=main
```

To enable request metrics in Istio, you must create a `telemetry` resource as follows:
To enable request metrics in Istio and scrape them, create the following resource:

```bash
kubectl apply -f https://raw.githubusercontent.com/Kuadrant/kuadrant-operator/refs/heads/main/config/observability/prometheus/monitors/istio/service-monitor-istiod.yaml
```

Some example dashboards show aggregations based on the path of requests.
By default, Istio metrics don't include labels for request paths.
However, you can enable them with the below Telemetry resource.
Note that this may lead to a [high cardinality](https://www.robustperception.io/cardinality-is-key/) label where multiple time-series are generated,
which can have an impact on performance and resource usage.

```bash
kubectl apply -f https://raw.githubusercontent.com/Kuadrant/kuadrant-operator/main/config/observability/openshift/telemetry.yaml
```

If you have Grafana installed in your cluster, you can import the [example dashboards and alerts](https://docs.kuadrant.io/latest/kuadrant-operator/doc/observability/examples).
You can configure scraping of metrics from the various Kuadrant operators with the below resources.

For example installation details, see [installing Grafana on OpenShift](https://cloud.redhat.com/experts/o11y/ocp-grafana/). When installed, you must add your Thanos instance as a data source to Grafana. Alternatively, if you are using only the user workload monitoring stack in your OpenShift cluster, and not writing metrics to an external Thanos instance, you can [set up a data source to the thanos-querier route in the OpenShift cluster](https://docs.openshift.com/container-platform/4.15/observability/monitoring/accessing-third-party-monitoring-apis.html#accessing-metrics-from-outside-cluster_accessing-monitoring-apis-by-using-the-cli).
```bash
kubectl apply -f https://raw.githubusercontent.com/Kuadrant/kuadrant-operator/refs/heads/main/config/observability/prometheus/monitors/operators.yaml
```

!!! note

There is 1 more metrics configuration that needs to be applied so that all relevant metrics are being scraped.
That configuration depends on where you deploy your Gateway later.
The steps to configure that are detailed in the follow on [Secure, protect, and connect](../user-guides/secure-protect-connect-single-multi-cluster.md) guide.

The [example Grafana dashboards and alerts](https://docs.kuadrant.io/latest/kuadrant-operator/doc/observability/examples/) for observing Kuadrant functionality use low-level CPU metrics and network metrics available from the user monitoring stack in OpenShift. They also use resource state metrics from Gateway API and Kuadrant resources.

For Grafana installation details, see [installing Grafana on OpenShift](https://cloud.redhat.com/experts/o11y/ocp-grafana/). That guide also explains how to set up a data source for the Thanos Query instance in OpenShift. For more detailed information about accessing the Thanos Query endpoint, see the [OpenShift documentation](https://docs.openshift.com/container-platform/4.17/observability/monitoring/accessing-third-party-monitoring-apis.html#accessing-metrics-from-outside-cluster_accessing-monitoring-apis-by-using-the-cli).

!!! note

For some dashboard panels to work correctly, HTTPRoutes must include a "service" and "deployment" label with a value that matches the name of the service & deployment being routed to. eg. "service=myapp, deployment=myapp".
This allows low level Istio & envoy metrics to be joined with Gateway API state metrics.

### Step 7 - Setup the catalogsource

Expand Down
57 changes: 57 additions & 0 deletions doc/user-guides/secure-protect-connect-single-multi-cluster.md
Original file line number Diff line number Diff line change
Expand Up @@ -162,6 +162,63 @@ kubectl get gateway ${gatewayName} -n ${gatewayNS} -o=jsonpath='{.status.listene

Kuadrant can help with this by using a TLSPolicy.

### Step 4a - (Optional) Configure metrics to be scraped from the Gateway instance

If you have prometheus in your cluster, set up a PodMonitor to configure it to scrape metrics directly from the Gateway pod.
This must be done in the namespace where the Gateway is running.
This configuration is required for metrics such as `istio_requests_total`.

```bash
kubectl apply -f - <<EOF
apiVersion: monitoring.coreos.com/v1
kind: PodMonitor
metadata:
name: istio-proxies-monitor
namespace: ${gatewayNS}
spec:
selector:
matchExpressions:
- key: istio-prometheus-ignore
operator: DoesNotExist
podMetricsEndpoints:
- path: /stats/prometheus
interval: 30s
relabelings:
- action: keep
sourceLabels: ["__meta_kubernetes_pod_container_name"]
regex: "istio-proxy"
- action: keep
sourceLabels:
["__meta_kubernetes_pod_annotationpresent_prometheus_io_scrape"]
- action: replace
regex: (\d+);(([A-Fa-f0-9]{1,4}::?){1,7}[A-Fa-f0-9]{1,4})
replacement: "[$2]:$1"
sourceLabels:
[
"__meta_kubernetes_pod_annotation_prometheus_io_port",
"__meta_kubernetes_pod_ip",
]
targetLabel: "__address__"
- action: replace
regex: (\d+);((([0-9]+?)(\.|$)){4})
replacement: "$2:$1"
sourceLabels:
[
"__meta_kubernetes_pod_annotation_prometheus_io_port",
"__meta_kubernetes_pod_ip",
]
targetLabel: "__address__"
- action: labeldrop
regex: "__meta_kubernetes_pod_label_(.+)"
- sourceLabels: ["__meta_kubernetes_namespace"]
action: replace
targetLabel: namespace
- sourceLabels: ["__meta_kubernetes_pod_name"]
action: replace
targetLabel: pod_name
EOF
```

### Step 5 - Secure and protect the Gateway with auth, TLS, rate limit, and DNS policies

While your Gateway is now deployed, it has no exposed endpoints and your listener is not programmed. Next, you can set up a `TLSPolicy` that leverages your CertificateIssuer to set up your listener certificates.
Expand Down

0 comments on commit 28ba551

Please sign in to comment.