Integrate runtime updates for etcd #71

bschimke95 · 2024-04-18T09:45:05Z

Overview

With canonical/k8s-snap#334 the snap allows to configure the external datastore on runtime. This allows for changes of the etcd configuration after the cluster is bootstrapped.
The snap only performs steps (restarting services etc.) if a configuration changed. Hence, we can just send the current datastore configuration of the charm to the endpoint on every event. If there is no change, nothing happens.

Changes

Add UserFacingDatastoreConfig type as defined in the k8s-snap API
Update Datastore status type
Add datastore to the UpdateClusterConfigRequest
Add new reconcile step ensure_cluster_config

charms/worker/k8s/tests/unit/test_k8sd_api_manager.py

tests/integration/test_etcd.py

addyess

Hopefully some suggestions for the unit tests to pass

charms/worker/k8s/lib/charms/k8s/v0/k8sd_api_manager.py

addyess

Looking really good -- one suggested change

tests/integration/test_etcd.py

addyess · 2024-04-20T02:27:11Z

charms/worker/k8s/src/charm.py

+                client_key=etcd_config.get("client_key", ""),
+            )
+
+        self.api_manager.update_cluster_config(update_request)


I continue to encounter this error when the third etcd unit tries to post it's update:

charms.k8s.v0.k8sd_api_manager.InvalidResponseError: Error status 500 method=PUT endpoint=/1.0/k8sd/cluster/config reason=Internal Server Error body={"type":"error","status":"","status_code":0,"operation":"","error_code":500,"error":"failed to reconcile network: failed to refresh network: failed to refresh network component: failed to upgrade component 'network': another operation (install/upgrade/rollback) is in progress","metadata":null}

Here are the k8sd and charm logs around the same moment of this failure
https://pastebin.canonical.com/p/njZJqx55yX/

api server logs around the same moment
https://pastebin.canonical.com/p/QzMpFCmQCV/

is it possible that adding extra etcd units in rapid succcession caused the API server to restart when the first datastore came online, and when we published the second datastore URL -- the config wouldn't take?

Hey Adam,
Thanks for the review. I think the issue you were facing was before canonical/k8s-snap#356 was merged.

I cannot reproduce this issue and the CI also seems to be happy.

charms/worker/k8s/lib/charms/k8s/v0/k8sd_api_manager.py

Co-authored-by: Adam Dyess <[email protected]>

Integrate runtime updates for etcd

fd12c7a

bschimke95 commented Apr 18, 2024

View reviewed changes

charms/worker/k8s/tests/unit/test_k8sd_api_manager.py Show resolved Hide resolved

bschimke95 commented Apr 18, 2024

View reviewed changes

tests/integration/test_etcd.py Outdated Show resolved Hide resolved

addyess requested changes Apr 18, 2024

View reviewed changes

charms/worker/k8s/lib/charms/k8s/v0/k8sd_api_manager.py Outdated Show resolved Hide resolved

bschimke95 and others added 4 commits April 19, 2024 10:45

fix tests

adb0478

disable mypy check for unknown keyword

d956342

Merge branch 'main' into KU-530/updates-for-external-datastore

ee7c6ff

Merge branch 'main' into KU-530/updates-for-external-datastore

de06082

addyess approved these changes Apr 20, 2024

View reviewed changes

tests/integration/test_etcd.py Outdated Show resolved Hide resolved

addyess reviewed Apr 20, 2024

View reviewed changes

Merge branch 'main' into KU-530/updates-for-external-datastore

b9d8b07

addyess reviewed Apr 20, 2024

View reviewed changes

charms/worker/k8s/lib/charms/k8s/v0/k8sd_api_manager.py Outdated Show resolved Hide resolved

addyess reviewed Apr 20, 2024

View reviewed changes

charms/worker/k8s/lib/charms/k8s/v0/k8sd_api_manager.py Outdated Show resolved Hide resolved

bschimke95 and others added 5 commits April 22, 2024 10:01

Update tests/integration/test_etcd.py

4171602

Co-authored-by: Adam Dyess <[email protected]>

Update charms/worker/k8s/lib/charms/k8s/v0/k8sd_api_manager.py

b02522d

Co-authored-by: Adam Dyess <[email protected]>

Update charms/worker/k8s/lib/charms/k8s/v0/k8sd_api_manager.py

08b0187

Co-authored-by: Adam Dyess <[email protected]>

Merge branch 'main' into KU-530/updates-for-external-datastore

ce9e49f

fixup linting

ea65aa2

bschimke95 marked this pull request as ready for review April 22, 2024 09:41

bschimke95 requested a review from a team as a code owner April 22, 2024 09:41

bschimke95 merged commit 653378b into main Apr 22, 2024
34 checks passed

bschimke95 deleted the KU-530/updates-for-external-datastore branch April 22, 2024 10:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate runtime updates for etcd #71

Integrate runtime updates for etcd #71

bschimke95 commented Apr 18, 2024

addyess left a comment

addyess left a comment

addyess Apr 20, 2024

addyess Apr 20, 2024 •

edited

Loading

addyess Apr 20, 2024

bschimke95 Apr 22, 2024

Integrate runtime updates for etcd #71

Integrate runtime updates for etcd #71

Conversation

bschimke95 commented Apr 18, 2024

Overview

Changes

addyess left a comment

Choose a reason for hiding this comment

addyess left a comment

Choose a reason for hiding this comment

addyess Apr 20, 2024

Choose a reason for hiding this comment

addyess Apr 20, 2024 • edited Loading

Choose a reason for hiding this comment

addyess Apr 20, 2024

Choose a reason for hiding this comment

bschimke95 Apr 22, 2024

Choose a reason for hiding this comment

addyess Apr 20, 2024 •

edited

Loading