Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RSP test update 3/10/24 #6

Merged
merged 303 commits into from
Oct 3, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
303 commits
Select commit Hold shift + click to select a range
5133211
Summit: Add LSSTCam butler directories to nublado.
mareuter Sep 6, 2024
5563bea
Update LATISS Prompt Processing to 4.4.0.
erinleighh Sep 6, 2024
058081f
Merge pull request #3666 from erinleighh/main
kfindeisen Sep 6, 2024
ba1f9bc
BTS: Update to Kafka Cycle 2.
mareuter Sep 6, 2024
a260743
Merge pull request #3667 from lsst-sqre/tickets/DM-45250-2
mareuter Sep 6, 2024
3bfe48e
Add new IMAGE_SERVER_URL environment variable for all the ScriptQueue…
tribeiro Sep 7, 2024
eed2de2
Merge pull request #3664 from lsst-sqre/tickets/DM-46156
mareuter Sep 9, 2024
c1200cf
Summit: Fix nublado LSSTCam mount.
mareuter Sep 9, 2024
c1218e4
Merge pull request #3670 from lsst-sqre/tickets/DM-46156-1
mareuter Sep 9, 2024
fec8641
Merge pull request #3669 from lsst-sqre/tickets/DM-46142
tribeiro Sep 9, 2024
2ccfbfa
Enable prompt processing for BLOCK-T17
hsinfang Aug 12, 2024
92e47ac
Merge pull request #3613 from lsst-sqre/tickets/DM-44605
hsinfang Sep 9, 2024
d194f56
TTS: Update nublado to Cycle 39.
mareuter Sep 7, 2024
695f1b7
Merge pull request #3671 from lsst-sqre/tickets/DM-46113
mareuter Sep 9, 2024
fc22c27
Update USDF TAP authentication to qserv.
ctslater Aug 15, 2024
598a5d2
Point USDF TAP int/dev to USDF qserv int. Use FQDNs.
ctslater Aug 15, 2024
602e51c
Merge pull request #3620 from lsst-sqre/tickets/DM-45807
ctslater Sep 9, 2024
d3143f4
vbecker argocd
pav511 Sep 10, 2024
dd7a694
Merge pull request #3672 from lsst-sqre/u/pav/INC0462691
pav511 Sep 10, 2024
f1ff416
Enable collection of memory statistics
afausti Sep 6, 2024
9288ad4
Add an extra tag to identify the Telegraf instance
afausti Sep 6, 2024
872d85b
Route Telegraf internal metrics to its own database
afausti Sep 6, 2024
acba3ca
Merge pull request #3668 from lsst-sqre/tickets/DM-46176
afausti Sep 10, 2024
3b1c8f5
Upgrade Butler server
dhirving Sep 11, 2024
dbc7e1d
Merge pull request #3674 from lsst-sqre/tickets/DM-46129
dhirving Sep 11, 2024
9224f18
Update dependencies
rra Sep 11, 2024
d9e60d1
Update secrets.yaml schema
rra Sep 11, 2024
f42c1c4
Merge pull request #3675 from lsst-sqre/u/rra/upgrades
rra Sep 11, 2024
72ad6b0
exposurelog: add butler secrets
sebastian-aranda Sep 10, 2024
f394510
exposurelog: add init container for secrets permission fix
sebastian-aranda Sep 11, 2024
137807b
exposurelog: add environment variable to point to `aws-credentials.in…
sebastian-aranda Sep 11, 2024
8e2c278
Merge pull request #3676 from lsst-sqre/tickets/DM-46251
sebastian-aranda Sep 12, 2024
7d3a98f
Upgrade version of ssotap to 1.18.6
stvoutsin Sep 12, 2024
35fd94b
Deploy Times Square 0.13.0
jonathansick Sep 6, 2024
ce778b0
Deploy Noteburst 0.13.0
jonathansick Sep 9, 2024
b333b95
Add timeout configurations for Times Square
jonathansick Sep 11, 2024
ada629f
Increase default Times Square timeouts
jonathansick Sep 12, 2024
3d2bd25
Change noteburst maxConcurrentJobs to 1
jonathansick Sep 12, 2024
eeabcd8
Merge pull request #3665 from lsst-sqre/tickets/DM-46137
jonathansick Sep 12, 2024
8246fdb
Enable Cruise Control
afausti Sep 3, 2024
7db8b42
Create new KafkaNodePool resource for local storage
afausti Sep 3, 2024
7d49867
Add resources requests and limits configuration
afausti Sep 3, 2024
c0e54c7
Allocate IPs from the LHN pool
afausti Sep 4, 2024
6f6e85d
Rollback to Kafka version 3.7.1 temporarily
afausti Sep 4, 2024
428c653
Start Kafka cluster rebalancing
afausti Sep 11, 2024
285ce6c
Complete migration
afausti Sep 13, 2024
7d3c94c
Remove old brokers
afausti Sep 13, 2024
c352d3f
Update LATISS Prompt Processing to 4.5.0.
erinleighh Sep 13, 2024
3600a30
Merge pull request #3658 from lsst-sqre/tickets/DM-46047
afausti Sep 13, 2024
9fbedee
Merge pull request #3678 from erinleighh/patch-1
erinleighh Sep 13, 2024
58fbf92
rubintv: update app version for summit and usdf production deployments
sebastian-aranda Sep 13, 2024
0f24f95
rubintv: add worker pods for summit production deployment
sebastian-aranda Sep 13, 2024
ab97e72
Merge pull request #3679 from lsst-sqre/tickets/DM-46321
sebastian-aranda Sep 13, 2024
ba5ea83
Summit: Update nublado to Cycle 39.
mareuter Sep 13, 2024
9810bf8
Remove the interval setting
afausti Sep 10, 2024
4c1e229
Tune configuration for high throughput
afausti Sep 10, 2024
6df4c81
Enable message compression
afausti Sep 10, 2024
d4ce565
Incrase default flush interval
afausti Sep 10, 2024
b41a6e6
Enable debug mode
afausti Sep 10, 2024
11c5743
Run one connector replica
afausti Sep 11, 2024
d78783d
Merge pull request #3683 from lsst-sqre/tickets/DM-46115
mareuter Sep 16, 2024
ed049c3
chore(deps): update helm release argo-workflows to v0.42.2
renovate[bot] Sep 16, 2024
022e1f6
chore(deps): update helm release kubernetes-replicator to v2.10.2
renovate[bot] Sep 16, 2024
2131821
Merge pull request #3685 from lsst-sqre/renovate/kubernetes-replicato…
rra Sep 16, 2024
04e313e
Merge pull request #3684 from lsst-sqre/renovate/argo-workflows-0.x
rra Sep 16, 2024
3a108e4
Update vo-cutouts, add schema migration support
rra Sep 12, 2024
fa7b66a
Enable schema updates for all environments
rra Sep 16, 2024
744f781
Drop neophile support
rra Sep 16, 2024
2764c4f
Merge pull request #3680 from lsst-sqre/tickets/DM-46034
rra Sep 16, 2024
6ae518f
Merge pull request #3686 from lsst-sqre/u/rra/upgrades
rra Sep 16, 2024
da531a9
Fix spelling of updateSchema vo-cutouts setting
rra Sep 17, 2024
382214c
Merge pull request #3687 from lsst-sqre/tickets/DM-46034
rra Sep 17, 2024
c44a5aa
Increase memory for Times Square redis
jonathansick Sep 17, 2024
663b6bc
Merge pull request #3688 from lsst-sqre/u/jonathansick/ts-redis-memory
jonathansick Sep 17, 2024
0690c25
DM-45522: Enable strimzi-access-operator in idfdev
fajpunk Sep 17, 2024
902b1fd
Merge pull request #3689 from lsst-sqre/tickets/DM-45522/enable-strim…
fajpunk Sep 17, 2024
8342bd2
Enable OpenID Connect on idfprod
rra Sep 18, 2024
f69a09e
Merge pull request #3691 from lsst-sqre/tickets/DM-46380
rra Sep 18, 2024
57cb3da
Remove float support from Prompt Processing timeouts.
kfindeisen Aug 9, 2024
c368d22
Support multiple workers per Prompt Processing container.
kfindeisen Aug 9, 2024
1a7f9c3
Merge pull request #3611 from lsst-sqre/tickets/DM-45199
kfindeisen Sep 18, 2024
923ed93
Merge pull request #5 from lsst-uk/rsptest
gpfrancis Sep 18, 2024
d506dc2
afausti dev alert-stream-broker rbac
pav511 Sep 18, 2024
f73df82
Merge pull request #3692 from lsst-sqre/u/pav/alert-stream-broker-arg…
pav511 Sep 18, 2024
2a588fb
Merge pull request #3677 from lsst-sqre/tickets/DM-46074
stvoutsin Sep 18, 2024
9102bc2
Create KafkaUser for templatebot
jonathansick Sep 17, 2024
f55be49
Add Phalanx app for templatebot
jonathansick Sep 17, 2024
51a8f5d
Merge pull request #3690 from lsst-sqre/tickets/DM-43699
jonathansick Sep 18, 2024
df29eab
Stop using fullname macro in docs
rra Sep 18, 2024
39346c2
Merge pull request #3693 from lsst-sqre/tickets/DM-46396
rra Sep 18, 2024
2268b71
Merge pull request #3673 from lsst-sqre/tickets/DM-46252
afausti Sep 19, 2024
d26aff3
Add a COmanage link to environment pages
rra Sep 19, 2024
04ae69b
Merge pull request #3694 from lsst-sqre/tickets/DM-46396
rra Sep 19, 2024
bd394e7
Document restoring user with same UID/GID
rra Sep 19, 2024
a744f41
Merge pull request #3695 from lsst-sqre/tickets/DM-46396
rra Sep 19, 2024
d88d93a
Update Kafka version to 3.8.0
afausti Sep 19, 2024
5882561
Merge pull request #3696 from lsst-sqre/u/afuasti/patch-kafka-version
afausti Sep 19, 2024
172f4b6
Make sure all connector offsets are set o oldest by default
afausti Sep 9, 2024
b6fefe5
Split auxtel connector
afausti Sep 9, 2024
3a6e3ec
Split eas connector
afausti Sep 14, 2024
b232082
Enable debug for all connectors
afausti Sep 19, 2024
991b4ba
Run mtmount and m1m3 with one replica
afausti Sep 19, 2024
ea2e7f9
Merge pull request #3697 from lsst-sqre/tickets/DM-46203
afausti Sep 19, 2024
509e359
Increase readiness probe initial delay
afausti Sep 19, 2024
c71224c
Merge pull request #3698 from lsst-sqre/tickets/DM-46408
afausti Sep 19, 2024
99d61ef
Fix naming of OpenID Connect client secrets
rra Sep 20, 2024
9f0a30c
Merge pull request #3699 from lsst-sqre/tickets/DM-46380
rra Sep 20, 2024
8ed7d46
Run m1m3 and mtmount connectors with one replica
afausti Sep 20, 2024
207add1
Make Butler client/server default on IDF int/dev dp02
dhirving Sep 11, 2024
2b8c4a9
Update Telegraf image
afausti Sep 20, 2024
f0b4eb4
Enable debug logs for all connectors
afausti Sep 20, 2024
dfaf226
Merge pull request #3701 from lsst-sqre/tickets/DM-46250
afausti Sep 20, 2024
84c4b4c
cm-service: move initial deployment from prod to dev vcluster
fritzm Sep 23, 2024
a6164c8
Merge pull request #3703 from lsst-sqre/u/fritzm/cm-service-to-dev
fritzm Sep 23, 2024
e0310aa
Update Helm release argo-workflows to v0.42.3
renovate[bot] Sep 23, 2024
b81297c
Update Helm release telegraf to v1.8.54
renovate[bot] Sep 23, 2024
a492c69
Update Helm release telegraf-ds to v1.1.34
renovate[bot] Sep 23, 2024
f56eda3
Update Helm release argo-cd to v7.6.1
renovate[bot] Sep 23, 2024
1944777
Merge pull request #3707 from lsst-sqre/renovate/argo-cd-7.x
rra Sep 23, 2024
b48136c
Merge pull request #3706 from lsst-sqre/renovate/telegraf-ds-1.x
rra Sep 23, 2024
756667a
Merge pull request #3705 from lsst-sqre/renovate/telegraf-1.x
rra Sep 23, 2024
6d275f1
Merge pull request #3704 from lsst-sqre/renovate/argo-workflows-0.x
rra Sep 23, 2024
6ae010f
Update Helm release connect to v1.16.0
renovate[bot] Sep 23, 2024
e274713
Merge pull request #3708 from lsst-sqre/renovate/connect-1.x
rra Sep 23, 2024
954a47f
Merge pull request #3702 from lsst-sqre/tickets/DM-46286
dhirving Sep 23, 2024
ed68f98
Add ghostwriter
athornton Sep 3, 2024
d031bef
no slack alerts ghostwriter/idfdev
athornton Sep 3, 2024
d43325d
Add delegated scopes
athornton Sep 3, 2024
e1fbc45
Add dummy config.yaml
athornton Sep 3, 2024
c0107da
try 'notebook' style delegation
athornton Sep 3, 2024
dfdb68a
Add path for tutorial nb rewrite
athornton Sep 3, 2024
d51328a
Add top-level ingresses to ghostwriter
athornton Sep 3, 2024
7846443
Try adding a hook for ghostwriter
athornton Sep 4, 2024
4ecd1f2
Add query target for ghostwriter
athornton Sep 4, 2024
f3b772a
separate multiple ingresses
athornton Sep 4, 2024
388f8bb
Add exec:notebook to delegated portal scope, so they can hit '/querie…
athornton Sep 4, 2024
7dd7939
Add tutorial-on-demand to ghostwriter
athornton Sep 9, 2024
f0a70ee
Add system-test to ghostwriter
athornton Sep 11, 2024
dd8a560
Add generic github notebook hook
athornton Sep 12, 2024
62d4ebb
Move to Ghostwriter release version, enable in int/prod
athornton Sep 23, 2024
4f49b48
BTS: Change ack for CSC producers.
mareuter Sep 23, 2024
ea700e3
Rename secret.yaml -> secrets.yaml (ghostwriter)
athornton Sep 23, 2024
1b85b51
Merge pull request #3711 from lsst-sqre/tickets/DM-45250-3
mareuter Sep 23, 2024
f220731
Fix context deadline exceeded error
afausti Sep 23, 2024
f8960fc
Merge pull request #3712 from lsst-sqre/tickets/DM-46463
afausti Sep 23, 2024
edfa77c
Update Python and pre-commit dependencies
rra Sep 23, 2024
11e360b
Remove now-unncessary Black configuration
rra Sep 23, 2024
3926885
Merge pull request #3713 from lsst-sqre/u/rra/upgrades
rra Sep 23, 2024
22601c3
Remove notebook delegation from portal; adopt newer ghostwriter
athornton Sep 23, 2024
4e3fd28
Merge pull request #3710 from lsst-sqre/tickets/DM-46010
athornton Sep 24, 2024
d77314b
exposurelog: move site-specific environment variables to Values.env
sebastian-aranda Sep 24, 2024
1cdcf79
exposurelog: remove `PGPASSWORD` as we are setting `PGPASSFILE`
sebastian-aranda Sep 24, 2024
34f48e9
exposurelog: update butler uri on usdfdev
sebastian-aranda Sep 24, 2024
617dcd6
exposurelog: Add secretEnv to be configured site-specific
sebastian-aranda Sep 24, 2024
590794c
exposurelog: add `PGUSER` and `PGPASSWORD` to base, tucson and summit
sebastian-aranda Sep 24, 2024
5841933
exposurelog: Update appVersion to 1.3.0
sebastian-aranda Sep 24, 2024
e974ae2
Add additional test topic for testing job based processing with keda.
dspeck1 Sep 24, 2024
d1e2601
Remove acks envvar.
mareuter Sep 24, 2024
1be9bdb
Merge pull request #3716 from lsst-sqre/tickets/DM-45250-4
mareuter Sep 24, 2024
e6c9181
Merge pull request #3715 from lsst-sqre/u/dspeck/test-topic-next-visi…
dspeck1 Sep 24, 2024
911a7a8
Merge pull request #3714 from lsst-sqre/tickets/DM-44769
sebastian-aranda Sep 24, 2024
a96f9f9
nightreport: add application to usdfdev
sebastian-aranda Sep 24, 2024
6d9ddda
nightreport: update nightreport secrets
sebastian-aranda Sep 24, 2024
daae455
Merge pull request #3718 from lsst-sqre/tickets/DM-46483
sebastian-aranda Sep 24, 2024
e5d123a
Turn off siav2 app on all IDFs
stvoutsin Sep 24, 2024
4a0b35b
usdf-cm-dev: enable gf and postgres (pav)
pav511 Aug 27, 2024
d5dd8ca
cm-service: update to 0.1.2
fritzm Sep 23, 2024
0bc9516
cm-service: add back gf ingress
fritzm Sep 23, 2024
a54e2f6
Merge pull request #3720 from lsst-sqre/tickets/DM-46466
fritzm Sep 25, 2024
bebfdf1
Update sdm_schemas to v3.2.1
JeremyMcCormick Sep 25, 2024
5cccf30
Shutdown Prompt Processing LSSTComCamSim prod
hsinfang Sep 25, 2024
b49ad69
Remove .pgpass support from Prompt Processing.
kfindeisen Sep 25, 2024
f615252
Merge pull request #3723 from lsst-sqre/tickets/DM-46487
hsinfang Sep 25, 2024
cf98825
Deploy the Telegraf-based connectors at TTS
afausti Sep 24, 2024
b6e1aa2
Remove LSSTCam
afausti Sep 24, 2024
d4a0494
Add ComCam
afausti Sep 24, 2024
8d20065
Add the calibration systems back
afausti Sep 24, 2024
7e0d2e1
Merge pull request #3724 from lsst-sqre/tickets/DM-46494
kfindeisen Sep 25, 2024
1a3d1dd
Merge pull request #3717 from lsst-sqre/tickets/DM-46485
afausti Sep 25, 2024
0245908
Add lsst.obsenv namespace for telescope environments
afausti Sep 24, 2024
8a0eb43
Replicate lsst.obsenv topics to USDF
afausti Sep 25, 2024
2cdc548
Merge pull request #3725 from lsst-sqre/tickets/DM-46462
afausti Sep 25, 2024
cd153ba
Add lsst.cp namespace for summit environment
afausti Sep 26, 2024
3d7cffb
Enable replication of lsst.cp topics to USDF
afausti Sep 26, 2024
d851003
rubintv: update app version for summit and usdf production deployments
sebastian-aranda Sep 26, 2024
e35bd6c
Merge pull request #3728 from lsst-sqre/tickets/DM-46516
sebastian-aranda Sep 26, 2024
9020ded
Merge pull request #3721 from lsst-sqre/sdm-schemas-3.2.1
JeremyMcCormick Sep 26, 2024
9d90320
Merge pull request #3726 from lsst-sqre/tickets/DM-46512
afausti Sep 26, 2024
a694855
Send production-run alerts to a separate alert topic
hsinfang Sep 26, 2024
434301b
Merge pull request #3719 from lsst-sqre/tickets/DM-46493
stvoutsin Sep 26, 2024
1d251de
Merge pull request #3729 from lsst-sqre/tickets/DM-46519
hsinfang Sep 26, 2024
3a2a96b
Deploy Squarebot with interaction support
jonathansick Sep 23, 2024
398e88a
Add block actions topic config for Squarebot
jonathansick Sep 23, 2024
2d5c872
Add block-actions Kafka topic
jonathansick Sep 24, 2024
4e83820
Configure the block actions topic for templatebot
jonathansick Sep 24, 2024
d5a10d8
Add Slack view submission topic for squarebot
jonathansick Sep 25, 2024
4f21b85
Update to Squarebot 0.10.0
jonathansick Sep 26, 2024
9e2c919
Merge pull request #3709 from lsst-sqre/tickets/DM-46427
jonathansick Sep 26, 2024
195a440
nightreport: fix summit site_id
sebastian-aranda Sep 27, 2024
0bb33e3
Deploy Prompt Processing 4.5.1 for LATISS.
kfindeisen Sep 27, 2024
a526dbe
Document block IDs for LATISS Prompt Processing.
kfindeisen Sep 27, 2024
630e649
Merge pull request #3734 from lsst-sqre/tickets/DM-46540
sebastian-aranda Sep 27, 2024
9aeed3a
Merge pull request #3735 from lsst-sqre/u/kfindeisen/deploy-4.5.1
kfindeisen Sep 27, 2024
4152d49
chore(deps): update confluentinc/cp-kafka-rest docker tag to v7.7.1
renovate[bot] Sep 16, 2024
69f063b
Merge pull request #3681 from lsst-sqre/renovate/confluentinc-cp-kafk…
afausti Sep 27, 2024
0ed022a
Change metric_batch_size for USDF M1M3 telegraf connector.
mareuter Sep 27, 2024
57d1b01
Merge pull request #3736 from lsst-sqre/u/mareuter/usdf_m1m3_telegraf…
mareuter Sep 27, 2024
562ebc4
Add a named template for generating the Telegraf configmap
afausti Sep 27, 2024
fab2946
Merge pull request #3738 from lsst-sqre/tickets/DM-46550
afausti Sep 28, 2024
ff7d4f9
Add ConsDb to Tucson TestStand environment
Vebop Sep 30, 2024
53be775
Add configuration for ConsDb on TTS
Vebop Sep 30, 2024
9328a5c
Add secrets update for ConsDb
Vebop Sep 30, 2024
c3e6e35
Add secrets update for ConsDb
Vebop Sep 30, 2024
e3052ea
Update Kafka to 3.7, add kraft, and remove zookeeper
bsmartradio Sep 16, 2024
2d5a5e0
Add LATISS topic
bsmartradio Sep 26, 2024
e72e772
Merge pull request #3733 from lsst-sqre/tickets/DM-46236
bsmartradio Sep 30, 2024
e7c65f8
Merge pull request #3741 from lsst-sqre/tickets/dm-46557
Vebop Sep 30, 2024
e3491f4
Summit: Update nublado mounts for LSSTComCam.
mareuter Sep 24, 2024
d6be52e
Merge pull request #3727 from lsst-sqre/tickets/DM-46488
mareuter Oct 1, 2024
cc608fe
Fix ComCam mounts in UWS.
mareuter Oct 1, 2024
e77519d
Merge pull request #3744 from lsst-sqre/tickets/DM-46488-1
mareuter Oct 1, 2024
3c2bbdf
Bump nublado version
athornton Oct 1, 2024
72cb2ae
rapid-analysis: add application to summit environment
sebastian-aranda Oct 1, 2024
fa9606f
Merge pull request #3745 from lsst-sqre/tickets/DM-46587
sebastian-aranda Oct 1, 2024
5fd65f1
Merge pull request #3746 from lsst-sqre/u/ajt/nublado720
athornton Oct 1, 2024
7db5ced
Makes metric_batch_size=1000 default everywhere
afausti Oct 1, 2024
c54221a
Use image with the race condition fix everywhere
afausti Oct 1, 2024
4f201c1
rapid-analysis: slight typo fix for AWS credentials that comes from r…
sebastian-aranda Oct 1, 2024
9b10a5d
Merge pull request #3747 from lsst-sqre/tickets/DM-46587
sebastian-aranda Oct 2, 2024
8d6b15d
exposurelog: update ComCam nfs mounts
sebastian-aranda Oct 2, 2024
b77bd8f
rubintv: add application to TTS
sebastian-aranda Oct 2, 2024
fe5c78e
Merge pull request #3749 from lsst-sqre/tickets/DM-46577
sebastian-aranda Oct 2, 2024
a6ed075
rapid-analysis: add application to TTS
sebastian-aranda Oct 2, 2024
493db90
Merge pull request #3750 from lsst-sqre/tickets/DM-46597
sebastian-aranda Oct 2, 2024
35ba222
Merge eas and auxtel connectors back
afausti Oct 1, 2024
3c8b6ff
Merge branch 'main' into tickets/DM-46595
afausti Oct 2, 2024
1a1f1e9
Merge pull request #3751 from lsst-sqre/tickets/DM-46595
afausti Oct 2, 2024
0db6769
Merge pull request #3748 from lsst-sqre/tickets/DM-46596
sebastian-aranda Oct 2, 2024
1edc467
gmegias usdf-rsp-dev argocd
pav511 Oct 2, 2024
f7e5742
Merge pull request #3753 from lsst-sqre/u/pav/INC0467144
pav511 Oct 2, 2024
b17071a
salnikov usdf-rsp-dev argocd
pav511 Oct 2, 2024
83e87c5
Merge pull request #3754 from lsst-sqre/u/pav/INC0467032
pav511 Oct 2, 2024
8085525
Add initial version of ppdb-replication application
JeremyMcCormick Sep 26, 2024
39a44dd
Merge pull request #3722 from lsst-sqre/tickets/DM-46257
JeremyMcCormick Oct 2, 2024
4a75b88
DM-45522 sasquatch: App metrics events in Sasquatch
fajpunk Sep 30, 2024
0946a80
Merge pull request #3737 from lsst-sqre/tickets/DM-45522/sasquatch2
fajpunk Oct 2, 2024
8624841
Adjust telegraf resources on TTS
afausti Oct 2, 2024
f21231b
Adjust configuration for EAS at TTS
afausti Oct 2, 2024
387a58f
Merge pull request #3756 from lsst-sqre/u/afausti/telegraf-resources
afausti Oct 3, 2024
eb97619
Merge branch 'lsst-sqre:main' into main
gpfrancis Oct 3, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
35 changes: 0 additions & 35 deletions .github/workflows/dependencies.yaml

This file was deleted.

4 changes: 2 additions & 2 deletions .pre-commit-config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ repos:
- -c=.yamllint.yml

- repo: https://github.com/python-jsonschema/check-jsonschema
rev: 0.29.1
rev: 0.29.2
hooks:
- id: check-jsonschema
files: ^applications/.*/secrets(-[^./-]+)?\.yaml
Expand Down Expand Up @@ -46,7 +46,7 @@ repos:
- --template-files=../helm-docs.md.gotmpl

- repo: https://github.com/astral-sh/ruff-pre-commit
rev: v0.5.7
rev: v0.6.7
hooks:
- id: ruff
args: [--fix, --exit-non-zero-on-fix]
Expand Down
16 changes: 8 additions & 8 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -41,20 +41,20 @@ update-deps:
pip install --upgrade pip uv
uv pip install --upgrade pre-commit
pre-commit autoupdate
uv pip compile --upgrade --generate-hashes \
--output-file requirements/main.txt requirements/main.in
uv pip compile --upgrade --generate-hashes \
uv pip compile --upgrade --universal --generate-hashes \
--output-file requirements/main.txt pyproject.toml
uv pip compile --upgrade --universal --generate-hashes \
--output-file requirements/dev.txt requirements/dev.in
uv pip compile --upgrade --generate-hashes \
uv pip compile --upgrade --universal --generate-hashes \
--output-file requirements/tox.txt requirements/tox.in

# Useful for testing against a Git version of Safir.
.PHONY: update-deps-no-hashes
update-deps-no-hashes:
pip install --upgrade uv
uv pip compile --upgrade \
--output-file requirements/main.txt requirements/main.in
uv pip compile --upgrade \
uv pip compile --upgrade --universal \
--output-file requirements/main.txt pyproject.toml
uv pip compile --upgrade --universal \
--output-file requirements/dev.txt requirements/dev.in
uv pip compile --upgrade \
uv pip compile --upgrade --universal \
--output-file requirements/tox.txt requirements/tox.in
9 changes: 6 additions & 3 deletions applications/alert-stream-broker/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -72,12 +72,17 @@ Alert transmission to community brokers
| alert-stream-broker.kafka.storage.size | string | `"1000Gi"` | Size of the backing storage disk for each of the Kafka brokers. |
| alert-stream-broker.kafka.storage.storageClassName | string | `"standard"` | Name of a StorageClass to use when requesting persistent volumes. |
| alert-stream-broker.kafka.version | string | `"3.4.0"` | Version of Kafka to deploy. |
| alert-stream-broker.kafkaController.enabled | bool | `false` | Enable Kafka Controller |
| alert-stream-broker.kafkaController.resources | object | See `values.yaml` | Kubernetes requests and limits for the Kafka Controller |
| alert-stream-broker.kafkaController.storage.size | string | `"20Gi"` | Size of the backing storage disk for each of the Kafka controllers |
| alert-stream-broker.kafkaController.storage.storageClassName | string | `""` | Name of a StorageClass to use when requesting persistent volumes |
| alert-stream-broker.kafkaExporter | object | `{"enableSaramaLogging":false,"enabled":false,"groupRegex":".*","logLevel":"warning","topicRegex":".*"}` | Kafka JMX Exporter for more detailed diagnostic metrics. |
| alert-stream-broker.kafkaExporter.enableSaramaLogging | bool | `false` | Enable Sarama logging |
| alert-stream-broker.kafkaExporter.enabled | bool | `false` | Enable Kafka exporter. |
| alert-stream-broker.kafkaExporter.groupRegex | string | `".*"` | Consumer groups to monitor |
| alert-stream-broker.kafkaExporter.logLevel | string | `"warning"` | Log level for Sarama logging |
| alert-stream-broker.kafkaExporter.topicRegex | string | `".*"` | Kafka topics to monitor |
| alert-stream-broker.kraft | bool | `true` | |
| alert-stream-broker.maxBytesRetained | string | `"100000000000"` | Maximum number of bytes for the replay topic, per partition, per replica. Default is 100GB, but should be lower to not fill storage. |
| alert-stream-broker.maxMillisecondsRetained | string | `"5259492000"` | Maximum amount of time to save alerts in the replay topic, in milliseconds. Default is 7 days (604800000). |
| alert-stream-broker.nameOverride | string | `""` | |
Expand All @@ -95,10 +100,8 @@ Alert transmission to community brokers
| alert-stream-broker.users[0].readonlyTopics | list | `["alert-stream","alerts-simulated","alert-stream-test"]` | A list of topics that the user should get read-only access to. |
| alert-stream-broker.users[0].username | string | `"rubin-testing"` | The username for the user that should be created. |
| alert-stream-broker.vaultSecretsPath | string | `""` | Path to the secret resource in Vault |
| alert-stream-broker.zookeeper.replicas | int | `3` | Number of Zookeeper replicas to run. |
| alert-stream-broker.zookeeper.storage.size | string | `"1000Gi"` | Size of the backing storage disk for each of the Zookeeper instances. |
| alert-stream-broker.zookeeper.storage.storageClassName | string | `"standard"` | Name of a StorageClass to use when requesting persistent volumes. |
| alert-stream-schema-registry.clusterName | string | `"alert-broker"` | Strimzi "cluster name" of the broker to use as a backend. |
| alert-stream-schema-registry.compatibilityLevel | string | `"None"` | |
| alert-stream-schema-registry.hostname | string | `"usdf-alert-schemas-dev.slac.stanford.edu"` | Hostname for an ingress which sends traffic to the Schema Registry. |
| alert-stream-schema-registry.name | string | `"alert-schema-registry"` | Name used by the registry, and by its users. |
| alert-stream-schema-registry.port | int | `8081` | Port where the registry is listening. NOTE: Not actually configurable in strimzi-registry-operator, so this basically cannot be changed. |
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -29,12 +29,17 @@ Kafka broker cluster for distributing alerts
| kafka.storage.size | string | `"1000Gi"` | Size of the backing storage disk for each of the Kafka brokers. |
| kafka.storage.storageClassName | string | `"standard"` | Name of a StorageClass to use when requesting persistent volumes. |
| kafka.version | string | `"3.4.0"` | Version of Kafka to deploy. |
| kafkaController.enabled | bool | `false` | Enable Kafka Controller |
| kafkaController.resources | object | See `values.yaml` | Kubernetes requests and limits for the Kafka Controller |
| kafkaController.storage.size | string | `"20Gi"` | Size of the backing storage disk for each of the Kafka controllers |
| kafkaController.storage.storageClassName | string | `""` | Name of a StorageClass to use when requesting persistent volumes |
| kafkaExporter | object | `{"enableSaramaLogging":false,"enabled":false,"groupRegex":".*","logLevel":"warning","topicRegex":".*"}` | Kafka JMX Exporter for more detailed diagnostic metrics. |
| kafkaExporter.enableSaramaLogging | bool | `false` | Enable Sarama logging |
| kafkaExporter.enabled | bool | `false` | Enable Kafka exporter. |
| kafkaExporter.groupRegex | string | `".*"` | Consumer groups to monitor |
| kafkaExporter.logLevel | string | `"warning"` | Log level for Sarama logging |
| kafkaExporter.topicRegex | string | `".*"` | Kafka topics to monitor |
| kraft | bool | `true` | |
| maxBytesRetained | string | `"100000000000"` | Maximum number of bytes for the replay topic, per partition, per replica. Default is 100GB, but should be lower to not fill storage. |
| maxMillisecondsRetained | string | `"5259492000"` | Maximum amount of time to save alerts in the replay topic, in milliseconds. Default is 7 days (604800000). |
| nameOverride | string | `""` | |
Expand All @@ -52,6 +57,3 @@ Kafka broker cluster for distributing alerts
| users[0].readonlyTopics | list | `["alert-stream","alerts-simulated","alert-stream-test"]` | A list of topics that the user should get read-only access to. |
| users[0].username | string | `"rubin-testing"` | The username for the user that should be created. |
| vaultSecretsPath | string | `""` | Path to the secret resource in Vault |
| zookeeper.replicas | int | `3` | Number of Zookeeper replicas to run. |
| zookeeper.storage.size | string | `"1000Gi"` | Size of the backing storage disk for each of the Zookeeper instances. |
| zookeeper.storage.storageClassName | string | `"standard"` | Name of a StorageClass to use when requesting persistent volumes. |
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,7 @@ spec:
cleanup.policy: "delete"
retention.ms: {{ .Values.maxMillisecondsRetained }} # 7 days
retention.bytes: {{ .Values.maxBytesRetained }}
compression.type: {{ .Values.topicCompression}}
# The default timestamp is the creation time of the alert.
# To get the ingestion rate, we need this to be the log
# append time, and the header will contain the producer
Expand All @@ -45,6 +46,7 @@ spec:
cleanup.policy: "delete"
retention.ms: {{ .Values.maxMillisecondsRetained }} # 7 days
retention.bytes: {{ .Values.maxBytesRetained }}
compression.type: {{ .Values.topicCompression}}
# The default timestamp is the creation time of the alert.
# To get the ingestion rate, we need this to be the log
# append time, and the header will contain the producer
Expand All @@ -64,11 +66,31 @@ spec:
cleanup.policy: "delete"
retention.ms: {{ .Values.maxMillisecondsRetained }} # 7 days
retention.bytes: {{ .Values.maxBytesRetained }}
compression.type: {{ .Values.devTopicCompression}}
compression.type: {{ .Values.topicCompression}}
# The default timestamp is the creation time of the alert.
# To get the ingestion rate, we need this to be the log
# append time, and the header will contain the producer
# timestamp instead
message.timestamp.type: 'LogAppendTime'
partitions: {{ .Values.devTopicPartitions }}
replicas: {{ .Values.devTopicReplicas }}
replicas: {{ .Values.devTopicReplicas }}
---
apiVersion: "kafka.strimzi.io/{{ .Values.strimziAPIVersion }}"
kind: KafkaTopic
metadata:
labels:
strimzi.io/cluster: "{{ .Values.clusterName }}"
name: "{{ .Values.latissTopicName}}"
spec:
config:
cleanup.policy: "delete"
retention.ms: {{ .Values.maxMillisecondsRetained }} # 7 days
retention.bytes: {{ .Values.maxBytesRetained }}
compression.type: {{ .Values.topicCompression}}
# The default timestamp is the creation time of the alert.
# To get the ingestion rate, we need this to be the log
# append time, and the header will contain the producer
# timestamp instead
message.timestamp.type: 'LogAppendTime'
partitions: {{ .Values.latissTopicPartitions }}
replicas: {{ .Values.latissTopicReplicas }}
Original file line number Diff line number Diff line change
@@ -1,7 +1,60 @@
apiVersion: kafka.strimzi.io/v1beta2
kind: KafkaNodePool
metadata:
name: controller
labels:
strimzi.io/cluster: {{ .Values.cluster.name }}
spec:
replicas: {{ .Values.kafka.replicas }}
roles:
- controller
storage:
type: jbod
volumes:
- id: 0
type: persistent-claim
size: {{ .Values.kafkaController.storage.size }}
class: {{ .Values.kafkaController.storage.storageClassName }}
deleteClaim: false
{{- with .Values.kafkaController.resources }}
resources:
{{- toYaml . | nindent 6 }}
{{- end }}
---
apiVersion: kafka.strimzi.io/v1beta2
kind: KafkaNodePool
metadata:
name: kafka
labels:
strimzi.io/cluster: {{ .Values.cluster.name }}
annotations:
strimzi.io/next-node-ids: "[0-99]"
spec:
replicas: {{ .Values.kafka.replicas }}
roles:
- broker
storage:
type: jbod
volumes:
- id: 0
type: persistent-claim
size: {{ .Values.kafka.storage.size }}
{{- if .Values.kafka.storage.storageClassName }}
class: {{ .Values.kafka.storage.storageClassName }}
{{- end}}
deleteClaim: false
{{- with .Values.kafka.resources }}
resources:
{{- toYaml . | nindent 6 }}
{{- end }}
---
apiVersion: kafka.strimzi.io/{{ .Values.strimziAPIVersion }}
kind: Kafka
metadata:
name: {{ .Values.cluster.name }}
annotations:
strimzi.io/kraft: enabled
strimzi.io/node-pools: enabled
spec:
{{- if .Values.kafkaExporter.enabled }}
kafkaExporter:
Expand Down Expand Up @@ -85,14 +138,15 @@ spec:

{{- if .Values.kafka.externalListener.brokers }}
brokers:
{{- range $idx, $broker := .Values.kafka.externalListener.brokers }}
- broker: {{ $idx }}
{{- range $broker := .Values.kafka.externalListener.brokers }}
- broker: {{ $broker.broker }}
loadBalancerIP: {{ $broker.ip }}
advertisedHost: {{ $broker.host }}
annotations: {{ toYaml $broker.annotations | nindent 16 }}
{{- end }}
advertisedPort: 9094
annotations:
annotations: {{ toYaml $broker.annotations | nindent 16 }}
{{- end }}
{{- end }}

{{- if and (.Values.kafka.externalListener.tls.enabled) (.Values.kafka.externalListener.bootstrap.host) }}
brokerCertChainAndKey:
secretName: {{ .Values.cluster.name }}-external-tls
Expand All @@ -114,8 +168,6 @@ spec:
transaction.state.log.replication.factor: 3
transaction.state.log.min.isr: 2
message.max.bytes: 4194304 # 8 Megabytes. For testing purposes only.
log.message.format.version: {{ .Values.kafka.logMessageFormatVersion }}
inter.broker.protocol.version: {{ .Values.kafka.interBrokerProtocolVersion }}
ssl.client.auth: required
{{- range $key, $value := .Values.kafka.config }}
{{ $key }}: {{ $value }}
Expand All @@ -133,53 +185,6 @@ spec:
class: {{ .Values.kafka.storage.storageClassName }}
deleteClaim: false

template:
pod:
{{- if .Values.kafka.nodePool.tolerations }}
tolerations:
{{- range $tol := .Values.kafka.nodePool.tolerations }}
- key: {{ $tol.key }}
operator: "Equal"
value: {{ $tol.value }}
effect: {{ $tol.effect }}
{{- end }}
{{- end }}

{{- if .Values.kafka.nodePool.affinities }}
affinity:
nodeAffinity:
preferredDuringSchedulingIgnoredDuringExecution:
{{- range $affinity := .Values.kafka.nodePool.affinities }}
- weight: 1
preference:
matchExpressions:
- key: {{ $affinity.key }}
operator: In
values: [{{ $affinity.value }}]
{{- end }}
{{- end }}

affinity:
podAntiAffinity:
requiredDuringSchedulingIgnoredDuringExecution:
- labelSelector:
matchExpressions:
- key: "app.kubernetes.io/name"
operator: In
values:
- kafka
topologyKey: "kubernetes.io/hostname"

zookeeper:
replicas: {{ .Values.zookeeper.replicas }}
storage:
# Note that storage is configured per replica. If there are 3 replicas,
# each will get its own PersistentVolumeClaim for the configured size.
type: persistent-claim
size: {{ .Values.zookeeper.storage.size }}
class: {{ .Values.zookeeper.storage.storageClassName }}
deleteClaim: false

template:
pod:
{{- if .Values.kafka.nodePool.tolerations }}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -121,17 +121,6 @@ users:
# matches.
groups: ["rubin-testing"]


zookeeper:
# -- Number of Zookeeper replicas to run.
replicas: 3

storage:
# -- Size of the backing storage disk for each of the Zookeeper instances.
size: 1000Gi
# -- Name of a StorageClass to use when requesting persistent volumes.
storageClassName: standard

tls:
subject:
# -- Organization to use in the 'Subject' field of the broker's TLS certificate.
Expand All @@ -149,6 +138,29 @@ fullnameOverride: ""

nameOverride: ""

kraft: true

kafkaController:
# -- Enable Kafka Controller
enabled: false

storage:
# -- Size of the backing storage disk for each of the Kafka controllers
size: 20Gi

# -- Name of a StorageClass to use when requesting persistent volumes
storageClassName: ""

# -- Kubernetes requests and limits for the Kafka Controller
# @default -- See `values.yaml`
resources:
requests:
memory: 32Gi
cpu: "4"
limits:
memory: 64Gi
cpu: "8"

# -- Topic used to send test alerts.
testTopicName: alert-stream-test

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,7 @@ Confluent Schema Registry for managing schema versions for the Alert Stream
| Key | Type | Default | Description |
|-----|------|---------|-------------|
| clusterName | string | `"alert-broker"` | Strimzi "cluster name" of the broker to use as a backend. |
| compatibilityLevel | string | `"None"` | |
| hostname | string | `"usdf-alert-schemas-dev.slac.stanford.edu"` | Hostname for an ingress which sends traffic to the Schema Registry. |
| name | string | `"alert-schema-registry"` | Name used by the registry, and by its users. |
| port | int | `8081` | Port where the registry is listening. NOTE: Not actually configurable in strimzi-registry-operator, so this basically cannot be changed. |
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -9,4 +9,5 @@ metadata:
revision: "1"
spec:
strimzi-version: {{ .Values.strimziAPIVersion }}
listener: internal
listener: internal
compatibilityLevel: none
Loading
Loading