Add primary key to schema_history #13611

wonko · 2024-09-17T09:20:49Z

Summary

It would be good to have a primary key in the creation statement for the schema_history table.

Use Cases

DB replication mechanisms and integrity-checking mechanisms sometimes rely on the ability to identify a single row. Primary keys are made for this.

I bumped into this while using a percona-xtradb database deployed with the Percona Operator (which isn't too uncommon, I had it running in the cluster for another db already, so re-used it for the argo database), which will deny the creation of this table with the default configuration:

Failed to init db: Error 1105 (HY000): Percona-XtraDB-Cluster prohibits use of DML command on a table (argo.schema_history) without an explicit primary key with pxc_strict_mode = ENFORCING or MASTER

While it might feel like it adds no value to the argo-workflows project, this is a minor change, won't affect anything else, but will allow users to deploy argo easier. I might add a PR if this would be allowed.

The changes would go in migrate.go, where the creation would get a primary key(schema_version) and an additional migration line in the big set which would do an alter-add-primary-key statement to try and add the primary key to an already existing table.

Message from the maintainers:

Love this feature request? Give it a 👍. We prioritise the proposals with the most 👍.

The text was updated successfully, but these errors were encountered:

MasonM · 2024-12-15T22:28:10Z

Notes for anyone wanting to work on this:

This will involve adding a small migration here:

argo-workflows/persist/sqldb/migrate.go

Lines 267 to 271 in f796449

    
           // PostgreSQL only: convert argo_archived_workflows.workflow column to JSONB for performance and consistency with MySQL. #13779 
        
           ternary(dbType == MySQL, 
        
           	noop{}, 
        
           	ansiSQLChange(`alter table argo_archived_workflows alter column workflow set data type jsonb using workflow::jsonb`), 
        
           ),

Make sure to test with both PostgreSQL and MySQL, which you do with the go run ./hack/db migrate tool: https://argo-workflows.readthedocs.io/en/latest/running-locally/#database-tooling

radusora · 2025-01-18T16:40:14Z

I ran some tests with Percona XtraDB and noticed that adding the migration as suggested by @MasonM doesn’t fully address the issue. After the schema_history table is created, subsequent DML commands in the migrations will fail without a primary key.

Adding the primary key directly in the create table statement helps with new installations, but it creates a problem for migrations attempting to add the key for existing installations. Similarly, trying to remove and re-add the primary key in the migrations isn’t reliable because it may not exist in the first place.

Given these constraints, one possible solution is to introduce conditional logic immediately after the schema_history table creation. This logic would check whether the primary key exists and add it if it doesn’t – or safely handle any duplicate key errors if it already exists. While this approach isn’t perfectly elegant (as it requires running an extra query each time), it seems like a practical compromise for both new and existing installations.

If this approach sounds acceptable, I’d be happy to submit a PR to implement it.

MasonM · 2025-01-18T23:19:25Z

@radusora Thanks for the details. That sounds reasonable. The query select 1 from information_schema.table_constraints where constraint_type = 'PRIMARY KEY' and table_name = 'schema_history' should allow you to quickly check if the primary key is already there with both MySQL and PostgreSQL (at least with the versions I tried).

wonko added the type/feature Feature request label Sep 17, 2024

agilgur5 added area/workflow-archive area/offloading Node status offloading solution/suggested A solution to the bug has been suggested. Someone needs to implement it. labels Sep 17, 2024

MasonM added the good first issue Good for newcomers label Dec 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add primary key to schema_history #13611

Add primary key to schema_history #13611

wonko commented Sep 17, 2024

MasonM commented Dec 15, 2024

radusora commented Jan 18, 2025

MasonM commented Jan 18, 2025

Add primary key to schema_history #13611

Add primary key to schema_history #13611

Comments

wonko commented Sep 17, 2024

Summary

Use Cases

MasonM commented Dec 15, 2024

radusora commented Jan 18, 2025

MasonM commented Jan 18, 2025