Add notes on fixing broken schema updates #6448

smklein · 2024-08-27T16:31:42Z

Addresses feedback on #6436

smklein · 2024-08-27T16:32:38Z

schema/crdb/README.adoc

-CockroachDB's representation of the schema includes some opaque
-internally-generated fields that are order dependent, like the names of
-anonymous CHECK constraints.  Our schema comparison tools intentionally ignore
-these values. As a result, when performing schema changes, the order of new
-tables and constraints should generally not be important.


This section was moved lower, as it's more of an "FYI" than guidance.

andrewjstone

Thanks @smklein. Love the writeup about convergence.

schema/crdb/README.adoc

Nieuwejaar · 2024-08-27T17:01:42Z

schema/crdb/README.adoc

+
+1. Add a column without a `NOT NULL` constraint.
+2. Ensure that all data has migrated to a non-nullable value, eventually.
+3. Once all data has been migrated to a non-nullable value, add a `NOT NULL` constraint.


As a practical matter, can we ever be sure that all customer data has migrated this far? I guess if we established a requirement that the cluster boot/run at schema level N before being allowed to migrate to N+M, it might be possible. Which is a long way of saying that "If all data..." might be more apt than "Once all data...".

Yeah, I was intentionally very handwavey with "how do you be certain the migration has completed", because it's very dependent on your data.

If it's possible to back-fill this data by observing the database, our current schema-updating system acts on a system with no other concurrent queries, so it should be possible to back-fill these values based on other tables and then make the conversion.

If backfilling this data requires reading data outside the database -- maybe we want to add a non-nullable field to inventory? -- then this is much harder. We basically rely on Nexus performing work after some schema updates are applied, but before the "nullability" constraint has been fully dropped. Right now, we don't really have a great way of doing this, and we'll probably need to build more mechanism to make this possible.

I proposed more specific language. I think the only realistic choice for the time being is to migrate the data yourself during the update. We've done that a bunch of times.

An alternative might be to ship the update over two releases: one that adds the column and whose Nexus backfills data in the meantime; and one that makes the column required. But this doesn't work unless there's some kind of interlock to ensure that the backfill process has finished before somebody starts the second upgrade.

An alternative is to ship an ad hoc Rust program alongside schema-updater (or a subcommand of that program) that backfills the data and maybe knows to do it between two specific schema versions. This is probably the most promising but let's not do that if we don't have to.

davepacheco · 2024-08-27T18:43:46Z

schema/crdb/README.adoc

+contain a default to help back-fill existing rows in that table which may
+exist. Without this default value, the schema upgrade might fail with
+an error like `null value in column "..." violates not-null constraint`.
+Unfortnately, it's possible that the schema upgrade might NOT fail with that


Suggested change

Unfortnately, it's possible that the schema upgrade might NOT fail with that

Unfortunately, it's possible that the schema upgrade might NOT fail with that

davepacheco · 2024-08-27T18:57:52Z

schema/crdb/README.adoc


-=== Scenario-specific gotchas
+If a `DEFAULT` value does not make sense, the following may be necessary


I believe we've done this a few times by being willing to rewrite the data during a migration. Schema version 26 seems like an example, though it's kind of a complicated one. Schema version 54 (mentioned below) is also an example of populating a new column but it did not make the new column NOT NULL after doing it.

What about being more specific here for the easy case and less specific for the general case. Maybe something like:

If a DEFAULT value does not make sense, then you need to implement a multi-step process.

Add the column without a NOT NULL constraint.

Migrate existing data to a non-null value.

Once all data has been migrated to a non-null value, alter the table again to add the NOT NULL constraint.

For the time being, if you can write the data migration in SQL (e.g., using a SQL UPDATE), then you can do this with a single new schema version where the second step is an UPDATE. See schema version 54 (blueprint-add-external-ip-id) for an example of this (though that one did not make the new column NOT NULL -- you'd want to do that in another step). Update the validate_data_migration() test in nexus/tests/integration_tests/schema.rs to add a test for this.

In the future when schema updates happen while the control plane is online, this may not be a tenable path because the operation may take a very long time on large tables.

If you cannot write the data migration in SQL, you would need to figure out a different way to backfill the data before you can apply the step that adds the NOT NULL constraint. This is likely a substantial project.

You don't have to take that edit. But basically, I'm worried the language that's here now isn't giving enough guidance for the common case that you can just put the UPDATE into a migration; but it also makes it sound potentially okay if the data doesn't get migrated promptly.

Updated. FWIW I do still think that the "schema migrations that we can make from reading the DB alone" will still totally be tractable, even with online update (though, that whole area is speculative). But yeah, totally agreed that the backfill outside the DB will be much harder, and require some back-and-forth between "Nexus running normally" and "Nexus updating schemas".

Yeah -- that makes me realize that when we are doing online update, then the control plane itself can know not to start the update until the data has been backfilled.

davepacheco · 2024-08-27T19:00:17Z

schema/crdb/README.adoc

+
+1. Add a column without a `NOT NULL` constraint.
+2. Ensure that all data has migrated to a non-nullable value, eventually.
+3. Once all data has been migrated to a non-nullable value, add a `NOT NULL` constraint.


I proposed more specific language. I think the only realistic choice for the time being is to migrate the data yourself during the update. We've done that a bunch of times.

An alternative might be to ship the update over two releases: one that adds the column and whose Nexus backfills data in the meantime; and one that makes the column required. But this doesn't work unless there's some kind of interlock to ensure that the backfill process has finished before somebody starts the second upgrade.

An alternative is to ship an ad hoc Rust program alongside schema-updater (or a subcommand of that program) that backfills the data and maybe knows to do it between two specific schema versions. This is probably the most promising but let's not do that if we don't have to.

davepacheco · 2024-08-27T19:02:31Z

schema/crdb/README.adoc

+*already* applied it arrive at the same state. This is only necessary if it is
+possible for the schema update to apply successfully in any possible
+deployment. This schema update should be added like any other "new" schema update,
+appended to the list of all updates, rather than re-ordering history.


Suggested change

appended to the list of all updates, rather than re-ordering history.

appended to the list of all updates, rather than re-ordering history. This schema update will run on systems that deployed both versions of the earlier schema update.

. Determine whether any of the schema versions after the broken one need to change because they depended on the specific behavior that you changed to _fix_ that version.

You kind of said this later but I feel like it will be missed if it's not explicitly one of the steps we tell people to follow.

davepacheco · 2024-08-27T19:03:18Z

schema/crdb/README.adoc

+
+* `S(bad)`: The particular `upN.sql` schema update which is "broken".
+* `S(fixed)`: That same `upN.sql` file after being updated to a non-broken version.
+* `S(converge)`: Some later schema update that converges the deployment to a known-good


I'm a little worried here that systems running versions (S(bad), S(converge)) may still be in inconsistent states...but that may be unavoidable.

Yeah, agreed. Happy to cover this more if we feel like we can, but this whole area is "documenting what to do when some ambiguous bad thing has already happened".

I think our inclination towards "try to prevent the bad thing from happening via tests" (e.g. #6451) should take priority, largely because it feels more tractable to me.

notes on fixing broken schema updates

4efb5dc

smklein requested review from davepacheco and Nieuwejaar August 27, 2024 16:31

smklein commented Aug 27, 2024

View reviewed changes

andrewjstone approved these changes Aug 27, 2024

View reviewed changes

schema/crdb/README.adoc Outdated Show resolved Hide resolved

feedback

347c573

Nieuwejaar reviewed Aug 27, 2024

View reviewed changes

davepacheco reviewed Aug 27, 2024

View reviewed changes

dap feedback

bd8eb55

smklein enabled auto-merge (squash) August 27, 2024 19:29

smklein merged commit 8fa1da2 into main Aug 27, 2024
22 checks passed

smklein deleted the crdb-docs branch August 27, 2024 23:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add notes on fixing broken schema updates #6448

Add notes on fixing broken schema updates #6448

smklein commented Aug 27, 2024

smklein Aug 27, 2024

andrewjstone left a comment

Nieuwejaar Aug 27, 2024

smklein Aug 27, 2024

davepacheco Aug 27, 2024

davepacheco Aug 27, 2024

smklein Aug 27, 2024

davepacheco Aug 27, 2024

smklein Aug 27, 2024

davepacheco Aug 27, 2024

davepacheco Aug 27, 2024

davepacheco Aug 27, 2024

smklein Aug 27, 2024

davepacheco Aug 27, 2024

smklein Aug 27, 2024

	Unfortnately, it's possible that the schema upgrade might NOT fail with that
	Unfortunately, it's possible that the schema upgrade might NOT fail with that


		=== Scenario-specific gotchas
		If a `DEFAULT` value does not make sense, the following may be necessary

	appended to the list of all updates, rather than re-ordering history.
	appended to the list of all updates, rather than re-ordering history. This schema update will run on systems that deployed both versions of the earlier schema update.
	. Determine whether any of the schema versions after the broken one need to change because they depended on the specific behavior that you changed to _fix_ that version.

Add notes on fixing broken schema updates #6448

Add notes on fixing broken schema updates #6448

Conversation

smklein commented Aug 27, 2024

Choose a reason for hiding this comment

andrewjstone left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment