Tracking: Instance Lifecycle Overhaul #3742

smklein · 2023-07-21T21:54:19Z

The text was updated successfully, but these errors were encountered:

gjcolombo · 2023-07-21T22:03:16Z

#2315 also tracks the "instances without sleds" work. It probably depends on #2824, since starting an instance with no resource reservation is a multi-step process.

Nexus can use an RPW to look for instances that are marked as "failed + auto_boot_on_fault", and re-provision them in the background.

If we use the existing Failed state for this, we'll need to make sure that

instances we've marked as Failed and are restarting have definitely been torn down (e.g. what happens if a Nexus-to-sled-agent call fails and causes Nexus to move an instance to the Failed state, but the problem was transient and the instance is actually alive?)
instance we've marked as Failed have some prospect of being recovered (e.g. suppose an instance is Failed because Propolis's startup sequence failed, and the problem is persistent; will the RPW constantly try and fail to start the VM?)

We might decide to have different failure reasons to help us distinguish these cases.

smklein · 2023-07-25T21:18:38Z

See also: #2825

hawkw · 2024-09-24T17:30:13Z

Most of the stuff described in "Updating Instance State Within Nexus" was implemented in a combination of #5611, #5759, and #6503. The proactive registration of sled-agents with Nexus isn't something we've done yet.

smklein added Sled Agent Related to the Per-Sled Configuration and Management nexus Related to nexus virtualization Propolis Integration & VM Management labels Jul 21, 2023

gjcolombo mentioned this issue Oct 3, 2023

Split instance state into Instance and VMM tables #4194

Merged

gjcolombo mentioned this issue Nov 14, 2023

Stop collecting Propolis metrics on instance stop #4495

Merged

davepacheco mentioned this issue Jan 23, 2024

need a way to trigger cleanup and next steps for vanished instances #4872

Closed

morlandi7 added this to the 10 milestone Aug 8, 2024

askfongjojo modified the milestones: 10, 11 Aug 22, 2024

morlandi7 modified the milestones: 11, 12 Sep 26, 2024

morlandi7 modified the milestones: 12, 13 Nov 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tracking: Instance Lifecycle Overhaul #3742

Tracking: Instance Lifecycle Overhaul #3742

smklein commented Jul 21, 2023 •

edited by hawkw

Loading

gjcolombo commented Jul 21, 2023

smklein commented Jul 25, 2023

hawkw commented Sep 24, 2024

Tracking: Instance Lifecycle Overhaul #3742

Tracking: Instance Lifecycle Overhaul #3742

Comments

smklein commented Jul 21, 2023 • edited by hawkw Loading

gjcolombo commented Jul 21, 2023

smklein commented Jul 25, 2023

hawkw commented Sep 24, 2024

smklein commented Jul 21, 2023 •

edited by hawkw

Loading