Optimizations to improve dataset ensuring performance #7236

smklein · 2024-12-12T01:49:33Z

Major part of #7217

Optimizations made:

zfs get is queried, so properties are samples for all datasets of interest ahead-of-time. No subsequent processes are exec'd for each dataset that needs no changes.
The "dataset ensure" process is now concurrent

These optimizations should significantly improve the latency of the "no (or few) changes necessary" cases.

Optimizations still left to be made:

Making blueprint execution concurrent, rather than blocking one-sled-at-a-time
Patching the illumos_utils "Zfs::ensure_filesystem" to re-use the pre-fetched properties, and minimize re-querying

illumos-utils/src/zfs.rs

smklein · 2024-12-12T01:53:34Z

illumos-utils/src/zfs.rs

+        //
+        // If one or more dataset doesn't exist, we can still read stdout to
+        // see about the ones that do exist.
+        let output = cmd.output().map_err(|err| {


Here's an easy example to see what's happening:

$ zfs get -o name,property,value used rpool does_not_exist cannot open 'does_not_exist': dataset does not exist NAME PROPERTY VALUE rpool used 256G

"Cannot open 'does_not_exist'" goes to stderr, the rest goes to stdout, and can be parsed.
Since we want to use this API to get as much info as possible about datasets which may or may not exist, it's critical to keep going, even if one or more datasets are missing.

Ignoring any errors and moving forward with parsing stdout seems pretty dicey. I assume there are lots of ways zfs get could fail that are more serious than being asked about datasets that don't exist. Do we know it's okay to ignore all of those too?

(To be clear: I'm not sure what the alternative is. I guess we could do something like "on error, try to parse stderr, and if it's only dataset does not exist logs then proceed"? Or maybe we could call zfs get without listing any datasets at all, then filter the returned list ourselves to match which?)

Yeah, I don't know how to proceed here. The exit code is "1", so there isn't much information to glean from this.

I looked through the zfs get flags and didn't see anything that allows this omission. I think my preference with these tools would be to rely on other mechanisms to detect problems with the datasets, and leave the listing as a best-effort thing.

Namely: If a dataset that should exist actually doesn't, for any reason, we'll try to re-create it later as a part of blueprint execution. Then we'll see errors at "ensure" time, rather than "listing" time.

I guess we could do something like "on error, try to parse stderr, and if it's only dataset does not exist logs then proceed"?

This is possible - to be clear, would this proposal be:

Call zfs get on several datasets (like this PR is currently doing)

If we see the command fail, parse stderr

Only treat it as "OK" if stderr contains the lines cannot open 'XXX': dataset does not exist for one of our datasets?

Small quirk here -- if this failed for any datasets we're trying to query (e.g., maybe a zpool is dying?) it would stop inventory collection for the entire sled. So at the end of the day, I think we need to proceed, parsing what we can, from the set of all datasets we can observe.

Or maybe we could call zfs get without listing any datasets at all, then filter the returned list ourselves to match which?

I also considered this, and would be somewhat okay with this approach. My only beef is that it would need to be fully recursive to be able to see everything, so we'd be e.g. listing all crucible datasets every time -- and that means there could be a lot of data here.

For example, here are the results on a "non-pathological system" -- dogfood:

pilot host exec -c "zfs get -o name,value name | wc -l" {7..25} 7 BRM27230045 ok: 108 8 BRM44220011 ok: 4364 9 BRM44220005 ok: 4190 10 BRM42220009 ok: 263 11 BRM42220006 ok: 4298 12 BRM42220057 ok: 4463 13 BRM42220018 ok: 21 14 BRM42220051 ok: 4281 16 BRM42220014 ok: 4397 17 BRM42220017 ok: 4286 21 BRM42220031 ok: 4315 23 BRM42220016 ok: 4379

This is possible - to be clear, would this proposal be:

* Call `zfs get` on several datasets (like this PR is currently doing) * If we see the command fail, parse stderr * Only treat it as "OK" if stderr contains the lines `cannot open 'XXX': dataset does not exist` for one of our datasets?

Small quirk here -- if this failed for any datasets we're trying to query (e.g., maybe a zpool is dying?) it would stop inventory collection for the entire sled. So at the end of the day, I think we need to proceed, parsing what we can, from the set of all datasets we can observe.

That is what I was proposing, yes, but your quirk is a great point.

I also considered this, and would be somewhat okay with this approach. My only beef is that it would need to be fully recursive to be able to see everything, so we'd be e.g. listing all crucible datasets every time -- and that means there could be a lot of data here.

That's certainly not awesome.

I think I'm inclined to go with what you have (i.e., ignore errors and parse what we can), although in the back of my mind I'm worried that's going to bite us eventually. Is there any chance (not on this PR) we could get changes to the zfs CLI or a ZFS API that would let us do what we want to do here with less ambiguity?

Would it be possible/reasonable here to specify a parent dataset that does exist and use -r to get all its children? Then we shouldn't expect any errors.

Would it be possible/reasonable here to specify a parent dataset that does exist and use -r to get all its children? Then we shouldn't expect any errors.

I think this bumps into the problem I mentioned about recursive children - we do want to know about e.g. crucible, but using -r will list all regions, and all properties on all regions, which is in the ballpark of 10,000 - 100,000 lines of output.

illumos-utils/src/zfs.rs

smklein · 2024-12-12T01:55:27Z

sled-storage/src/manager.rs

+        // Gather properties about these datasets, if they exist.
+        //
+        // This pre-fetching lets us avoid individually querying them later.
+        let old_datasets = Zfs::get_dataset_properties(


This query, as well as the logic below, is the primary optimization made by this PR.

We call zfs get once for all datasets we care about, and if no changes are necessary, that's the only process created.

sled-storage/src/manager.rs

jgallagher

Thanks for picking this up! Having to go through a CLI is pretty rough to begin with; "now change how we do it so it's fast" is a tall order.

jgallagher · 2024-12-12T14:58:56Z

illumos-utils/src/zfs.rs

-                    .filter(|(_prop, source)| {
-                        // If a quota has not been set explicitly, it has a default
-                        // source and a value of "zero". Rather than parsing the value
-                        // as zero, it should be ignored.


Was this comment wrong before? (What is source in the two -/0 cases now checked?)

The comment was kinda right (the default value for new datasets is zero + default), but incomplete. There are other cases where a "quota set to none" shows up.

Regardless, I think this code is now more holistically correct -- regardless of source, we should treat "0" / "-" as "no quota has been set" (and same for reservation).

illumos-utils/src/zfs.rs

sled-storage/src/manager.rs

smklein · 2024-12-12T22:00:04Z

Thanks for all the feedback! I think I've addressed all the comments in the latest couple commits

sled-storage/src/manager.rs

illumos-utils/src/zfs.rs

smklein added 7 commits December 10, 2024 17:27

Fix get_dataset_properties to avoid propagating inherited UUIDs

8b4093d

Review feedback

de9b7a5

Optimizing storage manager

ab2d682

Merge branch 'fix-uuid-parsing' into sluggish-datasets

9546fd2

Optimizations for ensuring datasets

e1eca0a

Merge branch 'main' into fix-uuid-parsing

824ec7b

Merge branch 'fix-uuid-parsing' into sluggish-datasets

d41de2b

smklein commented Dec 12, 2024

View reviewed changes

illumos-utils/src/zfs.rs Show resolved Hide resolved

smklein commented Dec 12, 2024

View reviewed changes

illumos-utils/src/zfs.rs Show resolved Hide resolved

smklein commented Dec 12, 2024

View reviewed changes

sled-storage/src/manager.rs Outdated Show resolved Hide resolved

cleanup

5dcee63

smklein commented Dec 12, 2024

View reviewed changes

sled-storage/src/manager.rs Show resolved Hide resolved

compiling lol

c55eb06

smklein requested a review from jgallagher December 12, 2024 02:08

jgallagher reviewed Dec 12, 2024

View reviewed changes

smklein mentioned this pull request Dec 12, 2024

Refactor ZFS filesystem ensure arguments #7237

Closed

smklein added 2 commits December 12, 2024 13:39

Review feedback

ff29b08

Review feedback

3dbb44f

jgallagher approved these changes Dec 13, 2024

View reviewed changes

sled-storage/src/manager.rs Outdated Show resolved Hide resolved

sled-storage/src/manager.rs Outdated Show resolved Hide resolved

illumos-utils/src/zfs.rs Outdated Show resolved Hide resolved

smklein added 2 commits December 13, 2024 12:36

Merge branch 'main' into sluggish-datasets

feb1bdc

review feedback

b650639

smklein enabled auto-merge (squash) December 13, 2024 20:54

smklein mentioned this pull request Dec 13, 2024

test failed in CI: test_mgs_metrics #7251

Closed

smklein merged commit de8bc93 into main Dec 13, 2024
17 checks passed

smklein deleted the sluggish-datasets branch December 13, 2024 23:14

smklein mentioned this pull request Dec 16, 2024

Blueprint execution is surprisingly slow #7217

Open

jgallagher mentioned this pull request Dec 18, 2024

[sled-agent] Avoid causing UUID conflicts #7266

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizations to improve dataset ensuring performance #7236

Optimizations to improve dataset ensuring performance #7236

smklein commented Dec 12, 2024 •

edited

Loading

smklein Dec 12, 2024

jgallagher Dec 12, 2024

smklein Dec 12, 2024

jgallagher Dec 12, 2024

davepacheco Dec 12, 2024

smklein Dec 12, 2024

smklein Dec 12, 2024

jgallagher left a comment

jgallagher Dec 12, 2024

smklein Dec 12, 2024

smklein commented Dec 12, 2024 •

edited

Loading

Optimizations to improve dataset ensuring performance #7236

Optimizations to improve dataset ensuring performance #7236

Conversation

smklein commented Dec 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jgallagher left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

smklein commented Dec 12, 2024 • edited Loading

smklein commented Dec 12, 2024 •

edited

Loading

smklein commented Dec 12, 2024 •

edited

Loading