-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Persist fault management data across reboots #4211
Comments
citrus-it
added a commit
that referenced
this issue
Oct 5, 2023
/var/fm/fmd is where the illumos fault management system records data. We want to preserve this data across system reboots and in real time rather than via periodic data copying, so that the information is available should the system panic shortly thereafter. Fixes: #4211
citrus-it
added a commit
that referenced
this issue
Oct 5, 2023
/var/fm/fmd is where the illumos fault management system records data. We want to preserve this data across system reboots and in real time rather than via periodic data copying, so that the information is available should the system panic shortly thereafter. Fixes: #4211
citrus-it
added a commit
that referenced
this issue
Oct 5, 2023
/var/fm/fmd is where the illumos fault management system records data. We want to preserve this data across system reboots and in real time rather than via periodic data copying, so that the information is available should the system panic shortly thereafter. Fixes: #4211
citrus-it
added a commit
that referenced
this issue
Oct 5, 2023
/var/fm/fmd is where the illumos fault management system records data. We want to preserve this data across system reboots and in real time rather than via periodic data copying, so that the information is available should the system panic shortly thereafter. Fixes: #4211
Related to disk filling up: #2478 |
A few notes from testing on a bench gimlet:
|
citrus-it
added a commit
that referenced
this issue
Oct 9, 2023
/var/fm/fmd is where the illumos fault management system records data. We want to preserve this data across system reboots and in real time rather than via periodic data copying, so that the information is available should the system panic shortly thereafter. Fixes: #4211
citrus-it
added a commit
that referenced
this issue
Oct 9, 2023
`/var/fm/fmd` is where the illumos fault management system records data. We want to preserve this data across system reboots and in real time rather than via periodic data copying, so that the information is available should the system panic shortly thereafter. Fixes: #4211
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
While @wesolows was digging into stlouis#281/cs#39 he found breadcrumbs in the kernel memory of the crash dump that indicated that the fault management system had done /something/ recently but, since the fault management logs and database are on non-persistent storage on the system ramdisk, that data was lost and we can only guess what occurred.
We should back the fault management data in
/var/fm/fmd
with a dataset on the current boot disk so that the fault management history is preserved. In the future, we should likely do the same for GZ system log files and other things.Note that we have also previously seen a system fill up the root ramdisk (via
/var/fm/fmd
) due to a flood of ZFS errors so an appropriate quota should also be applied here.The text was updated successfully, but these errors were encountered: