-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Post-processing fails when there's an ensemble member run failure #20
Comments
Then I ran the post-process only, but it failed with a
|
@SorooshMani-NOAA, the same issue happened for the
|
@FariborzDaneshvar-NOAA it might be related to permissions it seems: open-mpi/ompi#8510 let me take a look at the directory where it failed to create the file |
@SorooshMani-NOAA Thanks for following up. I ran it again and unlike the first time, all runs completed! |
No problem! I think it might just be some internal openmpi issue, or the fact that versions don't align correctly on PW and Singularity image. In any case I'm building another image with the same version using |
@SorooshMani-NOAA, I was running
Also one of runs for the
The rest of jobs completed successfully! Any thoughts? |
@FariborzDaneshvar-NOAA I can't think why it is failing. The only thing I can think of is permissions, but even that doesn't make sense! Unless we can find a way to reliably reproduce this error, it's hard to say why it's happening or how to fix it! |
I was running the workflow for 10 ensembles of florence, 2018
OFCL
storm track, but one of runs (run#9) failed. Here is the content ofslurm-##.out
file for that run:The text was updated successfully, but these errors were encountered: