-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
saved_logwt_bs
error after completion
#112
Comments
If you can reproduce, please add a print before to see what
in file "/u/sbethapudi/.local/lib/python3.8/site-packages/ultranest/netiter.py", line 897, in combine_results This is an error I have not seen before. mpi4py/mpi4py#23 suggests you may have crossed a 2GB threshold that your MPI does not support. I guess this translates into a limit on number of live points x number of iterations, the latter is increased with the improvement loops. |
(sorry to dig up this old issue; i need to have large number of live points and am running into the 2GB threshold more frequently) This SO answer recommends to replace lower case I want to edit the Does this seem reasonable? thanks in advance. |
Have you already tried switching MPI implementation? Your cluster should allow you to select different ones. |
i just tried with a toy example where i perform a according to that SO answer, this is a inherent limitation of using lower case methods as they use Using |
You could alternatively reduce the number of bootstrap rounds. |
Description
(let me preface by saying i love this piece of code)
i am solving a 6D fitting problem using
ultranest
. Five are angles hence are wrapped parameters. One is a simple parameter.i am running it on hpc with 480 tasks using MPI.
it crashes after it has converged to the ML point.
This has happened during multiple runs with the same error.
Looking through the traceback, it is failing in the gather step.
This happened after the iteration has completed.
In my code, after i run
sampler.run
, i runstore_tree,print_results,plot_corner
.What I Did
these are my parameters for
run
.The text was updated successfully, but these errors were encountered: