-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cuda issues with quay.io/pangeo/ml-notebook:2023.02.27 #450
Comments
Thanks for the report and workaround @sebastian-luna-valero, ideally these tutorials would just work... but ptxas/cuda-nvcc isn't installed by default unfortunately see this readme note pangeo-docker-images/README.md Line 173 in 614419a
i did just tag a more recent image with newer versions of everything in case its helpful |
Thanks for the background! I just tested the |
Btw, we should be able to finally resolve this very soon! In practice, we could just resolve it, but we should wait for a migration trigger to avoid unforeseen issues. xref #438 |
Cool, hopefully things are working for you now with the newer version by manually running Will close this as a duplicate of #438 to better focus the conversation on that thread. |
Hi,
When running this notebook: https://www.tensorflow.org/tutorials/images/cnn
Using
quay.io/pangeo/ml-notebook:2023.02.27
, in the cell:https://www.tensorflow.org/tutorials/images/cnn#compile_and_train_the_model
I get:
which is solved with:
mamba install -c nvidia cuda-nvcc=11.8 # same version as cudatoolkit
Then I also get:
The problem is solved when I replace
optimizer='adam'
withoptimizer=tf.keras.optimizers.legacy.Adam()
in:These solutions were found in:
https://discuss.tensorflow.org/t/cant-find-libdevice-directory-cuda-dir-nvvm-libdevice/
Maybe the
ml-notebook
image could be updated to solve these issues?Best regards,
Sebastian
The text was updated successfully, but these errors were encountered: