You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Sep 19, 2022. It is now read-only.
The version in title means the version of PyTorch instead of PyTorchJob. Let's fix it on 1.8.0 and see how the difference is introduced.
Oh yes. I'm sorry to make this mistake. I'll change it right now.
Shuai-Xie
changed the title
PytorchJob version has an effect on the training reproduction
Pytorch version may have an effect on the training reproduction
Sep 21, 2021
I try to figure out why Bare Metal (BM) and PytorchJob (PJ) have different training results in #354 (comment).
And now I find that PytorchJon v1.8.0 and 1.9.0 have different training results both on BM and PJ.
Experiment settings
BM
PJ
I build docker images from different versions of the PyTorch base images.
Please let me know if I write the wrong code. I've posted my code here: https://github.com/Shuai-Xie/mnist-pytorchjob-example.
The text was updated successfully, but these errors were encountered: