Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Learning rate for Imagenet #18

Open
Shiweiliuiiiiiii opened this issue Jun 17, 2020 · 1 comment
Open

Learning rate for Imagenet #18

Shiweiliuiiiiiii opened this issue Jun 17, 2020 · 1 comment
Labels
question Further information is requested wontfix This will not be worked on

Comments

@Shiweiliuiiiiiii
Copy link

Hi Tim,

First, thank you for your code.

I notice that you change the default learning rate for Imagenet in multi-GPU running by multiplying 0.1 with the number of GPUs. I am wondering did you actually use this to get the reported performance in the paper? Will this results in better performance only for sparse training or also dense performance.

Many thanks

@TimDettmers
Copy link
Owner

Good catch, I was not aware of this behavior. I did not change the code any further and trained on 4 GPUs. I have not studied the performance difference in detail if I change this behavior. It might have affected the results for sparse and dense performance, but since I have not any data I cannot say for sure what the effect is.

@TimDettmers TimDettmers added question Further information is requested wontfix This will not be worked on labels Oct 22, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested wontfix This will not be worked on
Projects
None yet
Development

No branches or pull requests

2 participants