Speed Up #10

yenchenlin · 2020-05-02T02:09:45Z

May I know the exact config (hyperparameters) to get an average speed of 14.2 it/s (on an RTX2080Ti, PyTorch 1.4.0, CUDA 9.2) reported here?

I couldn't get it by simply following modifications in #6.
(cc @kwea123, did you test it further?)

Thanks in advance!

kwea123 · 2020-05-02T02:24:31Z

As I commented in that thread, I can only get up to 8.5it/s (without his caching strategy). I didn't test further. He mentioned caching gave him +4 it/s, maybe that is the trick? But in my implementation data loading is not bottleneck at all (it's about 2e-4 sec to fetch a batch) so I didn't try it.

krrish94 · 2020-05-05T19:44:29Z

Hi @yenchenlin

Missed out on this issue. I haven't been able to follow up on that thread as I haven't been able to run additional experiments since. For the config though, it was PyTorch 1.4, CUDA 10.1, python 3.6. I was running on a GPU-cluster, and in particular, on a node that had a V100 GPU.

As for the config, I took care to replicate the exact lego config file (network of 8 layers, 128 fine samples per ray, and the like). And for speed comparison, I use the time taken until the optimizer parameter update (and not the tqdm loop reported times, which include tensorboard logging, etc.)

yenchenlin · 2020-05-09T16:29:22Z

Is there a specific config file and command to reproduce that? thanks a lot!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed Up #10

Speed Up #10

yenchenlin commented May 2, 2020

kwea123 commented May 2, 2020

krrish94 commented May 5, 2020

yenchenlin commented May 9, 2020 •

edited

Loading

Speed Up #10

Speed Up #10

Comments

yenchenlin commented May 2, 2020

kwea123 commented May 2, 2020

krrish94 commented May 5, 2020

yenchenlin commented May 9, 2020 • edited Loading

yenchenlin commented May 9, 2020 •

edited

Loading