Scheduler lr_noise questions. #1096

ayasyrev · 2022-01-19T11:21:23Z

ayasyrev
Jan 19, 2022

Looking into Scheduler code and got several questions about lr_noise.
Why used torch.Generator instead of python random - we just need float here.
Why on every epoch doing manual_seed instead of one seed at start?
It is noise_std argument but it never used - supposed some functionality?
P.S. by the way - what t (in t_initial, warmup_t just t etc) stands for? See what it same as epoch number...

Answered by rwightman

Jan 19, 2022

@ayasyrev the random number for this needs to be separate from the python and other (global) library random generators and seeded consistently across all distributed workers in distributed training scenarios so that all wokers are using the same learning rate

t is for time as it could be epochs or steps, I have (as yet unralized) plans to transition the schedulers to step based for a variety of reasons, mostly fine grained warmup (per step)

the noise idea had some tweaks/improvements to explore that I didn't end up doing, hence that extra arg floating around

I'd be curious to know if you find any improvements for find the feature useful. It's (as far as I know) unpublished and unique to timm

View full answer

rwightman · 2022-01-19T18:33:29Z

rwightman
Jan 19, 2022
Maintainer

@ayasyrev the random number for this needs to be separate from the python and other (global) library random generators and seeded consistently across all distributed workers in distributed training scenarios so that all wokers are using the same learning rate

t is for time as it could be epochs or steps, I have (as yet unralized) plans to transition the schedulers to step based for a variety of reasons, mostly fine grained warmup (per step)

the noise idea had some tweaks/improvements to explore that I didn't end up doing, hence that extra arg floating around

I'd be curious to know if you find any improvements for find the feature useful. It's (as far as I know) unpublished and unique to timm. Please cite if you do build on it and publish any work. Thanks!

0 replies

ayasyrev · 2022-01-20T15:37:35Z

ayasyrev
Jan 20, 2022
Author

Thanks for you answer.
I had no experience in distributed training. I thought as Lr updates at the end of epoch , than Lr will be synced throughout devices. And anyway same seed on different devices got different generators. I got it!
I will try use noise, but can do it with just imagewoof now.

Right now I preparing nbs for timmdocs. Started with scheduler- add document noise, changes and new schedulers from last commits,

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scheduler lr_noise questions. #1096

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Scheduler lr_noise questions. #1096

ayasyrev Jan 19, 2022

Replies: 2 comments

rwightman Jan 19, 2022 Maintainer

ayasyrev Jan 20, 2022 Author

ayasyrev
Jan 19, 2022

rwightman
Jan 19, 2022
Maintainer

ayasyrev
Jan 20, 2022
Author