Skip to content

Scheduler lr_noise questions. #1096

Answered by rwightman
ayasyrev asked this question in Q&A
Discussion options

You must be logged in to vote

@ayasyrev the random number for this needs to be separate from the python and other (global) library random generators and seeded consistently across all distributed workers in distributed training scenarios so that all wokers are using the same learning rate

t is for time as it could be epochs or steps, I have (as yet unralized) plans to transition the schedulers to step based for a variety of reasons, mostly fine grained warmup (per step)

the noise idea had some tweaks/improvements to explore that I didn't end up doing, hence that extra arg floating around

I'd be curious to know if you find any improvements for find the feature useful. It's (as far as I know) unpublished and unique to timm

Replies: 2 comments

Comment options

You must be logged in to vote
0 replies
Answer selected by ayasyrev
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants