Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Respect config when figuring out WD decoupling
WD = weight decay, as in whether AdamW is selected. Previously we just ignored `adam_w_mode`. Based on @ofivite's suggestion.
- Loading branch information