You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on May 1, 2023. It is now read-only.
You compute the mean and standard-deviation of the parameter once, and cache them. But it is said in the paper that not only the important parameters are updated, but also the ones corresponding to zero entries of masks. This means that the distribution of parameters are constantly changing. I also found that you only update the 'important' parameters.
Where does the code reflect the author's special update method of parameters?
The text was updated successfully, but these errors were encountered:
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
You compute the mean and standard-deviation of the parameter once, and cache them. But it is said in the paper that not only the important parameters are updated, but also the ones corresponding to zero entries of masks. This means that the distribution of parameters are constantly changing. I also found that you only update the 'important' parameters.
Where does the code reflect the author's special update method of parameters?
The text was updated successfully, but these errors were encountered: