-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Puzzles about Inconsistency between code and article #3
Comments
Yes @chen-qj, I noticed this too. Did you figure out why? |
I noticed another question. The multiplication of vectors and grad1/2 is element-wise but in the paper, it is matrix multiplication. Or I misunderstand the theory? |
The Author is not using score matching to learn the data distribution |
They are equivalent. Flatten the data into one dimension, you will find it easier to understand. |
To my understanding,$h = S_{m}(x;\theta)$ and loss2 is the first term of $J(\theta)$ , which is $v^{T}\nabla_{x}h(x;\theta)v$ . But in the code, it seems to be calculated as $v^{T}\nabla_{z}h(x;\theta)v$ .
grad1
is the estimation of scoreThe text was updated successfully, but these errors were encountered: