Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove an extra residual connection in DiffusionTransformer #305

Closed
wants to merge 1 commit into from

Conversation

amorehead
Copy link
Contributor

  • Removes an extra residual connection in DiffusionTransformer (not present in the paper), as below Algorithm 23 treats the output of AttentionPairBias as a separate term b_i (without a residual connection) which then forms a residual connection with the output of ConditionedTransitionBlock
    image

@lucidrains
Copy link
Owner

think this was concluded to be an error on deepminds part

#113

@amorehead
Copy link
Contributor Author

Yep, I agree. I'll close this PR for now

@amorehead amorehead closed this Oct 4, 2024
@amorehead amorehead deleted the patch-1 branch October 4, 2024 00:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants