Question about difference of training SDXL between kohya and diffusers #2534
SeungHwa92
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am working on training SDXL Lora so I review the kohya code and diffusers code.
I was checking whether kohya and diffusers have same output of text_encoder_2(hidden_states2, pool2).
hidden_state2 values are same but pool2 have different values.
I found that the way of calculating pool2 are different.
the code for calculating pool2 from kohya and diffusers belows.
Difference start from this.
kohya uses "text_encoder_2_output['last_hidden_state']" or "text_encoder_2_output[1]" for pool2
but diffusers uses "text_encoder_2_output['text_embeds']" or "text_encoder_2_output[0]" for pool2
if you have any insights for implementing this code.
Can you share your insights ?
Beta Was this translation helpful? Give feedback.
All reactions