Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Notable gap between image stylization and video stylization based on animatediff and controlnet #23

Open
mengfanShi opened this issue Sep 5, 2024 · 2 comments

Comments

@mengfanShi
Copy link

Hi, thanks for your great work! I've been exploring video stylization with animatediff yet and I noticed you might have already tried these out. I've found that the results of generated video significantly differ from image-to-image stylization, the image stylization is clear and aligns well with the style model, but with adv3, there's a notable gap. Have you ever experienced this? Additionally, when using adv3, the output becomes less smooth with more than 16 frames, resulting in flickering. Could I try your trained 48-frame model?

Thanks a lot!

@tumurzakov
Copy link
Owner

tumurzakov commented Sep 5, 2024

I'm using adv3 too, it is much more expressive then mine. 48 frame model gives more stability and removes flickering. I trained my models on 4000+ samples for 180 000 steps and my model lost possibility to imagine something.

I found out that using lora is brings quite good result. I'm training lora on all attention layers (unet and mm) on one video for 3 frames on resolution 1024x576 with adding info about frame position into embeddings for 100 epochs (for example video contain 30 frames, so i train for 1000 steps). Then i'm using it with weight something near 0.3 and it gives good result on stylisation task and removes flickering as well.

@mengfanShi
Copy link
Author

Thanks a lot for your suggestion !
I'll try to train a lora on all attentions layers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants