About training video length #20

Vincent-luo · 2024-08-06T04:07:38Z

Hello, I noticed that you're able to train on more than 300 frames using an A100 GPU. I'm curious about your training process - are you only training the to_q or the entire motion module?

I've been using the official AnimateDiff training script, and training on just 32 frames consumes about 30GB of VRAM. I'm wondering if you've implemented any optimizations to improve efficiency. It would be helpful if you could share some details about your training setup and any techniques you're using. Thanks!

The text was updated successfully, but these errors were encountered:

tumurzakov · 2024-08-26T07:20:21Z

Now i training lora 1024x576x3 and it tooks 23.8 GB on my 3090.

memory offload everything that don't needed for train (vae, text_encoder)
precache samples (encode latents and embeddings into pth)
keep an eye on gradients
i'm using my own framework latentflow https://github.com/tumurzakov/latentflow. I could be hard to understand and use it, but you could try to look at train code. May be it will be useful for you

Vincent-luo · 2024-08-28T09:25:19Z

Thanks for the suggestions! I'll give them a try. I've noticed that the official AnimateDiff code doesn't use gradient checkpointing by default, and it can save lots of GPU memory.

tumurzakov · 2024-08-28T10:59:03Z

Yes, i'm using checkpointing too

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About training video length #20

About training video length #20

Vincent-luo commented Aug 6, 2024

tumurzakov commented Aug 26, 2024

Vincent-luo commented Aug 28, 2024

tumurzakov commented Aug 28, 2024

About training video length #20

About training video length #20

Comments

Vincent-luo commented Aug 6, 2024

tumurzakov commented Aug 26, 2024

Vincent-luo commented Aug 28, 2024

tumurzakov commented Aug 28, 2024