Train Code for VAE Used in Paper #19

Ferry1231 · 2024-08-14T10:40:50Z

Dear researcher,I have been reading your team's paper, and I found it incredibly insightful and was inspired to attempt a reproduction of the work. Considering the limited resources available in my lab and from a learning perspective, I plan to start by training the model on smaller datasets like CIFAR-10. However, I've encountered some difficulties while using the VAE encoder and couldn't find a VAE model that fits well with it.

Do you have the train code of VAE used in the paper? Another question, what does "vae_stride" param mean?

Thank you and thank for your works.

LTH14 · 2024-08-14T10:57:16Z

Thanks for your interest! We follow the VAE training from VQGAN and LDM. Please use this codebase and follow this config.

LTH14 · 2024-08-14T10:58:42Z

You need to copy the AutoencoderKL class to this file in the VQGAN codebase.

Ferry1231 · 2024-08-15T12:06:05Z

Thank you! I got it.

gzhuinjune · 2024-08-27T16:43:15Z

您好，请问我当时在跑rcg的时候训练过的vqgan也可以直接在这里用对吗，vae_ckpt就是vqgan对吗，请问还改了别的细节吗，我依旧想换平面图的数据集，谢谢您的耐心回答，希望您能把改动的细节都告诉我，我害怕对不上，之前的rgb的范围以及数据增强之类的应该如何设置呢，比起官方代码还有别的改动吗，谢谢！！！

gzhuinjune · 2024-08-27T16:47:15Z

很遗憾之前的rcg我一直没有跑出来理想的结果，这个看起来组成部分要少很多，谢谢您的帮助

LTH14 · 2024-08-27T17:08:07Z

@gzhuinjune A major difference here is the vae in this paper does not rely on the "quantization" step in vqgan. Of course, this framework can also use vq-based tokenizer, but a non-vq tokenizer should work better. You can start with the commonly used non-vq tokenizer like the one below:

from diffusers.models import AutoencoderKL
vae = AutoencoderKL.from_pretrained(f"stabilityai/sd-vae-ft-ema")

LTH14 · 2024-08-28T04:40:12Z

@gzhuinjune 不能用这个，因为这个是在imagenet上训练的。可以用我上面说的Stable Diffusion用的VAE，他们是在openimage上训练的，通用性好很多，可以先在你的数据集上试试reconstruction效果。当然，如果performance不好的话，那还是得先在你自己的数据集上训练一个vae

gzhuinjune · 2024-08-28T04:47:31Z

您在上面引用了vqgan.py，请问这个是在哪里用呀，应该不是把AutoencoderKL放这里面吧，抱歉我还是没有明白用vqgan里面的哪个代码来训练

。您提到的sd的vae是这个对吗。

gzhuinjune · 2024-08-28T04:52:56Z

vae = AutoencoderKL.from_pretrained(f"stabilityai/sd-vae-ft-ema")是写在哪个文件里面呀，是main.py里面对吗，然后复制一个类到vqgan里面，直接用vqgan官网的这个定制数据集的来训练对吗

gzhuinjune · 2024-08-28T04:54:54Z

请问openimage的预训练权重是这个吗

gzhuinjune · 2024-08-28T05:11:00Z

是将AutoencoderKL类复制到了vqgan，为啥这里写得是from diffusers.models import AutoencoderKL，谢谢

LTH14 · 2024-08-28T12:14:29Z

from diffusers.models import AutoencoderKL让你可以直接使用stable diffusion训练好的vae。但如果你需要自己训练，那就需要把AutoencoderKL（https://github.com/CompVis/latent-diffusion/blob/main/ldm/models/autoencoder.py#L285）复制到vqgan的codebase里进行训练。

gzhuinjune · 2024-08-28T12:25:48Z

您好，请问我把AutoencoderKL类复制到vqgan里面替换哪个类呢，只替换进去应该没有被调用吧，我还是接着用main.py训练自己数据集的那个脚本示例对吗，然后如何在main.py里面调用它呢，谢谢您对初学者的耐心解答

LTH14 · 2024-08-28T12:30:00Z

复制到taming/models/vqgan.py里，然后用这个config。需要把这个config里面的ldm路径改成vqgan里的路径

gzhuinjune · 2024-08-28T12:31:46Z

谢谢大帅哥，祝你一切顺利

gzhuinjune · 2024-08-28T12:31:58Z

我明白了

fengyang0317 · 2024-10-22T15:03:39Z

Is there any reason for using taming-transformers instead of latent-diffusion or stable-diffusion codebase?

LTH14 · 2024-10-22T15:32:40Z

@fengyang0317 not really -- I just chose one.

fengyang0317 · 2024-10-23T11:48:02Z

I see. Thank you so much.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train Code for VAE Used in Paper #19

Train Code for VAE Used in Paper #19

Ferry1231 commented Aug 14, 2024

LTH14 commented Aug 14, 2024

LTH14 commented Aug 14, 2024

Ferry1231 commented Aug 15, 2024

gzhuinjune commented Aug 27, 2024

gzhuinjune commented Aug 27, 2024

LTH14 commented Aug 27, 2024

LTH14 commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

LTH14 commented Aug 28, 2024 •

edited

Loading

gzhuinjune commented Aug 28, 2024

LTH14 commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

fengyang0317 commented Oct 22, 2024

LTH14 commented Oct 22, 2024

fengyang0317 commented Oct 23, 2024 •

edited

Loading

Train Code for VAE Used in Paper #19

Train Code for VAE Used in Paper #19

Comments

Ferry1231 commented Aug 14, 2024

LTH14 commented Aug 14, 2024

LTH14 commented Aug 14, 2024

Ferry1231 commented Aug 15, 2024

gzhuinjune commented Aug 27, 2024

gzhuinjune commented Aug 27, 2024

LTH14 commented Aug 27, 2024

LTH14 commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

LTH14 commented Aug 28, 2024 • edited Loading

gzhuinjune commented Aug 28, 2024

LTH14 commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

gzhuinjune commented Aug 28, 2024

fengyang0317 commented Oct 22, 2024

LTH14 commented Oct 22, 2024

fengyang0317 commented Oct 23, 2024 • edited Loading

LTH14 commented Aug 28, 2024 •

edited

Loading

fengyang0317 commented Oct 23, 2024 •

edited

Loading