train code about controlnet ? #82

chuck-ma · 2023-07-18T08:44:45Z

I am very interested in training a new controlnet model. After studying the kandinsky-2-2-controlnet-depth model uploaded in HuggingFace, I found that its architecture seems to be different from the controlnet model of the traditional stable diffusion model.

In my understanding, the structure of the unet model corresponding to kandinsky-2-2-controlnet-depth has been modified compared to the unet model of kandinsky-2-2-decoder. The "in_channels" parameter of conv_in has been changed to 8, and an additional module called "input_hint_block" has been added.

In terms of parameters, the weights and biases are also completely different from the unet model of kandinsky-2-2-decoder.

My training approach is as follows: First, download the unet models corresponding to kandinsky-2-2-controlnet-depth and kandinsky-2-2-decoder. Then, copy the overall parameters of the unet in kandinsky-2-2-decoder to the corresponding unet parameters in kandinsky-2-2-controlnet-depth (except for the parts with different structures).

Afterward, train the new unet model based on the fill50k dataset.

I wonder if there are any issues with this approach? I would greatly appreciate any help or suggestions you can provide.

Additionally, I seem to have not found the training code specifically for kandinsky-2-2-controlnet-depth. I would greatly appreciate it if you could provide information on where to find it.

@Blucknote

ziniuwan · 2023-08-08T01:11:12Z

Any updates for this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train code about controlnet ? #82

train code about controlnet ? #82

chuck-ma commented Jul 18, 2023 •

edited

Loading

ziniuwan commented Aug 8, 2023

train code about controlnet ? #82

train code about controlnet ? #82

Comments

chuck-ma commented Jul 18, 2023 • edited Loading

ziniuwan commented Aug 8, 2023

chuck-ma commented Jul 18, 2023 •

edited

Loading