Add Timestep Sampling Function from SD3 Branch to SD #1668

gesen2egee · 2024-10-04T07:46:48Z

This PR introduces the timestep_sampling feature from SD3 Branch into the original SD model. The new timestep sampling options offer a more concentrated probability distribution compared to the default uniform sampling, which helps the model focus on specific aspects of learning. The new options can be used via the --timestep_sampling argument.

Default (uniform): Keeps the original uniform timestep sampling, which evenly distributes the learning steps.
Sigmoid: Behaves similarly to shift by default --discrete_flow_shift = 1.
Shift: Skews the timestep distribution, helping the model concentrate on style learning or specific objects.
Flux Shift: This option shift with the image size, similar to how FLUX behaves in larger images

Additional Parameters:

--discrete_flow_shift: By default set to 1, uses random normal distribution to sample timesteps. A rightward shift helps the model focus more on style, while a leftward shift aids in learning specific objects. The default value is 1, providing a balanced form.
--sigmoid_scale: Adjusts the shape of the sigmoid function.

While flux_shift maintains the distortion effects of FLUX, its application may vary in SD due to differences in model nature and training at fixed resolutions.

It is recommended to use the --timestep_sampling sigmoid option, combined with --soft_min_snr_gamma = 1
By rockerBOO #1068
#1068
for better results, as these settings seem to significantly improve model performance.

Suggested to merge this PR along with the soft min snr gamma PR.

Fused backward pass

Lora plus

Add "--disable_mmap_load_safetensors" parameter

Display name of error latent file

removed unnecessary `torch` import on line 115

Fix caption_separator missing in subset schema

Add caption_separator to output for subset

Accelerate: fix get_trainable_params in controlnet-llite training

Hyperparameter tracking

Make timesteps work in the standard way when Huber loss is used

…ategy

New optimizer:AdEMAMix8bit and PagedAdEMAMix8bit

…1647

…#1393

kohya-ss · 2024-10-04T11:40:48Z

It looks like this PR is trying to merge all the changes from the sd3 branch into main, please make any fixes within the sd3 branch.

gesen2egee · 2024-10-04T14:10:39Z

I got it, open #1671 and close this.

aria1th and others added 30 commits May 7, 2024 18:21

fix get_trainable_params in controlnet-llite training

793aeb9

chore: Refactor optimizer group

607e041

update readme

c1ba0b4

Merge branch 'dev' into fused-backward-pass

6dbc23c

update README for fused optimizer

f3d2cf2

update README for fused optimizer

bee8cee

Merge pull request kohya-ss#1319 from kohya-ss/fused-backward-pass

7983d3d

Fused backward pass

Merge branch 'dev' into lora-plus

e9f3a62

Merge branch 'dev' into lora-plus

e01e148

fix typo

1ffc0b3

Merge branch 'dev' into lora-plus

c6a4370

revert lora+ for lora_fa

3c8193f

update docs etc.

4419041

Merge pull request kohya-ss#1331 from kohya-ss/lora-plus

02298e3

Lora plus

Merge pull request kohya-ss#1266 from Zovjsra/feature/disable-mmap

8d1b1ac

Add "--disable_mmap_load_safetensors" parameter

update readme and help message etc.

9ddb4d7

Merge pull request kohya-ss#1278 from Cauldrath/catch_latent_error_file

7802093

Display name of error latent file

raise original error if error is occured in checking latents

3701507

update readme

39b82f2

Merge pull request kohya-ss#1291 from frodo821/patch-1

e96a521

removed unnecessary `torch` import on line 115

Merge pull request kohya-ss#1312 from rockerBOO/patch-2

1c296f7

Fix caption_separator missing in subset schema

Merge pull request kohya-ss#1313 from rockerBOO/patch-3

a384bf2

Add caption_separator to output for subset

fix create_network_from_weights doesn't work

16677da

update README

589c2aa

add prompt option '--f' for filename

153764a

support Diffusers' based SDXL LoRA key for inference

146edce

update README

2f19175

Merge pull request kohya-ss#1322 from aria1th/patch-1

0640f01

Accelerate: fix get_trainable_params in controlnet-llite training

update README and format code

e3ddd1f

Merge pull request kohya-ss#1285 from ccharest93/main

47187f7

Hyperparameter tracking

kohya-ss and others added 25 commits September 23, 2024 21:14

retain alpha in pil_resize backport kohya-ss#1619

29177d2

Merge branch 'dev' into sd3

fba7692

init

ab7b231

Merge pull request kohya-ss#1628 from recris/huber-timesteps

c1d16a7

Make timesteps work in the standard way when Huber loss is used

update README

e74f581

Merge branch 'dev' into sd3

65fb69f

delete code for cleaning

1beddd8

new block swap for FLUX.1 fine tuning

56a7bc1

fix typos

da94fd9

fix flip_aug, alpha_mask, random_crop issue in caching

bf91bea

Merge branch 'dev' into sd3

2cd6aa2

fix flip_aug, alpha_mask, random_crop issue in caching in caching str…

392e8de

…ategy

Merge pull request kohya-ss#1640 from sdbds/ademamix8bit

4296e28

New optimizer:AdEMAMix8bit and PagedAdEMAMix8bit

fix to work bitsandbytes optimizers with full path kohya-ss#1640

a94bc84

update readme

ce49ced

Merge branch 'dev' into sd3

3ebb65f

fix sample generation is not working in FLUX1 fine tuning kohya-ss#1647

a9aa526

add workaround for 'Some tensors share memory' error kohya-ss#1614

822fe57

re-fix sample generation is not working in FLUX1 split mode kohya-ss#…

1a0f5b0

…1647

adjust min/max bucket reso divisible by reso steps kohya-ss#1632

fe2aa32

update help text kohya-ss#1632

1567549

Merge branch 'dev' into sd3

d050638

fix to work linear/cosine scheduler closes kohya-ss#1651 ref kohya-ss…

012e7e6

…#1393

Merge branch 'dev' into sd3

8bea039

more_timestep_sampling

ece441e

gesen2egee closed this Oct 4, 2024

gesen2egee reopened this Oct 4, 2024

gesen2egee closed this Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Timestep Sampling Function from SD3 Branch to SD #1668

Add Timestep Sampling Function from SD3 Branch to SD #1668

gesen2egee commented Oct 4, 2024

kohya-ss commented Oct 4, 2024

gesen2egee commented Oct 4, 2024

Add Timestep Sampling Function from SD3 Branch to SD #1668

Add Timestep Sampling Function from SD3 Branch to SD #1668

Conversation

gesen2egee commented Oct 4, 2024

Additional Parameters:

kohya-ss commented Oct 4, 2024

gesen2egee commented Oct 4, 2024