Trainer

This trainer was developed by the Eden team, you can try our hosted version of the trainer in our app. It's a highly optimized trainer that can be used for both full finetuning and training LoRa modules on top of Stable Diffusion. It uses a single training script and loss module that works for both SDv15 and SDXL!

The outputs of this trainer are fully compatible with ComfyUI and AUTO111, see documentation here. A full guide on training can be found in our docs.

Training images:

Generated imgs with trained LoRa:

The trainer can be run in 4 different ways:

as a hosted service on our website
as a hosted service through replicate
as a ComfyUI node
as a standalone python script

Using in ComfyUI:

Example workflows for how to run the trainer and do inference with it can be found in /ComfyUI_workflows
Importantly this trainer uses a chatgpt call to cleanup the auto-generated prompts and inject the trainable token, this will only work if you have a .env file containing your OPENAI key in the root of the repo dir that contains a single line: OPENAI_API_KEY=your_key_string Everything will work without this, but results will be better if you set this up, especially for 'face' and 'object' modes.

The trainer supports 3 default modes:

style: used for learning the aesthetic style of a collection of images.
face: used for learning a specific face (can be human, character, ...).
object: will learn a specific object or thing featured in the training images.

Style training example:

Setup

Install all dependencies using

pip install -r requirements.txt

then you can simply run:

python main.py train_configs/training_args.json to start a training job.

Adjust the arguments inside training_args.json to setup a custom training job.

You can also run this through Replicate using cog (~docker image):

Install Replicate 'cog':

sudo curl -o /usr/local/bin/cog -L "https://github.com/replicate/cog/releases/latest/download/cog_$(uname -s)_$(uname -m)"
sudo chmod +x /usr/local/bin/cog

Build the image with cog build
Run a training run with sh cog_test_train.sh
You can also go into the container with cog run /bin/bash

Full unet finetuning

When running this trainer in native python, you can also perform full unet finetuning using something like (adjust to your needs) python main.py train_configs/full_finetuning_example.json

TODO's

Bugs:

pure textual inversion for SD15 does not seem to work well... (but it works amazingly well for SDXL...) ---> if anyone can figure this one out I'd be forever grateful!
figure out why training is 3x slower through comfyui node versus just running main.py as a python job..?
Fix aspect_ratio bucketing in the dataloader (see https://github.com/kohya-ss/sd-scripts)

Bigger improvements:

integrate Flux / SD3
Add multi-concept training (multiple things represented by multiple tokens, trained into a single LoRa)
add stronger token regularization (eg CelebBasis spanning basis)
implement perfusion ideas (key locking with superclass): https://research.nvidia.com/labs/par/Perfusion/
implement prompt-aligned: https://prompt-aligned.github.io/

Name		Name	Last commit message	Last commit date
Latest commit History 468 Commits
.github/workflows		.github/workflows
ComfyUI_workflows		ComfyUI_workflows
assets		assets
scripts		scripts
train_configs		train_configs
trainer		trainer
.dockerignore		.dockerignore
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
__init__.py		__init__.py
cog.yaml		cog.yaml
cog_test_train.sh		cog_test_train.sh
instructions_README.md		instructions_README.md
main.py		main.py
node.py		node.py
predict.py		predict.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trainer

The trainer can be run in 4 different ways:

Using in ComfyUI:

The trainer supports 3 default modes:

Setup

Full unet finetuning

TODO's

About

Releases

Packages

Languages

License

kabalao/sd-lora-trainer

Folders and files

Latest commit

History

Repository files navigation

Trainer

The trainer can be run in 4 different ways:

Using in ComfyUI:

The trainer supports 3 default modes:

Setup

Full unet finetuning

TODO's

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages