OnnxDiffusersUI

I’ve been helping people setup Stable Diffusion and run it on their AMD graphics card (or CPU) on Windows. I’ve also wrote a basic UI for the diffusers library version to make it more user friendly. This guide is a consolidation of what I’ve learned and hopefully will help other people setup their PC to run Stable Diffusion too.

The intent of this UI is to get people started running Stable Diffusion on Windows. As such this UI won't be as feature rich as other UI, nor will it be as fast as running Stable Diffusion on Linux and ROCm.

Example screenshot:

Credits

A lot of this document is based on other guides. I've listed them below:

Prerequisites

You'll need to have a few things prepared first:

Install Python 3.10: https://www.python.org/downloads/release/python-3108/
- NOTE: don't install the version of Python from the Microsoft Store
- make sure that you enable "add Python to environment variables/PATH" during the install
Install Git for Windows: https://git-scm.com/download/win
- just install with the default recommended settings
Have a huggingface.co account

NOTE: For Windows 10/11 you need to do an additional step. Go to Settings -> search for "Manage app execution aliases" -> disable the App Installer for "python.exe" and "python3.exe".

To check if they’re installed properly open up command prompt and run the following commands:

python --version
git --version
pip --version

There shouldn't be any "not recognized as an internal or external command" errors.

Creating a Workspace

Start by creating a folder somewhere to store your project. I named mine stable_diff.

Download the setup.bat file and save it into your stable_diff folder: https://raw.githubusercontent.com/azuritecoin/OnnxDiffusersUI/main/setup.bat
NOTE: make sure you save the file as a .bat file and not a .bat.txt file.

Open command prompt and navigate to your stable_diff folder. Once there run the setup.bat file:

cd <path to your stable_diff folder>
.\setup.bat

The setup batch file should create a virtual envrionment and install the Python packages. It will also download Python scripts from online repositories.

Activate the virtual environment:
.\virtualenv\Scripts\activate.bat

At this point you should be in your virtual environment and your prompt should have a (virtualenv) at the begining of the line. To exit the virtual environment just run deactivate at any time.

To restart the virtual environment after closing the command prompt window, cd back into the stable_diff folder and run the .\virtualenv\Scripts\activate.bat batch file again.

Logging in Using Your Token

Login to huggingface:
huggingface-cli.exe login

Go to the tokens page of the huggingface website and copy your token.

Go back to the command prompt window. Make sure there's no characters selected (see below). Press Esc to de-select.

Right click the Title Bar -> Edit -> Paste -> Press Enter. You should be logged in at this point.

NOTE: when pasting, the command prompt will look like nothing has happened. This is normal behaviour, just press enter and it should update.

Download Model and Convert to ONNX

Go to https://huggingface.co/runwayml/stable-diffusion-v1-5 and accept the terms and conditions for the model.

Option 1

Run the Python script to download and convert:
python convert_stable_diffusion_checkpoint_to_onnx.py --model_path="runwayml/stable-diffusion-v1-5" --output_path="model/stable_diffusion_onnx"

NOTE: This may take a while depending on your internet connection speed.

Option 2

Althernatively, you could download the pre-converted version of the model using git:
git clone https://huggingface.co/runwayml/stable-diffusion-v1-5 --branch onnx --single-branch model/stable_diffusion_onnx

Basic Script and Setup Check

Run the txt2img_onnx.py script and check if any images were generated in the output folder. NOTE: some warnings may show up but it should be working as long as an output image is generated:
python txt2img_onnx.py

If an image was generated and it's not just a blank image then you're ready to generate art! You can use the txt2img_onnx.py script to input your own prompt for example:
python txt2img_onnx.py --prompt="tire swing hanging from a tree" --height=512 --width=512

Running The GUI

Run the UI script and wait for everything to load:
python onnxUI.py

Once you see "Running on local URL:" open up your browser and go to "http://127.0.0.1:7860". You should be able to generate images using the web UI. To close the program, go back to the command prompt and hit ctrl-C.

Using Other Models

Models Using Diffuers

If the model is on the hugging face website and it's using the diffusers library, then you can use the same convert script from the guide. Make sure you've accepted the terms for any new model. In this example I'll use waifu-diffusion.
python convert_stable_diffusion_checkpoint_to_onnx.py --model_path="hakurei/waifu-diffusion" --output_path="model/waifu_diffusion_onnx"

Models Using `.ckpt`

If the pretrained model is a .ckpt file, then you'll need to do a two step conversion. You first will need to convert from .ckpt to diffusers, then from diffusers to ONNX.

Download the .ckpt model of your choice and put them in your stable_diff folder. Run the first conversion script, using trinart2_step115000.ckpt in this example:
python convert_original_stable_diffusion_to_diffusers.py --checkpoint_path="./trinart2_step115000.ckpt" --dump_path="./trinart2_step115000_diffusers"
Then run the second conversion script:
python convert_stable_diffusion_checkpoint_to_onnx.py --model_path="./trinart2_step115000_diffusers" --output_path="model/trinart2_step115000_onnx"
NOTE: make sure the --dump_path in the first script and the --model_path is the same folder name.

After Conversion

Once you have your newly converted model, make sure the model is saved into the model folder (create one if it doesn't exist). Once you restarted the UI, select the model from the dropdown menu on the top left.

Reduce VRAM use by having Text Encoder on CPU

The Text Encoder runs only once per generation and is not very compute intensive, but the model uses VRAM resulting in reduced available VRAM. VRAM memory pressure will reduce the speed of every iteration. It can be beneficial to run the Text Encoder on CPU instead.
You can make the UI load the Text Encoder on CPU by adding --cpu-textenc flag:
python onnxUI.py --cpu-textenc

De-allocate memory after each image generation

You can de-allocate memory after each image generation in case you want to do that. Might be useful for very low VRAM.

You can de-allocate memory after each generation by adding --release-memory-after-generation flag: python onnxUI.py --release-memory-after-generation

De-allocate memory when changing pipelines

You can de-allocate memory when swapping pipelines (txt2img, img2img, inpaint). With low VRAM sometimes you may want to do this to prevent slowdowns or OOM errors that may occur from having multiple loaded at once. With more than 8GB of VRAM this is possibly not needed.

You can de-allocate memory when swapping pipelines by adding --release-memory-on-change flag: python onnxUI.py --release-memory-on-change

Running Stable Diffusion on CPUs

If you don't have a graphics card with enough VRAM or you only have onboard graphics, you can still run Stable Diffusion with the CPU. Simply add a --cpu-only flag to your command line:
python txt2img_onnx.py --cpu-only

Updating onnxUI.py

If you want to update the program, download the latest setup.bat and overwrite the old one: https://raw.githubusercontent.com/azuritecoin/OnnxDiffusersUI/main/setup.bat

Then run the following:
.\setup.bat -update

NOTE: if you're updating from diffusers v0.5.1 and below, you will need to re-convert your models.

Inpainting Fix

If inpainting does not work for you, please follow these steps from de_inferno#6407 on discord to fix it.
- Within: virtualenv\lib\site-packages\diffusers\pipelines\stable_diffusion\pipeline_onnx_stable_diffusion_inpaint_legacy.py
- Find (likely on line 402): sample=latent_model_input, timestep=np.array([t]), encoder_hidden_states=prompt_embeds
- Replace with: sample=latent_model_input, timestep=np.array([t], dtype="float32"), encoder_hidden_states=prompt_embeds

Name		Name	Last commit message	Last commit date
Latest commit History 189 Commits
images		images
modules		modules
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENCE.txt		LICENCE.txt
README.md		README.md
lpw_pipe.py		lpw_pipe.py
onnxUI.py		onnxUI.py
requirements.txt		requirements.txt
setup.bat		setup.bat
txt2img_onnx.py		txt2img_onnx.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OnnxDiffusersUI

Credits

Prerequisites

Creating a Workspace

Logging in Using Your Token

Download Model and Convert to ONNX

Option 1

Option 2

Basic Script and Setup Check

Running The GUI

Using Other Models

Models Using Diffuers

Models Using `.ckpt`

After Conversion

Reduce VRAM use by having Text Encoder on CPU

De-allocate memory after each image generation

De-allocate memory when changing pipelines

Running Stable Diffusion on CPUs

Updating onnxUI.py

Inpainting Fix

About

Releases

Packages

Languages

License

Amblyopius/OnnxDiffusersUI

Folders and files

Latest commit

History

Repository files navigation

OnnxDiffusersUI

Credits

Prerequisites

Creating a Workspace

Logging in Using Your Token

Download Model and Convert to ONNX

Option 1

Option 2

Basic Script and Setup Check

Running The GUI

Using Other Models

Models Using Diffuers

Models Using .ckpt

After Conversion

Reduce VRAM use by having Text Encoder on CPU

De-allocate memory after each image generation

De-allocate memory when changing pipelines

Running Stable Diffusion on CPUs

Updating onnxUI.py

Inpainting Fix

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Models Using `.ckpt`

Packages