Stable Diffusion API Wrapper

Fast API Endpoint that supports LoRA integration with local Image Generation!

Dependencies

NVIDIA GPU with at least 12GB of VRAM (with a batch size of 1)
NVIDIA drivers installed + NVIDIA Container Toolkit installed

How to use

Starting the Software

Docker

As of right now, I do not have this working dockerized. That may come in the future.

Bare Metal

Setup

First thing we need to do is assign the following ENV variables:

BASE_DIRECTORY: This is the base directory of the repository.
- This is used when finding LoRAs and PipelineFiles
SELECTED_MODEL: This is the default Model to use if not provided.
SELECTED_VIDEO_MODEL: This WILL be used for IMG2Video. The code is here and working - I just found it was not a very good experience for my usecase', but can be used with the code in the deprectated API V1 file

Adding Models

The purpose of this repository was to learn how I can create image generation pipelines, without the need for external network connectivity.
I really didn't want to spend money and wanted to do it all on my own¯\_( ͡° ͜ʖ ͡°)_/¯

To add a model, we do one of the following:

Via API (Easiest)

You will need network connectivity to download as the model is pulled from HuggingFace Registry.

We can make a PUT request to /download/ containing the name of the Model.

{
"model_name": "stabilityai/stable-video-diffusion-img2vid"
}

This will download the model into the MODEL_DIRECTORY folder.
A successful message looks like:

{
  "Status": "Downloaded"
}

You can now see this model in the /models/ api call.

Manually Offline

Use this method if you want to install a predownloaded .safetensors file.

Download the local .safetensors file to the MODEL_DIRECTORY
Make a POST request to /export/safetensor, containing the model name (with or without .safetensors).

{
"safetensor_name": "my_local_safetensor.safetensors"
}

The Pipeline will not save this safetensor as a model folder, for local use. It will also delete the safetensors file.

Starting API

To start the API, we use fastapi run or fastapi dev.
I found fastapi run works better, as dev seems to freeze sometimes.

FAQ

What is `base_lora` and `contextual_lora`?

These are fields used to help generate images using multiple LoRAs.
An Example of a base_lora may be a LoRA used as a style.

i.e. A LoRA used for creating images in specific art styles.

The contextual_lora would be something that would modify something in the picture.
An example of this could be a LoRA for fixing a finger count, or better image-quality.

Why is negative prompt prefilled?

This is a canned negative prompt to help better generate pictures, feel free to remove and change for your use cases!

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
api_packages		api_packages
factories		factories
helpers		helpers
models		models
routers		routers
routes		routes
views		views
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
README.md		README.md
dep_ai_img_to_img.py		dep_ai_img_to_img.py
image_ai.py		image_ai.py
main.py		main.py
requirements.txt		requirements.txt
settings.py		settings.py
video_ai.py		video_ai.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stable Diffusion API Wrapper

Dependencies

How to use

Starting the Software

Docker

Bare Metal

Setup

Adding Models

Via API (Easiest)

Manually Offline

Starting API

FAQ

What is `base_lora` and `contextual_lora`?

Why is negative prompt prefilled?

Happy Hacking!

About

Releases

Packages

Languages

AlexScotland/StableDiffusionAPI

Folders and files

Latest commit

History

Repository files navigation

Stable Diffusion API Wrapper

Dependencies

How to use

Starting the Software

Docker

Bare Metal

Setup

Adding Models

Via API (Easiest)

Manually Offline

Starting API

FAQ

What is base_lora and contextual_lora?

Why is negative prompt prefilled?

Happy Hacking!

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

What is `base_lora` and `contextual_lora`?

Packages