Skip to content

Latest commit

 

History

History
141 lines (94 loc) · 7.27 KB

README.md

File metadata and controls

141 lines (94 loc) · 7.27 KB

Image Artisan XL

Image Artisan XL is the ultimate desktop application for creating amazing images with the power of artificial intelligence.

With Image Artisan XL, you can unleash your creativity and generate new images from scratch using simple text prompts. Whether you want to create realistic landscapes, fantasy creatures, abstract art, or anything else you can imagine, Image Artisan XL has it all. Image Artisan XL is powered by Stable Diffusion XL, the best open source image model developed by Stability AI.

No matter what you want to create, Image Artisan XL and Stable Diffusion XL will make it happen. Don’t miss this opportunity to get Image Artisan XL today and discover the endless possibilities of AI image creation.

Motivation

I wanted to make a software for Stable Diffusion that doesn’t need web APIs that make you run a separate service and open a web browser, which can be a hassle sometimes.

This software is for the people who like to use desktop apps for editing images and videos, not for the ones who want to automate everything or follow the latest fads in Stable Diffusion. If that’s what you’re looking for, there are other projects that can do that better for you.

Another cool thing about this software is that it has a simple installation process, unlike most other solutions. You just need to download the installer and follow the instructions, and you’re good to go. No need to worry about dependencies, compatibility issues, or complex configurations (only Windows for now).

Architecture

Image Artisan XL is a desktop application that uses PyQt6 as its graphical user interface. It mostly relies on the Diffusers library for generating images, using node graphs under the hood that are designed to meet the requirements of multithreading and real-time user interface updates.

All the models used in Image Artisan XL are loaded in half-point precision (FP16) whenever possible. This allows for faster inference and memory savings, since using them at full precision makes no difference to most people. When using Stable Diffusion XL models, it is possible to use under 10GB of Video RAM, although there may be some specific cases with VAE decoding where it goes over this limit.

It is highly recommended to use the included VAE with the FP16 fix, since the VAE in the base model and most of the shared ones need to be used in full precision to avoid generating black images.

Features

  • Run Stable Diffusion XL models to generate images.
  • User-friendly interface for easy image generation.
  • Will completely run offline after the first installation.
  • Powerfull features only avalaible as a desktop application.
  • Easy sharing of models and Lora's metadata since the information its stored in each model, including sample image, sample generation, triggers and tags for filtering.
  • Latent Consistency Models (LCM) and LoRAs for fast inference.
  • Segmind Stable Diffusion (SSD-1B and Vega) models for VRAM savings.
  • SDXL Turbo supported.
  • ControlNet.
  • T2I Adapters
  • Multiple IP Adapters with multiple images.
  • Dataset creation and management.
  • Dreambooth LoRA Training (more to come).
  • Resume training.
  • Image bucketing for training.

Limitations

  • Only runs with Stable Diffusion XL models.
  • It has the default CLIP 75 token limitation for the prompts.

You can read why here.

Hardware requirements

  • NVidia GPU with support for Cuda 11.8.
  • 16 GB RAM for a 12 VRAM or more GPU.
  • 24 GB RAM for lower than 12 VRAM cards (cpu offloading).
  • 4 GB Minimum VRAM with sequential cpu offloading (takes more than 4x times to generate though).

Planned features

  • Image to image.
  • Inpainting.
  • Outpainting.
  • Upscaling.
  • Fine tune your own model.
  • Train dreambooth LoRAs (Kohya)
  • Train LoRAs (diffusers and Kohya).
  • Nodes UI (under the hood already uses them).

Installation

Windows installer

Please download the most recent installer from the releases section. This version is exclusively for x64 and, while it has only been tested on Windows 11, it should also be compatible with Windows 10.

The final installed application without models will take around 6 GB of space, it will be installed under a directory named ZCode/ImageArtisanXL and you can uninstall later from the Windows installed apps menu. It will create a shortcut icon on your desktop named "Image Artisan XL".

The application will have a self contained version of python and environment so you don't have to worry about installing python or having conflicts with other installations or packages.

Once installed, on the first run you will be asked to choose the directories for the models, you can use the defaults or point them to a place you already have with models. Later you can change them on the fly if you want to use separate model directories.

Documentation

Coming soon...

Manual install

Linux

python -m venv .venv --prompt ImageArtisanXL
source .venv/bin/activate
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121
pip install .

Then run with the command python -m iartisan.

Windows

python -m venv .venv --prompt ImageArtisanXL
.\.venv\Scripts\activate
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121
pip install .

Then run with the command python -m iartisanxl.

Developer

Use this instead pip install -e .[dev] to install.

License

Image Artisan XL is licensed under the MIT License. You can find the full text of the license here.

Acknowledgements

Special thanks to:

HarroweD for his LoRA Harrlogos XL which was used to make the splashscreen.
artificialguybr for his LoRA Icons.Redmond which was used to make the app icon.

The creators of InvokeAI, ComfyUI, Stable Diffusion web UI and sd-scripts used for inspiration.

All the contributtors to this projects: taesd, Diffusers, Transformers, Peft and of course all the other big libraries that made this project possible.

Most vectors and icons by SVG Repo