Skip to content

Latest commit

 

History

History
230 lines (174 loc) · 7.1 KB

README_EN.md

File metadata and controls

230 lines (174 loc) · 7.1 KB

genai-tag-db-tools

Overview

genai-tag-db-tools is a database tool designed to centrally manage tags used in image generation AI. It enables unified handling of tag information utilized across various platforms and formats.

The main objectives are as follows:

  • Integrated management of tags, their translations, frequency of use, and relevance
  • Filtering and statistical analysis by associating tag types with specific formats
  • A GUI for browsing and updating tag data
  • Support for launching via CLI and using as a module from other projects

Key Features

  • Tag Management: Registering new tags, updates, setting aliases, recommended tags, etc.
  • Tag Referencing: Keyword search, translation lookup, usage counts, and statistics by tag type and format
  • GUI Provision: Launch the GUI from the CLI command for intuitive browsing and updating of the tag database
  • Modular Usage: Import and use the database operations and tag management logic in other projects

Installation

Requirements

  • Python 3.12 or higher
  • Verified on Windows 11 (other OS not tested)

Installation Steps

  1. Create a virtual environment (optional):

    py -3.12 -m venv venv
    venv\Scripts\Activate.ps1
  2. Install genai-tag-db-tools:

    pip install genai-tag-db-tools

    Or install directly from the GitHub repository:

    pip install git+https://github.com/NEXTAltair/genai-tag-db-tools.git

Uninstallation

pip uninstall genai-tag-db-tools

Usage

Launching the GUI

After installation, you can start the GUI using the following command:

genai-tag-db-tools

Or run as a Python module:

python -m genai_tag_db_tools

Using in Other Projects

You can import genai_tag_db_tools into other projects and utilize database operations and tag management functionality.

from genai_tag_db_tools import some_module

# Example usage of tag management features in another project

Project Structure (Example)

genai-tag-db-tools/
├── genai_tag_db_tools/      # Main package
│   ├── data/                # Data management
│   │   ├── migrations/      # Database migrations
│   │   ├── tags_v3.db      # Tag database v3
│   │   ├── tags_v4.db      # Tag database v4
│   │   └── database_schema.py
│   ├── db/                  # Database operations
│   ├── gui/                 # GUI components
│   │   ├── designer/        # UI definition files (.ui/.py)
│   │   ├── widgets/         # Various widgets
│   │   └── windows/         # Main window
│   ├── services/            # Application services
│   ├── utils/              # Utilities
│   └── main.py             # Entry point
├── docs/                   # Documentation
├── tests/                  # Test code
│   ├── gui/               # GUI tests
│   ├── unit/             # Unit tests
│   └── resource/         # Test resources
├── tools/                 # Tool scripts
├── pyproject.toml        # Project configuration
└── README.md

Database Overview

SQLite is primarily used to manage tag data. The following is an example ER diagram of entities and their relationships.

ER Diagram

erDiagram
    TAGS ||--o{ TAG_TRANSLATIONS : has
    TAGS ||--o{ TAG_STATUS : has
    TAGS ||--o{ TAG_USAGE_COUNTS : tracks
    TAG_FORMATS ||--o{ TAG_STATUS : defines
    TAG_FORMATS ||--o{ TAG_USAGE_COUNTS : tracks
    TAG_TYPE_FORMAT_MAPPING ||--o{ TAG_STATUS : maps
    TAG_TYPE_NAME ||--o{ TAG_TYPE_FORMAT_MAPPING : references

    TAGS {
        int tag_id PK "primary key"
        string source_tag
        datetime created_at
        datetime updated_at
        string tag
    }

    TAG_TRANSLATIONS {
        int translation_id PK "primary key"
        int tag_id FK "references TAGS(tag_id)"
        string language
        string translation
        datetime created_at
        datetime updated_at
    }

    TAG_FORMATS {
        int format_id PK "primary key"
        string format_name
        string description
    }

    TAG_TYPE_NAME {
        int type_name_id PK "primary key"
        string type_name
        string description
    }

    TAG_TYPE_FORMAT_MAPPING {
        int format_id PK, FK "references TAG_FORMATS(format_id)"
        int type_id PK "part of composite key"
        int type_name_id FK "references TAG_TYPE_NAME(type_name_id)"
        string description
    }

    TAG_USAGE_COUNTS {
        int tag_id PK, FK "references TAGS(tag_id)"
        int format_id PK, FK "references TAG_FORMATS(format_id)"
        int count
        datetime created_at
        datetime updated_at
    }

    TAG_STATUS {
        int tag_id PK, FK "references TAGS(tag_id)"
        int format_id PK, FK "references TAG_FORMATS(format_id)"
        int type_id FK "part of mapping"
        boolean alias
        int preferred_tag_id FK "references TAGS(tag_id)"
        datetime created_at
        datetime updated_at
    }
Loading

Table Relationships

  • TAGS: Basic tag information
  • TAG_TRANSLATIONS: Tag translation information (dependent on TAGS)
  • TAG_FORMATS: Definitions of tag formats
  • TAG_TYPE_NAME: Definitions of tag types
  • TAG_TYPE_FORMAT_MAPPING: Mapping between formats and types
  • TAG_USAGE_COUNTS: Format-specific usage counts for tags
  • TAG_STATUS: Manages tag statuses (aliases, recommended tags, etc.)

Data Sources

References and utilized data sources:

  1. DominikDoom/a1111-sd-webui-tagcomplete: CSV-based tag data that served as the base for tags.db
  2. Japanese translations by applemango: Japanese translations of the CSV tag data
  3. Japanese translations of CSV tag data by “としあき”
  4. AngelBottomless/danbooru-2023-sqlite-fixed-7110548: Database for danbooru tags
  5. hearmeneigh/e621-rising-v3-preliminary-data: Database of e621 and rule34 tags
  6. p1atdev/danbooru-ja-tag-pair-20241015: Japanese translation database for danbooru tags
  7. toynya/Z3D-E621-Convnext: CSV tags from e621 tagger convnext model #TODO: Not yet integrated
  8. Updated danbooru.csv (2024-10-16) for WebUI Tag Autocomplete: Updated danbooru.csv as of October 16, 2024 for WebUI Tag Autocomplete #TODO: Not yet integrated

For Developers

Running Tests

pytest

Display coverage:

pytest --cov=genai_tag_db_tools --cov-report=term-missing

Code Style

Use black, ruff, and pyright for code quality and type checks. Refer to pyproject.toml for details.

black genai_tag_db_tools
ruff check genai_tag_db_tools
pyright

License

This project is released under the MIT License. See LICENSE for details.