GitHub - MikeTkachuk/text2image_annotator: Tkinter app for assisted image annotation

Image annotation tool (by Mike Tkachuk)

v1.0.0

What's new:

data exploration tab
model head training tab
introducing tasks
spatial embeddings support
augmentations support
active learning features and suggestions
duplicates search
lots of other utilities

About

Interactive and semi-automated annotation tool. Has support for image-text pairs annotation, tagging, and classification tasks. Leverages pretrained CLIP-like and vision embedders to gain EDA insights and speed up annotation. You can train simple classification heads on top of embedders and explore your model's behaviour.

Please make sure to export your session at milestones, I will add the import functionality soon. The code works fine in my tests but the data loss is frustrating! All of the CRUD operations call disk sync by design.

Feel free to customize layout, shortcuts, model architectures, etc. All questions, bug reports, and collaborations are welcome in GitHub Issues and Pull Requests!

Installation and Requirements

(Should work quite stably even with deviations from my setup)
Was tested on Windows 10/11 with 1920x1080 screen.
(potentially optional) GPU - 8GB
Possible syntax errors in python < 3.10

Run
conda env create -f environment.yml
conda activate txt2img_ann
python main.py

Controls and shortcuts

Refer to Tools -> Info in each view.

Typical use-case step-by-step guide

Do Project -> Select Folder to start your session.
Use annotation UI (main window) to add text labels and tags. Tags are retained for further reuse but can be deleted.
Use Tools -> Info for more details.
In Project -> Session info session stats can be computed.
In Project -> Manage tasks you can add a classification task based on tags retained in step 2. It can be binary or multiclass, multi-select the appropriate tags and mode.
In Project -> Manage embedder you can add Huggingface models by name and from disk. See core.embedding.py for architecture docs.
Precompute embeddings for some number of images with options of your choice. You can also precompute only images that were annotated so far by pressing Precompute active.
Navigate to clustering view and run clustering on your selected task using embeddings you just computed.
Click on the plot to explore the data points at that region.
Use Tools -> Info for more details here as well.
With some data sample active you can override its class in the scope of your classification task. Use arrows or numbers to see effect.
Navigate to training view and create a new model for this task. Use one of the predefined templates. You can create new templates from existing models or customize by adding support to your models in core.model.py.
Fit your model on your data. You can modify validation split in Project -> Manage tasks
Now you can view your predictions in clustering view as well, just select the appropriate model.
If implemented for your model type, press Enter/Ctrl+Enter to show next/previous suggested image to annotate.
Repeat the model training and annotation until you are satisfied with the performance!

Contribution

Feel free to raise issues asking for features/explanations. Contributions are also highly welcomed!

Future plans:

export/import full support
BLIP annotation (did not impress me at first but may be useful at scale and when made tag-informed)
support for direct training on images

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
assets		assets
core		core
views		views
README.md		README.md
config.py		config.py
environment.yml		environment.yml
main.py		main.py
pretraining.py		pretraining.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image annotation tool (by Mike Tkachuk)

About

Installation and Requirements

Controls and shortcuts

Typical use-case step-by-step guide

Contribution

Future plans:

About

Releases

Packages

Languages

MikeTkachuk/text2image_annotator

Folders and files

Latest commit

History

Repository files navigation

Image annotation tool (by Mike Tkachuk)

About

Installation and Requirements

Controls and shortcuts

Typical use-case step-by-step guide

Contribution

Future plans:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages