Dataset Enhancement with Instance-Level Augmentations

This is an official repository for the paper

Dataset Enhancement with Instance-Level Augmentations
Orest Kupyn, Christian Rupprecht
ECCV 2024

Instance Augmentation method augment images by redrawing individual objects in the scene retaining their original shape. This allows training with the unchanged class label (e.g. class, segmentation, detection, etc.). The generations are highly diverse and match the scene composition

Original	Augmented	Augmented

Augmented Datasets

This repository contains links to several augmented datasets that can be used for various computer vision tasks, such as object detection, instance segmentation, and saliency detection.

Datasets

COCO Augmented Car:
- Link: https://thor.robots.ox.ac.uk/instance-augmentation/COCO_Augmented_Car.tar
- Description: An anonymized version of the COCO dataset, focusing on the "car" class.
COCO Augmented People:
- Link: https://thor.robots.ox.ac.uk/instance-augmentation/COCO_Augmented_People.tar
- Description: An anonymized verison of the COCO dataset, focusing on the "person" class.
COCO Augmented:
- Link: https://thor.robots.ox.ac.uk/instance-augmentation/COCO_Augmented.tar
- Description: An augmented version of the entire COCO dataset.
DUTS Augmented:
- Link: https://thor.robots.ox.ac.uk/instance-augmentation/DUTS_Augmented.tar
- Description: An augmented version of the DUTS dataset, which is commonly used for saliency detection.
DUTS SDXL (Experimental):
- Link: https://thor.robots.ox.ac.uk/instance-augmentation/DUTS_SDXL.tar
- Description: A larger, augmented version of the DUTS dataset.
SHA512 Checksums:
- Link: https://thor.robots.ox.ac.uk/instance-augmentation/SHA512SUMS
- Description: A file containing the SHA512 checksums for the above augmented datasets, which can be used to verify the integrity of the downloaded files.

Please note that these augmented datasets are provided for research purposes. If you plan to use these datasets in your projects, make sure to follow the appropriate licensing and citation requirements.

Installation

The code uses Python 3.8.

Create a Conda virtual environment and Install The Package:

Make sure you have Conda installed.

make env

Run Test for the Package:

make pytest

Run on a folder of images:

An example is available in tests/test_pipeline.py - test_end_to_end

To predict instance masks:

from instance_augmentation.pseudolabel_dataset import create_annotations

create_annotations("path_to_image_folder", "path_to_save_results", dataset_type="custom", class_names=["dog", "cat", "any_other_classes"])

To generate augmented images:

from instance_augmentation.pipeline.dataset_generator import DatasetGenerator
from instance_augmentation.pipeline.readers import CustomDatasetReader

reader = CustomDatasetReader("path_to_image_folder", {}, "path_to_save_results/annotations.json")
dataset_generator = DatasetGenerator.from_params(
        dataset_reader=reader,
        save_folder="path_to_save_results",
        preprocessing="resize",
        target_image_size=1024,
        base_inpainting_model="SG161222/RealVisXL_V3.0",
        generator="inpaint_sdxl_adapter",
        num_samples=1,
        num_inference_steps=20,
        control_methods=["t2i_depth", "t2i_sketch"],
        control_weights=[0.9, 0.5],
    )
    dataset_generator.run()

To apply augmentations:

import os
import cv2
import glob
from instance_augmentation.augment import Augmenter

augmenter = Augmenter("path_to_save_results", p=1.0)
for image_path in glob.glob("path_to_image_folder/*"):
    image_name = os.path.split(image_path)[1]
    original_image = cv2.cvtColor(cv2.imread(image_path), cv2.COLOR_BGR2RGB)
    augmented_image = augmenter.augment_image(original_image, image_name)

Citation

If you use the the method or this code - implicitly or explicitly - for your research projects, please cite the following paper:

@article{kupyn2024dataset,
    title = {Dataset Enhancement with Instance-Level Augmentations},
    author = {Kupyn, Orest and Rupprecht, Christian},
    journal = {arXiv preprint arXiv:2406.08249},
    year = {2024}
  }

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
examples		examples
images		images
instance_augmentation		instance_augmentation
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
environment.yml		environment.yml
mypy.ini		mypy.ini
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataset Enhancement with Instance-Level Augmentations

Augmented Datasets

Datasets

Installation

Create a Conda virtual environment and Install The Package:

Run Test for the Package:

Run on a folder of images:

Citation

About

Releases

Packages

Languages

License

KupynOrest/instance_augmentation

Folders and files

Latest commit

History

Repository files navigation

Dataset Enhancement with Instance-Level Augmentations

Augmented Datasets

Datasets

Installation

Create a Conda virtual environment and Install The Package:

Run Test for the Package:

Run on a folder of images:

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages