GitHub - istiakshihab/automated-retail-checkout-aicity22: Code for the VISTA, 3rd place solution to AICITY22 Track 4

VISTA: Vision Transformer enhanced by U-Net and Image Colorfulness Frame Filtration for Automatic Retail Checkout

Code for our CVPR 2022 Workshop paper VISTA: Vision Transformer enhanced by U-Net and Image Colorfulness Frame Filtration for Automatic Retail Checkout. The method described achieves 3rd place in the AI City Challenge 2022 Track 4: Multi-Class Product Counting & Recognition for Automated Retail Checkout. See here for details.

[arXiv]

Figure 1. Illustration of the overall segmentation and classification pipeline

1. Specification of dependencies

This code requires Python 3.8.12 and PyTorch 1.8.2. Run pip install -r requirements.txt to install all the dependencies.

2a. Segmentation training code

See training/segmentation for details.

2b. Classification training code

See training/classification for details.

3. Inference code

After steps 2a and 2b, make sure both segmentation and classification models are present in the test/models directory. Then see README.md for details.

4. Citation

will be added here.

Acknowledgements

We thank AICITY 22 organizers for making data available for use. We also thank Giga Tech Ltd. for providing funding for this work.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
media		media
test		test
training		training
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VISTA: Vision Transformer enhanced by U-Net and Image Colorfulness Frame Filtration for Automatic Retail Checkout

1. Specification of dependencies

2a. Segmentation training code

2b. Classification training code

3. Inference code

4. Citation

Acknowledgements

About

Releases 2

Packages

Contributors 5

Languages

istiakshihab/automated-retail-checkout-aicity22

Folders and files

Latest commit

History

Repository files navigation

VISTA: Vision Transformer enhanced by U-Net and Image Colorfulness Frame Filtration for Automatic Retail Checkout

1. Specification of dependencies

2a. Segmentation training code

2b. Classification training code

3. Inference code

4. Citation

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 5

Languages

Packages