A synthetic benchmark database for scene text removal is now released by Deep Learning and Vision Computing Lab of South China University of Technology. The database can be downloaded through the following links:
- Yunpan : (link: https://pan.baidu.com/s/1wwBwgm-n2A7iykoD0i37iQ PASSWORD: vk8f) (Size = 6.3G).
- Google Driver: (link: https://drive.google.com/open?id=1l_yJm1vWV7TF7vDcaVa7FqZLfW7ASYeo) (Size = 6.3G).
The training set consists of a total of 8000 images and the test set contains 800 images; all the training and test samples are resized to 512 × 512. The code for generating synthetic dataset and more synthetic text images as described in “ Synthetic Data for Text localisation in Natural Images"(CVPR2016)can be found in (https://github.com/ankush-me/SynthText).
For more details, please refer to our arxiv paper.
Please consider to cite our paper when you use our database:
@article{zhang2019EnsNet,
title = {EnsNet: Ensconce Text in the Wild},
author = {Shuaitao Zhang∗, Yuliang Liu∗, Lianwen Jin†, Yaoxiong Huang, Songxuan Lai
joural = {AAAI}
year = {2019}
}
Suggestions and opinions of dataset of this dataset (both positive and negative) are greatly welcome. Please contact the authors by sending email to [email protected].
The synthetic database can be only used for non-commercial research purpose.
For commercial purpose usage. Please Dr. Lianwen Jin: [email protected].
Copyright 2018, Deep Learning and Vision Computing Lab, South China University of Teacnology.http://www.dlvc-lab.net