A Pytorch implementation of DB-Text paper
Make awesome things that matter.
- Modify some configuration in config.yaml
make train
make test-all
- For evaluation metric, please refer to MegReader repository
# iou-based Pascal
make ioueval
# overlap-based DetEval
make deteval
Heatmap | Polygon | Rotated rectangle |
---|---|---|
Image origin | Text-line detected |
---|---|
# for TotalText dataset
make deteval
Method | image size | init lr | b-thresh | p-thresh | unclip ratio | Precision | Recall | F-measure |
---|---|---|---|---|---|---|---|---|
TotalText-resnet18-fcn (word-level) | 640 | 0.005 | 0.25 | 0.50 | 1.50 | 0.70 | 0.64 | 0.67 |
CTW1500-resnet18-fcn (line-level) | 640 | 0.005 | 0.25 | 0.50 | 1.50 | 0.83 | 0.66 | 0.74 |
- Support datasets
- TotalText
- ICDAR2015
- SCUT-CTW1500
- MSRA-TD500
- COCO-Text
- Synthtext
- ArT2019 (included Total-Text, SCUT-CTW1500 and Baidu Curved Scene Text dataset)
- Pytorch-lightning
- Model serving with Torchserve
- Metric callback (P/R/F1)
- IoU-based metric (P/R/F1 - Pascal)
- Overlap-based metric (P/R/F1 - DetEval)
- Model quantization
- Model pruning
- Docker / docker-compose
- ONNX, TensorRT
- Text recognition model
I got a lot of code from DBNet.pytorch, thanks to @WenmuZhou