Scene Text Detection
Scene Text Detection is a computer vision task that involves automatically identifying and localizing text within natural images or videos. The goal of scene text detection is to develop algorithms that can robustly detect and and label text with bounding boxes in uncontrolled and complex environments, such as street signs, billboards, or license plates.
Source: ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene Text Detection
Papers
Showing 1–10 of 213 papers
All datasetsICDAR 2015Total-TextMSRA-TD500SCUT-CTW1500ICDAR 2013ICDAR 2017 MLTCOCO-TextIC19-ArtIC19-ReCTs
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | TextFuseNet (ResNeXt-101) | F-Measure | 92.23 | — | Unverified |
| 2 | CharNet H-88 (multi-scale) | F-Measure | 91.55 | — | Unverified |
| 3 | CharNet H-88 (single-scale) | F-Measure | 90.97 | — | Unverified |
| 4 | CharNet H-50 (multi-scale) | F-Measure | 90.16 | — | Unverified |
| 5 | SBD | F-Measure | 90.1 | — | Unverified |
| 6 | CharNet H-57 (multi-scale) | F-Measure | 90.06 | — | Unverified |
| 7 | FOTS MS | F-Measure | 89.84 | — | Unverified |
| 8 | CharNet H-50 (single-scale) | F-Measure | 89.7 | — | Unverified |
| 9 | CharNet H-57 (single-scale) | F-Measure | 89.66 | — | Unverified |
| 10 | PMTD | F-Measure | 89.33 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | MixNet | F-Measure | 90.5 | — | Unverified |
| 2 | SRFormer (ResNet-50) | F-Measure | 90 | — | Unverified |
| 3 | DPText-DETR (ResNet-50) | F-Measure | 89 | — | Unverified |
| 4 | TextFuseNet (ResNeXt-101) | F-Measure | 87.5 | — | Unverified |
| 5 | FAST-B-800 | F-Measure | 87.5 | — | Unverified |
| 6 | I3CL + SSL(ResNet-50) | F-Measure | 86.9 | — | Unverified |
| 7 | CharNet H-88 (multi-scale) | F-Measure | 86.5 | — | Unverified |
| 8 | FAST-B-640 | F-Measure | 86.4 | — | Unverified |
| 9 | DBNet++ (ResNet-50) (800) | F-Measure | 86 | — | Unverified |
| 10 | FAST-B-512 | F-Measure | 85.8 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | MixNet | F-Measure | 89.4 | — | Unverified |
| 2 | FAST-B-736 | F-Measure | 87.3 | — | Unverified |
| 3 | DBNet++ (ResNet-50) (736) | F-Measure | 87.2 | — | Unverified |
| 4 | FAST-S-736 | F-Measure | 86.4 | — | Unverified |
| 5 | DBNet++ (ResNet-18) (736) | F-Measure | 85.1 | — | Unverified |
| 6 | FAST-T-736 | F-Measure | 84.9 | — | Unverified |
| 7 | DB-ResNet-50 (736) | F-Measure | 84.9 | — | Unverified |
| 8 | FAST-T-512 | F-Measure | 84.5 | — | Unverified |
| 9 | PAN | F-Measure | 84.1 | — | Unverified |
| 10 | CRAFT | F-Measure | 82.9 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | MixNet | F-Measure | 89.8 | — | Unverified |
| 2 | SRFormer (ResNet-50) | F-Measure | 89.6 | — | Unverified |
| 3 | DPText-DETR (ResNet50) | F-Measure | 88.8 | — | Unverified |
| 4 | TextFuseNet (ResNeXt-101) | F-Measure | 87.4 | — | Unverified |
| 5 | I3CL + SSL | F-Measure | 86.5 | — | Unverified |
| 6 | PAN | F-Measure | 85 | — | Unverified |
| 7 | FAST-B-640 | F-Measure | 84.2 | — | Unverified |
| 8 | PAN-640 | F-Measure | 83.7 | — | Unverified |
| 9 | CRAFT | F-Measure | 83.5 | — | Unverified |
| 10 | DB-ResNet50 (1024) | F-Measure | 83.4 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | CRAFT | Precision | 97.4 | — | Unverified |
| 2 | TextFuseNet (ResNeXt-101) | F-Measure | 94.61 | — | Unverified |
| 3 | SPCNET | F-Measure | 92.1 | — | Unverified |
| 4 | Mask TextSpotter | F-Measure | 91.7 | — | Unverified |
| 5 | WordSup (VGG16-synth-icdar) | F-Measure | 90.34 | — | Unverified |
| 6 | STN-OCR | F-Measure | 90.3 | — | Unverified |
| 7 | PixelLink+VGG16 2s MS | F-Measure | 88.1 | — | Unverified |
| 8 | TextBoxes++_MS | F-Measure | 88 | — | Unverified |
| 9 | Corner Localization (multi-scale) | F-Measure | 88 | — | Unverified |
| 10 | Corner-based Region Proposals | F-Measure | 87.6 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | PMTD* | Precision | 84.42 | — | Unverified |
| 2 | Corner Localization (single-scale) | Precision | 83.8 | — | Unverified |
| 3 | SBD | Precision | 82.75 | — | Unverified |
| 4 | FOTS MS | Precision | 81.86 | — | Unverified |
| 5 | CharNet H-88 | Precision | 81.27 | — | Unverified |
| 6 | FOTS | Precision | 80.95 | — | Unverified |
| 7 | SPCNET | Precision | 80.6 | — | Unverified |
| 8 | CRAFT | Precision | 80.6 | — | Unverified |
| 9 | PAN | Precision | 80 | — | Unverified |
| 10 | GNNets | Precision | 79.63 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Corner-based Region Proposals | F-Measure | 59.1 | — | Unverified |
| 2 | TextBoxes++_MS | F-Measure | 58.72 | — | Unverified |
| 3 | EAST + VGG16 | F-Measure | 39.45 | — | Unverified |
| 4 | SSTD | F-Measure | 37 | — | Unverified |
| 5 | WordSup (VGG16-synth-coco) | F-Measure | 36.8 | — | Unverified |
| 6 | Yao et al. | F-Measure | 33.31 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | MixNet | H-Mean | 79.7 | — | Unverified |
| 2 | SRFormer (ResNet-50) | H-Mean | 79.3 | — | Unverified |
| 3 | TextFuseNet (ResNeXt-101) | H-Mean | 78.6 | — | Unverified |
| 4 | DPText-DETR (ResNet-50) | H-Mean | 78.1 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | BDN | F-Measure | 93.36 | — | Unverified |