SOTAVerified

Scene Text Detection

Scene Text Detection is a computer vision task that involves automatically identifying and localizing text within natural images or videos. The goal of scene text detection is to develop algorithms that can robustly detect and and label text with bounding boxes in uncontrolled and complex environments, such as street signs, billboards, or license plates.

Source: ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene Text Detection

Papers

Showing 101150 of 213 papers

TitleStatusHype
Explore Faster Localization Learning For Scene Text Detection0
FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection0
Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection0
FPDIoU Loss: A Loss Function for Efficient Bounding Box Regression of Rotated Object Detection0
Fused Text Segmentation Networks for Multi-oriented Scene Text Detection0
GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition0
Geometry-Aware Scene Text Detection With Instance Transformation Network0
ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-20190
Image Processing Based Scene-Text Detection and Recognition with Tesseract0
IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection0
Incidental Scene Text Understanding: Recent Progresses on ICDAR 2015 Robust Reading Competition Challenge 40
Kernel Adaptive Convolution for Scene Text Detection via Distance Map Prediction0
KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark0
Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting0
Predictive Ensemble Learning with Application to Scene Text Detection0
PuzzleNet: Scene Text Detection by Segment Context Graph Learning0
Real-time Scene Text Detection Based on Global Level and Word Level Features0
Region Prompt Tuning: Fine-grained Scene Text Detection Utilizing Region Text Prompt0
ReLaText: Exploiting Visual Relationships for Arbitrary-Shaped Scene Text Detection with Graph Convolutional Networks0
Robust Handwriting Recognition with Limited and Noisy Data0
Robust Text Detection in Natural Scene Images0
Rotation-Sensitive Regression for Oriented Scene Text Detection0
RSCA: Real-time Segmentation-based Context-Aware Scene Text Detection0
Running Event Visualization using Videos from Multiple Cameras0
A method for detecting text of arbitrary shapes in natural scenes that improves text spotting0
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate0
Scene Text Detection via Holistic, Multi-Channel Prediction0
Scene Text Detection with Scribble Lines0
Scene Text Detection with Selected Anchor0
Scene Text Eraser0
Scene Text Synthesis for Efficient and Effective Deep Network Training0
SEE: Towards Semi-SupervisedEnd-to-End Scene Text Recognition0
Selective Feature Connection Mechanism: Concatenating Multi-layer CNN Features with a Feature Selector0
Self-Training for Domain Adaptive Scene Text Detection0
Separate Scene Text Detector for Unseen Scripts is Not All You Need0
Sequential Deformation for Accurate Scene Text Detection0
Shift Variance in Scene Text Detection0
Smart Library: Identifying Books in a Library using Richly Supervised Deep Scene Text Reading0
Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera0
Strokelets: A Learned Multi-Scale Representation for Scene Text Recognition0
StrokeNet: Stroke Assisted and Hierarchical Graph Reasoning Networks0
Symmetry-Based Text Line Detection in Natural Scenes0
Text-Attentional Convolutional Neural Networks for Scene Text Detection0
Text-attentional convolutional neural network for scene text detection0
TextContourNet: a Flexible and Effective Framework for Improving Scene Text Detection Architecture with a Multi-task Cascade0
TextDCT: Arbitrary-Shaped Text Detection via Discrete Cosine Transform Mask0
Text Flow: A Unified Text Detection System in Natural Scene Images0
TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision0
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis0
Text Growing on Leaf0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TextFuseNet (ResNeXt-101)F-Measure92.23Unverified
2CharNet H-88 (multi-scale)F-Measure91.55Unverified
3CharNet H-88 (single-scale)F-Measure90.97Unverified
4CharNet H-50 (multi-scale)F-Measure90.16Unverified
5SBDF-Measure90.1Unverified
6CharNet H-57 (multi-scale)F-Measure90.06Unverified
7FOTS MSF-Measure89.84Unverified
8CharNet H-50 (single-scale)F-Measure89.7Unverified
9CharNet H-57 (single-scale)F-Measure89.66Unverified
10PMTDF-Measure89.33Unverified
#ModelMetricClaimedVerifiedStatus
1MixNetF-Measure90.5Unverified
2SRFormer (ResNet-50)F-Measure90Unverified
3DPText-DETR (ResNet-50)F-Measure89Unverified
4TextFuseNet (ResNeXt-101)F-Measure87.5Unverified
5FAST-B-800F-Measure87.5Unverified
6I3CL + SSL(ResNet-50)F-Measure86.9Unverified
7CharNet H-88 (multi-scale)F-Measure86.5Unverified
8FAST-B-640F-Measure86.4Unverified
9DBNet++ (ResNet-50) (800)F-Measure86Unverified
10FAST-B-512F-Measure85.8Unverified
#ModelMetricClaimedVerifiedStatus
1MixNetF-Measure89.4Unverified
2FAST-B-736F-Measure87.3Unverified
3DBNet++ (ResNet-50) (736)F-Measure87.2Unverified
4FAST-S-736F-Measure86.4Unverified
5DBNet++ (ResNet-18) (736)F-Measure85.1Unverified
6FAST-T-736F-Measure84.9Unverified
7DB-ResNet-50 (736)F-Measure84.9Unverified
8FAST-T-512F-Measure84.5Unverified
9PANF-Measure84.1Unverified
10CRAFTF-Measure82.9Unverified
#ModelMetricClaimedVerifiedStatus
1MixNetF-Measure89.8Unverified
2SRFormer (ResNet-50)F-Measure89.6Unverified
3DPText-DETR (ResNet50)F-Measure88.8Unverified
4TextFuseNet (ResNeXt-101)F-Measure87.4Unverified
5I3CL + SSLF-Measure86.5Unverified
6PANF-Measure85Unverified
7FAST-B-640F-Measure84.2Unverified
8PAN-640F-Measure83.7Unverified
9CRAFTF-Measure83.5Unverified
10DB-ResNet50 (1024)F-Measure83.4Unverified
#ModelMetricClaimedVerifiedStatus
1CRAFTPrecision97.4Unverified
2TextFuseNet (ResNeXt-101)F-Measure94.61Unverified
3SPCNETF-Measure92.1Unverified
4Mask TextSpotterF-Measure91.7Unverified
5WordSup (VGG16-synth-icdar)F-Measure90.34Unverified
6STN-OCRF-Measure90.3Unverified
7PixelLink+VGG16 2s MSF-Measure88.1Unverified
8TextBoxes++_MSF-Measure88Unverified
9Corner Localization (multi-scale)F-Measure88Unverified
10Corner-based Region ProposalsF-Measure87.6Unverified
#ModelMetricClaimedVerifiedStatus
1PMTD*Precision84.42Unverified
2Corner Localization (single-scale)Precision83.8Unverified
3SBDPrecision82.75Unverified
4FOTS MSPrecision81.86Unverified
5CharNet H-88Precision81.27Unverified
6FOTSPrecision80.95Unverified
7SPCNETPrecision80.6Unverified
8CRAFTPrecision80.6Unverified
9PANPrecision80Unverified
10GNNetsPrecision79.63Unverified
#ModelMetricClaimedVerifiedStatus
1Corner-based Region ProposalsF-Measure59.1Unverified
2TextBoxes++_MSF-Measure58.72Unverified
3EAST + VGG16F-Measure39.45Unverified
4SSTDF-Measure37Unverified
5WordSup (VGG16-synth-coco)F-Measure36.8Unverified
6Yao et al.F-Measure33.31Unverified
#ModelMetricClaimedVerifiedStatus
1MixNetH-Mean79.7Unverified
2SRFormer (ResNet-50)H-Mean79.3Unverified
3TextFuseNet (ResNeXt-101)H-Mean78.6Unverified
4DPText-DETR (ResNet-50)H-Mean78.1Unverified
#ModelMetricClaimedVerifiedStatus
1BDNF-Measure93.36Unverified