SOTAVerified

Scene Text Detection

Scene Text Detection is a computer vision task that involves automatically identifying and localizing text within natural images or videos. The goal of scene text detection is to develop algorithms that can robustly detect and and label text with bounding boxes in uncontrolled and complex environments, such as street signs, billboards, or license plates.

Source: ContourNet: Taking a Further Step toward Accurate Arbitrary-shaped Scene Text Detection

Papers

Showing 51100 of 213 papers

TitleStatusHype
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition0
A Novel Scene Text Detection Algorithm Based On Convolutional Neural Network0
An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches0
Domain Adaptive Scene Text Detection via Subcategorization0
Adaptive Segmentation Network for Scene Text Detection0
Adaptive Adversarial Attack on Scene Text Recognition0
DPTNet: A Dual-Path Transformer Architecture for Scene Text Detection0
Character Proposal Network for Robust Text Extraction0
All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting0
DGST : Discriminator Guided Scene Text detector0
Efficient Scene Text Detection with Textual Attention Tower0
EK-Net:Real-time Scene Text Detection with Expand Kernel Distance0
Detection and Rectification of Arbitrary Shaped Scene Texts by using Text Keypoints and Links0
Characterness: An Indicator of Text in the Wild0
Real-time Scene Text Detection Based on Global Level and Word Level Features0
All you need is a second look: Towards Tighter Arbitrary shape text detection0
MENTOR: Multilingual tExt detectioN TOward leaRning by analogy0
PuzzleNet: Scene Text Detection by Segment Context Graph Learning0
Mask R-CNN with Pyramid Attention Network for Scene Text Detection0
Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes0
Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection0
Predictive Ensemble Learning with Application to Scene Text Detection0
Learning to Predict More Accurate Text Instances for Scene Text Detection0
Deformable Kernel Expansion Model for Efficient Arbitrary-shaped Scene Text Detection0
Canny Text Detector: Fast and Robust Scene Text Localization Algorithm0
Large Scale Scene Text Verification with Guided Attention0
Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting0
KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark0
Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping0
Deep Scene Text Detection with Connected Component Proposals0
Incidental Scene Text Understanding: Recent Progresses on ICDAR 2015 Robust Reading Competition Challenge 40
IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection0
Deep Residual Text Detection Network for Scene Text0
BPDO:Boundary Points Dynamic Optimization for Arbitrary Shape Scene Text Detection0
Image Processing Based Scene-Text Detection and Recognition with Tesseract0
Kernel Adaptive Convolution for Scene Text Detection via Distance Map Prediction0
ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-20190
Deep Learning Based Vehicle Tracking System Using License Plate Detection And Recognition0
Deep Direct Regression for Multi-Oriented Scene Text Detection0
Learning Markov Clustering Networks for Scene Text Detection0
Learning Robust Feature Representations for Scene Text Detection0
Learning Shape-Aware Embedding for Scene Text Detection0
Attention-based Feature Decomposition-Reconstruction Network for Scene Text Detection0
Location-Aware Feature Selection Text Detection Network0
A Human Eye-based Text Color Scheme Generation Method for Image Synthesis0
TextCohesion: Detecting Text for Arbitrary Shapes0
Oriented Objects as pairs of Middle Lines0
MSR: Multi-Scale Shape Regression for Scene Text Detection0
Aggregated Text Transformer for Scene Text Detection0
MT: Multi-Perspective Feature Learning Network for Scene Text Detection0
Show:102550
← PrevPage 2 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TextFuseNet (ResNeXt-101)F-Measure92.23Unverified
2CharNet H-88 (multi-scale)F-Measure91.55Unverified
3CharNet H-88 (single-scale)F-Measure90.97Unverified
4CharNet H-50 (multi-scale)F-Measure90.16Unverified
5SBDF-Measure90.1Unverified
6CharNet H-57 (multi-scale)F-Measure90.06Unverified
7FOTS MSF-Measure89.84Unverified
8CharNet H-50 (single-scale)F-Measure89.7Unverified
9CharNet H-57 (single-scale)F-Measure89.66Unverified
10PMTDF-Measure89.33Unverified
#ModelMetricClaimedVerifiedStatus
1MixNetF-Measure90.5Unverified
2SRFormer (ResNet-50)F-Measure90Unverified
3DPText-DETR (ResNet-50)F-Measure89Unverified
4TextFuseNet (ResNeXt-101)F-Measure87.5Unverified
5FAST-B-800F-Measure87.5Unverified
6I3CL + SSL(ResNet-50)F-Measure86.9Unverified
7CharNet H-88 (multi-scale)F-Measure86.5Unverified
8FAST-B-640F-Measure86.4Unverified
9DBNet++ (ResNet-50) (800)F-Measure86Unverified
10FAST-B-512F-Measure85.8Unverified
#ModelMetricClaimedVerifiedStatus
1MixNetF-Measure89.4Unverified
2FAST-B-736F-Measure87.3Unverified
3DBNet++ (ResNet-50) (736)F-Measure87.2Unverified
4FAST-S-736F-Measure86.4Unverified
5DBNet++ (ResNet-18) (736)F-Measure85.1Unverified
6DB-ResNet-50 (736)F-Measure84.9Unverified
7FAST-T-736F-Measure84.9Unverified
8FAST-T-512F-Measure84.5Unverified
9PANF-Measure84.1Unverified
10CRAFTF-Measure82.9Unverified
#ModelMetricClaimedVerifiedStatus
1MixNetF-Measure89.8Unverified
2SRFormer (ResNet-50)F-Measure89.6Unverified
3DPText-DETR (ResNet50)F-Measure88.8Unverified
4TextFuseNet (ResNeXt-101)F-Measure87.4Unverified
5I3CL + SSLF-Measure86.5Unverified
6PANF-Measure85Unverified
7FAST-B-640F-Measure84.2Unverified
8PAN-640F-Measure83.7Unverified
9CRAFTF-Measure83.5Unverified
10DB-ResNet50 (1024)F-Measure83.4Unverified
#ModelMetricClaimedVerifiedStatus
1CRAFTPrecision97.4Unverified
2TextFuseNet (ResNeXt-101)F-Measure94.61Unverified
3SPCNETF-Measure92.1Unverified
4Mask TextSpotterF-Measure91.7Unverified
5WordSup (VGG16-synth-icdar)F-Measure90.34Unverified
6STN-OCRF-Measure90.3Unverified
7PixelLink+VGG16 2s MSF-Measure88.1Unverified
8TextBoxes++_MSF-Measure88Unverified
9Corner Localization (multi-scale)F-Measure88Unverified
10Corner-based Region ProposalsF-Measure87.6Unverified
#ModelMetricClaimedVerifiedStatus
1PMTD*Precision84.42Unverified
2Corner Localization (single-scale)Precision83.8Unverified
3SBDPrecision82.75Unverified
4FOTS MSPrecision81.86Unverified
5CharNet H-88Precision81.27Unverified
6FOTSPrecision80.95Unverified
7CRAFTPrecision80.6Unverified
8SPCNETPrecision80.6Unverified
9PANPrecision80Unverified
10GNNetsPrecision79.63Unverified
#ModelMetricClaimedVerifiedStatus
1Corner-based Region ProposalsF-Measure59.1Unverified
2TextBoxes++_MSF-Measure58.72Unverified
3EAST + VGG16F-Measure39.45Unverified
4SSTDF-Measure37Unverified
5WordSup (VGG16-synth-coco)F-Measure36.8Unverified
6Yao et al.F-Measure33.31Unverified
#ModelMetricClaimedVerifiedStatus
1MixNetH-Mean79.7Unverified
2SRFormer (ResNet-50)H-Mean79.3Unverified
3TextFuseNet (ResNeXt-101)H-Mean78.6Unverified
4DPText-DETR (ResNet-50)H-Mean78.1Unverified
#ModelMetricClaimedVerifiedStatus
1BDNF-Measure93.36Unverified