SOTAVerified

Text Spotting

Scene Text Spotting is the combination of Scene Text Detection and Scene Text Recognition in an end-to-end manner. It is the ability to read natural text in the wild.

Papers

Showing 51100 of 112 papers

TitleStatusHype
E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial VehiclesCode0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)Code0
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesCode0
FOTS: Fast Oriented Text Spotting with a Unified NetworkCode0
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text SpottingCode0
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesCode0
Word length-aware text spotting: Enhancing detection and recognition in dense text image0
A3S: Adversarial learning of semantic representations for Scene-Text Spotting0
All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting0
A pooling based scene text proposal technique for scene text reading in the wild0
Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance0
ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter0
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration0
Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing0
Block-level Text Spotting with LLMs0
Cascaded Segmentation-Detection Networks for Word-Level Text Spotting0
Character Region Attention For Text Spotting0
CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction0
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR0
Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition0
Deep Neural Network for Semantic-based Text Recognition in Images0
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting0
Deformation Robust Text Spotting with Geometric Prior0
Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes0
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer0
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text Spotting0
Efficiently Leveraging Linguistic Priors for Scene Text Spotting0
You Only Recognize Once: Towards Fast Video Text Spotting0
Enhanced Characterness for Text Detection in the Wild0
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments0
Hear the Scene: Audio-Enhanced Text Spotting0
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction0
ICDAR 2021 Competition on Scene Video Text Spotting0
ICDAR 2023 Video Text Reading Competition for Dense and Small Text0
Inductive Visual Localisation: Factorised Training for Superior Generalisation0
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling0
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model0
MANGO: A Mask Attention Guided One-Stage Scene Text Spotter0
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild0
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition0
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition0
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language Models0
VGTS: Visually Guided Text Spotting for Novel Categories in Historical Manuscripts0
Reading Text in the Wild with Convolutional Neural Networks0
A method for detecting text of arbitrary shapes in natural scenes that improves text spotting0
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate0
Text-Aware Image Restoration with Diffusion Models0
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model0
Text Detection & Recognition in the Wild for Robot Localization0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1UNITSF-measure (%) - Strong Lexicon89Unverified
2DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - Strong Lexicon88.1Unverified
3DeepSolo(ResNet-50, TextOCR)F-measure (%) - Strong Lexicon88Unverified
4DeepSolo(ResNet-50)F-measure (%) - Strong Lexicon86.8Unverified
5SRTSF-measure (%) - Strong Lexicon85.6Unverified
6TESTRF-measure (%) - Strong Lexicon85.2Unverified
7A3SF-measure (%) - Strong Lexicon84.8Unverified
8GLASSF-measure (%) - Strong Lexicon84.7Unverified
9SwinTextSpotterF-measure (%) - Strong Lexicon83.9Unverified
10FOTSF-measure (%) - Strong Lexicon83.6Unverified
#ModelMetricClaimedVerifiedStatus
1DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - No Lexicon83.6Unverified
2DeepSolo (ResNet-50, TextOCR)F-measure (%) - No Lexicon82.5Unverified
3DeepSolo (ResNet-50)F-measure (%) - No Lexicon79.7Unverified
4A3SF-measure (%) - No Lexicon79.4Unverified
5UNITSF-measure (%) - No Lexicon78.7Unverified
6GLASSF-measure (%) - No Lexicon76.6Unverified
7DEERF-measure (%) - No Lexicon74.8Unverified
8SwinTextSpotterF-measure (%) - No Lexicon74.3Unverified
9TESTRF-measure (%) - No Lexicon73.3Unverified
10MANGOF-measure (%) - No Lexicon72.9Unverified
#ModelMetricClaimedVerifiedStatus
1A3SF-measure (%) - No Lexicon64.4Unverified
2DeepSolo (ResNet-50)F-measure (%) - No Lexicon64.2Unverified
3SPTSF-measure (%) - No Lexicon63.6Unverified
4ABINet++F-measure (%) - No Lexicon60.2Unverified
5TPSNetF-measure (%) - No Lexicon59.7Unverified
6MANGOF-measure (%) - No Lexicon58.9Unverified
7ABCNet v2F-measure (%) - No Lexicon57.5Unverified
8TextPerceptronF-measure (%) - No Lexicon57Unverified
9TESTRF-measure (%) - No Lexicon56Unverified
10SwinTextSpotterF-measure (%) - No Lexicon51.8Unverified
#ModelMetricClaimedVerifiedStatus
1DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - No Lexicon68.8Unverified
2DeepSolo (ResNet-50, TextOCR)F-measure (%) - No Lexicon64.6Unverified
3SwinTextSpotterF-measure (%) - No Lexicon55.4Unverified
4DeepSolo (ResNet-50)F-measure (%) - No Lexicon48.5Unverified
5MaskTextSpotter v2F-measure (%) - No Lexicon39Unverified
6SPTSF-measure (%) - No Lexicon38.3Unverified
7ABCNet v2F-measure (%) - No Lexicon34.5Unverified
8TESTRF-measure (%) - No Lexicon34.2Unverified
9ABCNetF-measure (%) - No Lexicon22.2Unverified