SOTAVerified

Text Spotting

Scene Text Spotting is the combination of Scene Text Detection and Scene Text Recognition in an end-to-end manner. It is the ability to read natural text in the wild.

Papers

Showing 51100 of 112 papers

TitleStatusHype
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text SpottingCode0
You Only Recognize Once: Towards Fast Video Text SpottingCode0
InstructOCR: Instruction Boosting Scene Text SpottingCode0
A Feasible Framework for Arbitrary-Shaped Scene Text RecognitionCode0
Single Shot Self-Reliant Scene Text Spotter by Decoupled yet Collaborative Detection and RecognitionCode0
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting PerformanceCode0
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesCode0
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text SpottingCode0
Text Perceptron: Towards End-to-End Arbitrary-Shaped Text SpottingCode0
Open Images V5 Text Annotation and Yet Another Mask Text SpotterCode0
Extremely Low-light Image Enhancement with Scene Text RestorationCode0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language ModelsCode0
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesCode0
Word length-aware text spotting: Enhancing detection and recognition in dense text image0
A3S: Adversarial learning of semantic representations for Scene-Text Spotting0
All You Need Is Boundary: Toward Arbitrary-Shaped Text Spotting0
A pooling based scene text proposal technique for scene text reading in the wild0
Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance0
ARTS: Eliminating Inconsistency between Text Detection and Recognition with Auto-Rectification Text Spotter0
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration0
Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing0
Block-level Text Spotting with LLMs0
Cascaded Segmentation-Detection Networks for Word-Level Text Spotting0
Character Region Attention For Text Spotting0
CLII: Visual-Text Inpainting via Cross-Modal Predictive Interaction0
CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR0
Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition0
Deep Neural Network for Semantic-based Text Recognition in Images0
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting0
Deformation Robust Text Spotting with Geometric Prior0
Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes0
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer0
Efficiently Leveraging Linguistic Priors for Scene Text Spotting0
Enhanced Characterness for Text Detection in the Wild0
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments0
Hear the Scene: Audio-Enhanced Text Spotting0
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction0
ICDAR 2021 Competition on Scene Video Text Spotting0
ICDAR 2023 Video Text Reading Competition for Dense and Small Text0
Inductive Visual Localisation: Factorised Training for Superior Generalisation0
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling0
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model0
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild0
VGTS: Visually Guided Text Spotting for Novel Categories in Historical Manuscripts0
Reading Text in the Wild with Convolutional Neural Networks0
A method for detecting text of arbitrary shapes in natural scenes that improves text spotting0
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate0
Text-Aware Image Restoration with Diffusion Models0
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1UNITSF-measure (%) - Strong Lexicon89Unverified
2DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - Strong Lexicon88.1Unverified
3DeepSolo(ResNet-50, TextOCR)F-measure (%) - Strong Lexicon88Unverified
4DeepSolo(ResNet-50)F-measure (%) - Strong Lexicon86.8Unverified
5SRTSF-measure (%) - Strong Lexicon85.6Unverified
6TESTRF-measure (%) - Strong Lexicon85.2Unverified
7A3SF-measure (%) - Strong Lexicon84.8Unverified
8GLASSF-measure (%) - Strong Lexicon84.7Unverified
9SwinTextSpotterF-measure (%) - Strong Lexicon83.9Unverified
10FOTSF-measure (%) - Strong Lexicon83.6Unverified
#ModelMetricClaimedVerifiedStatus
1DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - No Lexicon83.6Unverified
2DeepSolo (ResNet-50, TextOCR)F-measure (%) - No Lexicon82.5Unverified
3DeepSolo (ResNet-50)F-measure (%) - No Lexicon79.7Unverified
4A3SF-measure (%) - No Lexicon79.4Unverified
5UNITSF-measure (%) - No Lexicon78.7Unverified
6GLASSF-measure (%) - No Lexicon76.6Unverified
7DEERF-measure (%) - No Lexicon74.8Unverified
8SwinTextSpotterF-measure (%) - No Lexicon74.3Unverified
9TESTRF-measure (%) - No Lexicon73.3Unverified
10MANGOF-measure (%) - No Lexicon72.9Unverified
#ModelMetricClaimedVerifiedStatus
1A3SF-measure (%) - No Lexicon64.4Unverified
2DeepSolo (ResNet-50)F-measure (%) - No Lexicon64.2Unverified
3SPTSF-measure (%) - No Lexicon63.6Unverified
4ABINet++F-measure (%) - No Lexicon60.2Unverified
5TPSNetF-measure (%) - No Lexicon59.7Unverified
6MANGOF-measure (%) - No Lexicon58.9Unverified
7ABCNet v2F-measure (%) - No Lexicon57.5Unverified
8TextPerceptronF-measure (%) - No Lexicon57Unverified
9TESTRF-measure (%) - No Lexicon56Unverified
10SwinTextSpotterF-measure (%) - No Lexicon51.8Unverified
#ModelMetricClaimedVerifiedStatus
1DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - No Lexicon68.8Unverified
2DeepSolo (ResNet-50, TextOCR)F-measure (%) - No Lexicon64.6Unverified
3SwinTextSpotterF-measure (%) - No Lexicon55.4Unverified
4DeepSolo (ResNet-50)F-measure (%) - No Lexicon48.5Unverified
5MaskTextSpotter v2F-measure (%) - No Lexicon39Unverified
6SPTSF-measure (%) - No Lexicon38.3Unverified
7ABCNet v2F-measure (%) - No Lexicon34.5Unverified
8TESTRF-measure (%) - No Lexicon34.2Unverified
9ABCNetF-measure (%) - No Lexicon22.2Unverified