SOTAVerified

Text Spotting

Scene Text Spotting is the combination of Scene Text Detection and Scene Text Recognition in an end-to-end manner. It is the ability to read natural text in the wild.

Papers

Showing 51100 of 112 papers

TitleStatusHype
Deep Neural Network for Semantic-based Text Recognition in Images0
DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting0
Deformation Robust Text Spotting with Geometric Prior0
Diving into the Depths of Spotting Text in Multi-Domain Noisy Scenes0
Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer0
Efficiently Leveraging Linguistic Priors for Scene Text Spotting0
Enhanced Characterness for Text Detection in the Wild0
Ensemble Learning for Vietnamese Scene Text Spotting in Urban Environments0
Hear the Scene: Audio-Enhanced Text Spotting0
HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction0
ICDAR 2021 Competition on Scene Video Text Spotting0
ICDAR 2023 Video Text Reading Competition for Dense and Small Text0
Inductive Visual Localisation: Factorised Training for Superior Generalisation0
Inverse-like Antagonistic Scene Text Spotting via Reading-Order Estimation and Dynamic Sampling0
LOGO: Video Text Spotting with Language Collaboration and Glyph Perception Model0
Modeling Entities as Semantic Points for Visual Information Extraction in the Wild0
VGTS: Visually Guided Text Spotting for Novel Categories in Historical Manuscripts0
Reading Text in the Wild with Convolutional Neural Networks0
A method for detecting text of arbitrary shapes in natural scenes that improves text spotting0
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate0
Text-Aware Image Restoration with Diffusion Models0
TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model0
Text Detection & Recognition in the Wild for Robot Localization0
TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting0
TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision0
Textual Visual Semantic Dataset for Text Spotting0
Towards End-to-End Text Spotting in Natural Scenes0
Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks0
Towards Unconstrained End-to-End Text Spotting0
Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer0
Using Object Information for Spotting Text0
Video text tracking for dense and small text based on pp-yoloe-r and sort algorithm0
Watermark Text Pattern Spotting in Document Images0
1st Place Solution to ICDAR 2021 RRC-ICTEXT End-to-end Text Spotting and Aesthetic Assessment on Integrated Circuit0
STEP -- Towards Structured Scene-Text SpottingCode0
GloTSFormer: Global Video Text Spotting TransformerCode0
FOTS: Fast Oriented Text Spotting with a Unified NetworkCode0
WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text SpottingCode0
E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial VehiclesCode0
A Feasible Framework for Arbitrary-Shaped Scene Text RecognitionCode0
Dynamic Low-Resolution Distillation for Cost-Efficient End-to-End Text SpottingCode0
Visual Re-ranking with Natural Language Understanding for Text SpottingCode0
FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text SpottingCode0
Extremely Low-light Image Enhancement with Scene Text RestorationCode0
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table RecognitionCode0
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table RecognitionCode0
OmniParser V2: Structured-Points-of-Thought for Unified Visual Text Parsing and Its Generality to Multimodal Large Language ModelsCode0
Open Images V5 Text Annotation and Yet Another Mask Text SpotterCode0
Text Perceptron: Towards End-to-End Arbitrary-Shaped Text SpottingCode0
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesCode0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1UNITSF-measure (%) - Strong Lexicon89Unverified
2DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - Strong Lexicon88.1Unverified
3DeepSolo(ResNet-50, TextOCR)F-measure (%) - Strong Lexicon88Unverified
4DeepSolo(ResNet-50)F-measure (%) - Strong Lexicon86.8Unverified
5SRTSF-measure (%) - Strong Lexicon85.6Unverified
6TESTRF-measure (%) - Strong Lexicon85.2Unverified
7A3SF-measure (%) - Strong Lexicon84.8Unverified
8GLASSF-measure (%) - Strong Lexicon84.7Unverified
9SwinTextSpotterF-measure (%) - Strong Lexicon83.9Unverified
10FOTSF-measure (%) - Strong Lexicon83.6Unverified
#ModelMetricClaimedVerifiedStatus
1DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - No Lexicon83.6Unverified
2DeepSolo (ResNet-50, TextOCR)F-measure (%) - No Lexicon82.5Unverified
3DeepSolo (ResNet-50)F-measure (%) - No Lexicon79.7Unverified
4A3SF-measure (%) - No Lexicon79.4Unverified
5UNITSF-measure (%) - No Lexicon78.7Unverified
6GLASSF-measure (%) - No Lexicon76.6Unverified
7DEERF-measure (%) - No Lexicon74.8Unverified
8SwinTextSpotterF-measure (%) - No Lexicon74.3Unverified
9TESTRF-measure (%) - No Lexicon73.3Unverified
10MANGOF-measure (%) - No Lexicon72.9Unverified
#ModelMetricClaimedVerifiedStatus
1A3SF-measure (%) - No Lexicon64.4Unverified
2DeepSolo (ResNet-50)F-measure (%) - No Lexicon64.2Unverified
3SPTSF-measure (%) - No Lexicon63.6Unverified
4ABINet++F-measure (%) - No Lexicon60.2Unverified
5TPSNetF-measure (%) - No Lexicon59.7Unverified
6MANGOF-measure (%) - No Lexicon58.9Unverified
7ABCNet v2F-measure (%) - No Lexicon57.5Unverified
8TextPerceptronF-measure (%) - No Lexicon57Unverified
9TESTRF-measure (%) - No Lexicon56Unverified
10SwinTextSpotterF-measure (%) - No Lexicon51.8Unverified
#ModelMetricClaimedVerifiedStatus
1DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - No Lexicon68.8Unverified
2DeepSolo (ResNet-50, TextOCR)F-measure (%) - No Lexicon64.6Unverified
3SwinTextSpotterF-measure (%) - No Lexicon55.4Unverified
4DeepSolo (ResNet-50)F-measure (%) - No Lexicon48.5Unverified
5MaskTextSpotter v2F-measure (%) - No Lexicon39Unverified
6SPTSF-measure (%) - No Lexicon38.3Unverified
7ABCNet v2F-measure (%) - No Lexicon34.5Unverified
8TESTRF-measure (%) - No Lexicon34.2Unverified
9ABCNetF-measure (%) - No Lexicon22.2Unverified