SOTAVerified

Text Spotting

Scene Text Spotting is the combination of Scene Text Detection and Scene Text Recognition in an end-to-end manner. It is the ability to read natural text in the wild.

Papers

Showing 125 of 112 papers

TitleStatusHype
TextMonkey: An OCR-Free Large Multimodal Model for Understanding DocumentCode5
VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain GeneralizationCode2
Bridging the Gap Between End-to-End and Two-Step Text SpottingCode2
Hierarchical Text Spotter for Joint Text Spotting and Layout AnalysisCode2
DeepSolo++: Let Transformer Decoder with Explicit Points Solo for Multilingual Text SpottingCode2
DeepSolo: Let Transformer Decoder with Explicit Points Solo for Text SpottingCode2
SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text RecognitionCode2
GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and BenchmarkingCode1
SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text SpottingCode1
TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and VerificationCode1
DNTextSpotter: Arbitrary-Shaped Scene Text Spotting via Improved Denoising TrainingCode1
SwinTextSpotter v2: Towards Better Synergy for Scene Text SpottingCode1
GoMatching: A Simple Baseline for Video Text Spotting via Long and Short Term MatchingCode1
Parrot Captions Teach CLIP to Spot TextCode1
ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in TransformerCode1
FlowText: Synthesizing Realistic Scene Text Video with Optical Flow EstimationCode1
Scalable Mask Annotation for Video Text SpottingCode1
Towards Unified Scene Text Spotting based on Sequence GenerationCode1
Filtering, Distillation, and Hard Negatives for Vision-Language Pre-TrainingCode1
SPTS v2: Single-Point Scene Text SpottingCode1
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text SpottingCode1
GLASS: Global to Local Attention for Scene-Text SpottingCode1
Text Spotting TransformersCode1
End-to-End Video Text Spotting with TransformerCode1
SPTS: Single-Point Text SpottingCode1
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1UNITSF-measure (%) - Strong Lexicon89Unverified
2DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - Strong Lexicon88.1Unverified
3DeepSolo(ResNet-50, TextOCR)F-measure (%) - Strong Lexicon88Unverified
4DeepSolo(ResNet-50)F-measure (%) - Strong Lexicon86.8Unverified
5SRTSF-measure (%) - Strong Lexicon85.6Unverified
6TESTRF-measure (%) - Strong Lexicon85.2Unverified
7A3SF-measure (%) - Strong Lexicon84.8Unverified
8GLASSF-measure (%) - Strong Lexicon84.7Unverified
9SwinTextSpotterF-measure (%) - Strong Lexicon83.9Unverified
10FOTSF-measure (%) - Strong Lexicon83.6Unverified
#ModelMetricClaimedVerifiedStatus
1DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - No Lexicon83.6Unverified
2DeepSolo (ResNet-50, TextOCR)F-measure (%) - No Lexicon82.5Unverified
3DeepSolo (ResNet-50)F-measure (%) - No Lexicon79.7Unverified
4A3SF-measure (%) - No Lexicon79.4Unverified
5UNITSF-measure (%) - No Lexicon78.7Unverified
6GLASSF-measure (%) - No Lexicon76.6Unverified
7DEERF-measure (%) - No Lexicon74.8Unverified
8SwinTextSpotterF-measure (%) - No Lexicon74.3Unverified
9TESTRF-measure (%) - No Lexicon73.3Unverified
10MANGOF-measure (%) - No Lexicon72.9Unverified
#ModelMetricClaimedVerifiedStatus
1A3SF-measure (%) - No Lexicon64.4Unverified
2DeepSolo (ResNet-50)F-measure (%) - No Lexicon64.2Unverified
3SPTSF-measure (%) - No Lexicon63.6Unverified
4ABINet++F-measure (%) - No Lexicon60.2Unverified
5TPSNetF-measure (%) - No Lexicon59.7Unverified
6MANGOF-measure (%) - No Lexicon58.9Unverified
7ABCNet v2F-measure (%) - No Lexicon57.5Unverified
8TextPerceptronF-measure (%) - No Lexicon57Unverified
9TESTRF-measure (%) - No Lexicon56Unverified
10SwinTextSpotterF-measure (%) - No Lexicon51.8Unverified
#ModelMetricClaimedVerifiedStatus
1DeepSolo (ViTAEv2-S, TextOCR)F-measure (%) - No Lexicon68.8Unverified
2DeepSolo (ResNet-50, TextOCR)F-measure (%) - No Lexicon64.6Unverified
3SwinTextSpotterF-measure (%) - No Lexicon55.4Unverified
4DeepSolo (ResNet-50)F-measure (%) - No Lexicon48.5Unverified
5MaskTextSpotter v2F-measure (%) - No Lexicon39Unverified
6SPTSF-measure (%) - No Lexicon38.3Unverified
7ABCNet v2F-measure (%) - No Lexicon34.5Unverified
8TESTRF-measure (%) - No Lexicon34.2Unverified
9ABCNetF-measure (%) - No Lexicon22.2Unverified