SOTAVerified

Scene Text Recognition

See Scene Text Detection for leaderboards in this task.

Papers

Showing 201250 of 269 papers

TitleStatusHype
Parallel Scale-wise Attention Network for Effective Scene Text Recognition0
Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition0
PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Table Image Recognition to Latex0
Portmanteauing Features for Scene Text Recognition0
Why You Should Try the Real Data for the Scene Text RecognitionCode0
A Comprehensive Study on Deep Learning-based Methods for Sign Language RecognitionCode0
A Feasible Framework for Arbitrary-Shaped Scene Text RecognitionCode0
A Multi-Object Rectified Attention Network for Scene Text RecognitionCode0
AON: Towards Arbitrarily-Oriented Text RecognitionCode0
A Holistic Representation Guided Attention Network for Scene Text RecognitionCode0
ASTER: An Attentional Scene Text Recognizer with Flexible RectificationCode0
Bidirectional Scene Text Recognition with a Single DecoderCode0
Boosting Semi-Supervised Scene Text Recognition via Viewing and SummarizingCode0
Connectionist Temporal Classification with Maximum Entropy RegularizationCode0
Context Perception Parallel Decoder for Scene Text RecognitionCode0
Revisiting Classification Perspective on Scene Text RecognitionCode0
Decoder Pre-Training with only Text for Scene Text RecognitionCode0
DocXChain: A Powerful Open-Source Toolchain for Document Parsing and BeyondCode0
EventSTR: A Benchmark Dataset and Baselines for Event Stream based Scene Text RecognitionCode0
Focus-Enhanced Scene Text Recognition with Deformable ConvolutionsCode0
Focus on the Whole Character: Discriminative Character Modeling for Scene Text RecognitionCode0
FOTS: Fast Oriented Text Spotting with a Unified NetworkCode0
Geometric Perception based Efficient Text RecognitionCode0
ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)Code0
Improving Text Proposals for Scene Images with Fully Convolutional NetworksCode0
Instruction-Guided Scene Text RecognitionCode0
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text RecognitionCode0
KISS: Keeping It Simple for Scene Text RecognitionCode0
Levenshtein OCRCode0
LISTER: Neighbor Decoding for Length-Insensitive Scene Text RecognitionCode0
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text RecognitionCode0
Masked and Permuted Implicit Context Learning for Scene Text RecognitionCode0
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesCode0
MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes BenchmarkCode0
MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text RecognitionCode0
Multi-Granularity Prediction for Scene Text RecognitionCode0
Multi-Granularity Prediction with Learnable Fusion for Scene Text RecognitionCode0
NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text RecognitionCode0
On Recognizing Texts of Arbitrary Shapes with 2D Self-AttentionCode0
OTE: Exploring Accurate Scene Text Recognition Using One TokenCode0
Out of Length Text Recognition with Sub-String MatchingCode0
Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and BeyondCode0
Platypus: A Generalized Specialist Model for Reading Text in Various FormsCode0
Pyramid Mask Text DetectorCode0
Reading Between the Lanes: Text VideoQA on the RoadCode0
Reading Scene Text in Deep Convolutional SequencesCode0
Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text RecognitionCode0
Relational Contrastive Learning and Masked Image Modeling for Scene Text RecognitionCode0
Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text RecognitionCode0
RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of Text Contents and StylesCode0
Show:102550
← PrevPage 5 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L*Accuracy99.42Unverified
2DTrOCR 105MAccuracy99.4Unverified
3CLIP4STR-L (DataComp-1B)Accuracy99Unverified
4CLIP4STR-LAccuracy98.5Unverified
5MGP-STRAccuracy98.5Unverified
6CCD-ViT-Small(ARD_2.8M)Accuracy98.3Unverified
7CLIP4STR-BAccuracy98.3Unverified
8CCD-ViT-Base(ARD_2.8M)Accuracy98.3Unverified
9MATRNAccuracy97.9Unverified
10S-GTRAccuracy97.8Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-H (DFN-5B)Accuracy99.1Unverified
2DTrOCR 105MAccuracy98.9Unverified
3CLIP4STR-B*Accuracy98.76Unverified
4CLIP4STR-L (DataComp-1B)Accuracy98.6Unverified
5MGP-STRAccuracy98.6Unverified
6CLIP4STR-LAccuracy98.5Unverified
7CPPDAccuracy98.5Unverified
8CLIP4STR-BAccuracy98.3Unverified
9CCD-ViT-Base(ARD_2.8M)Accuracy97.8Unverified
10CCD-ViT-Small(ARD_2.8M)Accuracy96.4Unverified
#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy93.5Unverified
2CLIP4STR-L*Accuracy92.6Unverified
3CPPDAccuracy91.7Unverified
4CLIP4STR-L (DataComp-1B)Accuracy91.4Unverified
5MGP-STRAccuracy90.9Unverified
6CLIP4STR-LAccuracy90.8Unverified
7CLIP4STR-BAccuracy90.6Unverified
8SIGA_SAccuracy87.6Unverified
9S-GTRAccuracy87.3Unverified
10MATRNAccuracy86.6Unverified
#ModelMetricClaimedVerifiedStatus
1CPPDAccuracy99.7Unverified
2CLIP4STR-L (DataComp-1B)Accuracy99.7Unverified
3CLIP4STR-B*Accuracy99.65Unverified
4MGP-STRAccuracy99.31Unverified
5CLIP4STR-BAccuracy99.3Unverified
6DTrOCR 105MAccuracy99.1Unverified
7CLIP4STR-LAccuracy99Unverified
8CCD-ViT-Base(ARD_2.8M)Accuracy98.3Unverified
9CCD-ViT-Small(ARD_2.8M)Accuracy98.3Unverified
10CCD-ViT-Tiny(ARD_2.8M)Accuracy95.8Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L (DataComp-1B)Accuracy99.6Unverified
2DTrOCR 105MAccuracy99.6Unverified
3CLIP4STR-B (DataComp-1B)Accuracy99.5Unverified
4CLIP4STR-LAccuracy99.5Unverified
5CPPDAccuracy99.3Unverified
6CLIP4STR-BAccuracy99.2Unverified
7MGP-STRAccuracy98.8Unverified
8CCD-ViT-Base(ARD_2.8M)Accuracy98Unverified
9CCD-ViT-Small(ARD_2.8M)Accuracy98Unverified
10S-GTRAccuracy97.5Unverified
#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy98.6Unverified
2MGP-STRAccuracy98.3Unverified
3CLIP4STR-L*Accuracy98.13Unverified
4CLIP4STR-L (DataComp-1B)Accuracy98.1Unverified
5CLIP4STR-LAccuracy97.4Unverified
6CLIP4STR-BAccuracy97.2Unverified
7CPPDAccuracy96.7Unverified
8CCD-ViT-BaseAccuracy96.1Unverified
9CCD-ViT-SmallAccuracy92.7Unverified
10CCD-ViT-TinyAccuracy91.6Unverified
#ModelMetricClaimedVerifiedStatus
1Yet Another Text RecognizerAccuracy97.1Unverified
2SIGA_TAccuracy97Unverified
3SATRNAccuracy96.7Unverified
4DANAccuracy95Unverified
5SAFLAccuracy95Unverified
6CSTRAccuracy94.8Unverified
7Baek et al.Accuracy94.4Unverified
8ViTSTRAccuracy94.3Unverified
9AONAccuracy91.5Unverified
10RAREAccuracy90.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-H (DFN-5B)1:1 Accuracy90.9Unverified
2CLIP4STR-L (DataComp-1B)1:1 Accuracy90.6Unverified
3CLIP4STR-L1:1 Accuracy88.8Unverified
4CLIP4STR-B1:1 Accuracy87Unverified
5CCD-ViT-Base1:1 Accuracy86Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L (DataComp-1B)Accuracy (%)86.4Unverified
2CLIP4STR-LAccuracy (%)85.9Unverified
3CLIP4STR-BAccuracy (%)85.8Unverified
4MGP-STRAccuracy (%)85.5Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L1:1 Accuracy81.9Unverified
2MGP-STR1:1 Accuracy81.7Unverified
3CLIP4STR-B1:1 Accuracy81.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L1:1 Accuracy82.7Unverified
2CLIP4STR-B1:1 Accuracy79.8Unverified
3CCD-ViT-Base1:1 Accuracy77.3Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L (DataComp-1B)Accuracy (%)92.2Unverified
2MGP-STRAccuracy (%)91Unverified
3CLIP4STR-BAccuracy (%)86.8Unverified
#ModelMetricClaimedVerifiedStatus
1ABINet-LV+TPS++Accuracy97.8Unverified
#ModelMetricClaimedVerifiedStatus
1MLDGAverage Accuracy19.02Unverified
#ModelMetricClaimedVerifiedStatus
1ABINet-LV+TPS++Accuracy89.6Unverified