SOTAVerified

Scene Text Recognition

See Scene Text Detection for leaderboards in this task.

Papers

Showing 151200 of 269 papers

TitleStatusHype
Transfer Learning for Scene Text Recognition in Indian Languages0
Towards Boosting the Accuracy of Non-Latin Scene Text RecognitionCode0
SAFL: A Self-Attention Scene Text Recognizer with Focal LossCode0
Visual-Semantic Transformer for Scene Text Recognition0
Decoupling Visual-Semantic Feature Learning for Robust Scene Text Recognition0
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages0
TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance0
Oracle Teacher: Leveraging Target Information for Better Knowledge Distillation of CTC Models0
Ultra Light OCR Competition Technical Report0
Scene Text Image Super-Resolution via Parallelly Contextual Attention NetworkCode0
Sharp Attention for Sequence to Sequence Learning0
IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition0
Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text RecognitionCode0
Why You Should Try the Real Data for the Scene Text RecognitionCode0
Text is Text, No Matter What: Unifying Text Recognition using Knowledge Distillation0
RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of Text Contents and StylesCode0
Scene Text recognition with Full Normalization0
Scene Text Telescope: Text-Focused Scene Image Super-ResolutionCode0
Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text RecognitionCode0
I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition0
STRIDE : Scene Text Recognition In-Device0
Reciprocal Feature Learning via Explicit and Implicit Tasks in Scene Text RecognitionCode0
PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Table Image Recognition to Latex0
Parallel Scale-wise Attention Network for Effective Scene Text Recognition0
Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam0
FEDS -- Filtered Edit Distance Surrogate0
Revisiting Classification Perspective on Scene Text RecognitionCode0
Efficient Online ML API Selection for Multi-Label Classification Tasks0
On Calibration of Scene-Text Recognition Models0
Boosting High-Level Vision with Joint Compression Artifacts Reduction and Super-Resolution0
Hamming OCR: A Locality Sensitive Hashing Neural Network for Scene Text Recognition0
Exploring Font-independent Features for Scene Text Recognition0
Variational Connectionist Temporal Classification0
A Comprehensive Study on Deep Learning-based Methods for Sign Language RecognitionCode0
FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition0
RobustScanner: Dynamically Enhancing Positional Clues for Robust Text RecognitionCode0
Learning Surrogates via Deep Embedding0
Using Human Psychophysics to Evaluate Generalization in Scene Text Recognition Models0
Text Recognition in Real Scenarios with a Few Labeled Samples0
What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images0
SPIN: Structure-Preserving Inner Offset Network for Scene Text RecognitionCode0
On Vocabulary Reliance in Scene Text Recognition0
ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition0
Towards Accurate Scene Text Recognition with Semantic Reasoning NetworksCode0
Scene Text Recognition via Transformer0
Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units0
A New Perspective for Flexible Feature Gathering in Scene Text Recognition Via Character Anchor Pooling0
GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition0
Scene Text Recognition With Finer Grid Rectification0
TextScanner: Reading Characters in Order for Robust Scene Text Recognition0
Show:102550
← PrevPage 4 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L*Accuracy99.42Unverified
2DTrOCR 105MAccuracy99.4Unverified
3CLIP4STR-L (DataComp-1B)Accuracy99Unverified
4MGP-STRAccuracy98.5Unverified
5CLIP4STR-LAccuracy98.5Unverified
6CLIP4STR-BAccuracy98.3Unverified
7CCD-ViT-Base(ARD_2.8M)Accuracy98.3Unverified
8CCD-ViT-Small(ARD_2.8M)Accuracy98.3Unverified
9MATRNAccuracy97.9Unverified
10S-GTRAccuracy97.8Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-H (DFN-5B)Accuracy99.1Unverified
2DTrOCR 105MAccuracy98.9Unverified
3CLIP4STR-B*Accuracy98.76Unverified
4MGP-STRAccuracy98.6Unverified
5CLIP4STR-L (DataComp-1B)Accuracy98.6Unverified
6CLIP4STR-LAccuracy98.5Unverified
7CPPDAccuracy98.5Unverified
8CLIP4STR-BAccuracy98.3Unverified
9CCD-ViT-Base(ARD_2.8M)Accuracy97.8Unverified
10CCD-ViT-Small(ARD_2.8M)Accuracy96.4Unverified
#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy93.5Unverified
2CLIP4STR-L*Accuracy92.6Unverified
3CPPDAccuracy91.7Unverified
4CLIP4STR-L (DataComp-1B)Accuracy91.4Unverified
5MGP-STRAccuracy90.9Unverified
6CLIP4STR-LAccuracy90.8Unverified
7CLIP4STR-BAccuracy90.6Unverified
8SIGA_SAccuracy87.6Unverified
9S-GTRAccuracy87.3Unverified
10MATRNAccuracy86.6Unverified
#ModelMetricClaimedVerifiedStatus
1CPPDAccuracy99.7Unverified
2CLIP4STR-L (DataComp-1B)Accuracy99.7Unverified
3CLIP4STR-B*Accuracy99.65Unverified
4MGP-STRAccuracy99.31Unverified
5CLIP4STR-BAccuracy99.3Unverified
6DTrOCR 105MAccuracy99.1Unverified
7CLIP4STR-LAccuracy99Unverified
8CCD-ViT-Base(ARD_2.8M)Accuracy98.3Unverified
9CCD-ViT-Small(ARD_2.8M)Accuracy98.3Unverified
10CCD-ViT-Tiny(ARD_2.8M)Accuracy95.8Unverified
#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy99.6Unverified
2CLIP4STR-L (DataComp-1B)Accuracy99.6Unverified
3CLIP4STR-LAccuracy99.5Unverified
4CLIP4STR-B (DataComp-1B)Accuracy99.5Unverified
5CPPDAccuracy99.3Unverified
6CLIP4STR-BAccuracy99.2Unverified
7MGP-STRAccuracy98.8Unverified
8CCD-ViT-Base(ARD_2.8M)Accuracy98Unverified
9CCD-ViT-Small(ARD_2.8M)Accuracy98Unverified
10S-GTRAccuracy97.5Unverified
#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy98.6Unverified
2MGP-STRAccuracy98.3Unverified
3CLIP4STR-L*Accuracy98.13Unverified
4CLIP4STR-L (DataComp-1B)Accuracy98.1Unverified
5CLIP4STR-LAccuracy97.4Unverified
6CLIP4STR-BAccuracy97.2Unverified
7CPPDAccuracy96.7Unverified
8CCD-ViT-BaseAccuracy96.1Unverified
9CCD-ViT-SmallAccuracy92.7Unverified
10CCD-ViT-TinyAccuracy91.6Unverified
#ModelMetricClaimedVerifiedStatus
1Yet Another Text RecognizerAccuracy97.1Unverified
2SIGA_TAccuracy97Unverified
3SATRNAccuracy96.7Unverified
4DANAccuracy95Unverified
5SAFLAccuracy95Unverified
6CSTRAccuracy94.8Unverified
7Baek et al.Accuracy94.4Unverified
8ViTSTRAccuracy94.3Unverified
9AONAccuracy91.5Unverified
10RAREAccuracy90.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-H (DFN-5B)1:1 Accuracy90.9Unverified
2CLIP4STR-L (DataComp-1B)1:1 Accuracy90.6Unverified
3CLIP4STR-L1:1 Accuracy88.8Unverified
4CLIP4STR-B1:1 Accuracy87Unverified
5CCD-ViT-Base1:1 Accuracy86Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L (DataComp-1B)Accuracy (%)86.4Unverified
2CLIP4STR-LAccuracy (%)85.9Unverified
3CLIP4STR-BAccuracy (%)85.8Unverified
4MGP-STRAccuracy (%)85.5Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L1:1 Accuracy81.9Unverified
2MGP-STR1:1 Accuracy81.7Unverified
3CLIP4STR-B1:1 Accuracy81.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L1:1 Accuracy82.7Unverified
2CLIP4STR-B1:1 Accuracy79.8Unverified
3CCD-ViT-Base1:1 Accuracy77.3Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L (DataComp-1B)Accuracy (%)92.2Unverified
2MGP-STRAccuracy (%)91Unverified
3CLIP4STR-BAccuracy (%)86.8Unverified
#ModelMetricClaimedVerifiedStatus
1ABINet-LV+TPS++Accuracy97.8Unverified
#ModelMetricClaimedVerifiedStatus
1MLDGAverage Accuracy19.02Unverified
#ModelMetricClaimedVerifiedStatus
1ABINet-LV+TPS++Accuracy89.6Unverified