SOTAVerified

Scene Text Recognition

See Scene Text Detection for leaderboards in this task.

Papers

Showing 201250 of 269 papers

TitleStatusHype
A Feasible Framework for Arbitrary-Shaped Scene Text RecognitionCode0
Bidirectional Scene Text Recognition with a Single DecoderCode0
KISS: Keeping It Simple for Scene Text RecognitionCode0
Improving Long Handwritten Text Line Recognition with Convolutional Multi-way Associative Memory0
Scene Text Recognition with Temporal Convolutional Encoder0
On Recognizing Texts of Arbitrary Shapes with 2D Self-AttentionCode0
ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)Code0
Running Event Visualization using Videos from Multiple Cameras0
Focus-Enhanced Scene Text Recognition with Deformable ConvolutionsCode0
Adaptive Embedding Gate for Attention-Based Scene Text Recognition0
Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary ShapesCode0
Symmetry-constrained Rectification Network for Scene Text Recognition0
2D-CTC for Scene Text Recognition0
A Hardware-Oriented and Memory-Efficient Method for CTC Decoding0
FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition0
A Holistic Representation Guided Attention Network for Scene Text RecognitionCode0
Pyramid Mask Text DetectorCode0
Scene Text Synthesis for Efficient and Effective Deep Network Training0
SAFE: Scale Aware Feature Encoder for Scene Text Recognition0
A Multi-Object Rectified Attention Network for Scene Text RecognitionCode0
Recurrent Calibration Network for Irregular Text Recognition0
ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification0
Simultaneous Recognition of Horizontal and Vertical Text in Natural Images0
Connectionist Temporal Classification with Maximum Entropy RegularizationCode0
Visual Re-ranking with Natural Language Understanding for Text SpottingCode0
Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids0
Scene Text Recognition from Two-Dimensional Perspective0
Synthetically Supervised Feature Learning for Scene Text Recognition0
Double Supervised Network with Attention Mechanism for Scene Text Recognition0
Adaptive Adversarial Attack on Scene Text Recognition0
ASTER: An Attentional Scene Text Recognizer with Flexible RectificationCode0
Multilingual Scene Character Recognition System using Sparse Auto-Encoder for Efficient Local Features Representation in Bag of Features0
NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text RecognitionCode0
SCAN: Sliding Convolutional Attention Network for Scene Text Recognition0
Edit Probability for Scene Text Recognition0
Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition0
Pedestrian-Synthesis-GAN: Generating Pedestrian Data in Real Scene and BeyondCode0
TextBoxes++: A Single-Shot Oriented Scene Text DetectorCode0
FOTS: Fast Oriented Text Spotting with a Unified NetworkCode0
SEE: Towards Semi-SupervisedEnd-to-End Scene Text Recognition0
SEE: Towards Semi-Supervised End-to-End Scene Text RecognitionCode0
AON: Towards Arbitrarily-Oriented Text RecognitionCode0
Unconstrained Scene Text and Video Text Recognition for Arabic Script0
Total-Text: A Comprehensive Dataset for Scene Text Detection and RecognitionCode0
AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition0
Reading Scene Text with Attention Convolutional Sequence Modeling0
Focusing Attention: Towards Accurate Text Recognition in Natural Images0
Scene Text Recognition with Sliding Convolutional Character Models0
STN-OCR: A single Neural Network for Text Detection and Text RecognitionCode0
Visual attention models for scene text recognition0
Show:102550
← PrevPage 5 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L*Accuracy99.42Unverified
2DTrOCR 105MAccuracy99.4Unverified
3CLIP4STR-L (DataComp-1B)Accuracy99Unverified
4MGP-STRAccuracy98.5Unverified
5CLIP4STR-LAccuracy98.5Unverified
6CLIP4STR-BAccuracy98.3Unverified
7CCD-ViT-Base(ARD_2.8M)Accuracy98.3Unverified
8CCD-ViT-Small(ARD_2.8M)Accuracy98.3Unverified
9MATRNAccuracy97.9Unverified
10S-GTRAccuracy97.8Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-H (DFN-5B)Accuracy99.1Unverified
2DTrOCR 105MAccuracy98.9Unverified
3CLIP4STR-B*Accuracy98.76Unverified
4MGP-STRAccuracy98.6Unverified
5CLIP4STR-L (DataComp-1B)Accuracy98.6Unverified
6CLIP4STR-LAccuracy98.5Unverified
7CPPDAccuracy98.5Unverified
8CLIP4STR-BAccuracy98.3Unverified
9CCD-ViT-Base(ARD_2.8M)Accuracy97.8Unverified
10CCD-ViT-Small(ARD_2.8M)Accuracy96.4Unverified
#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy93.5Unverified
2CLIP4STR-L*Accuracy92.6Unverified
3CPPDAccuracy91.7Unverified
4CLIP4STR-L (DataComp-1B)Accuracy91.4Unverified
5MGP-STRAccuracy90.9Unverified
6CLIP4STR-LAccuracy90.8Unverified
7CLIP4STR-BAccuracy90.6Unverified
8SIGA_SAccuracy87.6Unverified
9S-GTRAccuracy87.3Unverified
10MATRNAccuracy86.6Unverified
#ModelMetricClaimedVerifiedStatus
1CPPDAccuracy99.7Unverified
2CLIP4STR-L (DataComp-1B)Accuracy99.7Unverified
3CLIP4STR-B*Accuracy99.65Unverified
4MGP-STRAccuracy99.31Unverified
5CLIP4STR-BAccuracy99.3Unverified
6DTrOCR 105MAccuracy99.1Unverified
7CLIP4STR-LAccuracy99Unverified
8CCD-ViT-Base(ARD_2.8M)Accuracy98.3Unverified
9CCD-ViT-Small(ARD_2.8M)Accuracy98.3Unverified
10CCD-ViT-Tiny(ARD_2.8M)Accuracy95.8Unverified
#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy99.6Unverified
2CLIP4STR-L (DataComp-1B)Accuracy99.6Unverified
3CLIP4STR-LAccuracy99.5Unverified
4CLIP4STR-B (DataComp-1B)Accuracy99.5Unverified
5CPPDAccuracy99.3Unverified
6CLIP4STR-BAccuracy99.2Unverified
7MGP-STRAccuracy98.8Unverified
8CCD-ViT-Base(ARD_2.8M)Accuracy98Unverified
9CCD-ViT-Small(ARD_2.8M)Accuracy98Unverified
10S-GTRAccuracy97.5Unverified
#ModelMetricClaimedVerifiedStatus
1DTrOCR 105MAccuracy98.6Unverified
2MGP-STRAccuracy98.3Unverified
3CLIP4STR-L*Accuracy98.13Unverified
4CLIP4STR-L (DataComp-1B)Accuracy98.1Unverified
5CLIP4STR-LAccuracy97.4Unverified
6CLIP4STR-BAccuracy97.2Unverified
7CPPDAccuracy96.7Unverified
8CCD-ViT-BaseAccuracy96.1Unverified
9CCD-ViT-SmallAccuracy92.7Unverified
10CCD-ViT-TinyAccuracy91.6Unverified
#ModelMetricClaimedVerifiedStatus
1Yet Another Text RecognizerAccuracy97.1Unverified
2SIGA_TAccuracy97Unverified
3SATRNAccuracy96.7Unverified
4DANAccuracy95Unverified
5SAFLAccuracy95Unverified
6CSTRAccuracy94.8Unverified
7Baek et al.Accuracy94.4Unverified
8ViTSTRAccuracy94.3Unverified
9AONAccuracy91.5Unverified
10RAREAccuracy90.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-H (DFN-5B)1:1 Accuracy90.9Unverified
2CLIP4STR-L (DataComp-1B)1:1 Accuracy90.6Unverified
3CLIP4STR-L1:1 Accuracy88.8Unverified
4CLIP4STR-B1:1 Accuracy87Unverified
5CCD-ViT-Base1:1 Accuracy86Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L (DataComp-1B)Accuracy (%)86.4Unverified
2CLIP4STR-LAccuracy (%)85.9Unverified
3CLIP4STR-BAccuracy (%)85.8Unverified
4MGP-STRAccuracy (%)85.5Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L1:1 Accuracy81.9Unverified
2MGP-STR1:1 Accuracy81.7Unverified
3CLIP4STR-B1:1 Accuracy81.1Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L1:1 Accuracy82.7Unverified
2CLIP4STR-B1:1 Accuracy79.8Unverified
3CCD-ViT-Base1:1 Accuracy77.3Unverified
#ModelMetricClaimedVerifiedStatus
1CLIP4STR-L (DataComp-1B)Accuracy (%)92.2Unverified
2MGP-STRAccuracy (%)91Unverified
3CLIP4STR-BAccuracy (%)86.8Unverified
#ModelMetricClaimedVerifiedStatus
1ABINet-LV+TPS++Accuracy97.8Unverified
#ModelMetricClaimedVerifiedStatus
1MLDGAverage Accuracy19.02Unverified
#ModelMetricClaimedVerifiedStatus
1ABINet-LV+TPS++Accuracy89.6Unverified