| Efficient and Accurate Scene Text Recognition with Cascaded-Transformers | Mar 24, 2025 | DecoderScene Text Recognition | —Unverified | 0 |
| Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition | Mar 24, 2025 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 |
| Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation | Mar 20, 2025 | DecoderScene Text Recognition | —Unverified | 0 |
| A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition | Mar 19, 2025 | Scene Text RecognitionText Detection | —Unverified | 0 |
| EventSTR: A Benchmark Dataset and Baselines for Event Stream based Scene Text Recognition | Feb 13, 2025 | Large Language ModelScene Text Recognition | CodeCode Available | 0 |
| Billet Number Recognition Based on Test-Time Adaptation | Feb 13, 2025 | Scene Text RecognitionTest-time Adaptation | —Unverified | 0 |
| Ocean-OCR: Towards General OCR Application via a Vision-Language Model | Jan 26, 2025 | document understandingLanguage Modeling | CodeCode Available | 1 |
| Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance | Dec 13, 2024 | Scene Text RecognitionText Spotting | —Unverified | 0 |
| TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition | Dec 2, 2024 | Image GenerationOptical Character Recognition (OCR) | CodeCode Available | 2 |
| SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition | Nov 24, 2024 | DecoderOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing | Nov 23, 2024 | Contrastive LearningScene Text Recognition | CodeCode Available | 0 |
| Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition | Nov 18, 2024 | Contrastive LearningRepresentation Learning | CodeCode Available | 0 |
| MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark | Oct 15, 2024 | FairnessScene Text Recognition | CodeCode Available | 0 |
| Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition | Oct 13, 2024 | Domain AdaptationOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Text Image Generation for Low-Resource Languages with Dual Translation Learning | Sep 26, 2024 | DiversityImage Generation | —Unverified | 0 |
| One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance | Sep 22, 2024 | Image Super-ResolutionScene Text Recognition | —Unverified | 0 |
| Scene-Text Grounding for Text-Based Video Question Answering | Sep 22, 2024 | 2kContrastive Learning | CodeCode Available | 1 |
| VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer | Sep 18, 2024 | DecoderScene Text Recognition | —Unverified | 0 |
| Platypus: A Generalized Specialist Model for Reading Text in Various Forms | Aug 27, 2024 | Handwritten Text RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Decoder Pre-Training with only Text for Scene Text Recognition | Aug 11, 2024 | DecoderScene Text Recognition | CodeCode Available | 0 |
| LEGO: Self-Supervised Representation Learning for Scene Text Images | Aug 4, 2024 | Representation LearningScene Text Recognition | —Unverified | 0 |
| MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text Recognition | Jul 26, 2024 | Mixture-of-ExpertsScene Text Recognition | CodeCode Available | 0 |
| Out of Length Text Recognition with Sub-String Matching | Jul 17, 2024 | Scene Text Recognition | CodeCode Available | 0 |
| Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition | Jul 8, 2024 | Scene Text Recognition | CodeCode Available | 0 |
| A General Framework for Jersey Number Recognition in Sports Video | May 22, 2024 | Jersey Number RecognitionScene Text Recognition | CodeCode Available | 2 |