| Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition | Mar 24, 2025 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 |
| Efficient and Accurate Scene Text Recognition with Cascaded-Transformers | Mar 24, 2025 | DecoderScene Text Recognition | —Unverified | 0 |
| Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation | Mar 20, 2025 | DecoderScene Text Recognition | —Unverified | 0 |
| A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition | Mar 19, 2025 | Scene Text RecognitionText Detection | —Unverified | 0 |
| EventSTR: A Benchmark Dataset and Baselines for Event Stream based Scene Text Recognition | Feb 13, 2025 | Large Language ModelScene Text Recognition | CodeCode Available | 0 |
| Billet Number Recognition Based on Test-Time Adaptation | Feb 13, 2025 | Scene Text RecognitionTest-time Adaptation | —Unverified | 0 |
| Ocean-OCR: Towards General OCR Application via a Vision-Language Model | Jan 26, 2025 | document understandingLanguage Modeling | CodeCode Available | 1 |
| Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance | Dec 13, 2024 | Scene Text RecognitionText Spotting | —Unverified | 0 |
| TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition | Dec 2, 2024 | Image GenerationOptical Character Recognition (OCR) | CodeCode Available | 2 |
| SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition | Nov 24, 2024 | DecoderOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing | Nov 23, 2024 | Contrastive LearningScene Text Recognition | CodeCode Available | 0 |
| Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition | Nov 18, 2024 | Contrastive LearningRepresentation Learning | CodeCode Available | 0 |
| MCTBench: Multimodal Cognition towards Text-Rich Visual Scenes Benchmark | Oct 15, 2024 | FairnessScene Text Recognition | CodeCode Available | 0 |
| Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition | Oct 13, 2024 | Domain AdaptationOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Text Image Generation for Low-Resource Languages with Dual Translation Learning | Sep 26, 2024 | DiversityImage Generation | —Unverified | 0 |
| One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance | Sep 22, 2024 | Image Super-ResolutionScene Text Recognition | —Unverified | 0 |
| Scene-Text Grounding for Text-Based Video Question Answering | Sep 22, 2024 | 2kContrastive Learning | CodeCode Available | 1 |
| VL-Reader: Vision and Language Reconstructor is an Effective Scene Text Recognizer | Sep 18, 2024 | DecoderScene Text Recognition | —Unverified | 0 |
| Platypus: A Generalized Specialist Model for Reading Text in Various Forms | Aug 27, 2024 | Handwritten Text RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Decoder Pre-Training with only Text for Scene Text Recognition | Aug 11, 2024 | DecoderScene Text Recognition | CodeCode Available | 0 |
| LEGO: Self-Supervised Representation Learning for Scene Text Images | Aug 4, 2024 | Representation LearningScene Text Recognition | —Unverified | 0 |
| MOoSE: Multi-Orientation Sharing Experts for Open-set Scene Text Recognition | Jul 26, 2024 | Mixture-of-ExpertsScene Text Recognition | CodeCode Available | 0 |
| Out of Length Text Recognition with Sub-String Matching | Jul 17, 2024 | Scene Text Recognition | CodeCode Available | 0 |
| Focus on the Whole Character: Discriminative Character Modeling for Scene Text Recognition | Jul 8, 2024 | Scene Text Recognition | CodeCode Available | 0 |
| A General Framework for Jersey Number Recognition in Sports Video | May 22, 2024 | Jersey Number RecognitionScene Text Recognition | CodeCode Available | 2 |
| The First Swahili Language Scene Text Detection and Recognition Dataset | May 19, 2024 | Information RetrievalScene Text Detection | CodeCode Available | 0 |
| HAAP: Vision-context Hierarchical Attention Autoregressive with Adaptive Permutation for Scene Text Recognition | May 15, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition | May 9, 2024 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 |
| Choose What You Need: Disentangled Representation Learning for Scene Text Recognition, Removal and Editing | May 7, 2024 | DecoderRepresentation Learning | —Unverified | 0 |
| Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer | Apr 19, 2024 | DecoderOptical Character Recognition | —Unverified | 0 |
| JSTR: Judgment Improves Scene Text Recognition | Apr 9, 2024 | Scene Text Recognition | —Unverified | 0 |
| Efficient scene text image super-resolution with semantic guidance | Mar 20, 2024 | Image Super-ResolutionScene Text Recognition | CodeCode Available | 1 |
| IndicSTR12: A Dataset for Indic Scene Text Recognition | Mar 12, 2024 | BenchmarkingScene Text Recognition | —Unverified | 0 |
| Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss | Mar 12, 2024 | Image InpaintingOptical Character Recognition (OCR) | —Unverified | 0 |
| Efficiently Leveraging Linguistic Priors for Scene Text Spotting | Feb 27, 2024 | Scene Text RecognitionText Detection | —Unverified | 0 |
| Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition | Feb 24, 2024 | Scene Text RecognitionSemantic Similarity | —Unverified | 0 |
| Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition | Feb 21, 2024 | DiversityScene Text Recognition | CodeCode Available | 1 |
| Lumos : Empowering Multimodal LLMs with Scene Text Recognition | Feb 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Instruction-Guided Scene Text Recognition | Jan 31, 2024 | Question AnsweringScene Text Recognition | CodeCode Available | 0 |
| Text Image Inpainting via Global Structure-Guided Diffusion Models | Jan 26, 2024 | Image InpaintingScene Text Recognition | CodeCode Available | 2 |
| CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition | Jan 18, 2024 | PositionScene Text Recognition | —Unverified | 0 |
| SVIPTR: Fast and Efficient Scene Text Recognition with Vision Permutable Extractor | Jan 18, 2024 | DecoderScene Text Recognition | CodeCode Available | 1 |
| An Empirical Study of Scaling Law for Scene Text Recognition | Jan 1, 2024 | Optical Character Recognition (OCR)Scene Text Recognition | CodeCode Available | 2 |
| OTE: Exploring Accurate Scene Text Recognition Using One Token | Jan 1, 2024 | DecoderScene Text Recognition | CodeCode Available | 0 |
| Choose What You Need: Disentangled Representation Learning for Scene Text Recognition Removal and Editing | Jan 1, 2024 | DecoderRepresentation Learning | —Unverified | 0 |
| An Empirical Study of Scaling Law for OCR | Dec 29, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition | Dec 19, 2023 | Conditional Text GenerationDecoder | CodeCode Available | 0 |
| Cross-Lingual Learning in Multilingual Scene Text Recognition | Dec 17, 2023 | Scene Text Recognition | CodeCode Available | 1 |
| STR-Cert: Robustness Certification for Deep Text Recognition on Deep Learning Pipelines and Vision Transformers | Nov 28, 2023 | Scene Text Recognition | —Unverified | 0 |
| Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution | Nov 22, 2023 | DenoisingDiversity | —Unverified | 0 |