| An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition | Jul 21, 2015 | Optical Character Recognition (OCR)Scene Text Recognition | CodeCode Available | 4 |
| TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition | Dec 2, 2024 | Image GenerationOptical Character Recognition (OCR) | CodeCode Available | 2 |
| A General Framework for Jersey Number Recognition in Sports Video | May 22, 2024 | Jersey Number RecognitionScene Text Recognition | CodeCode Available | 2 |
| Text Image Inpainting via Global Structure-Guided Diffusion Models | Jan 26, 2024 | Image InpaintingScene Text Recognition | CodeCode Available | 2 |
| An Empirical Study of Scaling Law for Scene Text Recognition | Jan 1, 2024 | Optical Character Recognition (OCR)Scene Text Recognition | CodeCode Available | 2 |
| Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning | Sep 3, 2023 | Scene Text Recognition | CodeCode Available | 2 |
| Orientation-Independent Chinese Text Recognition in Scene Images | Sep 3, 2023 | BenchmarkingImage Reconstruction | CodeCode Available | 2 |
| DTrOCR: Decoder-only Transformer for Optical Character Recognition | Aug 30, 2023 | DecoderHandwritten Text Recognition | CodeCode Available | 2 |
| Revisiting Scene Text Recognition: A Data Perspective | Jul 17, 2023 | Scene Text Recognition | CodeCode Available | 2 |
| Scene Text Recognition with Permuted Autoregressive Sequence Models | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GIT: A Generative Image-to-text Transformer for Vision and Language | May 27, 2022 | DecoderImage Captioning | CodeCode Available | 2 |
| Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition | Mar 24, 2025 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 |
| Ocean-OCR: Towards General OCR Application via a Vision-Language Model | Jan 26, 2025 | document understandingLanguage Modeling | CodeCode Available | 1 |
| Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition | Oct 13, 2024 | Domain AdaptationOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Scene-Text Grounding for Text-Based Video Question Answering | Sep 22, 2024 | 2kContrastive Learning | CodeCode Available | 1 |
| Self-Supervised Pre-training with Symmetric Superimposition Modeling for Scene Text Recognition | May 9, 2024 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 |
| Efficient scene text image super-resolution with semantic guidance | Mar 20, 2024 | Image Super-ResolutionScene Text Recognition | CodeCode Available | 1 |
| Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition | Feb 21, 2024 | DiversityScene Text Recognition | CodeCode Available | 1 |
| SVIPTR: Fast and Efficient Scene Text Recognition with Vision Permutable Extractor | Jan 18, 2024 | DecoderScene Text Recognition | CodeCode Available | 1 |
| An Empirical Study of Scaling Law for OCR | Dec 29, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Cross-Lingual Learning in Multilingual Scene Text Recognition | Dec 17, 2023 | Scene Text Recognition | CodeCode Available | 1 |
| Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer | Nov 22, 2023 | DiversityIn-Context Learning | CodeCode Available | 1 |
| Scene Text Image Super-resolution based on Text-conditional Diffusion Models | Nov 16, 2023 | Image GenerationImage Super-Resolution | CodeCode Available | 1 |
| Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation | Oct 25, 2023 | Handwritten Text RecognitionKey Information Extraction | CodeCode Available | 1 |
| Scene Text Recognition Models Explainability Using Local Features | Oct 14, 2023 | PredictionScene Text Recognition | CodeCode Available | 1 |