| Ocean-OCR: Towards General OCR Application via a Vision-Language Model | Jan 26, 2025 | document understandingLanguage Modeling | CodeCode Available | 1 |
| Data Augmentation for Scene Text Recognition | Aug 16, 2021 | Data AugmentationImage Augmentation | CodeCode Available | 1 |
| Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features | Nov 30, 2021 | Scene Text Recognition | CodeCode Available | 1 |
| Decoupled Attention Network for Text Recognition | Dec 21, 2019 | DecoderHandwritten Text Recognition | CodeCode Available | 1 |
| Convolutional Neural Networks with Gated Recurrent Connections | Jun 5, 2021 | object-detectionObject Detection | CodeCode Available | 1 |
| Multimodal Semi-Supervised Learning for Text Recognition | May 8, 2022 | Language ModellingRepresentation Learning | CodeCode Available | 1 |
| Dictionary-Guided Scene Text Recognition | Jun 19, 2021 | Scene Text DetectionScene Text Recognition | CodeCode Available | 1 |
| TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition | May 9, 2023 | Optical Character Recognition (OCR)Scene Text Recognition | CodeCode Available | 1 |
| SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization | Mar 20, 2022 | Common Sense ReasoningContrastive Learning | CodeCode Available | 1 |
| Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition | Jul 1, 2022 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 |
| Primitive Representation Learning for Scene Text Recognition | May 10, 2021 | DecoderRepresentation Learning | CodeCode Available | 1 |
| Masked Vision-Language Transformers for Scene Text Recognition | Nov 9, 2022 | DecoderScene Text Recognition | CodeCode Available | 1 |
| Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition | Mar 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AutoSTR: Efficient Backbone Search for Scene Text Recognition | Mar 14, 2020 | DeblurringDiversity | CodeCode Available | 1 |
| PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition | Sep 9, 2021 | DecoderScene Text Recognition | CodeCode Available | 1 |
| Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition | May 9, 2023 | Scene Text Recognition | CodeCode Available | 1 |
| Self-supervised Implicit Glyph Attention for Text Recognition | Mar 7, 2022 | Scene Text RecognitionText Segmentation | CodeCode Available | 1 |
| Efficient scene text image super-resolution with semantic guidance | Mar 20, 2024 | Image Super-ResolutionScene Text Recognition | CodeCode Available | 1 |
| B-Spline Texture Coefficients Estimator for Screen Content Image Super-Resolution | Jan 1, 2023 | Image Super-ResolutionScene Text Recognition | CodeCode Available | 1 |
| Pushing the Performance Limit of Scene Text Recognizer without Human Annotation | Apr 16, 2022 | Scene Text Recognition | CodeCode Available | 1 |
| CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition | Nov 22, 2021 | PositionScene Text Recognition | CodeCode Available | 1 |
| PlugNet: Degradation Aware Scene Text Recognition Supervised by a Pluggable Super-Resolution Unit | Aug 1, 2020 | DiversityMulti-Task Learning | CodeCode Available | 1 |
| CentripetalText: An Efficient Text Instance Representation for Scene Text Detection | Jul 13, 2021 | regressionScene Text Detection | CodeCode Available | 1 |
| Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation | Oct 25, 2023 | Handwritten Text RecognitionKey Information Extraction | CodeCode Available | 1 |
| Scene Text Recognition Models Explainability Using Local Features | Oct 14, 2023 | PredictionScene Text Recognition | CodeCode Available | 1 |
| Relational Contrastive Learning for Scene Text Recognition | Aug 1, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| Stratified Domain Adaptation: A Progressive Self-Training Approach for Scene Text Recognition | Oct 13, 2024 | Domain AdaptationOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Self-supervised Character-to-Character Distillation for Text Recognition | Nov 1, 2022 | Data AugmentationRepresentation Learning | CodeCode Available | 1 |
| An Empirical Study of Scaling Law for OCR | Dec 29, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition | Mar 24, 2025 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 |
| From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network | Aug 22, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting | Nov 19, 2022 | BlockingLanguage Modeling | CodeCode Available | 1 |
| UnrealText: Synthesizing Realistic Scene Text Images from the Unreal World | Mar 24, 2020 | Image GenerationScene Text Detection | CodeCode Available | 1 |
| Double Supervised Network with Attention Mechanism for Scene Text Recognition | Aug 2, 2018 | Scene Text Recognition | —Unverified | 0 |
| A fine-grained approach to scene text script identification | Feb 24, 2016 | Scene Text RecognitionText Detection | —Unverified | 0 |
| One Model for Two Tasks: Cooperatively Recognizing and Recovering Low-Resolution Scene Text Images by Iterative Mutual Guidance | Sep 22, 2024 | Image Super-ResolutionScene Text Recognition | —Unverified | 0 |
| Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer | Apr 19, 2024 | DecoderOptical Character Recognition | —Unverified | 0 |
| DiffusionSTR: Diffusion Model for Scene Text Recognition | Jun 29, 2023 | Image to textmodel | —Unverified | 0 |
| Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam | Apr 9, 2021 | BenchmarkingScene Text Recognition | —Unverified | 0 |
| Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition | Mar 7, 2023 | Image ReconstructionScene Text Recognition | —Unverified | 0 |
| Augmented Transformers with Adaptive n-grams Embedding for Multilingual Scene Text Recognition | Feb 28, 2023 | Language IdentificationScene Text Recognition | —Unverified | 0 |
| Deep Learning based Isolated Arabic Scene Character Recognition | Apr 22, 2017 | Deep LearningScene Text Recognition | —Unverified | 0 |
| Decoupling Visual-Semantic Feature Learning for Robust Scene Text Recognition | Nov 24, 2021 | DecoderScene Text Recognition | —Unverified | 0 |
| Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation | Mar 20, 2025 | DecoderScene Text Recognition | —Unverified | 0 |
| LEGO: Self-Supervised Representation Learning for Scene Text Images | Aug 4, 2024 | Representation LearningScene Text Recognition | —Unverified | 0 |
| On Vocabulary Reliance in Scene Text Recognition | May 8, 2020 | Scene Text Recognition | —Unverified | 0 |
| Adaptive Embedding Gate for Attention-Based Scene Text Recognition | Aug 26, 2019 | DecoderScene Text Recognition | —Unverified | 0 |
| Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids | Sep 27, 2018 | object-detectionObject Detection | —Unverified | 0 |
| Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance | Dec 13, 2024 | Scene Text RecognitionText Spotting | —Unverified | 0 |
| IndicSTR12: A Dataset for Indic Scene Text Recognition | Mar 12, 2024 | BenchmarkingScene Text Recognition | —Unverified | 0 |