| An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition | Jul 21, 2015 | Optical Character Recognition (OCR)Scene Text Recognition | CodeCode Available | 4 | 5 |
| Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning | Sep 3, 2023 | Scene Text Recognition | CodeCode Available | 2 | 5 |
| Text Image Inpainting via Global Structure-Guided Diffusion Models | Jan 26, 2024 | Image InpaintingScene Text Recognition | CodeCode Available | 2 | 5 |
| Revisiting Scene Text Recognition: A Data Perspective | Jul 17, 2023 | Scene Text Recognition | CodeCode Available | 2 | 5 |
| TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition | Dec 2, 2024 | Image GenerationOptical Character Recognition (OCR) | CodeCode Available | 2 | 5 |
| An Empirical Study of Scaling Law for Scene Text Recognition | Jan 1, 2024 | Optical Character Recognition (OCR)Scene Text Recognition | CodeCode Available | 2 | 5 |
| A General Framework for Jersey Number Recognition in Sports Video | May 22, 2024 | Jersey Number RecognitionScene Text Recognition | CodeCode Available | 2 | 5 |
| GIT: A Generative Image-to-text Transformer for Vision and Language | May 27, 2022 | DecoderImage Captioning | CodeCode Available | 2 | 5 |
| Scene Text Recognition with Permuted Autoregressive Sequence Models | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Orientation-Independent Chinese Text Recognition in Scene Images | Sep 3, 2023 | BenchmarkingImage Reconstruction | CodeCode Available | 2 | 5 |
| DTrOCR: Decoder-only Transformer for Optical Character Recognition | Aug 30, 2023 | DecoderHandwritten Text Recognition | CodeCode Available | 2 | 5 |
| PlugNet: Degradation Aware Scene Text Recognition Supervised by a Pluggable Super-Resolution Unit | Aug 1, 2020 | DiversityMulti-Task Learning | CodeCode Available | 1 | 5 |
| Pushing the Performance Limit of Scene Text Recognizer without Human Annotation | Apr 16, 2022 | Scene Text Recognition | CodeCode Available | 1 | 5 |
| Relational Contrastive Learning for Scene Text Recognition | Aug 1, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 | 5 |
| Gaussian Constrained Attention Network for Scene Text Recognition | Oct 19, 2020 | Scene Text Recognition | CodeCode Available | 1 | 5 |
| IterVM: Iterative Vision Modeling Module for Scene Text Recognition | Apr 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition | Sep 9, 2021 | DecoderScene Text Recognition | CodeCode Available | 1 | 5 |
| SCATTER: Selective Context Attentional Scene Text Recognizer | Mar 25, 2020 | Irregular Text RecognitionScene Text Recognition | CodeCode Available | 1 | 5 |
| Meta Self-Learning for Multi-Source Domain Adaptation: A Benchmark | Aug 24, 2021 | Domain AdaptationMeta-Learning | CodeCode Available | 1 | 5 |
| Multimodal Semi-Supervised Learning for Text Recognition | May 8, 2022 | Language ModellingRepresentation Learning | CodeCode Available | 1 | 5 |
| Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition | Feb 21, 2024 | DiversityScene Text Recognition | CodeCode Available | 1 | 5 |
| Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer | Nov 22, 2023 | DiversityIn-Context Learning | CodeCode Available | 1 | 5 |
| Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks | Dec 31, 2018 | Handwriting RecognitionLicense Plate Recognition | CodeCode Available | 1 | 5 |
| Arabic Scene Text Recognition in the Deep Learning Era: Analysis on A Novel Dataset | Jul 27, 2021 | Scene Text RecognitionScene Understanding | CodeCode Available | 1 | 5 |
| An Empirical Study of Scaling Law for OCR | Dec 29, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 | 5 |
| On the Cross-dataset Generalization in License Plate Recognition | Jan 2, 2022 | Data AugmentationLicense Plate Detection | CodeCode Available | 1 | 5 |
| AutoSTR: Efficient Backbone Search for Scene Text Recognition | Mar 14, 2020 | DeblurringDiversity | CodeCode Available | 1 | 5 |
| Primitive Representation Learning for Scene Text Recognition | May 10, 2021 | DecoderRepresentation Learning | CodeCode Available | 1 | 5 |
| Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition | Jul 1, 2022 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 | 5 |
| Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition | Mar 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features | Nov 30, 2021 | Scene Text Recognition | CodeCode Available | 1 | 5 |
| Looking and Listening: Audio Guided Text Recognition | Jun 6, 2023 | DecoderScene Text Recognition | CodeCode Available | 1 | 5 |
| CentripetalText: An Efficient Text Instance Representation for Scene Text Detection | Jul 13, 2021 | regressionScene Text Detection | CodeCode Available | 1 | 5 |
| CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition | Nov 22, 2021 | PositionScene Text Recognition | CodeCode Available | 1 | 5 |
| Dictionary-Guided Scene Text Recognition | Jun 19, 2021 | Scene Text DetectionScene Text Recognition | CodeCode Available | 1 | 5 |
| Masked Vision-Language Transformers for Scene Text Recognition | Nov 9, 2022 | DecoderScene Text Recognition | CodeCode Available | 1 | 5 |
| Decoupled Attention Network for Text Recognition | Dec 21, 2019 | DecoderHandwritten Text Recognition | CodeCode Available | 1 | 5 |
| B-Spline Texture Coefficients Estimator for Screen Content Image Super-Resolution | Jan 1, 2023 | Image Super-ResolutionScene Text Recognition | CodeCode Available | 1 | 5 |
| Self-supervised Implicit Glyph Attention for Text Recognition | Mar 7, 2022 | Scene Text RecognitionText Segmentation | CodeCode Available | 1 | 5 |
| ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting | Nov 19, 2022 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition | May 9, 2023 | Scene Text Recognition | CodeCode Available | 1 | 5 |
| CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | May 23, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Data Augmentation for Scene Text Recognition | Aug 16, 2021 | Data AugmentationImage Augmentation | CodeCode Available | 1 | 5 |
| Efficient scene text image super-resolution with semantic guidance | Mar 20, 2024 | Image Super-ResolutionScene Text Recognition | CodeCode Available | 1 | 5 |
| MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition | May 24, 2023 | Continual LearningIncremental Learning | CodeCode Available | 1 | 5 |
| Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation | Oct 25, 2023 | Handwritten Text RecognitionKey Information Extraction | CodeCode Available | 1 | 5 |
| From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network | Aug 22, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Convolutional Neural Networks with Gated Recurrent Connections | Jun 5, 2021 | object-detectionObject Detection | CodeCode Available | 1 | 5 |
| Cross-Lingual Learning in Multilingual Scene Text Recognition | Dec 17, 2023 | Scene Text Recognition | CodeCode Available | 1 | 5 |
| Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition | Mar 24, 2025 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 | 5 |