| Recognition-Guided Diffusion Model for Scene Text Image Super-Resolution | Nov 22, 2023 | DenoisingDiversity | —Unverified | 0 |
| Scene Text Image Super-resolution based on Text-conditional Diffusion Models | Nov 16, 2023 | Image GenerationImage Super-Resolution | CodeCode Available | 1 |
| Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation | Oct 25, 2023 | Handwritten Text RecognitionKey Information Extraction | CodeCode Available | 1 |
| DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond | Oct 19, 2023 | Document AIDocument Layout Analysis | CodeCode Available | 0 |
| Scene Text Recognition Models Explainability Using Local Features | Oct 14, 2023 | PredictionScene Text Recognition | CodeCode Available | 1 |
| Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition | Oct 8, 2023 | Image to textOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved | Sep 14, 2023 | AttributeScene Text Recognition | CodeCode Available | 0 |
| Orientation-Independent Chinese Text Recognition in Scene Images | Sep 3, 2023 | BenchmarkingImage Reconstruction | CodeCode Available | 2 |
| Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning | Sep 3, 2023 | Scene Text Recognition | CodeCode Available | 2 |
| DTrOCR: Decoder-only Transformer for Optical Character Recognition | Aug 30, 2023 | DecoderHandwritten Text Recognition | CodeCode Available | 2 |
| LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition | Aug 24, 2023 | DecoderScene Text Recognition | CodeCode Available | 0 |
| Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation | Aug 6, 2023 | Machine TranslationScene Text Editing | CodeCode Available | 1 |
| Relational Contrastive Learning for Scene Text Recognition | Aug 1, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition | Jul 25, 2023 | Language ModellingOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Context Perception Parallel Decoder for Scene Text Recognition | Jul 23, 2023 | DecoderLanguage Modelling | CodeCode Available | 0 |
| Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement | Jul 19, 2023 | Image Super-ResolutionLEMMA | CodeCode Available | 1 |
| Revisiting Scene Text Recognition: A Data Perspective | Jul 17, 2023 | Scene Text Recognition | CodeCode Available | 2 |
| Reading Between the Lanes: Text VideoQA on the Road | Jul 8, 2023 | Question AnsweringScene Text Recognition | CodeCode Available | 0 |
| DiffusionSTR: Diffusion Model for Scene Text Recognition | Jun 29, 2023 | Image to textmodel | —Unverified | 0 |
| Weakly Supervised Scene Text Generation for Low-resource Languages | Jun 25, 2023 | Scene Text RecognitionText Generation | —Unverified | 0 |
| Looking and Listening: Audio Guided Text Recognition | Jun 6, 2023 | DecoderScene Text Recognition | CodeCode Available | 1 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 |
| MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition | May 24, 2023 | Continual LearningIncremental Learning | CodeCode Available | 1 |
| CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | May 23, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition | May 9, 2023 | Scene Text Recognition | CodeCode Available | 1 |