| Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer | Nov 22, 2023 | DiversityIn-Context Learning | CodeCode Available | 1 |
| Scene Text Image Super-resolution based on Text-conditional Diffusion Models | Nov 16, 2023 | Image GenerationImage Super-Resolution | CodeCode Available | 1 |
| Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation | Oct 25, 2023 | Handwritten Text RecognitionKey Information Extraction | CodeCode Available | 1 |
| DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond | Oct 19, 2023 | Document AIDocument Layout Analysis | CodeCode Available | 0 |
| Scene Text Recognition Models Explainability Using Local Features | Oct 14, 2023 | PredictionScene Text Recognition | CodeCode Available | 1 |
| Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition | Oct 8, 2023 | Image to textOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Towards Large-scale Building Attribute Mapping using Crowdsourced Images: Scene Text Recognition on Flickr and Problems to be Solved | Sep 14, 2023 | AttributeScene Text Recognition | CodeCode Available | 0 |
| Orientation-Independent Chinese Text Recognition in Scene Images | Sep 3, 2023 | BenchmarkingImage Reconstruction | CodeCode Available | 2 |
| Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning | Sep 3, 2023 | Scene Text Recognition | CodeCode Available | 2 |
| DTrOCR: Decoder-only Transformer for Optical Character Recognition | Aug 30, 2023 | DecoderHandwritten Text Recognition | CodeCode Available | 2 |
| LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition | Aug 24, 2023 | DecoderScene Text Recognition | CodeCode Available | 0 |
| Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation | Aug 6, 2023 | Machine TranslationScene Text Editing | CodeCode Available | 1 |
| Relational Contrastive Learning for Scene Text Recognition | Aug 1, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition | Jul 25, 2023 | Language ModellingOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Context Perception Parallel Decoder for Scene Text Recognition | Jul 23, 2023 | DecoderLanguage Modelling | CodeCode Available | 0 |
| Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement | Jul 19, 2023 | Image Super-ResolutionLEMMA | CodeCode Available | 1 |
| Revisiting Scene Text Recognition: A Data Perspective | Jul 17, 2023 | Scene Text Recognition | CodeCode Available | 2 |
| Reading Between the Lanes: Text VideoQA on the Road | Jul 8, 2023 | Question AnsweringScene Text Recognition | CodeCode Available | 0 |
| DiffusionSTR: Diffusion Model for Scene Text Recognition | Jun 29, 2023 | Image to textmodel | —Unverified | 0 |
| Weakly Supervised Scene Text Generation for Low-resource Languages | Jun 25, 2023 | Scene Text RecognitionText Generation | —Unverified | 0 |
| Looking and Listening: Audio Guided Text Recognition | Jun 6, 2023 | DecoderScene Text Recognition | CodeCode Available | 1 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 |
| MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition | May 24, 2023 | Continual LearningIncremental Learning | CodeCode Available | 1 |
| CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | May 23, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition | May 9, 2023 | Scene Text Recognition | CodeCode Available | 1 |
| TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition | May 9, 2023 | Optical Character Recognition (OCR)Scene Text Recognition | CodeCode Available | 1 |
| Scene Text Recognition with Image-Text Matching-guided Dictionary | May 8, 2023 | Image-text matchingLanguage Modeling | —Unverified | 0 |
| Improving Scene Text Recognition for Character-Level Long-Tailed Distribution | Mar 31, 2023 | Scene Text Recognition | —Unverified | 0 |
| Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model | Mar 13, 2023 | Decision MakingScene Text Recognition | —Unverified | 0 |
| Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition | Mar 7, 2023 | Image ReconstructionScene Text Recognition | —Unverified | 0 |
| Augmented Transformers with Adaptive n-grams Embedding for Multilingual Scene Text Recognition | Feb 28, 2023 | Language IdentificationScene Text Recognition | —Unverified | 0 |
| Geometric Perception based Efficient Text Recognition | Feb 8, 2023 | Scene Text Recognition | CodeCode Available | 0 |
| CLIPTER: Looking at the Bigger Picture in Scene Text Recognition | Jan 18, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| B-Spline Texture Coefficients Estimator for Screen Content Image Super-Resolution | Jan 1, 2023 | Image Super-ResolutionScene Text Recognition | CodeCode Available | 1 |
| ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting | Nov 19, 2022 | BlockingLanguage Modeling | CodeCode Available | 1 |
| Portmanteauing Features for Scene Text Recognition | Nov 9, 2022 | Scene Text Recognition | —Unverified | 0 |
| Masked Vision-Language Transformers for Scene Text Recognition | Nov 9, 2022 | DecoderScene Text Recognition | CodeCode Available | 1 |
| Pure Transformer with Integrated Experts for Scene Text Recognition | Nov 9, 2022 | DecoderScene Text Recognition | —Unverified | 0 |
| Self-supervised Character-to-Character Distillation for Text Recognition | Nov 1, 2022 | Data AugmentationRepresentation Learning | CodeCode Available | 1 |
| Scene Text Recognition with Semantics | Oct 19, 2022 | Scene Text Recognition | —Unverified | 0 |
| Scene Text Image Super-Resolution via Content Perceptual Loss and Criss-Cross Transformer Blocks | Oct 13, 2022 | Image ReconstructionImage Super-Resolution | —Unverified | 0 |
| Reading Chinese in Natural Scenes with a Bag-of-Radicals Prior | Oct 5, 2022 | Scene Text Recognition | —Unverified | 0 |
| Out-of-Vocabulary Challenge Report | Sep 14, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Levenshtein OCR | Sep 8, 2022 | Imitation LearningOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Multi-Granularity Prediction for Scene Text Recognition | Sep 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Scene Text Recognition with Single-Point Decoding Network | Sep 5, 2022 | Scene Text Recognition | —Unverified | 0 |
| Vision-Language Adaptive Mutual Decoder for OOV-STR | Sep 2, 2022 | DecoderLanguage Modeling | —Unverified | 0 |
| 1st Place Solution to ECCV 2022 Challenge on Out of Vocabulary Scene Text Understanding: End-to-End Recognition of Out of Vocabulary Words | Sep 1, 2022 | Autonomous DrivingScene Text Recognition | —Unverified | 0 |
| Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition | Jul 31, 2022 | Scene Text Recognition | CodeCode Available | 1 |
| Optimal Boxes: Boosting End-to-End Scene Text Recognition by Adjusting Annotated Bounding Boxes via Reinforcement Learning | Jul 25, 2022 | Domain AdaptationOptical Character Recognition (OCR) | —Unverified | 0 |