| Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition | Oct 8, 2023 | Image to textOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Show Me the World in My Language: Establishing the First Baseline for Scene-Text to Scene-Text Translation | Aug 6, 2023 | Machine TranslationScene Text Editing | CodeCode Available | 1 |
| Relational Contrastive Learning for Scene Text Recognition | Aug 1, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement | Jul 19, 2023 | Image Super-ResolutionLEMMA | CodeCode Available | 1 |
| Looking and Listening: Audio Guided Text Recognition | Jun 6, 2023 | DecoderScene Text Recognition | CodeCode Available | 1 |
| MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition | May 24, 2023 | Continual LearningIncremental Learning | CodeCode Available | 1 |
| CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model | May 23, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition | May 9, 2023 | Scene Text Recognition | CodeCode Available | 1 |
| TPS++: Attention-Enhanced Thin-Plate Spline for Scene Text Recognition | May 9, 2023 | Optical Character Recognition (OCR)Scene Text Recognition | CodeCode Available | 1 |
| B-Spline Texture Coefficients Estimator for Screen Content Image Super-Resolution | Jan 1, 2023 | Image Super-ResolutionScene Text Recognition | CodeCode Available | 1 |
| ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Spotting | Nov 19, 2022 | BlockingLanguage Modeling | CodeCode Available | 1 |
| Masked Vision-Language Transformers for Scene Text Recognition | Nov 9, 2022 | DecoderScene Text Recognition | CodeCode Available | 1 |
| Self-supervised Character-to-Character Distillation for Text Recognition | Nov 1, 2022 | Data AugmentationRepresentation Learning | CodeCode Available | 1 |
| Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition | Jul 31, 2022 | Scene Text Recognition | CodeCode Available | 1 |
| Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition | Jul 1, 2022 | Contrastive LearningScene Text Recognition | CodeCode Available | 1 |
| Multimodal Semi-Supervised Learning for Text Recognition | May 8, 2022 | Language ModellingRepresentation Learning | CodeCode Available | 1 |
| Pushing the Performance Limit of Scene Text Recognizer without Human Annotation | Apr 16, 2022 | Scene Text Recognition | CodeCode Available | 1 |
| IterVM: Iterative Vision Modeling Module for Scene Text Recognition | Apr 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization | Mar 20, 2022 | Common Sense ReasoningContrastive Learning | CodeCode Available | 1 |
| Training Protocol Matters: Towards Accurate Scene Text Recognition via Training Protocol Searching | Mar 13, 2022 | CPUGPU | CodeCode Available | 1 |
| Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement | Mar 9, 2022 | Document EnhancementImage Enhancement | CodeCode Available | 1 |
| Self-supervised Implicit Glyph Attention for Text Recognition | Mar 7, 2022 | Scene Text RecognitionText Segmentation | CodeCode Available | 1 |
| On the Cross-dataset Generalization in License Plate Recognition | Jan 2, 2022 | Data AugmentationLicense Plate Detection | CodeCode Available | 1 |
| Visual Semantics Allow for Textual Reasoning Better in Scene Text Recognition | Dec 24, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution | Dec 13, 2021 | Image Super-ResolutionScene Text Recognition | CodeCode Available | 1 |