| SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild | Jan 6, 2025 | AttributeOptical Character Recognition | —Unverified | 0 |
| Geometry Restoration and Dewarping of Camera-Captured Document Images | Jan 6, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Efficient Video-Based ALPR System Using YOLO and Visual Rhythm | Jan 4, 2025 | License Plate RecognitionOptical Character Recognition | CodeCode Available | 0 |
| Embedding Similarity Guided License Plate Super Resolution | Jan 2, 2025 | License Plate RecognitionOptical Character Recognition | —Unverified | 0 |
| CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR | Jan 1, 2025 | AllOptical Character Recognition | —Unverified | 0 |
| OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning | Dec 31, 2024 | BenchmarkingLogical Reasoning | CodeCode Available | 4 |
| Optical Character Recognition using Convolutional Neural Networks for Ashokan Brahmi Inscriptions | Dec 29, 2024 | Data AugmentationImage Segmentation | —Unverified | 0 |
| Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study | Dec 29, 2024 | Motion DetectionOptical Character Recognition | CodeCode Available | 0 |
| ERPA: Efficient RPA Model Integrating OCR and LLMs for Intelligent Document Processing | Dec 24, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Leveraging Deep Learning with Multi-Head Attention for Accurate Extraction of Medicine from Handwritten Prescriptions | Dec 24, 2024 | Optical Character Recognition | —Unverified | 0 |