| Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval | May 8, 2025 | Computational EfficiencyOptical Character Recognition | —Unverified | 0 |
| DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation | May 7, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer | May 2, 2025 | document understandingHallucination | —Unverified | 0 |
| Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models | Apr 16, 2025 | document understandingLayout Design | CodeCode Available | 0 |
| Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR | Apr 15, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Relation-Rich Visual Document Generator for Visual Information Extraction | Apr 14, 2025 | Diversitydocument understanding | CodeCode Available | 0 |
| NoTeS-Bank: Benchmarking Neural Transcription and Search for Scientific Notes Understanding | Apr 12, 2025 | BenchmarkingDocument AI | —Unverified | 0 |
| Towards Calibration Enhanced Network by Inverse Adversarial Attack | Apr 8, 2025 | Adversarial AttackOptical Character Recognition | —Unverified | 0 |
| Context-Independent OCR with Multimodal LLMs: Effects of Image Resolution and Visual Complexity | Mar 31, 2025 | Image CaptioningOptical Character Recognition | —Unverified | 0 |
| TFIC: End-to-End Text-Focused Image Compression for Coding for Machines | Mar 25, 2025 | Image CompressionOptical Character Recognition | —Unverified | 0 |