| How Do Large Vision-Language Models See Text in Image? Unveiling the Distinctive Role of OCR Heads | May 21, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Every Pixel Tells a Story: End-to-End Urdu Newspaper OCR | May 20, 2025 | ArticlesImage Super-Resolution | —Unverified | 0 |
| Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline | May 16, 2025 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language | May 15, 2025 | BenchmarkingOptical Character Recognition | CodeCode Available | 0 |
| A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court | May 13, 2025 | DiversityDocument Layout Analysis | —Unverified | 0 |
| Reproducibility, Replicability, and Insights into Visual Document Retrieval with Late Interaction | May 12, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Development of a WAZOBIA-Named Entity Recognition System | May 10, 2025 | Machine Translationnamed-entity-recognition | —Unverified | 0 |
| Toward Advancing License Plate Super-Resolution in Real-World Scenarios: A Dataset and Benchmark | May 9, 2025 | License Plate RecognitionOptical Character Recognition | CodeCode Available | 0 |
| Arrow-Guided VLM: Enhancing Flowchart Understanding via Arrow Direction Encoding | May 9, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| ChemRxivQuest: A Curated Chemistry Question-Answer Database Extracted from ChemRxiv Preprints | May 8, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |