| A document processing pipeline for the construction of a dataset for topic modeling based on the judgments of the Italian Supreme Court | May 13, 2025 | DiversityDocument Layout Analysis | —Unverified | 0 |
| Reproducibility, Replicability, and Insights into Visual Document Retrieval with Late Interaction | May 12, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Development of a WAZOBIA-Named Entity Recognition System | May 10, 2025 | Machine Translationnamed-entity-recognition | —Unverified | 0 |
| Arrow-Guided VLM: Enhancing Flowchart Understanding via Arrow Direction Encoding | May 9, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Toward Advancing License Plate Super-Resolution in Real-World Scenarios: A Dataset and Benchmark | May 9, 2025 | License Plate RecognitionOptical Character Recognition | CodeCode Available | 0 |
| Lost in OCR Translation? Vision-Based Approaches to Robust Document Retrieval | May 8, 2025 | Computational EfficiencyOptical Character Recognition | —Unverified | 0 |
| ChemRxivQuest: A Curated Chemistry Question-Answer Database Extracted from ChemRxiv Preprints | May 8, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| DOTA: Deformable Optimized Transformer Architecture for End-to-End Text Recognition with Retrieval-Augmented Generation | May 7, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Automated Parsing of Engineering Drawings for Structured Information Extraction Using a Fine-tuned Document Understanding Transformer | May 2, 2025 | document understandingHallucination | —Unverified | 0 |
| Evaluating Menu OCR and Translation: A Benchmark for Aligning Human and Automated Evaluations in Large Vision-Language Models | Apr 16, 2025 | document understandingLayout Design | CodeCode Available | 0 |