| Patchfinder: Leveraging Visual Language Models for Accurate Information Retrieval using Model Uncertainty | Dec 3, 2024 | Information RetrievalOptical Character Recognition | —Unverified | 0 |
| OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation | Dec 3, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 2 |
| AI-assisted summary of suicide risk Formulation | Nov 29, 2024 | Optical Character Recognition | —Unverified | 0 |
| Towards Accessible Learning: Deep Learning-Based Potential Dysgraphia Detection and OCR for Potentially Dysgraphic Handwriting | Nov 18, 2024 | DiagnosticOptical Character Recognition | —Unverified | 0 |
| DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language Archives | Nov 14, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding | Nov 7, 2024 | document understandingOptical Character Recognition | —Unverified | 0 |
| TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models | Nov 7, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Handwriting Recognition in Historical Documents with Multimodal LLM | Oct 31, 2024 | Handwriting RecognitionOptical Character Recognition | —Unverified | 0 |
| Toxicity of the Commons: Curating Open-Source Pre-Training Data | Oct 29, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 1 |
| Are VLMs Really Blind | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |