| CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models | Aug 30, 2024 | Articlesnamed-entity-recognition | CodeCode Available | 0 |
| Can Visual Language Models Replace OCR-Based Visual Question Answering Pipelines in Production? A Case Study in Retail | Aug 28, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| A Permuted Autoregressive Approach to Word-Level Recognition for Urdu Digital Text | Aug 27, 2024 | Data AugmentationOptical Character Recognition | —Unverified | 0 |
| FastTextSpotter: A High-Efficiency Transformer for Multilingual Scene Text Spotting | Aug 27, 2024 | BenchmarkingDecoder | CodeCode Available | 0 |
| Knowledge Discovery in Optical Music Recognition: Enhancing Information Retrieval with Instance Segmentation | Aug 27, 2024 | Information RetrievalInstance Segmentation | —Unverified | 0 |
| Ancient but Digitized: Developing Handwritten Optical Character Recognition for East Syriac Script Through Creating KHAMIS Dataset | Aug 24, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | —Unverified | 0 |
| Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models for Page Stream Segmentation | Aug 21, 2024 | DecoderOptical Character Recognition | —Unverified | 0 |
| Revisiting Multi-Modal LLM Evaluation | Aug 9, 2024 | Chart UnderstandingOptical Character Recognition | —Unverified | 0 |
| Handwritten Code Recognition for Pen-and-Paper CS Education | Aug 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |