| Efficient Video-Based ALPR System Using YOLO and Visual Rhythm | Jan 4, 2025 | License Plate RecognitionOptical Character Recognition | CodeCode Available | 0 |
| Multi-modal Page Stream Segmentation with Convolutional Neural Networks | Sep 27, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing | Nov 20, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer Learning | Mar 18, 2024 | Handwritten Digit RecognitionOptical Character Recognition | CodeCode Available | 0 |
| MultiOCR-QA: Dataset for Evaluating Robustness of LLMs in Question Answering on Multilingual OCR Texts | Feb 24, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism | Apr 29, 2024 | document understandingGPU | CodeCode Available | 0 |
| MultiQG-TI: Towards Question Generation from Multi-modal Sources | Jul 7, 2023 | Image to textOptical Character Recognition | CodeCode Available | 0 |
| ASTER: An Attentional Scene Text Recognizer with Flexible Rectification | Jun 25, 2018 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters | Jan 1, 2024 | Multi-Task LearningOptical Character Recognition | CodeCode Available | 0 |
| Efficient License Plate Recognition in Videos Using Visual Rhythm and Accumulative Line Analysis | Jan 8, 2025 | License Plate DetectionLicense Plate Recognition | CodeCode Available | 0 |
| ChemGrapher: Optical Graph Recognition of Chemical Compounds by Deep Learning | Feb 23, 2020 | ArticlesDeep Learning | CodeCode Available | 0 |
| NASS-AI: Towards Digitization of Parliamentary Bills using Document Level Embedding and Bidirectional Long Short-Term Memory | Oct 2, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation | May 9, 2023 | DecoderMachine Translation | CodeCode Available | 0 |
| Noisy Parallel Data Alignment | Jan 23, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Post-OCR parsing: building simple and robust parser via BIO tagging | Sep 14, 2019 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Post-OCR Text Correction for Bulgarian Historical Documents | Aug 31, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| DuoSearch: A Novel Search Engine for Bulgarian Historical Documents | May 30, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Character decomposition to resolve class imbalance problem in Hangul OCR | Aug 12, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Object detection deep learning networks for Optical Character Recognition | May 1, 2019 | Deep LearningDocument Classification | CodeCode Available | 0 |
| Chandojnanam: A Sanskrit Meter Identification and Utilization System | Sep 29, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| Binary Document Image Super Resolution for Improved Readability and OCR Performance | Dec 6, 2018 | Image Super-ResolutionInformation Retrieval | CodeCode Available | 0 |
| BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset | Mar 9, 2023 | BenchmarkingDeep Learning | CodeCode Available | 0 |
| DriveThru: a Document Extraction Platform and Benchmark Datasets for Indonesian Local Language Archives | Nov 14, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| An Evaluation of DNN Architectures for Page Segmentation of Historical Newspapers | Apr 15, 2020 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 |
| PsOCR: Benchmarking Large Multimodal Models for Optical Character Recognition in Low-resource Pashto Language | May 15, 2025 | BenchmarkingOptical Character Recognition | CodeCode Available | 0 |
| Do Current Video LLMs Have Strong OCR Abilities? A Preliminary Study | Dec 29, 2024 | Motion DetectionOptical Character Recognition | CodeCode Available | 0 |