| MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features | Sep 25, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| License Plate Detection and Recognition in Unconstrained Scenarios | Sep 1, 2018 | License Plate DetectionLicense Plate Recognition | CodeCode Available | 0 | 5 |
| ChemGrapher: Optical Graph Recognition of Chemical Compounds by Deep Learning | Feb 23, 2020 | ArticlesDeep Learning | CodeCode Available | 0 | 5 |
| Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer Learning | Mar 18, 2024 | Handwritten Digit RecognitionOptical Character Recognition | CodeCode Available | 0 | 5 |
| Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription | Feb 27, 2025 | Handwritten Text RecognitionHTR | CodeCode Available | 0 | 5 |
| A model of diffuse Galactic Radio Emission from 10 MHz to 100 GHz | Feb 12, 2008 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports | Jan 22, 2021 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Are VLMs Really Blind | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing | Nov 20, 2023 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| Arrow-Guided VLM: Enhancing Flowchart Understanding via Arrow Direction Encoding | May 9, 2025 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| LILA-BOTI : Leveraging Isolated Letter Accumulations By Ordering Teacher Insights for Bangla Handwriting Recognition | May 23, 2022 | Handwriting RecognitionKnowledge Distillation | CodeCode Available | 0 | 5 |
| IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character Recognition | Dec 2, 2023 | Optical Character RecognitionPrinted Text Recognition | CodeCode Available | 0 | 5 |
| iExam: A Novel Online Exam Monitoring and Analysis System Based on Face Detection and Recognition | Jun 27, 2022 | Face DetectionFace Recognition | CodeCode Available | 0 | 5 |
| Alleviating Digitization Errors in Named Entity Recognition for Historical Documents | Nov 1, 2020 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 | 5 |
| Character decomposition to resolve class imbalance problem in Hangul OCR | Aug 12, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| memorAIs: an Optical Character Recognition and Rule-Based Medication Intake Reminder-Generating Solution | Dec 11, 2023 | FrictionOptical Character Recognition | CodeCode Available | 0 | 5 |
| CLOCR-C: Context Leveraging OCR Correction with Pre-trained Language Models | Aug 30, 2024 | Articlesnamed-entity-recognition | CodeCode Available | 0 | 5 |
| ASTER: An Attentional Scene Text Recognizer with Flexible Rectification | Jun 25, 2018 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| High-Throughput Phenotyping using Computer Vision and Machine Learning | Jul 8, 2024 | Image SegmentationOptical Character Recognition | CodeCode Available | 0 | 5 |
| Chandojnanam: A Sanskrit Meter Identification and Utilization System | Sep 29, 2022 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |
| A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision | Mar 30, 2023 | DecoderMulti-Task Learning | CodeCode Available | 0 | 5 |
| Handwritten Code Recognition for Pen-and-Paper CS Education | Aug 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| AON: Towards Arbitrarily-Oriented Text Recognition | Nov 12, 2017 | DecoderOptical Character Recognition | CodeCode Available | 0 | 5 |
| Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism | Apr 29, 2024 | document understandingGPU | CodeCode Available | 0 | 5 |
| LMV-RPA: Large Model Voting-based Robotic Process Automation | Dec 23, 2024 | Optical Character RecognitionOptical Character Recognition (OCR) | CodeCode Available | 0 | 5 |