Transfer Learning Approach for Railway Technical Map (RTM) Component Identification May 21, 2024 Management object-detection
— Unverified 0GeoContrastNet: Contrastive Key-Value Edge Learning for Language-Agnostic Document Understanding May 6, 2024 Contrastive Learning document understanding
Code Code Available 0Callico: a Versatile Open-Source Document Image Annotation Platform May 2, 2024 Document Layout Analysis HTR
— Unverified 0CREPE: Coordinate-Aware End-to-End Document Parser May 1, 2024 document understanding Optical Character Recognition (OCR)
— Unverified 0DELINE8K: A Synthetic Data Pipeline for the Semantic Segmentation of Historical Documents Apr 30, 2024 8k Diversity
Code Code Available 0Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism Apr 29, 2024 document understanding GPU
Code Code Available 0How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites Apr 25, 2024 4k Language Modeling
— Unverified 0Mixed Text Recognition with Efficient Parameter Fine-Tuning and Transformer Apr 19, 2024 Decoder Optical Character Recognition
— Unverified 0Improvement in Semantic Address Matching using Natural Language Processing Apr 17, 2024 Optical Character Recognition (OCR)
— Unverified 0ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images Apr 16, 2024 Multimodal Deep Learning Optical Character Recognition (OCR)
Code Code Available 0TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content Apr 16, 2024 Information Retrieval Knowledge Graphs
— Unverified 0MathWriting: A Dataset For Handwritten Mathematical Expression Recognition Apr 16, 2024 Form Optical Character Recognition (OCR)
— Unverified 0Resilience of Large Language Models for Noisy Instructions Apr 15, 2024 Automatic Speech Recognition Optical Character Recognition
— Unverified 0Convolution-based Probability Gradient Loss for Semantic Segmentation Apr 10, 2024 Optical Character Recognition (OCR) Semantic Segmentation
Code Code Available 0Making Old Kurdish Publications Processable by Augmenting Available Optical Character Recognition Engines Apr 9, 2024 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0HAMMR: HierArchical MultiModal React agents for generic VQA Apr 8, 2024 Optical Character Recognition (OCR) Question Answering
— Unverified 0Design and Development of a Framework For Stroke-Based Handwritten Gujarati Font Generation Apr 4, 2024 Font Generation Optical Character Recognition (OCR)
— Unverified 0Optical Text Recognition in Nepali and Bengali: A Transformer-based Approach Apr 3, 2024 Decoder Machine Translation
— Unverified 0RealKIE: Five Novel Datasets for Enterprise Key Information Extraction Mar 29, 2024 Key Information Extraction Optical Character Recognition (OCR)
— Unverified 0The Solution for the ICCV 2023 1st Scientific Figure Captioning Challenge Mar 26, 2024 Caption Generation Image Captioning
— Unverified 0SciCapenter: Supporting Caption Composition for Scientific Figures with Machine-Generated Captions and Ratings Mar 26, 2024 Optical Character Recognition (OCR)
— Unverified 0Grammatical vs Spelling Error Correction: An Investigation into the Responsiveness of Transformer-based Language Models using BART and MarianMT Mar 25, 2024 Optical Character Recognition (OCR) speech-recognition
— Unverified 0Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation Mar 25, 2024 Image Generation Optical Character Recognition (OCR)
— Unverified 0Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs Mar 19, 2024 Chart Question Answering Optical Character Recognition (OCR)
— Unverified 0mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Mar 19, 2024 document understanding Optical Character Recognition (OCR)
— Unverified 0Financial Table Extraction in Image Documents Mar 18, 2024 Image Segmentation Optical Character Recognition (OCR)
— Unverified 0OCR is All you need: Importing Multi-Modality into Image-based Defect Detection System Mar 18, 2024 All Decision Making
— Unverified 0Advancing Multilingual Handwritten Numeral Recognition with Attention-driven Transfer Learning Mar 18, 2024 Handwritten Digit Recognition Optical Character Recognition
Code Code Available 0Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning Mar 17, 2024 Edge Detection Line Detection
— Unverified 0TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model Mar 15, 2024 Language Modeling Language Modelling
— Unverified 0Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation Mar 14, 2024 Image to text Optical Character Recognition (OCR)
— Unverified 0Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering Mar 14, 2024 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking Mar 13, 2024 Chinese Spell Checking In-Context Learning
— Unverified 0Open-Vocabulary Scene Text Recognition via Pseudo-Image Labeling and Margin Loss Mar 12, 2024 Image Inpainting Optical Character Recognition (OCR)
— Unverified 0The future of document indexing: GPT and Donut revolutionize table of content processing Mar 12, 2024 Language Modeling Language Modelling
— Unverified 0Multimodal Transformer for Comics Text-Cloze Mar 6, 2024 Language Modeling Language Modelling
— Unverified 0LOCR: Location-Guided Transformer for Optical Character Recognition Mar 4, 2024 Marketing Optical Character Recognition
— Unverified 0Large Language Models for Simultaneous Named Entity Extraction and Spelling Correction Mar 1, 2024 Decoder Optical Character Recognition
— Unverified 0Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System Feb 27, 2024 Image Generation Optical Character Recognition (OCR)
— Unverified 0Representing Online Handwriting for Recognition in Large Vision-Language Models Feb 23, 2024 Handwriting Recognition Optical Character Recognition
— Unverified 0Syntactic Language Change in English and German: Metrics, Parsers, and Convergences Feb 18, 2024 Optical Character Recognition (OCR) Sentence
Code Code Available 0Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing Feb 12, 2024 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification Feb 8, 2024 CAPTCHA Detection Classification
— Unverified 0Enhancement of Bengali OCR by Specialized Models and Advanced Techniques for Diverse Document Types Feb 7, 2024 Optical Character Recognition (OCR) Table Recognition
— Unverified 0ExTTNet: A Deep Learning Algorithm for Extracting Table Texts from Invoice Images Feb 3, 2024 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0From Training-Free to Adaptive: Empirical Insights into MLLMs' Understanding of Detection Information Jan 31, 2024 Hallucination object-detection
— Unverified 0Improving OCR Quality in 19th Century Historical Documents Using a Combined Machine Learning Based Approach Jan 15, 2024 Optical Character Recognition (OCR)
— Unverified 0Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters Jan 1, 2024 Multi-Task Learning Optical Character Recognition
Code Code Available 0Bidirectional Trained Tree-Structured Decoder for Handwritten Mathematical Expression Recognition Dec 31, 2023 Decoder Language Modeling
— Unverified 0Chaurah: A Smart Raspberry Pi based Parking System Dec 28, 2023 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0