Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey Dec 19, 2023 Articles Optical Character Recognition
— Unverified 0TDeLTA: A Light-weight and Robust Table Detection Method based on Learning Text Arrangement Dec 18, 2023 Optical Character Recognition (OCR) Table Detection
— Unverified 0Information Extraction from Unstructured data using Augmented-AI and Computer Vision Dec 15, 2023 Optical Character Recognition (OCR)
— Unverified 0Polar-Doc: One-Stage Document Dewarping with Multi-Scope Constraints under Polar Representation Dec 13, 2023 Optical Character Recognition (OCR)
— Unverified 0Multimodal Sentiment Analysis: Perceived vs Induced Sentiments Dec 12, 2023 Multimodal Sentiment Analysis Optical Character Recognition (OCR)
— Unverified 0UPOCR: Towards Unified Pixel-Level OCR Interface Dec 5, 2023 Decoder Optical Character Recognition
— Unverified 0Enhancing Vehicle Entrance and Parking Management: Deep Learning Solutions for Efficiency and Security Dec 5, 2023 Face Detection License Plate Recognition
— Unverified 0Pipeline Enabling Zero-shot Classification for Bangla Handwritten Grapheme Dec 1, 2023 Bangla Text Detection Classification
— Unverified 0Vulnerability Analysis of Transformer-based Optical Character Recognition to Adversarial Attacks Nov 28, 2023 Adversarial Attack Optical Character Recognition
— Unverified 0Automatic Recognition of Learning Resource Category in a Digital Library Nov 28, 2023 document-image-classification Document Image Classification
Code Code Available 0Optimization of Image Processing Algorithms for Character Recognition in Cultural Typewritten Documents Nov 27, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0SUT: a new multi-purpose synthetic dataset for Farsi document image analysis Nov 27, 2023 Document Classification document-image-classification
Code Code Available 0Similar Document Template Matching Algorithm Nov 21, 2023 Fraud Detection Optical Character Recognition (OCR)
— Unverified 0ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram Parsing Nov 20, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding Nov 20, 2023 document understanding Language Modeling
— Unverified 0Efficient End-to-End Visual Document Understanding with Rationale Distillation Nov 16, 2023 document understanding Image to text
— Unverified 0DECDM: Document Enhancement using Cycle-Consistent Diffusion Models Nov 16, 2023 Data Augmentation Denoising
— Unverified 0Multiple-Question Multiple-Answer Text-VQA Nov 15, 2023 Decoder Denoising
— Unverified 0Reading Between the Mud: A Challenging Motorcycle Racer Number Dataset Nov 14, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0What Large Language Models Bring to Text-rich VQA? Nov 13, 2023 Image Comprehension Optical Character Recognition (OCR)
— Unverified 0DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency Nov 9, 2023 document understanding Key Information Extraction
— Unverified 0On Manipulating Scene Text in the Wild with Diffusion Models Nov 1, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0DCQA: Document-Level Chart Question Answering towards Complex Reasoning and Common-Sense Understanding Oct 29, 2023 Answer Generation Chart Question Answering
Code Code Available 0PHD: Pixel-Based Language Modeling of Historical Documents Oct 22, 2023 Language Modeling Language Modelling
Code Code Available 0MultiCoNER v2: a Large Multilingual dataset for Fine-grained and Noisy Named Entity Recognition Oct 20, 2023 named-entity-recognition Named Entity Recognition
— Unverified 0DocXChain: A Powerful Open-Source Toolchain for Document Parsing and Beyond Oct 19, 2023 Document AI Document Layout Analysis
— Unverified 0EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge Oct 16, 2023 Image Retrieval Language Modeling
— Unverified 0Towards reducing hallucination in extracting information from financial reports using Large Language Models Oct 16, 2023 Hallucination Optical Character Recognition
— Unverified 0Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA Oct 13, 2023 Graph Learning Object
— Unverified 0Invisible Threats: Backdoor Attack in OCR Systems Oct 12, 2023 Backdoor Attack Optical Character Recognition
— Unverified 0Solution for SMART-101 Challenge of ICCV Multi-modal Algorithmic Reasoning Task 2023 Oct 10, 2023 Decoder object-detection
— Unverified 0Constructing Image-Text Pair Dataset from Books Oct 3, 2023 Image-text Retrieval Optical Character Recognition (OCR)
— Unverified 0Comprehensive Overview of Named Entity Recognition: Models, Domain-Specific Applications and Challenges Sep 25, 2023 named-entity-recognition Named Entity Recognition
— Unverified 0Order-preserving Consistency Regularization for Domain Adaptation and Generalization Sep 23, 2023 Data Augmentation Domain Adaptation
Code Code Available 0STEP -- Towards Structured Scene-Text Spotting Sep 5, 2023 Optical Character Recognition (OCR) Scene Text Detection
Code Code Available 0Bengali Document Layout Analysis -- A YOLOV8 Based Ensembling Approach Sep 2, 2023 Data Augmentation Document Layout Analysis
— Unverified 0Separate and Locate: Rethink the Text in Text-based Visual Question Answering Aug 31, 2023 Optical Character Recognition (OCR) Position
Code Code Available 0Enhancing OCR Performance through Post-OCR Models: Adopting Glyph Embedding for Improved Correction Aug 29, 2023 Optical Character Recognition (OCR)
— Unverified 0Vision Grid Transformer for Document Layout Analysis Aug 29, 2023 Document AI Document Layout Analysis
— Unverified 0Optimal Projections for Discriminative Dictionary Learning using the JL-lemma Aug 27, 2023 Dictionary Learning Dimensionality Reduction
Code Code Available 0Bengali Document Layout Analysis with Detectron2 Aug 26, 2023 Data Augmentation Document Layout Analysis
— Unverified 0DISGO: Automatic End-to-End Evaluation for Scene Text OCR Aug 25, 2023 Machine Translation Optical Character Recognition
— Unverified 0American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers Aug 24, 2023 Articles Language Modeling
— Unverified 0CNN based Cuneiform Sign Detection Learned from Annotated 3D Renderings and Mapped Photographs with Illumination Augmentation Aug 22, 2023 Optical Character Recognition (OCR)
— Unverified 0OCR Language Models with Custom Vocabularies Aug 18, 2023 Decoder Language Modeling
— Unverified 0FashionLOGO: Prompting Multimodal Large Language Models for Fashion Logo Embeddings Aug 17, 2023 Image Retrieval Logo Recognition
Code Code Available 0Training BERT Models to Carry Over a Coding System Developed on One Corpus to Another Aug 7, 2023 Domain Adaptation Optical Character Recognition (OCR)
— Unverified 0Making the V in Text-VQA Matter Aug 1, 2023 Optical Character Recognition (OCR) TextVQA
— Unverified 0Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations Aug 1, 2023 Denoising Image Denoising
— Unverified 0Optimizing the Neural Network Training for OCR Error Correction of Historical Hebrew Texts Jul 30, 2023 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0